Package org.apache.any23.filter
This package defines specific
TripleHandler
s able to filter document and triples that
don't respect particular conditions.-
Class Summary Class Description ExtractionContextBlocker A wrapper around aTripleHandler
that can block and unblock calls to the handler, either for the entire document, or for individualExtractionContext
s.IgnoreAccidentalRDFa ATripleHandler
that suppresses output of the RDFa parser if the document only contains "accidental" RDFa, like stylesheet links and other non-RDFa uses of HTML'sIgnoreTitlesOfEmptyDocuments ATripleHandler
that suppresses output of theTitleExtractor
unless some other triples could be parsed from the document.