Class MicrodataExtractor
- java.lang.Object
-
- org.apache.any23.extractor.microdata.MicrodataExtractor
-
- All Implemented Interfaces:
Extractor<Document>,Extractor.TagSoupDOMExtractor
public class MicrodataExtractor extends Object implements Extractor.TagSoupDOMExtractor
Default implementation of Microdata extractor, based onExtractor.TagSoupDOMExtractor.- Author:
- Michele Mostarda (mostarda@fbk.eu), Davide Palmisano ( dpalmisano@gmail.com ), Hans Brende (hansbrende@apache.org)
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
-
-
Constructor Summary
Constructors Constructor Description MicrodataExtractor()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description ExtractorDescriptiongetDescription()Returns aExtractorDescriptionof this extractor.voidrun(ExtractionParameters extractionParameters, ExtractionContext extractionContext, Document in, ExtractionResult out)This extraction performs the Microdata to RDF conversion algorithm.
-
-
-
Method Detail
-
getDescription
public ExtractorDescription getDescription()
Description copied from interface:ExtractorReturns aExtractorDescriptionof this extractor.- Specified by:
getDescriptionin interfaceExtractor<Document>- Returns:
- the object representing the extractor description.
-
run
public void run(ExtractionParameters extractionParameters, ExtractionContext extractionContext, Document in, ExtractionResult out) throws IOException, ExtractionException
This extraction performs the Microdata to RDF conversion algorithm.- Specified by:
runin interfaceExtractor<Document>- Parameters:
extractionParameters- the parameters to be applied during the extraction.extractionContext- The document context.in- The extractor input data.out- the collector for the extracted data.- Throws:
IOException- On error while reading from the input stream.ExtractionException- On other error, such as parse errors.
-
-