Package org.apache.any23.extractor.csv
Class CSVExtractor
- java.lang.Object
-
- org.apache.any23.extractor.csv.CSVExtractor
-
- All Implemented Interfaces:
Extractor<InputStream>
,Extractor.ContentExtractor
public class CSVExtractor extends Object implements Extractor.ContentExtractor
This extractor produces RDF from a CSV file . It automatically detects fields delimiter. If not able uses the one provided in the Any23 configuration.- Author:
- Davide Palmisano ( dpalmisano@gmail.com )
- See Also:
CSVReaderBuilder
-
-
Nested Class Summary
-
Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
-
-
Constructor Summary
Constructors Constructor Description CSVExtractor()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description ExtractorDescription
getDescription()
Returns aExtractorDescription
of this extractor.void
run(ExtractionParameters extractionParameters, ExtractionContext extractionContext, InputStream in, ExtractionResult out)
Executes the extractor.void
setStopAtFirstError(boolean f)
Iftrue
, the extractor will stop at first parsing error, iffalse
the extractor will attempt to ignore all parsing errors.
-
-
-
Method Detail
-
setStopAtFirstError
public void setStopAtFirstError(boolean f)
Iftrue
, the extractor will stop at first parsing error, iffalse
the extractor will attempt to ignore all parsing errors.- Specified by:
setStopAtFirstError
in interfaceExtractor.ContentExtractor
- Parameters:
f
- tolerance flag.
-
run
public void run(ExtractionParameters extractionParameters, ExtractionContext extractionContext, InputStream in, ExtractionResult out) throws IOException, ExtractionException
Executes the extractor. Will be invoked only once, extractors are not reusable.- Specified by:
run
in interfaceExtractor<InputStream>
- Parameters:
extractionParameters
- the parameters to be applied during the extraction.extractionContext
- The document context.in
- The extractor input data.out
- the collector for the extracted data.- Throws:
IOException
- On error while reading from the input stream.ExtractionException
- On other error, such as parse errors.
-
getDescription
public ExtractorDescription getDescription()
Returns aExtractorDescription
of this extractor.- Specified by:
getDescription
in interfaceExtractor<InputStream>
- Returns:
- the object representing the extractor description.
-
-