public class ExcelExtractor extends Object implements Extractor.ContentExtractor
Extractor.ContentExtractor able to process
a MS Excel 97-2007+ file format .xls/.xlsx and
convert the detected content to triples.
This extractor is based on
Apache POI-HSSF and POI-XSSF Java API.Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor| Constructor and Description |
|---|
ExcelExtractor() |
| Modifier and Type | Method and Description |
|---|---|
ExtractorDescription |
getDescription()
Returns a
ExtractorDescription of this extractor. |
boolean |
isStopAtFirstError() |
void |
run(ExtractionParameters extractionParameters,
ExtractionContext context,
InputStream in,
ExtractionResult er)
Executes the extractor.
|
void |
setStopAtFirstError(boolean f)
If
true, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors. |
public boolean isStopAtFirstError()
public void setStopAtFirstError(boolean f)
Extractor.ContentExtractortrue, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors.setStopAtFirstError in interface Extractor.ContentExtractorf - tolerance flag.public ExtractorDescription getDescription()
ExtractorExtractorDescription of this extractor.getDescription in interface Extractor<InputStream>public void run(ExtractionParameters extractionParameters, ExtractionContext context, InputStream in, ExtractionResult er) throws IOException, ExtractionException
Extractorrun in interface Extractor<InputStream>extractionParameters - the parameters to be applied during the extraction.context - The document context.in - The extractor input data.er - the collector for the extracted data.IOException - On error while reading from the input stream.ExtractionException - On other error, such as parse errors.Copyright © 2010–2019 The Apache Software Foundation. All rights reserved.