public class ExcelExtractor extends Object implements Extractor.ContentExtractor
Extractor.ContentExtractor
able to process
a MS Excel 97-2007+ file format .xls/.xlsx and
convert the detected content to triples.
This extractor is based on
Apache POI-HSSF and POI-XSSF Java API.Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor
Constructor and Description |
---|
ExcelExtractor() |
Modifier and Type | Method and Description |
---|---|
ExtractorDescription |
getDescription()
Returns a
ExtractorDescription of this extractor. |
boolean |
isStopAtFirstError() |
void |
run(ExtractionParameters extractionParameters,
ExtractionContext context,
InputStream in,
ExtractionResult er)
Executes the extractor.
|
void |
setStopAtFirstError(boolean f)
If
true , the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors. |
public boolean isStopAtFirstError()
public void setStopAtFirstError(boolean f)
Extractor.ContentExtractor
true
, the extractor will stop at first parsing error,
iffalse
the extractor will attempt to ignore all parsing errors.setStopAtFirstError
in interface Extractor.ContentExtractor
f
- tolerance flag.public ExtractorDescription getDescription()
Extractor
ExtractorDescription
of this extractor.getDescription
in interface Extractor<InputStream>
public void run(ExtractionParameters extractionParameters, ExtractionContext context, InputStream in, ExtractionResult er) throws IOException, ExtractionException
Extractor
run
in interface Extractor<InputStream>
extractionParameters
- the parameters to be applied during the extraction.context
- The document context.in
- The extractor input data.er
- the collector for the extracted data.IOException
- On error while reading from the input stream.ExtractionException
- On other error, such as parse errors.Copyright © 2010–2019 The Apache Software Foundation. All rights reserved.