|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.any23.extractor.rdf.BaseRDFExtractor
public abstract class BaseRDFExtractor
Base class for a generic RDF
Extractor.ContentExtractor.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from interface org.apache.any23.extractor.Extractor |
|---|
Extractor.BlindExtractor, Extractor.ContentExtractor, Extractor.TagSoupDOMExtractor |
| Constructor Summary | |
|---|---|
BaseRDFExtractor()
|
|
BaseRDFExtractor(boolean verifyDataType,
boolean stopAtFirstError)
Constructor, allows to specify the validation and error handling policies. |
|
| Method Summary | |
|---|---|
abstract ExtractorDescription |
getDescription()
Returns a ExtractorDescription of this extractor. |
protected abstract org.openrdf.rio.helpers.RDFParserBase |
getParser(ExtractionContext extractionContext,
ExtractionResult extractionResult)
|
boolean |
isStopAtFirstError()
|
boolean |
isVerifyDataType()
|
void |
run(ExtractionParameters extractionParameters,
ExtractionContext extractionContext,
InputStream in,
ExtractionResult extractionResult)
Executes the extractor. |
void |
setStopAtFirstError(boolean b)
If true, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors. |
void |
setVerifyDataType(boolean verifyDataType)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public BaseRDFExtractor(boolean verifyDataType,
boolean stopAtFirstError)
verifyDataType - if true the data types will be verified,
if false will be ignored.stopAtFirstError - if true the parser will stop at first parsing error,
if false will ignore non blocking errors.public BaseRDFExtractor()
| Method Detail |
|---|
public abstract ExtractorDescription getDescription()
ExtractorExtractorDescription of this extractor.
getDescription in interface Extractor<InputStream>
protected abstract org.openrdf.rio.helpers.RDFParserBase getParser(ExtractionContext extractionContext,
ExtractionResult extractionResult)
public boolean isVerifyDataType()
public void setVerifyDataType(boolean verifyDataType)
public boolean isStopAtFirstError()
public void setStopAtFirstError(boolean b)
Extractor.ContentExtractortrue, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors.
setStopAtFirstError in interface Extractor.ContentExtractorb - tolerance flag.
public void run(ExtractionParameters extractionParameters,
ExtractionContext extractionContext,
InputStream in,
ExtractionResult extractionResult)
throws IOException,
ExtractionException
Extractor
run in interface Extractor<InputStream>extractionParameters - the parameters to be applied during the extraction.extractionContext - The document context.in - The extractor input data.extractionResult - the collector for the extracted data.
IOException - On error while reading from the input stream.
ExtractionException - On other error, such as parse errors.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||