Package org.apache.any23.extractor
Interface TagSoupExtractionResult
-
- All Superinterfaces:
ExtractionResult,IssueReport
- All Known Implementing Classes:
ExtractionResultImpl
public interface TagSoupExtractionResult extends ExtractionResult
This interface models a specificExtractionResultable to collect property roots generated by HTML Microformat extractions.- Author:
- Michele Mostarda (mostarda@fbk.eu)
-
-
Nested Class Summary
Nested Classes Modifier and Type Interface Description static classTagSoupExtractionResult.PropertyPathDefines a property path object.static classTagSoupExtractionResult.ResourceRootDefines a property root object.-
Nested classes/interfaces inherited from interface org.apache.any23.extractor.IssueReport
IssueReport.Issue, IssueReport.IssueLevel
-
-
Method Summary
All Methods Instance Methods Abstract Methods Modifier and Type Method Description voidaddPropertyPath(Class<? extends MicroformatExtractor> extractor, org.eclipse.rdf4j.model.Resource propertySubject, org.eclipse.rdf4j.model.Resource property, org.eclipse.rdf4j.model.BNode object, String[] path)Adds a property path to the list of the extracted data.voidaddResourceRoot(String[] path, org.eclipse.rdf4j.model.Resource root, Class<? extends MicroformatExtractor> extractor)Adds a root property to the extraction result, specifying also the path corresponding to the root of data which generated the property and the extractor responsible for such addition.List<TagSoupExtractionResult.PropertyPath>getPropertyPaths()Returns all the collected property paths.List<TagSoupExtractionResult.ResourceRoot>getResourceRoots()Returns all the collected property roots.-
Methods inherited from interface org.apache.any23.extractor.ExtractionResult
close, openSubResult, writeNamespace, writeTriple, writeTriple
-
Methods inherited from interface org.apache.any23.extractor.IssueReport
getIssues, notifyIssue, printReport
-
-
-
-
Method Detail
-
addResourceRoot
void addResourceRoot(String[] path, org.eclipse.rdf4j.model.Resource root, Class<? extends MicroformatExtractor> extractor)
Adds a root property to the extraction result, specifying also the path corresponding to the root of data which generated the property and the extractor responsible for such addition.- Parameters:
path- the path from the document root to the local root of the data generating the property.root- the property root node.extractor- the extractor responsible of such extraction.
-
getResourceRoots
List<TagSoupExtractionResult.ResourceRoot> getResourceRoots()
Returns all the collected property roots.- Returns:
- an unmodifiable list of
TagSoupExtractionResult.ResourceRoots.
-
addPropertyPath
void addPropertyPath(Class<? extends MicroformatExtractor> extractor, org.eclipse.rdf4j.model.Resource propertySubject, org.eclipse.rdf4j.model.Resource property, org.eclipse.rdf4j.model.BNode object, String[] path)
Adds a property path to the list of the extracted data.- Parameters:
extractor- the identifier of the extractor responsible for retrieving such property.propertySubject- the subject of the property.property- the property IRI.object- the property object if any,nullotherwise.path- the path of the HTML node from which the property literal has been extracted.
-
getPropertyPaths
List<TagSoupExtractionResult.PropertyPath> getPropertyPaths()
Returns all the collected property paths.- Returns:
- a valid list of property paths.
-
-