|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
QuadTemplate to the rule.
Configuration.
ValueFactory.Extractor.ContentExtractor.TripleHandler decorator useful to
perform benchmarking.BNode.
null CSVParser guessing
from the provided CSV file.
DocumentSource.TripleHandler multi decorator, that wraps zero or more
other triple handlers and dispatches all events to each of them.initialExtractorGroup
and also the extractors detected in classpath specified by pluginLocations.
initialExtractorGroup
and also the extractors detected in classpath specified by the default configuration.
TripleHandler that merely counts the number
of triples it has received.NL in the given in string.
content appears within container
without string overlapping.
src to the dest.
ExtractorFactory serving concrete implementation
instances of Extractor.
Vocabulary.classes map.
DocumentSource for the givendocumentURI.
Vocabulary.properties map.
DefaultConfiguration.Configuration.InputStream on an HTTP URI.ModifiableConfiguration.RuleContext.ValidationReport.ValidationReportBuilder.Validator.TagSoupParser when a document
is retrieved and validated.Validator.ValidationReportBuilder with no data.in string as JSON string
to let it being embeddable within a string field.
in document source, sending the generated events
to the specified outputHandler.
in string
associated to the documentURI URI, declaring
contentType and encoding.
in string
associated to the documentURI URI, sending the generated
events to the specified outputHandler.
file
sending the generated events to the specified outputHandler.
documentURI
sending the generated events to the specified outputHandler.
documentURI
sending the generated events to the specified outputHandler.
in document source, sending the generated events
to the specified outputHandler.
in document source, sending the generated events
to the specified outputHandler.
in document source, sending the generated events
to the specified outputHandler.
Extractor.TripleHandler that can block and unblock
calls to the handler, either for the entire document, or for
individual ExtractionContexts.SingleDocumentExtraction.METADATA_NESTING_FLAG.
ExtractionResult; it receives
extraction output from one Extractor working on one document,
and passes the output on to a TripleHandler.Extractor able to handle
URI as input format.Extractor able to handle
InputStream as input format.Extractor able to handle
Document as input format.Extractor description.Extractor.ExtractorFactory providing
simple accessing methods.Any23
extractor plugin that can be detected and registered from the library classpath.Extractor.prefixID:prefixURL strings
normalizing blanks where present.
rel tag nodes.
DocumentSource.ExtractorGroup with a set of Extractor able to
process the provided mime type.
ExtractorRegistry and the ExtractorPlugins.
BNode.
packageName satisfying the given class
filter.
iface.
ExtractionResult associated
to the extraction session.
ExtractorDescription of this extractor.
ExtractorDescription of this extractor.
DocumentSource.
ExtractorGroup containing the
ExtractorFactory mathing the names provided as input.
ExtractorFactory given its name
Configuration.
HTTPClient implementation.
FormatWriter.
ItemScope instance described within the specified node.
document.
document,
works in full report mode.
FormatWriter.
NQuadsParser.
NTriplesParser.
ExtractorPlugin classes declared within the classpath.
Configuration.
defaultValue will be returned.
Integer value of the specified propertyName or raises an exception
if propertyName is not defined.
propertyName or raises an exception
if propertyName is not defined.
RDFParser instance.
RDFXMLParser.
null otherwise.
Tool classes declared within the classpath.
TriXParser.
TurtleParser.
true.
Term
given the varMapping, the #isVar and #internalValue
parameters.
FormatWriter identified by id.
FormatWriter ready to write on the given os
OutputStream.
mimeType.
XSLTStylesheet able to distill RDFa from
HTML pages.
MIME type of the content of input file.
MIME type of the content of input file.
node.
FormatWriter with given id identifier has been registered.
DocumentSource content activates of not at least an extractor.
Extractor.TagSoupDOMExtractor implementation
retrieves the LINKs declared within the HTML/HEAD page header.href property.
InputStream on an
HTTP URI.HTTPClient configuration.DocumentSource.ical.rdfTripleHandler that suppresses output of the RDFa
parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML'sTripleHandler that suppresses output of the
TitleExtractor unless some other triples could
be parsed from the document.og:image.
MicroformatExtractor to
represent which of the other Microformats could it nest.MicroformatExtractor.
href is absolute or not.
true then the produced RDF is annotated with
the extractors used to generate the specific statements.
true then the produced RDF is annotated with
the extractors used to generate the specific statements.
candidatePrefix is prefix of string container.
candidateSuffix is suffix of string container.
data list
using the specified delimiter.
Literal.
Literal.
Literal.
Literal.
Literal.
Literal.
Literal.
Literal.
Literal.
Literal.
DocumentSources.DocumentSource by
reading them into an in-memory buffer.TagSoupDOMExtractor.MicrodataParser.MIME type of
some given raw data.Configuration.c n times.
- muse -
Variable in class org.apache.any23.vocab.XFN
-
- mv(File, File) -
Static method in class org.apache.any23.util.FileUtils
- Moves a
target file to a new dest location.
- myersBriggs -
Variable in class org.apache.any23.vocab.FOAF
-
MIMETypeDetector based
on file extensions.DefaultConfiguration.
Extractor.ContentExtractor
handling N-Quads format.RDFParser interface.RDFWriter.Extractor.ContentExtractor
handling NTriples NTriples format.InputStream from a given URI.
MIMEType.
is input stream with the
specified parser p using baseURI.
is input stream with the
specified parser p using '' as base URI.
in string with the
specified parser p using '' as base URI.
resource file
guessing the content format from the extension.
ExtractionResult, using
the passed variableAssignment to expand variables.
TikaMIMETypeDetector could
call in order to clean the input before performing the MIME type
detection.InputStream.
InputStream.
Statement.
Extractor implementation for
RDFa 1.1 specification.RDFa11Parser.ExtractionResult.TripleHandler that writes
triples to a Sesame RDFWriter,
eg for serialization using one of Sesame's writers.Extractor.ContentExtractor
able to perform the extraction on RDF/XML
documents.Reader, returning the same value.
Reader into a byte array
attribute, returning the
defaultValue string if not present.
attribute, returning the
empty string if not present.
attributePrefix.
Reader as an InputStream.ReaderInputStream
for the specified Reader.
ReaderInputStream
for the specified Reader,
with the specified encoding.
ExtractorFactory.
FormatWriter to the registry.
Fixes.
Fix.
TripleHandler that collects
various information about the extraction process, such as
the extractors used and the total number of triples.Rule.
Rule
to a Fix.Extractor
registered to this class using the specified extraction parameters.
Extractor
registered to this class using the default extraction parameters.
ValidationReportSerializer.Node produces the XML serialization
omitting the XML declaration.
namespace,
resources and properties.
vocabulary to triples over the given writer.
vocabulary to NQuads over the given output stream.
vocabulary to NQuads and return them as string.
LocalCopyFactory instance.
HTTPClient implementation
used to retrieve contents.
null the MemCopyFactory will be used.
MIMETypeDetector.
null mimetype detection will
be skipped and all extractors will be activated.
propertyValue for
property which name propertyName.
true, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors.
true, the extractor will stop at first parsing error,
iffalse the extractor will attempt to ignore all parsing errors.
ExtractorFactory.DefaultConfiguration.
SingleDocumentExtraction run.InputStream meant to
detect and replace any occurrence of inline span:src property.
DocumentSource.ExtractionResult
able to collect property roots generated by HTML Microformat extractions.InputStream
into an XPathExtractionRule able
to expand a QuadTemplate set based
on a given template.XPathExtractionRule.EncodingDetector based on
Apache Tika.MIMETypeDetector based on
Apache Tika.Rover.Value.
toRDF(s), toRDF(p), toRDF(o)
date to the XSD datetime format.
Statement.
TripleHandler class.ContentExtractor
to perform extraction on TriX documents.Extractor.ContentExtractor able to perform the
extraction on Turtle documents.video.movie.
URI.
URI.
Validator class.Document instances.Validator.TemplateXPathExtractionRule.Purifier that removes all the eventual blank
characters at the header of a file that might prevents its MIME Type detection.FormatWriter metadata.FormatWriters.ValidationReportSerializer
for XML.XPathExtractor.Extractor.TagSoupDOMExtractor able to
apply XPathExtractionRules and generate quads.Writer.XSLTStylesheet class.
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||