All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
| Class |
Description |
| AboutNotURIRule |
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
|
| AcceptHeaderBuilder |
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and
removes duplicates and types covered by wildcards.
|
| AdrExtractor |
Extractor for the adr microformat.
|
| AdrExtractorFactory |
|
| Any23 |
A facade with convenience methods for typical Any23 extraction operations.
|
| Any23PluginManager |
The Any23PluginManager is responsible for inspecting dynamically the classpath and retrieving useful classes.
|
| Any23ValueFactoryWrapper |
Any23 specialization of the ValueFactory.
|
| Author |
Describes the author of the plugin.
|
| BaseRDFExtractor |
|
| BenchmarkTripleHandler |
|
| ByteArrayDocumentSource |
|
| CompositeTripleHandler |
A TripleHandler multi decorator, that wraps zero or more other triple handlers and dispatches all events to
each of them.
|
| Configuration |
Defines the main Any23 configuration.
|
| CountingTripleHandler |
A simple TripleHandler that merely counts the number of triples it has received.
|
| CSV |
This vocabulary models the structure of a CSV file according the
RFC 4180.
|
| CSVExtractor |
This extractor produces RDF from a CSV file .
|
| CSVExtractorFactory |
|
| CSVReaderBuilder |
This class is responsible to build a reader first guessing the configuration from the file it self and then, if not
successful, from the DefaultConfiguration.
|
| DCTerms |
The DCTERMS vocabulary.
|
| DecoratingWriterFactory |
Base interface used for constructors of decorating TripleHandler implementations.
|
| DefaultConfiguration |
|
| DefaultDOMDocument |
This class wraps the DOM document.
|
| DefaultHTTPClient |
|
| DefaultHTTPClientConfiguration |
|
| DefaultModifiableConfiguration |
|
| DefaultRuleContext |
|
| DefaultValidationReport |
|
| DefaultValidationReportBuilder |
|
| DefaultValidator |
|
| DiscoveryUtils |
This class provides utility methods for discovering classes in packages.
|
| DOAC |
|
| DOAP |
|
| DocumentReport |
Represents the validationReportBuilder generated by a the TagSoupParser when a document is retrieved and
validated.
|
| DocumentSource |
A source of input streams.
|
| DOMDocument |
This interface models a document to be processed by the Validator.
|
| DomUtils |
This class provides utility methods for DOM manipulation.
|
| ElementsProcessor |
|
| ElementsProcessor.ModelHolder |
A model holder describes the two required parameters which makes a model useful in further processing: a root
node and model itself.
|
| EmbeddedJSONLDExtractor |
This extractor represents the HTML script tags used to embed blocks of data in documents.
|
| EmbeddedJSONLDExtractorFactory |
|
| EmptyValidationReport |
|
| EncodingDetector |
Defines a detector for charset encoding.
|
| EntityBasedMicroformatExtractor |
Base class for microformat extractors based on entities.
|
| ExampleInputOutput |
A reporter for example input and output of an extractor.
|
| ExtractionContext |
This class provides the context for the processing of a single Extractor.
|
| ExtractionContextBlocker |
A wrapper around a TripleHandler that can block and unblock calls to the handler, either for the entire
document, or for individual ExtractionContexts.
|
| ExtractionException |
Defines a specific exception raised during the metadata extraction phase.
|
| ExtractionParameters |
This class models the parameters to be used to perform an extraction.
|
| ExtractionParameters.ValidationMode |
Declares the supported validation actions.
|
| ExtractionReport |
This class contains some statistics and general information about an extraction.
|
| ExtractionResult |
Interface defining the methods that a representation of an extraction result must have.
|
| ExtractionResultImpl |
|
| Extractor<Input> |
It defines the signature of a generic Extractor.
|
| Extractor.BlindExtractor |
This interface specializes an Extractor able to handle URI as input format.
|
| Extractor.ContentExtractor |
|
| Extractor.TagSoupDOMExtractor |
|
| ExtractorDescription |
It defines a minimal signature for an Extractor description.
|
| ExtractorDocumentation |
This class provides some command-line documentation about available extractors and their usage.
|
| ExtractorFactory<T extends Extractor<?>> |
|
| ExtractorGroup |
It simple models a group of ExtractorFactory providing simple accessing methods.
|
| ExtractorPlugin<T extends Extractor<?>> |
Deprecated.
|
| ExtractorRegistry |
An interface to the enable a registry for extractors to be implemented by different implementors of this API.
|
| ExtractorRegistryImpl |
Singleton class acting as a register for all the various Extractor.
|
| FileDocumentSource |
|
| FileUtils |
Utility class for handling files.
|
| Fix |
This interface models a fix for an issue.
|
| FOAF |
|
| FormatWriter |
Base interface used for the definition of RDF format writers.
|
| FunctionalSyntaxExtractor |
|
| FunctionalSyntaxExtractorFactory |
|
| GeoExtractor |
Extractor for the Geo microformat.
|
| GeoExtractorFactory |
|
| GeoNames |
|
| HAdrExtractor |
Extractor for the h-adr microformat.
|
| HAdrExtractorFactory |
|
| HCalendarExtractor |
|
| HCalendarExtractorFactory |
|
| HCard |
Vocabulary to map the h-card microformat.
|
| HCardExtractor |
Extractor for the hCard microformat.
|
| HCardExtractor |
Extractor for the h-Card microformat.
|
| HCardExtractorFactory |
|
| HCardExtractorFactory |
|
| HCardName |
An HCard name, consisting of various parts.
|
| HeadLinkExtractor |
|
| HeadLinkExtractorFactory |
|
| HEntry |
Vocabulary to map the h-entry microformat.
|
| HEntryExtractor |
Extractor for the h-entry microformat.
|
| HEntryExtractorFactory |
Extractor for the h-entry microformat.
|
| HEvent |
Vocabulary to map the h-event microformat.
|
| HEventExtractor |
Extractor for the h-event microformat.
|
| HEventExtractorFactory |
|
| HGeoExtractor |
Extractor for the h-geo microformat.
|
| HGeoExtractorFactory |
|
| HItem |
Vocabulary to map the h-item microformat.
|
| HItemExtractor |
Extractor for the h-item microformat.
|
| HItemExtractorFactory |
|
| HListing |
|
| HListingExtractor |
|
| HListingExtractorFactory |
|
| HProduct |
|
| HProductExtractor |
|
| HProductExtractorFactory |
|
| HRecipe |
Vocabulary to map the hRecipe microformat.
|
| HRecipeExtractor |
Extractor for the hRecipe microformat.
|
| HRecipeExtractor |
Extractor for the hRecipe microformat.
|
| HRecipeExtractorFactory |
|
| HRecipeExtractorFactory |
|
| HResume |
|
| HResumeExtractor |
Extractor for the hResume microformat.
|
| HResumeExtractor |
Extractor for the hResume microformat.
|
| HResumeExtractorFactory |
|
| HResumeExtractorFactory |
|
| HReviewAggregateExtractor |
|
| HReviewAggregateExtractorFactory |
|
| HReviewExtractor |
Extractor for the hReview microformat.
|
| HReviewExtractorFactory |
|
| HTMLDocument |
A wrapper around the DOM representation of an HTML document.
|
| HTMLDocument.TextField |
This class represents a text extracted from the HTML DOM related to the node from which such test has been
retrieved.
|
| HTMLMetaExtractor |
|
| HTMLMetaExtractorFactory |
|
| HTTPClient |
|
| HTTPClientConfiguration |
|
| HTTPDocumentSource |
|
| ICAL |
Vocabulary definitions from ical.rdf
|
| ICalExtractor |
|
| ICalExtractorFactory |
|
| ICBMExtractor |
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
|
| ICBMExtractorFactory |
|
| IgnoreAccidentalRDFa |
A TripleHandler that suppresses output of the RDFa parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML's
|
| IgnoreTitlesOfEmptyDocuments |
|
| Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
| Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
| IssueReport |
This interface models an issue reporter.
|
| IssueReport.Issue |
This class defines a generic issue traced by this extraction result.
|
| IssueReport.IssueLevel |
Possible issue levels.
|
| Item |
Base class for Microdata items.
|
| ItemProp |
Describes a Microdata item property.
|
| ItemPropValue |
Describes a possible value for a Microdata item property.
|
| ItemPropValue.Type |
Supported types.
|
| ItemScope |
This class describes a Microdata itemscope.
|
| JCalExtractor |
|
| JCalExtractorFactory |
|
| JSONLDExtractor |
|
| JSONLDExtractorFactory |
|
| JSONLDWriter |
|
| JSONLDWriterFactory |
|
| JSONWriter |
Deprecated.
|
| JSONWriterFactory |
Deprecated.
|
| JsoupUtils |
|
| LicenseExtractor |
|
| LicenseExtractorFactory |
|
| LKIFCoreAction |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreExpression |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreLegalAction |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreLegalRole |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreMereology |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreNorm |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreProcess |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreRelativePlaces |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreRole |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreRules |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreTime |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreTimeModification |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LKIFCoreTop |
An implementation of the lkif-core vocabulary which is a
library of ontologies relevant for the legal domain.
|
| LocalCopyFactory |
|
| LoggingTripleHandler |
Triple handler decorator useful for logging purposes.
|
| LogUtils |
Deprecated.
|
| ManchesterSyntaxExtractor |
|
| ManchesterSyntaxExtractorFactory |
|
| MathUtils |
Mathematical utility functions.
|
| MemCopyFactory |
Creates local copies of DocumentSource by reading them into an in-memory buffer.
|
| MetaNameMisuseFix |
Fixes the misuse of the meta name attribute.
|
| MetaNameMisuseRule |
Checks whether the meta attribute name is used to contain a property.
|
| MicrodataExtractor |
|
| MicrodataExtractorFactory |
|
| MicrodataParser |
Command line Microdata parser, accepting both files and URLs and returing a JSON representation of the
extracted metadata as described at Microdata JSON Specification.
|
| MicrodataParser |
This class provides utility methods for handling Microdata nodes contained within a DOM document.
|
| MicrodataParser.MicrodataParserDocumentSourceConverter |
|
| MicrodataParserException |
Defines an exception occurring while parsing Microdata.
|
| MicrodataParserReport |
|
| MicroformatExtractor |
|
| Microformats2Prefixes |
|
| MimeDetector |
Commandline tool to detect MIME Types from file, HTTP and direct input sources.
|
| MimeDetector.MimeDetectorDocumentSourceConverter |
|
| MIMEType |
A MIME type with an optional q (quality) value.
|
| MIMETypeDetector |
This detector is able to estimate the MIME type of some given raw data.
|
| MissingItemscopeAttributeValueFix |
|
| MissingItemscopeAttributeValueRule |
This fixes missing attribute values for the 'itemscope' attribute Typically when such a snippet of XHTML is fed
through the RDFa11Extractor, and subsequently to Sesame's SesameRDFaParser,
it will result in the following behavior.
|
| MissingOpenGraphNamespaceRule |
This rule detects the issue of missing Open Graph namespace.
|
| ModifiableConfiguration |
|
| NaiveMIMETypeDetector |
|
| NQuadsExtractor |
|
| NQuadsExtractorFactory |
|
| NQuadsWriter |
|
| NQuadsWriterFactory |
|
| NTriplesExtractor |
|
| NTriplesExtractorFactory |
|
| NTriplesWriter |
|
| NTriplesWriterFactory |
|
| OGP |
|
| OGPArticle |
|
| OGPBook |
|
| OGPMusic |
|
| OGPProfile |
|
| OGPVideo |
|
| OpenGraphNamespaceFix |
This fixes the missing Open Graph protocol.
|
| PluginVerifier |
Commandline utility to verify the Any23 plugins and extract basic information.
|
| PopularPrefixes |
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
|
| Prefixes |
A mapping from prefixes to namespace IRIs.
|
| Programme |
|
| Purifier |
This interface defines a minimum set of methods that a TikaMIMETypeDetector could call in order to clean the input
before performing the MIME type detection.
|
| QuadTemplate |
This class models a NQuads template, that is a quadruple in which any component can be a variable.
|
| RDFa11Extractor |
|
| RDFa11ExtractorFactory |
|
| RDFa11Parser |
Deprecated.
|
| RDFa11ParserException |
Deprecated.
|
| RDFaExtractor |
|
| RDFaExtractorFactory |
|
| RDFHandlerAdapter |
An RDFHandler that relays statements and prefix definitions to an ExtractionResult.
|
| RDFParserFactory |
This factory provides a common logic for creating and configuring correctly any RDF parser used within the
library.
|
| RDFSchemaUtils |
This class provides a set of methods for generating RDF Schema.
|
| RDFUtils |
Basic class providing a set of utility methods when dealing with RDF.
|
| RDFWriterTripleHandler |
A TripleHandler that writes triples to a Sesame RDFWriter, eg for serialization
using one of Sesame's writers.
|
| RDFXMLExtractor |
|
| RDFXMLExtractorFactory |
|
| RDFXMLWriter |
|
| RDFXMLWriterFactory |
|
| ReaderInputStream |
Adapts a Reader as an InputStream.
|
| ReportingTripleHandler |
A TripleHandler that collects various information about the extraction process, such as the extractors used
and the total number of triples.
|
| RepositoryWriter |
An RDF4J repository triple writer.
|
| Review |
Vocabulary definitions from vocabularies/review.rdf
|
| ReviewAggregate |
|
| Rover |
A default rover implementation.
|
| Rover.ArgumentToIRIConverter |
|
| Rover.PrintStreamConverter |
|
| Rule |
Models a rule for an issue detection.
|
| RuleContext<T> |
This context is used to pass data from a Rule to a Fix.
|
| SchemaOrg |
|
| SerializationException |
|
| Setting<V> |
Represents a setting key paired with a compatible value.
|
| Settings |
This class represents an immutable Set of Setting objects, with the additional property that
no two settings having the same identifier can be simultaneously present in a
Settings object.
|
| SimpleExtractorFactory<T extends Extractor<?>> |
|
| SINDICE |
This class models an internal Sindice Vocabulary to describe resource domains and Microformat nesting
relationships.
|
| SingleDocumentExtraction |
This class acts as a facade where all extractors (for a given MIMEType) can be called on a single document.
|
| SingleDocumentExtractionReport |
|
| SpanCloserInputStream |
Extension of InputStream meant to detect and replace any occurrence of inline span:
|
| SpeciesExtractor |
|
| SpeciesExtractorFactory |
|
| StreamUtils |
Contains general utility functions for handling streams.
|
| StringDocumentSource |
|
| StringUtils |
This class provides a set of string utility methods.
|
| TagSoupExtractionResult |
This interface models a specific ExtractionResult able to collect property roots generated by HTML
Microformat extractions.
|
| TagSoupExtractionResult.PropertyPath |
Defines a property path object.
|
| TagSoupExtractionResult.ResourceRoot |
Defines a property root object.
|
| TagSoupParser |
|
| TagSoupParser.ElementLocation |
Describes a DOM Element location.
|
| TemplateGraph |
Represents an Quad graph IRI template.
|
| TemplateObject |
Represents a Quad object template.
|
| TemplateObject.Type |
Supported object types.
|
| TemplatePredicate |
Represents a Quad predicate template.
|
| TemplateSubject |
Represents a Quad subject template.
|
| TemplateSubject.Type |
Supported subject types.
|
| TemplateXPathExtractionRule |
|
| TemplateXPathExtractionRuleImpl |
|
| Term<T extends org.eclipse.rdf4j.model.Value> |
Represents a generic template term.
|
| TikaEncodingDetector |
|
| TikaMIMETypeDetector |
|
| TitleExtractor |
Extracts the value of the <title> element of an HTML or XHTML page.
|
| TitleExtractorFactory |
|
| Tool |
Defines a runnable CLI tool.
|
| ToolRunner |
This class is the main class responsible to provide a uniform command-line access points to all the others tools like
Rover.
|
| TripleFormat |
|
| TripleFormat.Capabilities |
|
| TripleHandler |
Defines a document based triple handler.
|
| TripleHandlerException |
Defines a generic exception raised when accessing to TripleHandler class.
|
| TripleWriter |
Base interface for triple writers that don't need an extraction context to write triples
|
| TripleWriterFactory |
Base interface for constructors of TripleHandler implementations that write to an OutputStream using
a particular FileFormat.
|
| TripleWriterHandler |
|
| TriXExtractor |
|
| TriXExtractorFactory |
|
| TriXWriter |
|
| TriXWriterFactory |
|
| TurtleExtractor |
|
| TurtleExtractorFactory |
|
| TurtleHTMLExtractor |
Extractor for Turtle/N3 format embedded within HTML script tags.
|
| TurtleHTMLExtractorFactory |
|
| TurtleWriter |
|
| TurtleWriterFactory |
|
| URIListWriter |
This writer simply produces a list of unique IRI present in the subject or in the object of every single
extracted RDF Statement.
|
| URIListWriterFactory |
|
| URLUtils |
Contains general utility functions for handling URLs.
|
| ValidationReport |
This class contains the report of a validation performed by the Validator class.
|
| ValidationReport.Error |
An error occurred while performing the validation process.
|
| ValidationReport.FixError |
An error occurred while executing a fix.
|
| ValidationReport.Issue |
An issue found during the validation process.
|
| ValidationReport.IssueLevel |
Defines the different issue levels.
|
| ValidationReport.RuleActivation |
This class describes the activation of a rule.
|
| ValidationReport.RuleError |
An error occurred while executing a rule.
|
| ValidationReportBuilder |
The report interface is used to generate diagnostics about validation.
|
| ValidationReportSerializer |
Defines a serializer for validation reports.
|
| Validator |
The validator class allows to perform validation - correction of related to HTML Document
instances.
|
| ValidatorException |
Models any exception raised by the Validator.
|
| Variable |
|
| VCard |
Vocabulary definitions from vcard.owl
|
| VocabPrinter |
Prints out the vocabulary RDFSchema as NQuads.
|
| VocabPrinter.RDFFormatConverter |
|
| Vocabulary |
Base class for the definition of a vocabulary.
|
| WhiteSpacesPurifier |
Implementation of Purifier that removes all the eventual blank characters at the header of a file that might
prevents its MIME Type detection.
|
| WO |
|
| WriterFactory |
|
| WriterFactoryRegistry |
|
| WriterSettings |
This class encapsulates commonly supported settings for TripleWriter implementations.
|
| XCalExtractor |
|
| XCalExtractorFactory |
|
| XFN |
|
| XFNExtractor |
Extractor for the XFN microformat.
|
| XFNExtractorFactory |
|
| XHTML |
|
| XMLValidationReportSerializer |
|
| XMLValidationReportSerializer.NodeName |
Allows to specify a custom node name.
|
| XPathExtractionRule |
|
| XPathExtractor |
|
| XPathExtractorFactory |
|
| XSLTStylesheet |
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a Writer.
|
| XSLTStylesheetException |
|
| YAML |
This vocabulary describes model of the yaml file.
|
| YAMLExtractor |
|
| YAMLExtractorFactory |
|
| YAMLProcessorException |
|