Introduction to Apache Any23


Anything To Triples (any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a variety of Web documents. Currently it supports the following input formats:

A detailed description of available extractors is here.

Apache Any23 is written in Java and licensed under the Apache License v2.0. Apache Any23 can be used in various ways: * As a library in Java applications that consume structured data from the Web. * As a command-line tool for extracting and converting between the supported formats. * As online service API available at

You can download the latest release from our Apache Mirrors.

Previous versions are available from the Apache Archives site.

Documentation Content

Introduction: this page.

Install: how to install Apache Any23 library and service.

Getting Started: start using Apache Any23 command-line tools.

Supported Formats: complete list of Semantic Web formats supported by Apache Any23.

Configuration: learn how to change default library and service configuration.

REST Service: discover how to use the Apache Any23 REST Service.

Plugins: read how to install and configure the Apache Any23 plugins.

Developers: understand the Apache Any23 code internals, how to write plugins, fixing rules and customize the code.


Questions, comments? Get in touch on the mailing lists! Bugs, feature requests, patches? Please submit to the issue tracker. You can access the source through Git, see the Installation Guide for details.