Class TagSoupParserTest


  • public class TagSoupParserTest
    extends Object
    Reference Test class for TagSoupParser parser.
    Author:
    Davide Palmisano (dpalmisano@gmail.com), Michele Mostarda (michele.mostarda@gmail.com)
    • Constructor Detail

      • TagSoupParserTest

        public TagSoupParserTest()
    • Method Detail

      • tearDown

        public void tearDown()
                      throws org.eclipse.rdf4j.repository.RepositoryException
        Throws:
        org.eclipse.rdf4j.repository.RepositoryException
      • testImplicitEncodingBehavior

        public void testImplicitEncodingBehavior()
                                          throws IOException,
                                                 ExtractionException,
                                                 org.eclipse.rdf4j.repository.RepositoryException
        This tests the Neko HTML parser without forcing it on using a specific encoding charset. We expect that this test may fail if something changes in the Neko library, as an auto-detection of the encoding.
        Throws:
        IOException - if there is an error interpreting the input data
        ExtractionException - if there is an exception during extraction
        org.eclipse.rdf4j.repository.RepositoryException - if an error is encountered whilst loading content from a storage connection
      • testEmptySpanElements

        public void testEmptySpanElements()
                                   throws IOException
        Test related to the issue 78 and disabled until the underlying NekoHTML bug has been fixed.
        Throws:
        IOException - if there is an error interpreting the input data