com.norconex.importer.parser.impl
Class HTMLParser

java.lang.Object
  extended by com.norconex.importer.parser.impl.AbstractTikaParser
      extended by com.norconex.importer.parser.impl.HTMLParser
All Implemented Interfaces:
IDocumentParser, Serializable

public class HTMLParser
extends AbstractTikaParser

HTML parser based on Apache Tika HtmlParser.

Author:
Pascal Essiembre
See Also:
Serialized Form

Nested Class Summary
 
Nested classes/interfaces inherited from class com.norconex.importer.parser.impl.AbstractTikaParser
AbstractTikaParser.RecursiveMetadataParser
 
Field Summary
 
Fields inherited from interface com.norconex.importer.parser.IDocumentParser
RDF_BASE_URI, RDF_SUBJECT_CONTENT
 
Constructor Summary
HTMLParser(String format)
           
 
Method Summary
 
Methods inherited from class com.norconex.importer.parser.impl.AbstractTikaParser
addTikaMetadata, parseDocument
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HTMLParser

public HTMLParser(String format)


Copyright © 2009-2013 Norconex Inc.. All Rights Reserved.