com.norconex.importer.transformer
Interface IDocumentTransformer
- All Superinterfaces:
- IImportHandler, Serializable
- All Known Implementing Classes:
- AbstractCharStreamTransformer, AbstractRestrictiveTransformer, AbstractStringTransformer, StripAfterTransformer, StripBeforeTransformer, StripBetweenTransformer
public interface IDocumentTransformer
- extends IImportHandler
Transformers allow to manipulate and convert extracted text and
save the modified text back.
- Author:
- Pascal Essiembre
transformDocument
void transformDocument(String reference,
InputStream input,
OutputStream output,
Properties metadata,
boolean parsed)
throws IOException
- Transforms document content and metadata.
- Parameters:
reference
- document reference (e.g. URL)input
- document to transformoutput
- transformed documentmetadata
- document metadataparsed
- whether the document has been parsed already or not (a
parsed document should normally be text-based)
- Throws:
IOException
Copyright © 2009-2013 Norconex Inc.. All Rights Reserved.