public class TikaEntityProcessor extends EntityProcessorBase
An implementation of EntityProcessor which reads data from rich docs
using Apache Tika
To index latitude/longitude data that might
be extracted from a file's metadata, identify
the geo field for this information with this attribute:
spatialMetadataField
ABORT, cacheSupport, context, CONTINUE, entityName, isFirstInit, ON_ERROR, onError, query, rowIterator, SKIP, TRANSFORM_ROW, TRANSFORMER| Constructor and Description |
|---|
TikaEntityProcessor() |
| Modifier and Type | Method and Description |
|---|---|
protected void |
firstInit(Context context)
first time init call. do one-time operations here
it's necessary to call it from the overridden method,
otherwise it throws NPE on accessing zipper from nextRow()
|
void |
init(Context context)
This method is called when it starts processing an entity.
|
Map<String,Object> |
nextRow()
For a simple implementation, this is the only method that the sub-class should implement.
|
destroy, getNext, initCache, nextDeletedRowKey, nextModifiedParentRowKey, nextModifiedRowKeyclose, postTransformpublic void init(Context context)
EntityProcessorinit in class EntityProcessorBasecontext - The current contextprotected void firstInit(Context context)
EntityProcessorBasefirstInit in class EntityProcessorBasepublic Map<String,Object> nextRow()
EntityProcessorBasenextRow in class EntityProcessorBaseCopyright © 2000–2015 The Apache Software Foundation. All rights reserved.