...
- Feed extractor
- Web extractor
- File extractor
- Database extractor
- Logstash extractor
- Federated Query Source
- TODO post processing Post Processing (enterprise only) (ROADMAP)
Global processing
- Harvest control settings
- Javascript globals
- Lookup tablesTables
- Aliasing (not currently supported) (ROADMAP)
Secondary extractors
...
- Custom Inputs: Which data to bring in, what filtering and transforms to apply
- Custom Control: Scheduling and other controls
- Custom Processing: Generic scripting or customizable templates, single or chained (roadmap)
- Custom Outputs: Where the results of the data end up (documents, records, custom tables)
Custom Inputs
...
- Distributed File Input
- Process Existing Docs, simple query
- Process Existing Docs, complex (Infinit.e) query
- Process Existing Records
- Process Custom Results
- Process Entity and Association Features
Custom Control
Custom Processing
- TODORun Built-in/Custom Hadoop Module
- Run Distributed Scripting Engine
- Run Custom Hadoop Mapper/Combiner/Reducer
Custom Outputs
...
- Table Output
- Record Output (ROADMAP)
- Document Output (ROADMAP)