Overview of the Infinit.e Data Harvesting Process
Source Documents
Source documents define the following basic pieces of information for Infinit.e's Harvester:
- The type of data harvest (RSS feed, database record, XML document, etc)
- Location of source data to harvest
- Authentication information required to access source information
- Basic source information including title, description, and tags
- The extractor to use on unstructured sources to extract entity and events with
- How to extract entities and events from structured sources
Creating Source Documents
Using JavaScript to Perform Data Transformations
Sample Source Documents
The following pages have sample Source documents demonstrating difference features of the Infinit.e harvester:
- Unstructured Sources
- Structured Sources