Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Overview of the Infinit.e Data Harvesting Process

The Infinit.e platform features a robust set of data harvesters that give Infinit.e a powerful data extraction and transformation (enrichment) capabilitie. Infinit.e's harvesters are designed to consume data from a variety of sources and media types including:

...

  1. Extract data from source
  2. Create feed document from source data
  3. Enrich source data by extracting entities, events, geographic/location data, etc.
  4. Update entity counts/aggregates
  5. Store finished within Infinit.e's MongoDB data store

Creating a Source

The following WIKI pages describe detail the steps involved with creating sources:

  1. Specifying a data source
    How to specify the mechanics required to extract data from a source system:
    1. Using the Feed Harvester
    2. Using the Database Harvester
    3. Using the File Harvester
  2. Structured Analysis - Overview
    An introduction to the Structured Analysis Harvester and how to specify the methods for enriching structured data sources with geographic information, entities, and events.
    1. Specifying Document Level Geographical Location
    2. Specifying Entities
    3. Specifying Events
    4. Transforming Data with JavaScript
  3. Unstructured Analysis - Overview
    An introduction to the Unstructured Analysis Harvester.

Source Reference Documents

Source Document Specification

The following links provide detailed information regarding the objects that make up a Source document and the individual fields within each object to support the introductory materials above.

Sample Source Documents

The following sample source documents are provided as an aid to learning how to create your own sources:

Source APIs: