Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Overview

You can use the Advanced source builder to edit raw JSON directly in the browser.  There is also a list of JSON templates that you can choose from to get started. 

About Source Configuration

Access to the Advanced Source Builder is for power users and Developers.  Editing JSON provides direct access to the source pipeline elements, a series of JSON objects that are configured to ingest, transform, enrich, and output data. Data ingested into the ISA platform is stored in a platform-specific JSON format to enable flexible transformation and enrichment.

For more information, see section Source Pipeline Documentation.

Using the Advanced Source Builder

To use the Advanced Source Builder

  1. From the Treat Analytics Dashboard, click on Data Sources (top right).
  2. Click on Add New Source.
  3. Under "What kind of source would you like to create?" specify "Advanced Source Builder."
  4. Click on Next.
  5. Specify the the fields as defined in the table below.

    FieldDescription
    Source TitleProvide a title for the source.
    TagsComma-separated list of tags
    DescriptionDescription of the source.
    Data GroupData Group where the source data will be saved.
    Media TypePick from the list of Media Types.
    Data OriginInternal or External. For more information, see Data Sources.
  6. Click on Next.

Configuring and Testing

As part of configuring and testing, you will edit the raw JSON in the browser, either starting from an empty template or specifying one of the templates from the dropdown.

To configure and test

  1. Choose an optional source template or specify empty.
  2. Edit the JSON as required
  3. Specify the Test Options.  See below.
  4. Click on Test Source.

About Testing

If the source has been configured properly testing with return test results, and you will be able to move forward with Publishing the new source.  Otherwise, a failure message is generated which can be used for troubleshooting (currently it only says FAIL).  You can always Save your source and come back to fix any testing errors later.

Testing Options:

FieldDescription
Test:

Select the amount of documents under test. 1, 10 or 20.

For testing purposes, it is often convenient to limit the mount of documents under test to an easily manageable number.

Full textBy default, the full text of a document is not returned (it can be quite long). For testing text extractors (eg "boilerpipe" vs "none" vs "AlchemyAPI"), or for testing "unstructured analysis" transformations, the Full text maybe useful or essential though. In these instances, it is recommended to enable the checkbox.
Update test mode

Setting which enables/disables test mode to check the database to see if documents already exist.

This enables additional logic to be setup elsewhere in the source pipeline for testing eg. compare old and new sources, ignore sources older than N days etc.

 

Saving or Publishing

Saving 

To save the source after testing

  • Click on Save.

The source is saved and you are re-directed to the Source Manager.

Publishing

To publish the source after testing

  • Click on Publish.

The source is published and you are re-directed to the list Source Manager.

Panel

In this section:

Table of Contents
maxLevel2
indent16px