Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

RSS Source

A common Source Editor task is to process an RSS feed and configure the IKANOWS extractors for desired results.

This use case has the following steps

  • Create a new RSS Source using the Source Manager
  • Edit the source using the Source Builder
  • Publish the Source

Another way to quickly create an RSS source is to use the Google Chrome plugin.

 

Creating a New RSS Source

To create a source RSS Feed using the Source Manager.

1. Navigate to the Infinit.e Manager

 2. Select Source Editor from the menu

 

3. Click the ‘New Source’ button at the top right of the screen 

 

4. Under the Source Template drop-down on the left side: select ‘RSS Source Template’ and click ‘Select’


5. Under the New Source template on the right side:

  • Enter a Title (i.e. NY Times Front Page) and Description (i.e. NY Times RSS)

  • Select a Community for your source (i.e. General News)

  • Select ‘Save Source’

6. When saved, the source template reloads with:

  • Unique ‘Share ID’ for your source

  • Title, Description, and Community ID entered in Step 5 are added to the corresponding fields in the JSON

 

7.  Enter tags directly into the field provided by the GUI.

  • Note: No comma after the last tag, see below

  • i.e. "tags": ["NY Times", “Front Page”, “RSS”],

TODO: determine how to get "New York Times" added as a tag

8.  Replace the template URL in the "url": field with the .rss or .xml URL for the desired RSS feed, followed by a comma after the quotation

 



Test the Source

Once you have correctly configured the URL, you can test the source

To test the source

  1. Click on Test Source

If the source has been correctly configured Infinit.e will return newly generated documents in a pop-up window.

Configuring the Extractors

You will likely want to tailor you extraction settings to properly generate entities and associations.

 

Two fields must be added to the end of the JSON for text and metadata extractors depending on the content and desired results:

  • For extraction of entities and associations and if sentiment scoring is NOT desired, copy and paste the following :

           "useExtractor":"OpenCalais",

"useTextExtractor": "boilerpipe"

  • For keyword extraction AND sentiment scoring, or for foreign language sources, copy and paste the following:

"useExtractor":"AlchemyAPI-metadata",

           "userTextExtractor":"AlchemyAPI"

10. Next to Test Parameters, change the ‘Number of Documents:’ from ‘10’ to ‘2’, and select ‘Test Source’

11. Depending on your settings, you may need to accept pop-ups in order to receive the success or error message

 

Publishing the Source


 

If the test is successful, select ‘Publish Source’ and select ‘OK’ when success message appears

13.  If the test results in an error, double check all fields and test the RSS URL in a separate window to ensure it is accurate

13a.  If a second test results in error, copy and paste error message and send to Ikanow POC

13b.  If successful, select ‘Publish Source’ to begin harvesting

 




 

 

 
  • No labels