Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

It uses feedType to specify that the data source is RSS.  It connects to the specified urls and can either include or exclude specfied specified urls via regex using regexInclude or regexExclude.

extraUrls

complex Complex type that enables urls to be manually specified, overriding settings that would be provided by the RSS feed.

Example:

In the example, the manually harvested urls to be harvested once are specified.  The  title, description, publisheddate, and fullText parameters can be used to manually specify strings for the specified urls, rather than using the harvested RSS data.

Code Block
"extraUrls": [ // This array allows for manually specified URLs to be harvested once        {
            "url": string, // The URL 
            "title": string, // The title that the document will be given (ie the equivalent to the RSS title)
            "description": string, // (Optional) The description that the document will be given (ie the equivalent to the RSS description)
            "publishedData": string, // (Optional) The date that will be assigned to the document (default: now) - this can be overridden from "structuredAnalysis"
            "fullText": string //
 (Optional) If present and "useTextExtractor" is "none", then uses the 
specified string instead of the URL contents (mainly for debugging)

 

userAgent

This parameter, if present, will override the system default user agent string.  This can be used for emulating a specific browser such as Firefox.

Example:

 

proxyOverride

 

httpFields

 

QUESTION:

-what happened to searchConfig?

 

 

Legacy documentation:

...