Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

JSON format

Note that there is a separate overview of using the Unstructured Analysis Harvester. This page is reference information.

...

Info

By default XPath and Regex fields are deduplicated, ie if the string "apple" is found twice for the same field name, then it is not added to the field array. In cases where multiple fields are being correlated based on index, this is obviously not desirable, and it can be turned off by setting the flag 'DU' ("Don't DeDuplicate""u" for "Unique", capitalization in flags typically denotes negation).

Javascript "meta" fields

...

  • 't': provides the full text of the document to the script, in the field "text"
  • 'd': provides the entire document object to the script, int he field "_doc"
  • 'm': provides the entire metadata object ("_doc.metadata") to the script, in the field "_metadata".
XPath "meta" fields

The following flags are supported for XPath (and regex, except for "O"):

...