Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The harvester first evaluates the strings specified for the headerRegEx and footerRegEx. These are each regular expression strings that specify if a document has a header or if the document has a footer.  This is useful to be able to negate a lot of formatting that normally occurs in headers/footers that would not result in correctly extracted entities or events. It is important to note that the header/footer patterns are matched in DOTOALL modeDOTALL mode (other regexes are not unless the flag "d" is specified).

For Example:

Panel
titleSample Document

Address: 123 Sample Dr. Woodbridge, VA 22191

To: John Doe

********************

The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog.

Sincerely,

Sample Person

_________________________

Sample Person

Address

Company

...