...
The harvester first evaluates the strings specified for the headerRegEx and footerRegEx. These are each regular expression strings that specify if a document has a header or if the document has a footer. This is useful to be able to negate a lot of formatting that normally occurs in headers/footers that would not result in correctly extracted entities or events. It is important to note that the header/footer patterns are matched in DOTOALL modeDOTALL mode (other regexes are not unless the flag "d" is specified).
For Example:
Panel | ||
---|---|---|
| ||
Address: 123 Sample Dr. Woodbridge, VA 22191 To: John Doe ******************** The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog. The quick brown fox jumps over the lazy dog. Sincerely, Sample Person _________________________ Sample Person Address Company |
...