Page | Reviewed by Alex | Andrew Comments | Alex Comments | Status |
---|
File extractor | | | | |
Feed extractor | | | | |
Web extractor | | | | |
Database extractor | | Updated now and ready for review | Missing authentication piece (moved into the db object from legacy format). Note encryption requirements described here: from v0.3 we now accept decrypted passwords. Would be good to show the example input table (from the source gallery) in your example. | |
Follow Web links | | | | |
Automated text extraction | | Alex will you convert to JSON for the TODO? | | |
Manual text transformation | | | | |
Document metadata | | Alex will you convert to JSON for the TODO? | | |
Content metadata | | requires new examples in source gallery for regex and xpath (see IN PROGRESS) | | |
Manual entities | | | | |
Manual association of entities | | | | |
Document storage settings | | Additional examples for onUpdateScript, and metadataFieldStorage would be beneficial. | | |
Feature extraction | | | | |
Aliasing | | Not supported | | |
Harvest control settings | | Require more examples for the following: - duplicateExistingUrls
- maxDocs_global
- throttleDocs_perCycle
- maxDocs_perCycle
- distributionFactor
| | |
Search index settings | | More examples in the source for searchIndex parameters would be beneficial. | | |
Lookup tables | | I tried to edit an existing example from the old source, as I could not find any new examples. Please verify the changes I made to the example source and scripts. | | |
Javascript globals | | | | |