Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

PageReviewed by AlexAndrew CommentsAlex CommentsStatus
File extractor
  •   
  
Status
colourYellow
titleReview
Feed extractor
  •   
  
Status
colourYellow
titleReview
Web extractor
  •   
  
Status
colourYellow
titleReview
Database extractor
  •   
I need the initial content to get started 
Status
colourRed
titleOn Hold
Follow Web links
  •   
  
Status
colourYellow
titleReview
Automated text extraction
  •   
Alex will you convert to JSON for the TODO? 
Status
colourYellow
titleReview
Manual text transformation
  •   
  
Status
colourYellow
titleReview
Document metadata
  •   
Alex will you convert to JSON for the TODO? 
Status
colourYellow
titleReview
Content metadata
  •   
requires new examples in source gallery for regex and xpath (see IN PROGRESS) 
Status
colourYellow
titleReview
Manual entities
  •   
  
Status
colourYellow
titleReview
Manual association of entities
  •   
  
Status
colourYellow
titleReview
Document storage settings
  •   
Additional examples for onUpdateScript, and metadataFieldStorage would be beneficial. 
Status
colourYellow
titlereview
Feature extraction
  •   
  
Status
colourYellow
titlereview
Aliasing
  •   
Not supported 
Status
colourRed
titleon hold
Harvest control settings
  •   

Require more examples for the following:

  • duplicateExistingUrls
  • maxDocs_global
  • throttleDocs_perCycle
  • maxDocs_perCycle
  • distributionFactor
  
Status
colourYellow
titlereview