Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

PageReviewed by AlexAndrew CommentsAlex CommentsStatus
File extractor
  •   
  
Status
colourYellow
titleReview
Feed extractor
  •   
  
Status
colourYellow
titleReview
Web extractor
  •   
  
Status
colourYellow
titleReview
Database extractor
  •   
I need the initial content to get started
Updated now and ready for review 
Status
colourRedYellow
titleOn HoldReview
Follow Web links
  •   
  
Status
colourYellow
titleReview
Automated text extraction
  •   
Alex will you convert to JSON for the TODO? 
Status
colourYellow
titleReview
Manual text transformation
  •   
  
Status
colourYellow
titleReview
Document metadata
  •   
Alex will you convert to JSON for the TODO? 
Status
colourYellow
titleReview
Content metadata
  •   
requires new examples in source gallery for regex and xpath (see IN PROGRESS) 
Status
colourYellow
titleReview
Manual entities
  •   
  
Status
colourYellow
titleReview
Manual association of entities
  •   
  
Status
colourYellow
titleReview
Document storage settings
  •   
Additional examples for onUpdateScript, and metadataFieldStorage would be beneficial. 
Status
colourYellow
titlereview
Feature extraction
  •   
  
Status
colourYellow
titlereview
Aliasing
  •   
Not supported 
Status
colourRed
titleon hold
Harvest control settings
  •   

Require more examples for the following:

  • duplicateExistingUrls
  • maxDocs_global
  • throttleDocs_perCycle
  • maxDocs_perCycle
  • distributionFactor
 
Status
colourYellow
titlereview
Search index settings More examples in the source for searchIndex parameters would be beneficial. 
Status
colourYellow
titlereview
Lookup tables I tried to edit an existing example from the old source, as I could not find any new examples.  Please verify the changes I made to the example source and scripts. 
Status
colourYellow
titlereview
Javascript globals   
Status
colourYellow
titlereview

...