Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info

Updates a hadoop map reduce job. Returns the output collection id in the data field of the response if successfully queued to run. If you change the timeToRun a job can be rescheduled.

Note

Return the word "null" for any of the update fields to not change a field, it will remain whatever the field was previously.

 A detailed guide to creating plugins.

A simple web-based utility is available for uploading JARs and managing jobs.

...

frequencyToRun (required)
How often the job should be ran, either: NONE, HOURLY, DAILY, WEEKLY, MONTHLY. This will cause the job to get resubmitted after running, use NONE if you only want the job to run once.

...

query (required)
The mongo or JSON query to use to get the jobs data. {} is a blank query or you can submit null.   Also you can submit any post-processing you want by passing in an array of the form [{mongodb query},{postproc}] where postproc is a json object following the form:

Code Block
{
	"limit":int,
	"sortField":"field.field.field",
	"sortDirection":-1|1,
	"limitAllData":true|false
}

See See Custom - Schedule Job for more information about this query's format, or see the Hadoop Plugin Guide for more information.

inputcollection (required)
The mongo collection you want to use as input. You can submit DOC_METADATA to get the documents table the documents metadata, DOC_CONTENT to get the document contents, or grab a previous map reduce jobs results table in your communities by submitting its id or title (must be a member of that community).

outputKey (required)
The classpath for the map reduce output format key usually org.apache.hadoop.io.Text

...