Infinit.e's entity extractors take harvested documents, ie URLs (RSS/HTML), text (files), or metadata objects (XML, databases), and add meaning in the form of entities and associations between entities.

Examples of the built-in entity extractors (JSON field "useExtractor") include:

In addition to the above entity extractors, Infinit.e has three options for "text extractors", which convert URLs into text (eg Advert removal, HTML tag cleansing etc):

In either of the above cases ("useExtractor" or "useTextExtractor"), the field can be set to "none".

Per-source configuration for extractor engines

The "extractorOptions" field of the source JSON object allows for custom configuration of text and entity extractors.

The format of the object is in the form:

{
	//"app.<EXTRACTOR_NAME>.<PARAMETER_NAME>": "<PARAMETER_VALUE>"
	//...
	//eg:
	"app.alchemyapi-metadata.sentiment": "true"
}

Currently the following configuration options are available: