Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When a query is issued, often a large number of documents will satisfy the query criteria (particularly for a common query like "obama"), these are called matching documents. These documents are not directly available to the widget (apart from "top documents", see below), except

However there are normally too many documents for a person to analyze directly (see below, under aggregations). As a result, a ranked subset of these matching documents (according to a configurable scoring method) is retrieved and only these are returned directly to the GUI. The default number of these "top documents" returned is 100.

The IWidgetContext API allows for further filtering of these "top documents" within the GUI framework, eg based on "top documents" containing a specific set of entities (eg click on one of the bars in the graph in the "Significance" widget). This sub-set is called the "filtered documents".

Aggregations

Obviously all the matching documents can contribute to the "knowledge" that a query can provide, and the documents themselves are not the only objects returned from a query. Instead, relevant information to the analysis is summed/averaged/etc ("aggregated") across all matching documents, and these are referred to as the "aggregations". Examples include:

...