Advanced Options
Advanced Options
The Advanced Options appear when you click on Options>Advanced Options.
Description
Use the Advanced Options to set document analysis thresholds, scoring weights, geo decay, and other settings which impact the overall functionality of the platform and the widgets.
| Field | Description | Notes |
|---|---|---|
| Enable/Disable Scoring | When disabled, documents & entities are returned without significance or relevance scoring. Default setting is enabled. | |
| Number of documents to analyze | The maximum number of documents to be returned from the Lucene (/ElasticSearch) query and analyzed according to the significance algorithm. Default is 1,000 Documents. | |
| Scoring Weights | Ratio of Significance vs. Relevance scoring weights Default setting is 2:1 (Significance:Relevance) | |
| Weight by | Applies half-life principle to results ranking, based on desired time interval. Time decay date: Hour: Half Life:
| Example: 1m (one month) time decay - results within 1 month of the entered date are promoted to top of results; results between 1 to 2 months from decay time are halved; results 2 to 3 months from decay time are quartered, etc. |
| Geo Decay | Applies half-life principle to results ranking based on distance (kilometers) from lat/long centerpoint. Default: None | Example: a geo decay of 10k (10 kilometers) means that results 10-20k from the centerpoint have their scores reduced by 50%, results 20-30k from centerpoint have scores reduced 25%, etc. |
| Aggregate Significance Weighting | If true, aggregated entities are weighted by relevance to ensure entities occurring in more relevant documents are weighted up Automatic: The platform attempts to decide for itself based on the query. Always Weight: Entities in more relevant documents are always weighted up. Never Weight: Entities in more relevant documents are never weighted up. Default: Automatic | |
| Manual Weightings (in order of precedence) | Source Weights: Documents from weighted source are promoted to top of query results Example: "www.google.com.search.56.6.":1.25","www.google.com.search.532.23":0.75 Type Weighs: Documents matching weighted source type are promoted to top of query results Example: "News":1.1,"Social":4.0 Tag Weights: Documents matching weighted source tag are promoted to top of query results Example: "mysql":2.0,"katrina":1.1 Default: None | The weights are applied as follows:
|
| Return documents | If disabled, documents are excluded from query results and therefore no results displayed in Doc Viewer. Only entities, events, facts, geo-tags, Default: Enabled | |
| Return Standalone Events | Aggregates events, facts and summaries while retaining the temporal element (unlike event/fact aggregation) Standalone events are only viewable in the event timeline widget (vs event/fact aggregations in the event graph) Default: Disabled | |
| Max Documents to Return | Max # of documents to return in Doc Viewer results list Default: 100 Documents | |
| Documents to Skip | Discards the first X documents. If set to 100, skips docs 1-100 Default: None | |
| Include Entities | If disabled, entities and entity scoring are not returned with queries - will result in slightly improved performance and query time. Default: Enabled | |
| Score Entities | If disabled, entities are returned with queries, however significance/relevance scoring is not performed - will result in slightly improved performance and query time. Default: Enabled | |
| Include Geotags | If disabled, geotags are excluded from query results. Map Widget will not display docs. Default: Enabled | |
| Include Metadata | If disabled, source-specific metadata is not returned with query results. Default: Enabled | |
| Include Summaries | If disabled, document summaries are excluded front query results. Default: Enabled | |
| Include Events | If disabled, events are excluded from query results. Default: Enabled | |
| Include Facts | If disabled, facts are excluded from query results. Default: Enabled | |
| Aggregate Geotags | If disabled, most common geotags are not aggregated. Default: Enabled Max Geotags to Return - Default: 1,000 | |
| Aggregate Times | If disabled, document counts of query results are not aggregated. Default: Enabled Aggregation Interval - Default: 1w (one week) | |
| Aggregate Entities | If disabled, top ranking entities are not aggregated. Default: Enabled Max Entities to Return - Default: 250 (recommend setting to 3,000+) | |
| Aggregate Events | If disabled, top ranking events are not aggregated. Default: Enabled Max Events to Return - Default: 100 | |
| Aggregate Facts | If disabled, top ranking facts are not aggregated. Default: Enabled Max Facts to Return - Default: 100 | |
| Aggregate Sources | Default: Disabled | |
| Aggregate Source Metadata | Default: Disabled | |
| Entity Filters | Docs not containing an entity of that type will be discarded Other entities types will be discarded from docs that are promoted Negative filters - entering a minus (-) before an entity type will discard that entity type from all results, no effect on query (i.e. -keyword will omit all keywords | The entity type filter or association verb category filter can be specified in one of two ways (in Advanced Options):
|
| Association Filters | Docs not containing an association of that type will be discarded Other association types will be discarded from docs that are promoted Negative filters - entering a minus (-) before an association type will discard that association type from all results, no effect on query (i.e. -generic relations) | The entity type filter or association verb category filter can be specified in one of two ways (in Advanced Options):
|