Advanced Options
- Josh Liss (Unlicensed)
- andrew johnston (Unlicensed)
Advanced Options
The Advanced Options appear when you click on Options>Advanced Options.
Description
Use the Advanced Options to set document analysis thresholds, scoring weights, geo decay, and other settings which impact the overall functionality of the platform and the widgets.
Field | Description | Notes |
---|---|---|
Enable/Disable Scoring | When disabled, documents & entities are returned without significance or relevance scoring. Default setting is enabled. | |
Number of documents to analyze | The maximum number of documents to be returned from the Lucene (/ElasticSearch) query and analyzed according to the significance algorithm. Default is 1,000 Documents. | |
Scoring Weights | Ratio of Significance vs. Relevance scoring weights Default setting is 2:1 (Significance:Relevance) | |
Weight by | Applies half-life principle to results ranking, based on desired time interval. Time decay date: Hour: Half Life:
| Example: 1m (one month) time decay - results within 1 month of the entered date are promoted to top of results; results between 1 to 2 months from decay time are halved; results 2 to 3 months from decay time are quartered, etc. |
Geo Decay | Applies half-life principle to results ranking based on distance (kilometers) from lat/long centerpoint. Default: None | Example: a geo decay of 10k (10 kilometers) means that results 10-20k from the centerpoint have their scores reduced by 50%, results 20-30k from centerpoint have scores reduced 25%, etc. |
Aggregate Significance Weighting | If true, aggregated entities are weighted by relevance to ensure entities occurring in more relevant documents are weighted up Automatic: The platform attempts to decide for itself based on the query. Always Weight: Entities in more relevant documents are always weighted up. Never Weight: Entities in more relevant documents are never weighted up. Default: Automatic | |
Manual Weightings (in order of precedence) | Source Weights: Documents from weighted source are promoted to top of query results Example: "www.google.com.search.56.6.":1.25","www.google.com.search.532.23":0.75 Type Weighs: Documents matching weighted source type are promoted to top of query results Example: "News":1.1,"Social":4.0 Tag Weights: Documents matching weighted source tag are promoted to top of query results Example: "mysql":2.0,"katrina":1.1 Default: None | The weights are applied as follows:
|
Return documents | If disabled, documents are excluded from query results and therefore no results displayed in Doc Viewer. Only entities, events, facts, geo-tags, Default: Enabled | |
Return Standalone Events | Aggregates events, facts and summaries while retaining the temporal element (unlike event/fact aggregation) Standalone events are only viewable in the event timeline widget (vs event/fact aggregations in the event graph) Default: Disabled | |
Max Documents to Return | Max # of documents to return in Doc Viewer results list Default: 100 Documents | |
Documents to Skip | Discards the first X documents. If set to 100, skips docs 1-100 Default: None | |
Include Entities | If disabled, entities and entity scoring are not returned with queries - will result in slightly improved performance and query time. Default: Enabled | |
Score Entities | If disabled, entities are returned with queries, however significance/relevance scoring is not performed - will result in slightly improved performance and query time. Default: Enabled | |
Include Geotags | If disabled, geotags are excluded from query results. Map Widget will not display docs. Default: Enabled | |
Include Metadata | If disabled, source-specific metadata is not returned with query results. Default: Enabled | |
Include Summaries | If disabled, document summaries are excluded front query results. Default: Enabled | |
Include Events | If disabled, events are excluded from query results. Default: Enabled | |
Include Facts | If disabled, facts are excluded from query results. Default: Enabled | |
Aggregate Geotags | If disabled, most common geotags are not aggregated. Default: Enabled Max Geotags to Return - Default: 1,000 | |
Aggregate Times | If disabled, document counts of query results are not aggregated. Default: Enabled Aggregation Interval - Default: 1w (one week) | |
Aggregate Entities | If disabled, top ranking entities are not aggregated. Default: Enabled Max Entities to Return - Default: 250 (recommend setting to 3,000+) | |
Aggregate Events | If disabled, top ranking events are not aggregated. Default: Enabled Max Events to Return - Default: 100 | |
Aggregate Facts | If disabled, top ranking facts are not aggregated. Default: Enabled Max Facts to Return - Default: 100 | |
Aggregate Sources | Default: Disabled | |
Aggregate Source Metadata | Default: Disabled | |
Entity Filters | Docs not containing an entity of that type will be discarded Other entities types will be discarded from docs that are promoted Negative filters - entering a minus (-) before an entity type will discard that entity type from all results, no effect on query (i.e. -keyword will omit all keywords | The entity type filter or association verb category filter can be specified in one of two ways (in Advanced Options):
|
Association Filters | Docs not containing an association of that type will be discarded Other association types will be discarded from docs that are promoted Negative filters - entering a minus (-) before an association type will discard that association type from all results, no effect on query (i.e. -generic relations) | The entity type filter or association verb category filter can be specified in one of two ways (in Advanced Options):
|