Entity Significance Interface

The Entity Significance widget appears when the widget is added to the workspace.

Description:

Use the Entity Significance widget to view the entities across all documents, ranked by score or frequency.

FieldDescriptionNotes
Widget label. 
Drag and drop control. For more information, see section Common Functionality. 
Ignore/Apply workspace filtering. See section Common Functionality for more information. 
Add selected entities to the query. 
Type an entity name to filter the results of the widget. 
Expand control for more options. 
EntitiesDropdown menu that can be used to select a specific entity from the returned results. You can then click on the plus button to add the entity to the complex query string, or perform a google search. 
googlePerforms Google search on selected entity. 
Rank By

Dropdown which determines if entities are ranked (sorted by order of importance) using Significance, Coverage or Frequency.

Rank by Significance:

Rank by Coverage:

Rank by Frequency:

 

 

 

Graph

This section defines the fields that are displayed on the graph.

Legend

The legend explains the color coding of the bar chart.  The entities are returned to the bar chart using their pertinent metrics.  For example, the orange part of the chart displays the entities query coverage metric.  Hovering over the grey portion reveals the max Doc Frequency metric.

Query Coverage: % of matching docs in which the entity occurs.

Query Significance:  TF-IDF score of entity for the query.   Entities with low document counts have their significance suppressed by 33% (well below a dynamically calculated "noise floor") or 66% (just below/at the "noise floor").  When only a subset of the matching documents are returned (eg > 1000 documents), the significance is adjusted to estimate the TF-IDF across the entire matching dataset, not just the returned subset.

MaxDoc Significance: % of times the entity occurs in the documents in comparison to the other entities taken together as an average.

MaxDoc Frequency: The most number of times within a single document that the entity occurred.

Bar Chart

When you hover over the bar charts, a variety of data is displayed depending on the applicable metric.  For example if you hover over the orange portion, you will view information pertaining to Query Coverage.  For each metric, the specific metric data is returned as well as Entity and Entity Type.

 


 

 

Related User Documentation:

Entity Significance