...
Widget Connector | ||
---|---|---|
|
TODO very quick documentation, also link to blog post
More complex analytics and visualization
VIDEO6
TODOThis video covers 3 topics:
- Using the entity alias builder to "merge" multiple different entities that actually represent the same person/place/etc.
- (ie dragging an entity from the table on the left to the top right table to make it a master alias, then dragging entities to merge from the left table to the bottom right one - those entities will be replaced with the master when the query is refreshed)
- Using the entity alias builder to "discard" unwanted entities.
- (ie dragging entities from the table on the left to the bottom right table, with "DISCARD" selected - those entities will disappear when the query is refreshed)
- Use the type/verb filters in the advanced options to remove entire classes of entities and associations.
These are straightforward and will not be covered again in this section.
It is worth re-iterating one of the
There are a few additional useful functions that are not covered in the video:
- Selecting different alias sets
- Creating new alias masters (that aren't present in the data)
- "Text" aliases
- Manually setting alias configurations
- Positive and negative selection in the entity and association filters
Selecting different alias sets
As was noted in the video, all of the sets of aliases from the different configurations across the different communities that are being searched are combined. If you do not search over a community then any aliases saved in that community are not applied.
As a result, if you place an alias set in a community with no data in it then you can choose whether or not to apply it at query time just by either including the community or not from the source manager.
In fact there is a built in community that lets you accomplish this: the personal community (referred to as the "Personal Sandbox" in the Entity Alias Builder widget), see screenshot below:
TODO screenshot
TODO more of this dullness
Creating synthetic alias masters
It will be often the case that the desired "master" entity will not actually be present in the data.
For example, if you have a reddit and a twitter author who you believe to be the same person, you will have their handles as TwitterUser and RedditUser respectively (eg "joeOnTwitter" and "blogs_the_blogger"). If you can infer their name from the posts then you might want to make that the master entity, eg "Joe Blogs/Person" with "joeOnTwitter/TwitterUser" and "blogs_the_bloggetr/RedditUser" as aliases, even though "Joe Blogs" never appears in the content.
This is easily accomplished: in the widget, type the entity name, "/" then the desired entity type in the "Filte/Add Master" text box (see screencap below; ignore the fact that it will temporarily filter the other masters out), and then press the "+" button.
Select the created master entity and then drag aliases from the table on the left, and save as normal.
"Text" aliases
Creating master:alias sets actually does two things:
- Aliases are merged into the master entity during the query
- Queries involving the master entity are expanded to include the master's aliases
Sometimes this expansion is not sufficient. For example, some of the data might not have had any entities extracted at all (eg it presents the data in bullets that the NLP cannot parse). To address this sort of issue, the Entity Alias Builder widget also allows you to add arbitrary text to the expansion, which is converted to full text searches, ie will bring back documents regardless of the entity extraction quality/completeness.
There are two easy ways of doing this (see screenshots below):
- (red) type the desired text into the "Filter/Add Entities" text box on the right (ignore that it filters as you type), and press the "+" button next to it.
- (orange) tick the "create exact text terms for aliases" and then drag aliases across as normal, with the checkbox ticked 2 aliases are created: the normal entity and also the text of the entity name.
Manually setting alias configurations
TODO
Positive and negative selection in the entity and association filters
TODO
Further reading:
More complex analytics and visualization
COMING SOON! Note the functionality is already present.
Further reading:
- Plugin manager documentation
- Information about the built-in Javascript engine
- Developer information about building Java Hadoop plugins
- An IKANOW blog post discussing using jsfiddle to visualize custom analytics
- (contains links to some other relevant blog posts about running analytics on Infinit.e datasets, including this one about doing temporal/sentiment analytics on emails)
Exporting the data (and alerting)
...
Importing other sources
TODO
Adding communities
VIDEO7
TODO
Updating the software
...
TODO