The Community Edition platform enables you to manage sources (data connectors pulling in data from databases, RSS feeds, fileshares etc.), and to visualize them using visualization widgets, in order to gain insights.
Source data in the platform is stored in JSON format as a document and the document format contains elements such as metadata, entities, and associations.
Sources are managed as part of Communities (see diagram on this page). All data ingested into the platform is segmented into a Community for access control, statistical integrity, and/or convenience (i.e. Financial News). Users are only able to access the communities (data) they've been added to in their user profile.
Sources are the data connectors pulling data from a database, feed (RSS), or fileshares (i.e. directories, single files (pdf/csv/xml), or ZIP). Each Source is assigned a Title (Fox News RSS), Tags (News, Politics, Conservative, Republican, US) and Type (News). Sources are then made up of documents harvested over time
Series of metadata fields (title, description, source ID, date/time, etc.)
Entities (person, IP-internal)
Associations: hard (subject - verb - object) vs soft
Entities are the who, what, and where extracted from a document
Who: Person, Company, Organization
What: IndustryTerm, Product, Facility
Where: City, ProvinceorState, Country
For more information, see section Entities.
An association is an activity or relationship between entities. It can be thought of as "subject / verb / object / at location / over time", where the subjects and objects can be free text and/or point to entities within the document.
For more information, see section Associations.
For more information, see section Scoring.
All matching documents contribute to the "knowledge" that a query can provide, however the documents themselves are not the only objects returned from a query. Instead, relevant information to the analysis is summed/averaged/etc ("aggregated") across all matching documents, and these are referred to as the "aggregations". Examples include:
Geo: lat/longs and their frequency in the document set
Times: number of documents per period (day, week, etc) in the document set
Entities: entity objects found in the document set, ranked by significance.
Events: event objects found in the document set, ranked by frequency.
Visualizations are where sources and documents come to life.
You can view complex geo-spatial and temporal document aggregations as intuitive graphs, and easily filter queries and results directly from the widgets. Furthermore, you can view document scoring parameters as metrics, including significance, relevance and frequency.
Sentiment Widget:
Map Widget:
For more information, see section Visualization.
Related Documentation: |
Related Visualization Documentation: |