Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info

Note you can also get data from the API directly, see the examples in the query documentation. The only difference is that the documents are returned in a field called "data" instead of "documents", so it will have to be renamed by hand before running the test harness.

...

Warning

It is recommended to use an indexed field when trying to select a small % of large communities:

  • Document metadata:
    • "sourceKey"
    • "_id"
    • "url"
    • "entities.index"
  • Document content:
    • "sourceKey"
    • "url"
  • Entity features
    • "index"
    • "disambiguated_name"
    • "alias"
  • Association feature
    • "index"
  • Custom metadata tables:
    • "_id"
    • Any fields specified for sorting

eg in the example above, "source" is not indexed and therefore would perform a full scan within the community which for larger communities is sub-optimal. A future release will allow use of the full query API to provide more flexible queries at high speeds.

Also note: If running in standalone mode you cannot use Infinit.e-style queries, or "$srctags" or "$caches" modifiers.

...

  1. Pull the project infinit.e.processing.custom.library (from ikanow_infinit.e_community repo)
  2. Pull the project infinit.e.data_model (from ikanow_infinit.e_community repo)
  3. Create a system environment variable named "HADOOP_HOME" pointed at infinit.e.processing.custom.library/win_hadoop_home (or copy those files somewhere on your machine and point at that)
  4. Add "%HADOOP_HOME%\bin" to the windows PATH (and ensure it is update, eg restart eclipse/relaunch run configuration)
  5. Copy the jars in "infinit.e.processing.custom.library/standalone_libs" into your event project and add them to the build path
  6. Add the data_model project to your build path
  7. Copy the jar "hadoop-common-2.5.0-cdh5.3.1-IKANOW_LOCAL_NOCRC_STUBS.jar" into your project from the data-model project also
Info

(In order to run custom source tests via the tomcat API/UI, only steps 3 and 4 are needed).