Introduction

Welcome to the Infinit.e landing page! This sub-site is intended for integrators, developers, IT staff, technical analysts, researchers and similar roles, who want a technical overview of the platform and technical information about how to install, configure, exploit, integrate, or extend it.

For a higher level view, check out the following links:

In the spirit of the sort of analysis we would like to support, we will provide the remainder of this overview using the "5 Ws".

  • Who

  • What

  • Why

  • When

  • How

TODO link the Ws

Who?

We are Ikanow, the developers of Infinit.e, the first Open Source document analysis platform. Our vision is XXX. 

What?

Infinit.e is a framework for collecting, storing, processing, retrieving, and visualizing unstructured documents. 

Let's provide some clarification on each of the often overloaded terms used in that previous sentence:

  • By "processing" we mean TODO
  • By "retrieving" we mean the ability to search documents and return them in ranking order, but also to be able to retrieve "knowledge" aggregated over all documents matching the analyst's query.

(TODO confusion between framework and platform to be resolved here)

(TODO links to diagrams from presentations with pretty pictures of unstructured analysis)

TODO PIC internet, internal data, question, documents, cloud/embedded enrichment, knowledge

TODO thumbnails

Why...

... did we build it?

While supporting information analysts for the military and Government, we observed that the landscape of professional analysis tools is dominated by expensive proprietary products with limited flexibility, vendor lock-in, and requiring extensive and continuing  customization by expensive consultants.

Further, these tools had often originally been designed to analyze and mine structured records, whereas increasingly data is generated in a mix of unstructured documents and traditional structured records. Usually, unstructured documents dominate structured records in terms of readily available intelligence to be gleaned.

So we believed there was a gap in the market, if it could be filled.

We also observed that the Open Source community was developing tools that provided many of the core functions needed for an unstructured document-centric analysis tool (storage, search, aggregation, analytic frameworks). This provided some exciting new opportunities:

 Based on these needs and opportunities, we built Infinit.e, the first Open Source document analysis platform, using great OSS projects like Lucene, elasticsearch, Hadoop, MongoDB, tomcat, and many others. Our objectives are:

TODO some images of the OSS logos

... might you want to use it?

TODO

When?

Starting with prototypes developed by MTCSC (since acquired by Mantech), we started development of the tool in November 2010, focusing on adopting the best OSS tools for the tool's different functions, productionizing their use, and glueing them together in a logical way.

We released the first Open Source version in March 2012, mostly under the Affero GPL license (with plugins and utilities mostly released under the Apache 2.0 license).

Development continues: check out our TODO LINK roadmap.

How...