As a starting point for the analysis, we had the system explained to us by the QA staff, collected all the components and linked them. This analysis should help to understand all components when it comes to re-sorting and expansion.
Breaking down and defining the currently available content
Definition of content and networking components
The analysis confirmed our initial idea to subdivide the content into configuration and exploration.
Configuration
Allows the import of data from external systems or other users. The data can be linked to various NLP algorithms, combined and represented in a component named “Thinker”.
Some of standard algorithms are trained to recognise e.g. people, organisations or the relation between them.
Dataset
By linking the thinkers to the imported data, the algorithms can access and analyze the data.
To provide maximum configurability, multiple data sources and Thinkers can be created and linked. Results to be shown are determined by the inputs/thinkers connected to the Dataset.
Exploration
An exploration mode can be used to query the Dataset to result in different representations. A possible form of representation is the display of an imported document with the extracted entities that the algorithms have found in the text.
Content splitting into Configuration and Exploration - both parts have access to the Dataset
Configuration mode: Consisting of data input and Thinkers using NLP algorithms to extract information
Visualization: Each Dataset can be displayed in a wide set of output methods and compared with each other
Exploration Mode: In addition to comparing different Datasets, it is also possible to explore the data for different representations
Sharing & Export: Results can be shared with other users