This pattern is part of the Get started with natural language processing learning path.
|100||An introduction to Watson natural language processing||Article|
|101||Look deeper into the Syntax API feature within Watson Natural Language Understanding||Article|
|201||Visualize unstructured data using Watson Natural Language Understanding||Code pattern|
In this code pattern, we will create a web app for visualizing unstructured data using Watson™ Natural Understanding, Apache Tika, and D3.js. After a user uploads a local file of choice, the application leverages Apache Tika to extract text from the unstructured data file. The text is then passed through Watson Natural Language Understanding, where entities and concepts are extracted. Finally, the application uses the D3.js library as a visualization tool to display the results to the user.
The main benefit of using the Watson Natural Understanding Service is its powerful analytics engine that provides cognitive enrichments and insights into the data. The key enrichments that are extracted include:
- Entities – People, companies, organizations, cities, and more
- Keywords – Important topics typically used to index or search the data
- Concepts – Identified general concepts that aren’t necessarily referenced in the data
- Sentiment – The overall positive or negative sentiment of the data
When you have completed this code pattern, you will understand how to:
- Create and use an instance of Watson Natural Language Understanding
- Leverage Apache Tika to extract text from unstructured files
- Use D3.js for displaying the visuals
- User configures credentials for the Watson Natural Language Understanding service and starts the app.
- User selects data file to process and load.
- Apache Tika extracts text from the data file.
- Extracted text is passed to Watson NLU for enrichment.
- Enriched data is visualized in the UI using the D3.js library.
Ready to get started? Please see the README for detailed instructions.
This pattern showed how to create a web app for visualizing unstructured data using the Watson Natural Understanding service, Apache Tika, and D3.js. The pattern is part of the Get started with natural language processing learning path.