This article will highlight some new features in the Text toolkit in Streams 4.2. It will show how to update resources used by the TextExtract operator within a running Streams application. Continue reading Real time text analysis using Streams, Part 2: Updating dictionaries and tables
The goal of statistical classification is to use an object's characteristics to identify which class (or group) it belongs to. Such classifiers work well for practical problems such as document classification. The LinearClassification operator identifies the category of text from streaming data according to a model. It is part of the IBM Streams NLP Toolkit... Continue reading How to classify text using the IBM Streams Natual Language Processing (NLP) Toolkit LinearClassification operator?
Text extraction is one means to get insights to unstructured data like text or speech transformed into text. There are different methods to write text extraction rules. One of them is the UIMA Ruta language. The RutaText operator extracts data from streaming text according to predefined UIMA Ruta rules. It is part of the IBM Streams... Continue reading How to extract text using the IBMStreams Natural Language Processing (NLP) Toolkit RutaText operator?