The recent upgrade of the Streaming Analytics service on Bluemix also includes integration with the IBM Data Science Experience (DSX) platform.  This allows you to create Streaming analytics applications in Python right in your browser!

What is the Data Science Experience?

The Data Science Experience (DSX) is an interactive, collaborative environment that allows you to quickly extract insights from your data. You can create your applications using popular languages like Python, R, and Scala, Jupyter Notebooks, Spark, RStudio and more.  DSX has built in tutorials and notebooks, and even sample datasets! You can also import data from files, on-prem or cloud databases including dashDB, Cloudant, Object Storage, and the list is growing.  It also makes it easy to build appealing visualizations using PixieDust and share content with others via GitHub.


Data Science Experience and Streaming Analytics

With the integration of DSx and Streaming Analytics, you can now create Python notebooks to analyze real-time data without having to store the data first or install any software.  Using the Data Science Experience platform allows you to  gain insights from data at rest, and now, you can gain insights from real time data by integrating with the Streaming Analytics service. You can now build and your applications in Python right from the DSX environment, without having to install Streams!

  • Use your models and algorithms on real time data:
    For example, you could create a Python application to predict the likelihood that a given engine will fail based on its temperature. After creating a model and training the model on historic failure data, you can submit the application to the service to run the model on real time data readings. You could even configure the application to alert you if a failure is imminent.
  • Retrieve the analysis results from the service and visualize them within the notebook:
    Create useful visualizations for your streaming data using Bokeh, Matplot, PixieDust, and more.  More on this below.


Sample Notebooks

With the launch of this feature, we have included 3 Python notebooks to help you get started.  More notebooks will be added in the future, so be sure to check the DSX Community page or the IBM Streams samples catalog for the latest.

Included in the samples is a basic “Hello, World” notebook, which is a great way to verify that your connection to Streams from DSX is working properly.

We also released the Healthcare Analytics demo notebook, which uses the Streams Healthcare Analytics platform.  In this notebook, you create a Streams application that monitors and analyzes patient vital signs.  The notebook also shows how to retrieve  the analysis results from a  Streams view and visualize them in the notebook, as you can see here. The visualization is done using Bokeh.

The last notebook we included is based on the example we referred to earlier.  It creates a neural network to determine the probability that an engine will fail based on its temperature. It also demonstrates how to create a sample data set to train the model.  It uses PyBrain, NumPy, and Matplot.

Try it out

Ready to experiment? If you haven’t already, sign up for DSX  and for the Streaming Analytics service.  Then, choose a notebook below and click “Copy”  to import the notebook into your project.




3 Comments on "NEW: Streaming Analytics now integrates with the Data Science Experience!"

  1. Richard Sun April 25, 2017

    I use DSX desktop to do this demo. there was following error. JAVA_HOME not set
    then I used export to set. but it didn’t take effect. the error was the same.
    !export JAVA_HOME=’/usr/lib/jvm/java-8-openjdk-amd64′
    !echo $JAVA_HOME
    ValueError Traceback (most recent call last)
    in ()
    32 patients_vital = patients_data.view(name=’patients_vital’)
    —> 34 rc = submit(‘STREAMING_ANALYTICS_SERVICE’, topo, config)
    35 print (“DONE”)

    /opt/conda/lib/python3.5/site-packages/streamsx/topology/ in submit(ctxtype, graph, config, username, password)
    71 context_submitter = _SubmitContextFactory(graph, config, username, password).get_submit_context(ctxtype)
    —> 72 return SubmissionResult(context_submitter.submit())

    /opt/conda/lib/python3.5/site-packages/streamsx/topology/ in submit(self)
    116 java_home = os.environ.get(‘JAVA_HOME’)
    117 if java_home is None:
    –> 118 raise ValueError(“JAVA_HOME not found. Please set the JAVA_HOME system variable”)
    120 jvm = os.path.join(java_home, “bin”, “java”)

    ValueError: JAVA_HOME not found. Please set the JAVA_HOME system variable

  2. Richard Sun April 27, 2017

    Hi, this time I used DSX(cloud) to exercise this demo. but there is no data showing in the graph.
    the four rectangles are all blank , just like in this article.
    there was no any error before display the graph (4.3)

Join The Discussion