In this developer code pattern, we will show you how to leverage the development and use of analytic algorithms to perform research or other business-related activities using Netezza® Performance Server. The Netezza Performance Server enables data mining tasks on large data sets using the computational power and parallelization mechanisms provided by the Netezza appliance. The parallel architecture of the Netezza database environment enables high-performance computation on large data sets, making it the ideal platform for large-scale data mining applications.
Netezza has in-database analytics packages for mining the spectrum of data set sizes. IBM Netezza Analytics is an in-database data mining application that applies key techniques and popular real-world algorithms.
In this code pattern, we will use a Jupyter Notebook using the IBM Watson® Studio service IBM Cloud Pak for Data®. The notebook has steps to connect to Netezza and use in-database analytic functions to analyze the data and run machine learning algorithms, allowing you to predict and forecast data. In order to access the analytical functions of Netezza, you should install the Netezza Analytics module into the Netezza server.
We will use an energy price dataset to analyze the data with Jupyter Notebook using IBM Cloud Pak for Data. We will walk you through:
- Analyzing data using Netezza in-database analytic functions
- Creating machine learning models using Netezza in-database machine learning algorithms
- User loads Jupyter Notebook into IBM Cloud Pak for Data.
- User connect to Netezza using Python connector.
- User loads and analyzes data from Netezza Performance Server.
- Netezza creates models using in-database analytics functions.
- User forecasts and predicts energy price using the model.
Detailed instructions are in the README, where you will learn how to:
- Create a new project in IBM Cloud Pak for Data
- Add connection to Netezza server
- Upload data assets
- Load notebook to your project
- Install the nzpy Python library
- Configure Netezza Performance Server connection in notebook
- Load data into Netezza
- Visualize energy price data
- Analyze energy price data
- Create machine learning model using time series algorithm
See the learning path summary for further steps.