Join the Digital Developer Conference: AIOps & Integration to propel your AI-powered automation skills Register for free

Predict energy prices with in-database analytics

This code pattern is part of the Get started with data science using Netezza learning path.

Summary

In this developer code pattern, we will show you how to leverage the development and use of analytic algorithms to perform research or other business-related activities using Netezza® Performance Server. The Netezza Performance Server enables data mining tasks on large data sets using the computational power and parallelization mechanisms provided by the Netezza appliance. The parallel architecture of the Netezza database environment enables high-performance computation on large data sets, making it the ideal platform for large-scale data mining applications.

Description

Netezza has in-database analytics packages for mining the spectrum of data set sizes. IBM Netezza Analytics is an in-database data mining application that applies key techniques and popular real-world algorithms.

In this code pattern, we will use a Jupyter Notebook using the IBM Watson® Studio service IBM Cloud Pak for Data®. The notebook has steps to connect to Netezza and use in-database analytic functions to analyze the data and run machine learning algorithms, allowing you to predict and forecast data. In order to access the analytical functions of Netezza, you should install the Netezza Analytics module into the Netezza server.

We will use an energy price dataset to analyze the data with Jupyter Notebook using IBM Cloud Pak for Data. We will walk you through:

  • Analyzing data using Netezza in-database analytic functions
  • Creating machine learning models using Netezza in-database machine learning algorithms

Flow

Flow diagram

  1. User loads Jupyter Notebook into IBM Cloud Pak for Data.
  2. User connect to Netezza using Python connector.
  3. User loads and analyzes data from Netezza Performance Server.
  4. Netezza creates models using in-database analytics functions.
  5. User forecasts and predicts energy price using the model.

Instructions

Detailed instructions are in the README, where you will learn how to:

  1. Create a new project in IBM Cloud Pak for Data
  2. Add connection to Netezza server
  3. Upload data assets
  4. Load notebook to your project
  5. Install the nzpy Python library
  6. Configure Netezza Performance Server connection in notebook
  7. Load data into Netezza
  8. Visualize energy price data
  9. Analyze energy price data
  10. Create machine learning model using time series algorithm

Next steps

Check out Neteeza Performance Server to learn even more.