2021 Call for Code Awards: Live from New York, with SNL’s Colin Jost! Learn more

Taranaki Basin Curated Well Logs


This dataset contains details about a set of oil wells located in the Taranaki Basin. The Taranaki basin comprises an area of about 330,000 square kilometers, located broadly onshore and offshore the New Zealand west coast. This basin is the main region for oil exploration and production in New Zealand, with over 400 wells drilled to date. The basin consists of sedimentary rocks dated from Late Cretaceous to present, covering the Paleozoic and Mesozoic basement rocks.

The data was curated from two sources, the New Zealand Petroleum & Minerals Online Exploration Database (data.nzpam.govt.nz), and the Petlab (pet.gns.cri.nz), which served to characterize the Taranaki basin. In particular, the data served to map important tectonic regions in the basin and the various formations in these regions. We used geological reports to identify formation markers, spreadsheets to find well header and drilling deviation information, and finally, LAS files to characterize a reasonable set of well log annotations. The curated dataset consists of a set with 407 wells containing the main geophysical well logs and reported geological formations in true vertical depth.

The data was then prepared, processed, and cleaned from various files into a final CSV file containing the well logs, the coordinates of the wells, and the corresponding labels.

Dataset Metadata

Field Value
Format CSV
License CDLA – Sharing
Domain Text
Number of Records 6,427,379 data points (407 wells)
Data Split NA
Size 873 MB
Authors Breno W.S.R. de Carvalho, Matheus Oliveira, Maiana Avalone, Júlio Hoffimann, Daniela Szwarcman, Jorge Guevara Diaz, Bianca Zadrozny
Dataset Origin IBM Research
Dataset Version Version 1 – May 10, 2020
Data Coverage Location: New Zealand
Cover: An area of about 330,000 square kilometers with 407 wells drilled to date
Business Use Case Map important tectonic regions in the basin and the various formations in the regions

Dataset Archive Contents

File or Folder Description
coords.csv This data file provides the coordinates of the wells.
logs.csv This data file provides the well logs information.
LICENSE.txt Terms of Use
README.md Explains data collection, processing details, and steps for splitting dataset

Data Glossary and Preview

Click here to explore the data glossary, sample records, and additional dataset metadata.

Use the Dataset

This dataset is complemented by starter notebooks that will help you get started:

Quick access in Python (requires the pardata pypi package):

$ pip install pardata

import pardata
data = pardata.load_dataset('taranaki_basin_curated_well_logs')