2021 Call for Code Awards: Live from New York, with SNL’s Colin Jost! Learn more

NOAA Weather Data – JFK Airport


The NOAA JFK dataset contains 114,546 hourly observations of various local climatological variables (including visibility, temperature, wind speed and direction, humidity, dew point, and pressure). The data was collected by a NOAA weather station located at the John F. Kennedy International Airport in Queens, New York.

Get this Dataset

Data Description Zipped File Name
Full (Original) Dataset, 3.5 MB noaa-weather-data-jfk-airport.tar.gz
Sample Dataset, 80 KB noaa-weather-sample-data.tar.gz

Dataset Metadata

Field Value
Format CSV
License CDLA-Sharing
Domain Time Series
Number of Records 114,546 hourly observations
Data Split NA
Size 3.2 MB
Dataset Origin National Oceanic and Atmospheric Administration
Dataset Version Update Version 2 – September 12, 2019
Version 1 – July 16, 2019
Data Coverage Location: New York City
Dates: 2010-01-01 through 2018-07-27
Note: To download raw data from NOAA for a different region or date span, follow the steps outlined in the data archive’s README.txt.
Business Use Case Agriculture: Detect unseasonal temperature change and alert farmers about potential damage to plants.
Construction: Integrate hourly temperature into building model simulations to test structural integrity.

Dataset Archive Contents

File or Folder Description
jfk_weather.csv Raw data obtained directly from NOAA.
jfk_weather_cleaned.csv Cleaned version of the raw dataset. Used to train MAX model.
LICENSE.txt Terms of Use
clean_data.py Script used to generate jfk_weather_cleaned.csv
README.txt Explains dataset information and steps for extracting similar data from NOAA

Data Glossary and Preview

Click here to explore the data glossary, sample records, and additional dataset metadata.

Use the Dataset

This dataset is complemented by data exploration, data analysis, and modeling Python notebooks to help you get started:

Quick access in Python (requires the pardata pypi package):

$ pip install pardata

import pardata
data = pardata.load_dataset('noaa_jfk')