Win $20,000. Help build the future of education. Answer the call. Learn more


This dataset is used for solving complex ML problems. This dataset was created and prepared by:

Dataset Metadata

Field Value
Format text
License CC BY 4.0
Domain text
Number of Records 1000
Data Split 80-20
Size 1.49 GB
Dataset Origin IBM
Dataset Version Version 1 – Oct 1, 2020
Data Coverage Wikipedia
Business Use Case lorem ipsum

Contents of Dataset Archive

File Description
LICENSE.txt Terms of Use Explains data collection, processing details, and steps for splitting dataset

Data Glossary and Preview

For a full view of this dataset’s metadata, data glossary, and a set of sample records click on the Preview the dataset button displayed above or follow the link here.

Use the Dataset

This dataset is complemented by starter notebooks that will help you get started:

  • LINK TO RESEARCH PAPER Describes how the data was collected and verified, what it contains, previous versions and properties.