Now available! Red Hat OpenShift Container Platform for Linux on IBM Z and LinuxONE Learn more

VizWiz

Overview

The VizWiz dataset contains 20,000 training images, 8,000 test images, and 3,173 validation images. Each image has a question about that image and 10 associated answers to the question.

Dataset Metadata

Format License Domain Number of Records Size
JSON
CC BY 4.0 Visual Question Answering 20,000 image/question pairs
15.3 GB

Citation

@inproceedings{vizwiz,
author="Danna Gurari and Qing Li and Chi Lin and Yinan Zhao and Anhong Guo and Abigale J. Stangl and Jeffrey P. Bigham",
title="{VizWiz-Priv}: {A} Dataset for Recognizing the Presence and Purpose of Private Visual Information in Images Taken by Blind People",
year=2019,
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}
}