2021 Call for Code Awards: Live from New York, with SNL’s Colin Jost! Learn more

VizWiz – Visual Question Answering


The VizWiz-VQA (Visual Question Answering) dataset contains 20,523 training images, 8,000 test images, and 4,319 validation images. Each image in the training and validation set has a question about that image and 10 associated answers to the question.

Dataset Metadata

Format License Domain Number of Records Size
CC BY 4.0 Visual Question Answering 32842 images, 248420 question answer pairs
17.5 GB


author="Danna Gurari and Qing Li and Chi Lin and Yinan Zhao and Anhong Guo and Abigale J. Stangl and Jeffrey P. Bigham",
title="{VizWiz-Priv}: {A} Dataset for Recognizing the Presence and Purpose of Private Visual Information in Images Taken by Blind People",
booktitle={IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}