VQA: Visual Question Answering: www.visualqa.org

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_miscellaneous_1904247196

VQA: Visual Question Answering: www.visualqa.org

About this item

Full title

Author / Creator

Agrawal, Aishwarya , Lu, Jiasen , Antol, Stanislaw , Mitchell, Margaret , Zitnick, C. Lawrence , Parikh, Devi and Batra, Dhruv

Publisher

New York: Springer US

Journal title

International journal of computer vision, 2017-05, Vol.123 (1), p.4-31

Language

English

Formats

Articles

Publication information

Publisher

New York: Springer US

More information

Scope and Contents

Contents

We propose the task of
free-form
and
open-ended
Visual Question Answering (VQA). Given an image and a natural language question about the image, the task is to provide an accurate natural language answer. Mirroring real-world scenarios, such as helping the visually impaired, both the questions and answers are open-ended. Visual questions selectively target different areas of an image, including background details and underlying context. As a result, a system that succeeds at VQA typically needs a more detailed understanding of the image and complex reasoning than a system producing generic image captions. Moreover, VQA is amenable to automatic evaluation, since many open-ended answers contain only a few words or a closed set of answers that can be provided in a multiple-choice format. We provide a dataset containing
∼
0.25 M images,
∼
0.76 M questions, and
∼
10 M answers (
www.visualqa.org
), and discuss the information it provides. Numerous baselines and methods for VQA are provided and compared with human performance. Our VQA demo is available on CloudCV (
http://cloudcv.org/vqa
)....

Alternative Titles

Full title

VQA: Visual Question Answering: www.visualqa.org

Authors, Artists and Contributors

Author / Creator

Agrawal, Aishwarya
Lu, Jiasen
Antol, Stanislaw
Mitchell, Margaret
Zitnick, C. Lawrence
Parikh, Devi
Batra, Dhruv

Identifiers

Primary Identifiers

Record Identifier

TN_cdi_proquest_miscellaneous_1904247196

Permalink

https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_proquest_miscellaneous_1904247196

Other Identifiers

ISSN

0920-5691

E-ISSN

1573-1405

DOI

10.1007/s11263-016-0966-6

How to access this item

Full text available

View in old catalogue

VQA: Visual Question Answering: www.visualqa.org

VQA: Visual Question Answering: www.visualqa.org

VQA: Visual Question Answering: www.visualqa.org

About this item

Publication information

Subjects

More information

Scope and Contents

Alternative Titles

Authors, Artists and Contributors

Identifiers

Primary Identifiers

Other Identifiers

How to access this item

Connecting people and collections

Indigenous engagement

Learning

Stories