An Effective Med-VQA Method Using a Transformer with Weights Fusion of Multiple Fine-Tuned Models
An Effective Med-VQA Method Using a Transformer with Weights Fusion of Multiple Fine-Tuned Models
About this item
Full title
Author / Creator
Publisher
Basel: MDPI AG
Journal title
Language
English
Formats
Publication information
Publisher
Basel: MDPI AG
Subjects
More information
Scope and Contents
Contents
Visual question answering (VQA) is a task that generates or predicts an answer to a question in human language about visual images. VQA is an active field combining two AI branches: NLP and computer vision. VQA in the medical field is still at an early stage, and it needs vast efforts and exploration to reach practical usage. This paper proposes tw...
Alternative Titles
Full title
An Effective Med-VQA Method Using a Transformer with Weights Fusion of Multiple Fine-Tuned Models
Authors, Artists and Contributors
Author / Creator
Identifiers
Primary Identifiers
Record Identifier
TN_cdi_doaj_primary_oai_doaj_org_article_ff0490f17ed942fba346050566dbece1
Permalink
https://devfeature-collection.sl.nsw.gov.au/record/TN_cdi_doaj_primary_oai_doaj_org_article_ff0490f17ed942fba346050566dbece1
Other Identifiers
ISSN
2076-3417
E-ISSN
2076-3417
DOI
10.3390/app13179735