Carnegie Mellon University
Browse
multimodal.pdf (1.14 MB)

Multihop and Multimodal Question Answering on WebQA

Download (1.14 MB)
poster
posted on 2024-01-19, 21:23 authored by Dheeraj PaiDheeraj Pai, Deigant Yadava, Vinay Nair, João Monteiro

Multihop and Multimodal Question Answering (MMQA) is the cognitive process of retrieving and combining relevant information from diverse knowledge sources (e.g., textual, visual, and audio) to answer a given question.


In our work we propose a model for MMQA for the webQA dataset. Our model is based on the transformer architecture where we jointly alight and learn both the questions and parts of images that are relevant in answering the question.

History

Date

2023-12-06

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC