CompTIA 1001 Performance-Based Questions

Efficient Multimodal Selection for Retrieval in Knowledge-Based Visual Question Answering

Abstract: Retrieval plays an important role in knowledge-based visual question answering (KB-VQA), which relies on external knowledge to answer questions related to an image. However, not all ...

IEEE

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering

Previous works employ the Large Language Model (LLM) like GPT-3 for knowledge-based Visual Question Answering (VQA). We argue that the inferential capacity of LLM can be enhanced through knowledge ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Efficient Multimodal Selection for Retrieval in Knowledge-Based Visual Question Answering

Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering

Trending now