ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Comparative Evaluation of Visual Question Answering with Scene Graph Utilization
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Choulsoo Jang, Yoon-Seok Choi, Kwang-Yong Kim, Jaewan Kim, Sungwoo Jun, Chang Eun Lee
Issue Date
2024-10
Citation
International Conference on Information and Communication Technology Convergence (ICTC) 2024, pp.1980-1981
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICTC62082.2024.10827158
Abstract
Integrating scene graphs enhances Visual Question Answering (VQA) systems' performance in dynamic and diverse real-world applications. This study examines the impact of scene graphs and external knowledge on VQA systems. Using the LLM-based model, LLaVA-v1.5, we trained and evaluated models with and without scene graphs. Additionally, we compared models trained solely with scene graphs to those trained with both scene graphs and external knowledge.
KSP Keywords
Comparative Evaluation, External knowledge, Real-world applications, Scene graph, Visual Question Answering