ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Multimodal 3D Object Retrieval System based on Text and Generated image
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Jong Gook Ko, Su Woong Lee, Seungjae Lee
Issue Date
2024-10
Citation
International Conference on Information and Communication Technology Convergence (ICTC) 2024, pp.1695-1698
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICTC62082.2024.10826940
Abstract
This paper presents a novel multimodal 3D object retrieval technique that utilizes both text and 2D images generated from the text as inputs. The demand for efficient and accurate 3D object retrieval systems has grown significantly across various domains, including virtual reality, augmented reality, game development, and industrial design. Traditional 3D object retrieval methods typically rely on single-modal approaches, such as text-based or image-based searches, which often struggle to fully capture the complex visual and spatial characteristics of 3D objects. This limitation is particularly pronounced when textual descriptions alone cannot adequately express intricate visual features or when appropriate reference images are unavailable. To address these challenges, we propose a novel approach that integrates the descriptive capabilities of text with the detailed visual information provided by 2D images generated directly from those text descriptions. Our research demonstrates that this multimodal approach significantly enhances retrieval accuracy by combining the complementary strengths of text and image modalities. In conclusion, the multimodal 3D object retrieval system proposed in this paper, which utilizes text-generated 2D images as supplementary input, offers substantial improvements in search accuracy and user satisfaction.
KSP Keywords
3D Object retrieval, Augmented reality(AR), Game Development, Image-based, Multimodal approach, Novel approach, Search accuracy, Spatial characteristics, Virtual Reality, Visual Features, industrial design