ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Seeing is Smelling: Localizing Odor-Related Objects in Images
Cited 1 time in scopus Download 8 time Share share facebook twitter linkedin kakaostory
저자
김상윤, 박준석, 방준성, 이해룡
발행일
201802
출처
Augmented Human International Conference (AH) 2018, pp.1-9
DOI
https://dx.doi.org/10.1145/3174910.3174922
협약과제
17HS3100, 후각 바이오 정보 기반 감성증강 인터랙티브 콘텐츠 기술 개발, 이해룡
초록
Research on cross-modal associations between vision and olfaction has shown that odor perception can be strongly influenced by vision including odor-evoked objects. Recently, convolutional neural networks (CNNs), which is mainly applied to image recognition by learning image features, is driving the development of a visual recognition method with promising results. However, there is no attempts to recognition and localization of odor-related objects which allow us to facilitate detection of objects on scene when a congruent odor stimuli is released from scent device. The existing object localization methods require bounding-box annotation indicating the presence of object and location information in an image which is too costly because it is a time-consuming process. Image-level annotation, which indicates the presence of objects in an image, is easier to obtain than bounding-box annotation. In this work, we perform weakly supervised object localization using features of CNNs with only image-level annotation data to detect odor-related objects. We propose a method to classify and localize odor-related objects, and describe its implementation in detail. The experimental results indicate that its performance is comparable to previous CNNs for the classification of odor-related objects.
키워드
Convolutional neural networks, Olfaction and vision cross-modal, Olfaction-enhanced multimedia, Weakly supervised object localization
KSP 제안 키워드
Annotation data, Bounding Box, Convolution neural network(CNN), Image feature, Localization method, Location information(GPS), Recognition method, Weakly supervised, cross-modal, image recognition, object localization