ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Character Detection in Animated Movies Using Multi-Style Adaptation and Visual Attention
Cited 0 time in scopus Download 5 time Share share facebook twitter linkedin kakaostory
저자
김하연, 이은철, 서용석, 임동혁, 이인권
발행일
202106
출처
IEEE Transactions on Multimedia, v.23, pp.1990-2004
ISSN
1520-9210
출판사
IEEE
DOI
https://dx.doi.org/10.1109/TMM.2020.3006372
협약과제
20IH2400, 음악 및 동영상 모니터링을 위한 지능형 마이크로 식별 기술 개발, 박지현
초록
Automatic identification of fictional characters is one of the primary analysis techniques for video content. A common approach to detect characters in live-action movies involves detecting human faces; however, this approach cannot be used in non-realistic domains, such as animated movies. Detection of characters in animated movies presents two major challenges: the same subject of character can be expressed in various unique styles, and there are no stylistic or other restrictions on the nature and design of character objects. To address these challenges, we introduce the 'animation adaptive region-based convolutional neural network' model to detect characters in animated movies and determine whether the detected characters are human or non-human types. Our model extends the Faster R-CNN model, which is a two-stage object detector, in the following manner: 1) we add a hierarchical animation adaptation module to learn the variety of unique styles from animated movies using a single model; 2) we incorporate a double-detector architecture to focus on the regions that are visually important in determining the character class. We build a new dataset for the animated character detection task. Experiments on this dataset show that our model outperforms other existing representative object detector models in terms of character detection. Furthermore, our model achieves significant performance improvements compared with previous state-of-the-art methods used for the character dictionary generation task. Our model is robust for a variety of animation styles and can find common visual representations of all types of characters, providing an effective way to detect animated characters.
키워드
Image analysis, multi-domain learning, object detection, video content analysis, visual attention
KSP 제안 키워드
Animated characters, CNN model, Character detection, Convolution neural network(CNN), Detection task, Faster r-cnn, Image Analysis, Multi-Domain, Object detection, Region-based, Two-Stage