ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Journal Article Modeling Long-Term Multimodal Representations for Active Speaker Detection with Spatio-Positional Encoder
Cited 0 time in scopus Download 70 time Share share facebook twitter linkedin kakaostory
Authors
Minyoung Kyoung, Hwa Jeon Song
Issue Date
2023-10
Citation
IEEE Access, v.11, pp.116561-116569
ISSN
2169-3536
Publisher
Institute of Electrical and Electronics Engineers Inc.
Language
English
Type
Journal Article
DOI
https://dx.doi.org/10.1109/ACCESS.2023.3325474
Project Code
23HS3800, Development of Multi-speaker Dialog Modeling and Summarization Technology, Hwa Jeon Song
KSP Keywords
Active speaker detection, Multimodal representation
This work is distributed under the term of Creative Commons License (CCL)
(CC BY)
CC BY