ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Interpolating the Directional Room Impulse Response for Dynamic Spatial Audio Reproduction
Cited 9 time in scopus Download 78 time Share share facebook twitter linkedin kakaostory
저자
Jiahong Zhao, Xiguang Zheng, Christian Ritz, 장대영
발행일
202202
출처
Applied Sciences, v.12 no.4, pp.1-24
ISSN
2076-3417
출판사
MDPI
DOI
https://dx.doi.org/10.3390/app12042061
협약과제
22ZH1200, 초실감 입체공간 미디어·콘텐츠 원천기술연구, 이태진
초록
Virtual reality (VR) is increasingly important for exploring the real world, which has partially moved to virtual workplaces. In order to create immersive presence in a simulated scene for humans, VR needs to reproduce spatial audio that describes three-dimensional acoustic characteristics in the counterpart physical environment. When the user moves, this reproduction should be dynamically updated, which provides practical challenges because the bandwidth for continuously transmitting audio and video scene data may be limited. This paper proposes an interpolation approach for dynamic spatial audio reproduction using acoustic characteristics of direction and reverberation at limited numbers of positions, which are represented using a first order Ambisonics encoding of the room impulse response (RIR), called the directional RIR (DRIR). We decompose two known DRIRs into reflection components, before interpolating early dominant components for DRIR synthesis and utilizing DRIR recordings for accuracy evaluation. Results indicate that the most accurate interpolation is obtained by the proposed method over two comparative approaches, particularly in a simulated small room where most direction of arrival estimation errors of early components are below five degrees. These findings suggest precise interpolated DRIRs with limited data using the proposed approach, which is vital for dynamic spatial audio reproduction for VR applications.
KSP 제안 키워드
Accuracy evaluation, Acoustic characteristics, Audio and video, Direction of arrival(DoA), Direction of arrival estimation, Limited data, Physical environment, Real-world, Room Impulse Response(RIR), Spatial audio, Three dimensional(3D)
본 저작물은 크리에이티브 커먼즈 저작자 표시 (CC BY) 조건에 따라 이용할 수 있습니다.
저작자 표시 (CC BY)