ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper A Metric for Predicting the Quality of Ambisonic Spatial Audio Reproduced Using Spatially Interpolated or Extrapolated Room Impulse Responses
Cited 0 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Hualin Ren, Christian Ritz, Jiahong Zhao, Xiguang Zheng, Daeyoung Jang
Issue Date
2025-04
Citation
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025, pp.1-5
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICASSP49660.2025.10889495
Abstract
In virtual reality (VR), sound sources are convolved with room impulse responses (RIRs) to create immersive and dynamic audio experiences. Assessing the quality of spatial audio synthesis in VR is challenging. Subjective listening tests are accurate, but they are time-consuming and costly. This paper introduces a novel objective quality metric to predict the listening quality (LQ) and localization accuracy (LA) of Ambisonic spatial audio at new positions using spatially interpolated or extrapolated first-order Ambisonic (FOA) RIRs based on known nearby FOA RIRs. The LQ and LA scores are computed based on the Kolmogorov-Smirnov test, to measure the similarity between segments of direct sound and reflections of reference and synthesized FOA RIRs. Results show that these scores strongly correlate with subjective test results, proving the reliability of the proposed method. A major advantage is that it predicts spatial audio quality directly from synthesized FOA RIRs, without requiring convolved signals or being affected by specific sound sources, providing a more practical and efficient solution for development.
KSP Keywords
Audio quality, Audio synthesis, Efficient solution, First-order, Kolmogorov-Smirnov test, Localization Accuracy, Sound source, Spatial audio, Subjective test, Virtual Reality, measure the similarity