ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Modified unrestricted polar quantization with the psychoacoustic parameter for audio coding
Cited 2 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Byeongho Jo, Seungkwon Beack, Taejin Lee
Issue Date
2022-10
Citation
International Congress on Acoustics (ICA) 2022, pp.1-8
Language
English
Type
Conference Paper
Abstract
In audio coding technology, the framework based on the modified discrete cosine transform (MDCT) has been mainly adopted in the literature and audio coding standards. The MDCT does not introduce blocking effects while allowing perfect reconstruction, and can evade the data rate increase with a 50% overlapped window. However, when using the temporal noise shaping for coding the transient signal, the undesired aliasing may occur, and the MDCT window should be designed under the time-domain aliasing cancellation constraint. These limitations can be simply avoided by using the discrete Fourier transform (DFT), but it increases the data rate by double because of the complex-valued representation of the DFT coefficients. In this study, we propose a scheme that effectively quantizes the complex-valued signal in the comparable data rate with the real-valued coding scheme. The proposed coding scheme is based on unrestricted polar quantization for complex variables combined with the psychoacoustic model. This study demonstrates the feasibility of the quantization scheme for the complex-valued coefficients of the same data rate with comparable perceptual distortion against the conventional MDCT-based audio coding.
KSP Keywords
Audio coding, Coding standard, Complex-valued, DFT coefficients, Modified Discrete Cosine Transform(MDCT), Perfect Reconstruction, Psychoacoustic Model, Real-valued coding, Temporal noise shaping, Unrestricted polar quantization, blocking effect