ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Highly Efficient Audio Coding with Blind Spectral Recovery Based on Machine Learning
Cited 2 time in scopus Download 4 time Share share facebook twitter linkedin kakaostory
저자
김재원, 백승권, 임우택, 박호종
발행일
202205
출처
IEEE Signal Processing Letters, v.29, pp.1212-1216
ISSN
1070-9908
출판사
IEEE
DOI
https://dx.doi.org/10.1109/LSP.2022.3172853
협약과제
21HH7300, [통합과제] 초실감 테라미디어를 위한 AV부호화 및 LF미디어 원천기술 개발, 최진수
초록
This letter proposes a new method for audio coding that utilizes blind spectral recovery to improve the coding efficiency without compromising performance. The proposed method transmits only a fraction of the spectral coefficients, thereby reducing the coding bit rate. Then, it recovers the remaining coefficients in the decoder using the transmitted coefficients as input. The proposed method is differentiated from conventional spectral recovery in that the coefficients to be recovered are interleaved with the transmitted coefficients to obtain the most data correlation. Further, it enhances the transmitted coefficients, which are degraded by quantization errors, to deliver better information to the recovery process. The spectral recovery is conducted recursively on a band basis such that information recovered in one band is used for the recovery in subsequent bands. An improved level correction for the recovered coefficients and a new sign coding are also developed. A subjective performance evaluation confirms that the proposed method at 40 kbps provides statistically equivalent sound quality to a state-of-the-art coding method at 48 kbps for speech and music categories.
KSP 제안 키워드
Audio coding, Bit Rate, Coding efficiency, Coding method, Performance evaluation, Quantization error, Recovery process, data correlation, highly efficient, machine Learning, new method