ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Progressive Multi-Stage Neural Audio Coding with Guided References
Cited 9 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Chanwoo Lee, Hyungseob Lim, Jihyun Lee, Inseon Jang, Hong-Goo Kang
Issue Date
2022-05
Citation
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022, pp.876-880
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/ICASSP43922.2022.9747527
Abstract
In this paper, we propose an effective multi-stage neural audio coding algorithm that encodes full-band audio signals (up to 20 kHz) using an end-to-end training criterion. By predefining several dyadic subband signals as training targets, we progressively encode input audio signals in each stage such that deeper stages of the network encode the residual error terms from the previous encoding stage. Our proposed audio codec successfully decodes full-band audio signals by using an effective multi-stage vector quantization scheme to represent key encoding features extracted in the latent space. Subjective listening tests show that the decoded outputs of the proposed audio codec achieve almost transparent quality at an average bitrate of 132 kbps.
KSP Keywords
Audio codec, Audio coding, Audio signal, End to End(E2E), Full-band, Latent space, Multi-stage vector quantization, Residual error, coding algorithm, end-to-end training, subjective listening tests