등록
효율적인 단-대-단 신경망 구조의 오디오코딩을 위해 손실함수에 심리음향 모델을 적용하는 방법 및 장치
- 발명자
-
김민제, 카이 젠, 이미숙, 성종모, 백승권, 이태진, 최진수
- 출원번호
-
17156006 (2021.01.22)
- 공개번호
-
20210233547 (2021.07.29)
- 등록번호
- 11790926 (2023.10.17)
- 출원국
- 미국
- 협약과제
-
18HR2300, [통합과제] 초실감 테라미디어를 위한 AV부호화 및 LF미디어 원천기술 개발,
최진수
- 초록
- A method and apparatus for processing an audio signal are disclosed. According to an example embodiment, a method of processing an audio signal may include acquiring a final audio signal for an initial audio signal using a plurality of neural network models generating output audio signals by encoding and decoding input audio signals, calculating a difference between the initial audio signal and the final audio signal in a time domain, converting the initial audio signal and the final audio signal into Mel-spectra, calculating a difference between the Mel-spectra of the initial audio signal and the final audio signal in a frequency domain, training the plurality of neural network models based on results calculated in the time domain and the frequency domain, and generating a new final audio signal distinguished from the final audio signal from the initial audio signal using the trained neural network models.
- KSP 제안 키워드
- Audio signal, Encoding and decoding, Network model, frequency domain(FD), neural network, neural network model, time-domain
- 패밀리
-