ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술지 Phonetically balanced text corpus design using a similarlity measure for a stereo super wideband speech database
Cited 4 time in scopus Download 0 time Share share facebook twitter linkedin kakaostory
저자
오유리, 김용국, 김미나, 김홍국, 이미숙, 배현주
발행일
201107
출처
IEICE Transactions on Information and Systems, v.E94.D no.7, pp.1459-1466
ISSN
0916-8532
출판사
일본, 전자정보통신학회 (IEICE)
DOI
https://dx.doi.org/10.1587/transinf.E94.D.1459
협약과제
11PI1500, FMC 어커스틱 융합코덱 및 제어기술 연구, 이병선
초록
In this paper, we propose a text corpus design method for a Korean stereo super-wideband speech database. Since a small-sized text corpus for speech coding is generally required for speech coding, the corpus should be designed to comply with the pronunciation behavior of natural conversation in order to ensure efficient speech quality tests. To this end, the proposed design method utilizes a similarity measure between the phoneme distribution occurring from natural conversation and that from the designed text corpus. In order to achieve this goal, we first collect and refine text data from textbooks and websites. Next, a corpus is designed from the refined text data based on the similarity measure to compare phoneme distributions. We then construct a Korean stereo super-wideband speech (K-SW) database using the designed text corpus, where the recording environment is set to meet the conditions defined by ITU-T. Finally, the subjective quality of the K-SW database is evaluated using an ITU-T super-wideband codec in order to demonstrate that the K-SW database is useful for developing and evaluating super-wideband codecs. Copyright © 2011 The Institute of Electronics, Information and Communication Engineers.
KSP 제안 키워드
Design method, ITU-T, Information and communication, Small-sized, Speech Database, Speech coding, Subjective quality, Text Corpus, similarity measure, speech quality, super wideband(SWB)