ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Semi-Automatic Filtering of Translation Errors in Triangle Corpus
Cited - time in scopus Share share facebook twitter linkedin kakaostory
Authors
Sung-Kwon Choi, Jong-Hun Shin, Young-Gil Kim
Issue Date
2015-10
Citation
Pacific Asia Conference on Language, Information and Computation (PACLIC) 2015, pp.72-79
Language
English
Type
Conference Paper
Abstract
We are developing a multilingual machine translation system to provide foreign tourists with a multilingual speech translation service in the Winter Olympic Games that will be held in Korea in 2018. For a knowledge learning to make the multilingual expansibility possible, we needed large bilingual corpus. In Korea there were a lot of Korean-English bilingual corpus, but Korean-French bilingual corpus and Korean-Spanish bilingual corpus lacked absolutely. Korean-English-French and Korean-English-Spanish triangle corpus were constructed by crowdsourcing translation using the existing large Korean-English corpus. But we found a lot of translation errors from the triangle corpora. This paper aims at filtering of translation errors in large triangle corpus constructed by crowdsourcing translation to reduce the translation loss of triangle corpus with English as a pivot language. Experiment shows that our method improves +0.34 BLEU points over the baseline system.
KSP Keywords
Baseline system, Knowledge learning, Machine Translation(MT), Multilingual speech, Semi-Automatic, Speech translation, Translation errors, Translation system, Winter Olympic Games, bilingual corpus, pivot language