ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Customizing an English-Korean Machine Translation System for Patent/Technical Documents Translation
Cited - time in scopus Share share facebook twitter linkedin kakaostory
Authors
Oh Woog Kwon, Sung Kwon Choi, Ki Young Lee, Yoon Hyung Roh, Young Gil Kim
Issue Date
2009-12
Citation
Pacific Asia Conference on Language, Information and Computation (PACLIC) 2009, pp.718-725
Language
English
Type
Conference Paper
Abstract
This paper addresses a method for customizing an English-Korean machine translation system from general domain to patent or technical document domain. The customizing method includes the followings: (1) adapting the probabilities of POS tagger trained from general domain to the specific domain, (2) syntactically analyzing long and complex sentences by recognizing coordinate structures, and (3) selecting a proper target word using domain-specific bilingual dictionary and collocation knowledge extracted from patent or technical document corpus. The translation accuracy of the customized English-Korean patent translation system is 82.43% on the average in 5 patent categories according to the evaluation of 7 professional patent translators. The translation accuracy of the customized English-Korean technical document translation system is 81.10% and its BLEU score is 0.5185 in the evaluation test set where the average BLEU score of cross-evaluation between references is 0.6615.
KSP Keywords
Domain-specific, Machine Translation(MT), POS tagger, Test Set, Translation Accuracy, Translation system, bilingual dictionary, bleu score, document translation, technical documents