ETRI-Knowledge Sharing Plaform

KOREAN
특허 검색
Status Country
Year ~ Keyword

Detail

Registered APPARATUS AND METHOD FOR CONSTRUCTING LEARNING DATA

학습 데이터 반자동 구축 장치 및 그 방법
이미지 확대
Inventors
Changki Lee, Kim Hyeon Jin, Lee Chung Hee, Wang Ji Hyun, Oh Hyo-Jung, Jang Myung Gil, Young Jik Lee
Application No.
11633190 (2006.12.04)
Publication No.
20070143284 (2007.06.21)
Registration No.
7725408 (2010.05.25)
Country
UNITED STATES
Project Code
05MF1100, Language Information Processing Technology Development, Young Jik Lee
Abstract
An apparatus and method for efficiently constructing learning data required in statistical methodology used in information retrieval, information extraction, translation, natural language processing, etc. are provided. The method includes the steps of: generating learning models by performing machine learning with respect to learning data; attaching tags to a raw corpus automatically by using the generated learning models to thereby generate learning data candidates; calculating confidence scores of the generated learning data candidates, and then selecting a learning data candidate using the confidence scores; and allowing a user to correct an error in the selected learning data candidate through an interface and adding the error-corrected learning data candidate to the learning data, thereby adding new learning models incrementally.
KSP Keywords
Information retrieval(IR), Language processing, Learning data, Learning model, Natural Language Processing, Statistical methodology, information extraction, machine Learning, natural language