ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

특허 검색
구분 출원국
출원년도 ~ 키워드

상세정보

등록 학습 데이터 반자동 구축 장치 및 그 방법

학습 데이터 반자동 구축 장치 및 그 방법
이미지 확대
발명자
이창기, 김현진, 오효정, 왕지현, 이충희, 장명길, 이영직
출원번호
11633190 (2006.12.04)
공개번호
20070143284 (2007.06.21)
등록번호
7725408 (2010.05.25)
출원국
미국
협약과제
05MF1100, 언어정보처리 기술개발, 이영직
초록
An apparatus and method for efficiently constructing learning data required in statistical methodology used in information retrieval, information extraction, translation, natural language processing, etc. are provided. The method includes the steps of: generating learning models by performing machine learning with respect to learning data; attaching tags to a raw corpus automatically by using the generated learning models to thereby generate learning data candidates; calculating confidence scores of the generated learning data candidates, and then selecting a learning data candidate using the confidence scores; and allowing a user to correct an error in the selected learning data candidate through an interface and adding the error-corrected learning data candidate to the learning data, thereby adding new learning models incrementally.
KSP 제안 키워드
Information retrieval(IR), Language processing, Learning data, Learning model, Natural Language Processing, Statistical methodology, information extraction, machine Learning, natural language