Registered
APPARATUS AND METHOD FOR CONSTRUCTING LEARNING DATA
- Inventors
-
Changki Lee, Kim Hyeon Jin, Wang Ji Hyun, Lee Chung Hee, Oh Hyo-Jung, Jang Myung Gil, Young Jik Lee
- Application No.
-
11633190 (2006.12.04)
- Publication No.
-
20070143284 (2007.06.21)
- Registration No.
- 7725408 (2010.05.25)
- Country
- UNITED STATES
- Project Code
-
05MF1100, Language Information Processing Technology Development,
Young Jik Lee
- Abstract
- An apparatus and method for efficiently constructing learning data required in statistical methodology used in information retrieval, information extraction, translation, natural language processing, etc. are provided. The method includes the steps of: generating learning models by performing machine learning with respect to learning data; attaching tags to a raw corpus automatically by using the generated learning models to thereby generate learning data candidates; calculating confidence scores of the generated learning data candidates, and then selecting a learning data candidate using the confidence scores; and allowing a user to correct an error in the selected learning data candidate through an interface and adding the error-corrected learning data candidate to the learning data, thereby adding new learning models incrementally.
- KSP Keywords
- Information Extraction(IE), Language Processing, Learning data, Natural Language Processing(NLP), Natural language, Statistical methodology, information retrieval, learning models, machine Learning