Registered
APPARATUS AND METHOD FOR CONSTRUCTING LEARNING DATA
- Inventors
-
Changki Lee, Kim Hyeon Jin, Lee Chung Hee, Wang Ji Hyun, Oh Hyo-Jung, Jang Myung Gil, Young Jik Lee
- Application No.
-
11633190 (2006.12.04)
- Publication No.
-
20070143284 (2007.06.21)
- Registration No.
- 7725408 (2010.05.25)
- Country
- UNITED STATES
- Project Code
-
05MF1100, Language Information Processing Technology Development,
Young Jik Lee
- Abstract
- An apparatus and method for efficiently constructing learning data required in statistical methodology used in information retrieval, information extraction, translation, natural language processing, etc. are provided. The method includes the steps of: generating learning models by performing machine learning with respect to learning data; attaching tags to a raw corpus automatically by using the generated learning models to thereby generate learning data candidates; calculating confidence scores of the generated learning data candidates, and then selecting a learning data candidate using the confidence scores; and allowing a user to correct an error in the selected learning data candidate through an interface and adding the error-corrected learning data candidate to the learning data, thereby adding new learning models incrementally.
- KSP Keywords
- Information retrieval(IR), Language processing, Learning data, Learning model, Natural Language Processing, Statistical methodology, information extraction, machine Learning, natural language