서브워드 임베딩 기반 스킵서트 문장 임베딩 기술
정의석, 정호영, 김현우, 송화전, 박전규, 이윤근, 오유리, 강병옥
- 11423238 (2022.08.23)
18ZS1100, 자율성장형 AI 핵심원천기술 연구,
- Provided are sentence embedding method and apparatus based on subword embedding and skip-thoughts. To integrate skip-thought sentence embedding learning methodology with a subword embedding technique, a skip-thought sentence embedding learning method based on subword embedding and methodology for simultaneously learning subword embedding learning and skip-thought sentence embedding learning, that is, multitask learning methodology, are provided as methodology for applying intra-sentence contextual information to subword embedding in the case of subword embedding learning. This makes it possible to apply a sentence embedding approach to agglutinative languages such as Korean in a bag-of-words form. Also, skip-thought sentence embedding learning methodology is integrated with a subword embedding technique such that intra-sentence contextual information can be used in the case of subword embedding learning. A proposed model minimizes additional training parameters based on sentence embedding such that most training results may be accumulated in a subword embedding parameter.
- KSP 제안 키워드
- Agglutinative languages, Bag-of-words, Contextual information, Embedding Technique, Learning methods, Proposed model, Training results, embedding learning, embedding method, learning methodologies, multi-task learning