ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

특허 검색
구분 출원국
출원년도 ~ 키워드

상세정보

등록 서브워드 임베딩 기반 스킵서트 문장 임베딩 기술

발명자
정의석, 정호영, 김현우, 송화전, 박전규, 이윤근, 오유리, 강병옥
출원번호
16671773 (2019.11.01)
공개번호
20200175119 (2020.06.04)
등록번호
11423238 (2022.08.23)
출원국
미국
협약과제
18ZS1100, 자율성장형 AI 핵심원천기술 연구, 이윤근
초록
Provided are sentence embedding method and apparatus based on subword embedding and skip-thoughts. To integrate skip-thought sentence embedding learning methodology with a subword embedding technique, a skip-thought sentence embedding learning method based on subword embedding and methodology for simultaneously learning subword embedding learning and skip-thought sentence embedding learning, that is, multitask learning methodology, are provided as methodology for applying intra-sentence contextual information to subword embedding in the case of subword embedding learning. This makes it possible to apply a sentence embedding approach to agglutinative languages such as Korean in a bag-of-words form. Also, skip-thought sentence embedding learning methodology is integrated with a subword embedding technique such that intra-sentence contextual information can be used in the case of subword embedding learning. A proposed model minimizes additional training parameters based on sentence embedding such that most training results may be accumulated in a subword embedding parameter.
KSP 제안 키워드
Agglutinative languages, Bag-of-words, Contextual information, Embedding Technique, Learning methods, Proposed model, Training results, embedding learning, embedding method, learning methodologies, multi-task learning