ETRI-Knowledge Sharing Plaform

ENGLISH

성과물

논문 검색
구분 SCI
연도 ~ 키워드

상세정보

학술대회 Extracting Protein-Protein Interactions in Biomedical Literature Using an Existing Syntactic Parser
Cited 3 time in scopus Download 0 time Share share facebook twitter linkedin kakaostory
저자
장현철, 임재수, 임준호, 박수준, 박선희, 이규철
발행일
200604
출처
International Workshop on Knowledge Discovery in Life Science LIterature (KDLL) 2006 (LNCS 3886), v.3886, pp.78-90
DOI
https://dx.doi.org/10.1007/11683568_7
협약과제
06MB1700, 바이오 데이터 마이닝 통합관리 핵심 S/W 컴포넌트 개발, 박선희
초록
We are developing an information extraction system for life science literature, We are currently focusing on PubMed abstracts and trying to extract named entities and their relationships, especially protein names and protein-protein interactions, We are adopting methods including natural language processing, machine learning, and text processing. But we are not developing a new tagging or parsing technique, Developing a new tagger or a new parser specialized in life science literature is a very complex job, And it is not easy to get a good result by tuning an existing parser or by training it without a sufficient corpus. These all are another research topics and we are trying to extract information, not to develop something to help the extracting job or else, In this paper, we introduce our method to use an existing full parser without training or tuning. After tagging sentences and extracting proteins, we make sentences simple by substituting some words like named entities, nouns into one word. Then parsing errors are reduced and parsing precision is increased by this sentence simplification. We parse the simplified sentences syntactically with an existing syntactic parser and extract protein-protein interactions from its results. We show the effects of sentence simplification and syntactic parsing. © Springer-Verlag Berlin Heidelberg 2006.
KSP 제안 키워드
Biomedical literature, Extraction system, Full parser, Natural Language Processing, Research topics, Syntactic parser, Syntactic parsing, Text processing, information extraction, life science, machine Learning