ETRI-Knowledge Sharing Plaform

KOREAN
논문 검색
Type SCI
Year ~ Keyword

Detail

Conference Paper Normalization of Gene/Protein Names in Biological Literatures using Vector-Space Model
Cited 4 time in scopus Share share facebook twitter linkedin kakaostory
Authors
Joon-Ho Lim, Hyun Chul Jang, Jae Soo Lim, Soo-Jun Park
Issue Date
2007-08
Citation
International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS) 2007, pp.390-393
Publisher
IEEE
Language
English
Type
Conference Paper
DOI
https://dx.doi.org/10.1109/IEMBS.2007.4352306
Abstract
As the number of biological literatures grows exponentially, needs for text mining system are increased. In text mining area, normalization is mapping gene/protein names to a database. It is necessary to combine extracted information from various literatures and to curate a database or an ontology using literatures. Previous normalization researches used direct comparison methods between a database and literatures, but it is weak to extremely variational gene/protein names in literatures. Therefore, in this paper, we propose a normalization method using Vector-Space Model. For each gene/protein name, we rank identifiers using Vector-Space Model, and find the most similar identifier with the name. Experimental result shows the proposed method has 70.7% f-measure. © 2007 IEEE.
KSP Keywords
Experimental Result, F-measure, Mining area, Mining system, Normalization method, comparison method, direct comparison, space model, text mining