Extraction of key-phrases from biomedical full-text with supervised learning techniques
Document Type
Conference Proceeding
Publication Date
12-1-2009
Abstract
Key-phrase extraction plays useful a role in the research area of Information Systems (IS) such as digital libraries. Short metadata like key phrases could be beneficial for searchers to understand the concepts of documents' concept. This paper evaluates the effectiveness of different supervised learning techniques on biomedical full-text: Naïve Bayes, linear regression, SVMs (reg1/2), all of which could be embedded inside an IS for document search. We use these techniques to extract key phrases from PubMed. We evaluate the performance of these systems using the well-established holdout validation method. The contributions of the paper are comparison among different classifier techniques, and a comparison of performance differences between full-text and abstract. We conducted experiments and found that SVMreg-1 improves the performance of key-phrase extraction from full-text while Naïve Bayes improves from the abstracts. These techniques should be considered for use in information system search functionality. Additional research issues also are identified. © (2009) by the AIS/ICIS Administrative Office All rights reserved.
Identifier
79954623508 (Scopus)
ISBN
[9781615675814]
Publication Title
15th Americas Conference on Information Systems 2009 Amcis 2009
First Page
2992
Last Page
3000
Volume
5
Recommended Citation
Qi, Yanliang; Yagci, Artun I.; and Song, Min, "Extraction of key-phrases from biomedical full-text with supervised learning techniques" (2009). Faculty Publications. 11691.
https://digitalcommons.njit.edu/fac_pubs/11691
