Automatically finding significant topical terms from documents
Document Type
Conference Proceeding
Publication Date
12-1-2005
Abstract
With the pervasion of digital textual data, text mining is becoming more and more important to deriving competitive advantages. One factor for successful text mining applications is the ability of finding significant topical terms for discovering interesting patterns or relationships. Document keyphrases are phrases carrying the most important topical concepts for a given document. In many applications, keyphrases as textual elements are better suited for text mining and could provide more discriminating power than single words. This paper describes an automatic keyphrase identification program (KIP). KIP's algorithm examines the composition of noun phrases and calculates their scores by looking up a domain-specific glossary database; the ones with higher scores are extracted as keyphrases. KIP's learning function can enrich its glossary database by automatically adding new identified keyphrases. KIP's personalization feature allows the user build a glossary database specifically suitable for the area of his/her interest.
Identifier
84869789448 (Scopus)
ISBN
[9781604235531]
Publication Title
Association for Information Systems 11th Americas Conference on Information Systems Amcis 2005 A Conference on A Human Scale
First Page
452
Last Page
459
Volume
1
Recommended Citation
Li, Quanzhi; Wu, Yi Fang Brook; Bot, Razvan Stefan; and Chen, Xin, "Automatically finding significant topical terms from documents" (2005). Faculty Publications. 19330.
https://digitalcommons.njit.edu/fac_pubs/19330
