KIP: A keyphrase identification program with learning functions
Document Type
Conference Proceeding
Publication Date
1-1-2004
Abstract
In this paper, we report a keyphrase identification program (KIP) which uses sample human keyphrases and then learns to identify additional new keyphrases. KIP first populates its database using manually identified keyphrases; each keyphrase is pre-processed and assigned an initial weight. It then extracts noun phrases from documents. All noun phrases will be assigned a score, depending on the weights for words it contains; the ones that have a score higher than the threshold will be selected as keyphrases. Learned new keyphrases will be inserted to the database and weights will be updated. As a result, new keyphrase identification iteration will be triggered. The process stops when no new keyphrases are identified during previous iteration. According to the results of evaluation, the base KIP system's average recall was 0.7 and precision was 0.44. The augmented KIP with learning functions did produce new keyphrases which were not identified by the base system.
Identifier
3042555842 (Scopus)
ISBN
[0769521088, 9780769521084]
Publication Title
International Conference on Information Technology Coding Computing ITCC
External Full Text Location
https://doi.org/10.1109/itcc.2004.1286694
First Page
450
Last Page
454
Volume
2
Recommended Citation
Wu, Yi Fang Brook; Li, Quanzhi; Bot, Razvan Stefan; and Chen, Xin, "KIP: A keyphrase identification program with learning functions" (2004). Faculty Publications. 20469.
https://digitalcommons.njit.edu/fac_pubs/20469
