Learning by examples: Identifying key concepts from text using pre-defined inputs

Document Type

Conference Proceeding

Publication Date

12-1-2005

Abstract

This article describes a keyphrase identification program (KIP) which extracts document key concepts by using sample human keyphrases. KIP considers the composition of a keyphrase. The more keywords a phrase contains and the more significant these keywords are, the more likely this phrase is a keyphrase. KIP first populates its database using manually identified keyphrases and keywords; it then checks the composition of all identified noun phrases, looks up the database and calculates scores for all these noun phrases; the ones having higher scores will be extracted as keyphrases. KIP's learning function can enrich the database by automatically adding new keyphrases to the database. Consequently, the database will grow gradually and the system performance will be improved. The results from our small-scale preliminary experiments show that KIP is effective in extracting document keyphrases and its learning function is useful.

Identifier

60749126296 (Scopus)

ISBN

[9781932415667, 193241567X, 9781932415674]

Publication Title

Proceedings of the 2005 International Conference on Artificial Intelligence Icai 05

First Page

826

Last Page

832

Volume

2

This document is currently not available here.

Share

COinS