Document clustering by semantic smoothing and Dynamic Growing Cell Structure (DynGCS) for biomedical literature
Document Type
Conference Proceeding
Publication Date
10-6-2008
Abstract
The general goal of clustering is to group data elements such that the intra-group similarities are high and the inter-group similarities are low. In this paper, we propose a novel hybrid clustering technique that incorporates semantic smoothing of document models into a neural network framework. Recently it has been reported that the semantic smoothing model enhances the retrieval quality in Information Retrieval (IR). Inspired by that, we apply the context-sensitive semantic smoothing model to boost accuracy of clustering that is generated by a dynamic growing cell structure algorithm, a variation of the neural network technique. We evaluated the proposed technique on article sets from MEDLINE, the largest biomedical digital library in Biomedicine. Our experimental evaluations show that the proposed algorithm significantly improves the clustering quality over the traditional clustering techniques. © 2008 Springer-Verlag Berlin Heidelberg.
Identifier
52949098384 (Scopus)
ISBN
[3540858350, 9783540858355]
Publication Title
Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics
External Full Text Location
https://doi.org/10.1007/978-3-540-85836-2_21
e-ISSN
16113349
ISSN
03029743
First Page
217
Last Page
226
Volume
5182 LNCS
Recommended Citation
Song, Min; Hu, Xiaohua; Yoo, Illhoi; and Koppel, Eric, "Document clustering by semantic smoothing and Dynamic Growing Cell Structure (DynGCS) for biomedical literature" (2008). Faculty Publications. 12630.
https://digitalcommons.njit.edu/fac_pubs/12630
