Auditing SNOMED Integration into the UMLS for Duplicate Concepts
Document Type
Article
Publication Date
1-1-2010
Abstract
The UMLS contains terms from many sources. Every update of a source requires reintegration. Each new term needs to be assigned to a preexisting UMLS concept, or a new concept must be created. Whenever the integration process unnecessarily creates a new concept, this is undesirable. We report on a method to detect such undesirable duplicate concepts. Terms are removed from the UMLS and reintegrated using "piecewise synonym generation." The concept of the reintegrated term is programmatically compared to the initial concept of the term (before removal). If they are different, this indicates an error, either in the integration process or in the initial concept. Thus, such a term-concept pair is deemed suspicious. A study of five hierarchies of the SNOMED found 7.7% suspicious matches. A human expert needs to evaluate the correctness of suspicious concepts. In a sample of 149 of those, 19% of concepts were found to be duplicates.
Identifier
84964994928 (Scopus)
Publication Title
AMIA Annual Symposium Proceedings AMIA Symposium AMIA Symposium
e-ISSN
1942597X
PubMed ID
21346993
First Page
321
Last Page
325
Volume
2010
Recommended Citation
Huang, Kuo Chuan; Geller, James; Elhanan, Gai; Perl, Yehoshua; and Halper, Michael, "Auditing SNOMED Integration into the UMLS for Duplicate Concepts" (2010). Faculty Publications. 6435.
https://digitalcommons.njit.edu/fac_pubs/6435
