Preliminary analysis of difficulty of importing pattern-based concepts into the national cancer institute thesaurus
Document Type
Conference Proceeding
Publication Date
1-1-2017
Abstract
Maintenance of biomedical ontologies is difficult. We have developed a pattern-based method for dealing with the problem of identifying missing concepts in the National Cancer Institute thesaurus (NCIt). Specifically, we are mining patterns connecting NCIt concepts with concepts in other ontologies to identify candidate missing concepts. However, the final decision about a concept insertion is always up to a human ontology curator. In this paper, we are estimating the difficulty of this task for a domain expert by counting possible choices for a pattern-based insertion. We conclude that even with support of our mining algorithm, the insertion task is challenging.
Identifier
85021308986 (Scopus)
Publication Title
Studies in Health Technology and Informatics
External Full Text Location
https://doi.org/10.3233/978-1-61499-678-1-389
e-ISSN
18798365
ISSN
09269630
PubMed ID
27577410
First Page
389
Last Page
393
Volume
228
Grant
R01CA190779
Fund Ref
National Institutes of Health
Recommended Citation
He, Zhe and Geller, James, "Preliminary analysis of difficulty of importing pattern-based concepts into the national cancer institute thesaurus" (2017). Faculty Publications. 9976.
https://digitalcommons.njit.edu/fac_pubs/9976
