A quality assurance methodology for ChEBI ontology focusing on uncommonly modeled concepts
Document Type
Conference Proceeding
Publication Date
1-1-2018
Abstract
The Chemical Entities of Biological Interest (ChEBI) ontology is an important knowledge source of chemical entities in a biological context. ChEBI is large and complex, making it almost impossible to be error-free, given the scarce resources for quality assurance (QA). We present a methodology to locate concepts in ChEBI with a high probability of being erroneous. An Abstraction Network, which provides a compact summarization of an ontology, supports the methodology. By investigating a sample of ChEBI concepts, we show that uncommonly modeled concepts residing in small units of the Abstraction Network of ChEBI are statistically significantly more likely to have errors than other concepts. The finding may guide ChEBI ontology curators to focus their limited QA resources on such concepts to achieve a better QA yield. Furthermore, this study, combined with previous work, contributes to progress in showing that this methodology can be applied to a whole family of similar ontologies.
Identifier
85059841943 (Scopus)
Publication Title
Ceur Workshop Proceedings
ISSN
16130073
Volume
2285
Grant
R01CA190779
Fund Ref
National Institutes of Health
Recommended Citation
Liu, Hao; Chen, Ling; Zheng, Ling; Perl, Yehoshua; and Geller, James, "A quality assurance methodology for ChEBI ontology focusing on uncommonly modeled concepts" (2018). Faculty Publications. 9001.
https://digitalcommons.njit.edu/fac_pubs/9001
