A quality assurance methodology for ChEBI ontology focusing on uncommonly modeled concepts

Document Type

Conference Proceeding

Publication Date

1-1-2018

Abstract

The Chemical Entities of Biological Interest (ChEBI) ontology is an important knowledge source of chemical entities in a biological context. ChEBI is large and complex, making it almost impossible to be error-free, given the scarce resources for quality assurance (QA). We present a methodology to locate concepts in ChEBI with a high probability of being erroneous. An Abstraction Network, which provides a compact summarization of an ontology, supports the methodology. By investigating a sample of ChEBI concepts, we show that uncommonly modeled concepts residing in small units of the Abstraction Network of ChEBI are statistically significantly more likely to have errors than other concepts. The finding may guide ChEBI ontology curators to focus their limited QA resources on such concepts to achieve a better QA yield. Furthermore, this study, combined with previous work, contributes to progress in showing that this methodology can be applied to a whole family of similar ontologies.

Identifier

85059841943 (Scopus)

Publication Title

Ceur Workshop Proceedings

ISSN

16130073

Volume

2285

Grant

R01CA190779

Fund Ref

National Institutes of Health

This document is currently not available here.

Share

COinS