Faculty Publications

Discovering additional complex NCIt gene concepts with high error rate

Ling Zheng, Department of Computer Science
Hua Min, George Mason University
Yehoshua Perl, Department of Computer Science
James Geller, Department of Computer Science

Document Type

Conference Proceeding

Publication Date

12-15-2017

Abstract

The Gene hierarchy of the National Cancer Institute (NCI) Thesaurus (NCIt) is of high priority for NCI. It is important to have quality assurance (QA) techniques to improve its content quality. We present a two-step methodology concentrating on auditing the modeling of complex concepts, which are shown to have a higher error rate compared to control concepts. In the first step, we test whether concepts that appear complex in a so called 'partial-area taxonomy' have a higher error rate than control concepts. In the second step, we introduce an innovative technique based on a 'partial-area sub-taxonomy' (constructed with a subset of roles) to discover additional complex concepts. The results of the QA study show that these concepts are indeed statistically significantly more likely to have more errors than control concepts. This makes it easier for NCI staff to improve the modeling quality of gene concepts in NCIt.

Identifier

85045984627 (Scopus)

ISBN

[9781509030491]

Publication Title

Proceedings 2017 IEEE International Conference on Bioinformatics and Biomedicine Bibm 2017

External Full Text Location

https://doi.org/10.1109/BIBM.2017.8217731

First Page

653

Last Page

657

Volume

2017-January

Grant

R01CA190779

Fund Ref

National Institutes of Health

Recommended Citation

Zheng, Ling; Min, Hua; Perl, Yehoshua; and Geller, James, "Discovering additional complex NCIt gene concepts with high error rate" (2017). Faculty Publications. 9123.
https://digitalcommons.njit.edu/fac_pubs/9123

This document is currently not available here.

COinS

DOI

10.1109/BIBM.2017.8217731

Faculty Publications

Discovering additional complex NCIt gene concepts with high error rate

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

DOI

Search

Browse

Author Corner

Links

Faculty Publications

Discovering additional complex NCIt gene concepts with high error rate

Authors

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links