Multiple testing for pattern identification, with applications to microarray time-course experiments
Document Type
Article
Publication Date
3-1-2011
Abstract
In time-course experiments, it is often desirable to identify genes that exhibit a specific pattern of differential expression over time and thus gain insights into the mechanisms of the underlying biological processes. Two challenging issues in the pattern identification problem are: (i) how to combine the simultaneous inferences across multiple time points and (ii) how to control the multiplicity while accounting for the strong dependence. We formulate a compound decision-theoretic framework for set-wise multiple testing and propose a data-driven procedure that aims to minimize the missed set rate subject to a constraint on the false set rate. The hidden Markov model proposed in Yuan and Kendziorski (2006) is generalized to capture the temporal correlation in the gene expression data. Both theoretical and numerical results are presented to show that our data-driven procedure controls the multiplicity, provides an optimal way of combining simultaneous inferences across multiple time points, and greatly improves the conventional combined p-value methods. In particular, we demonstrate our method in an application to a study of systemic inflammation in humans for detecting early and late response genes. © 2011 American Statistical Association.
Identifier
79954483579 (Scopus)
Publication Title
Journal of the American Statistical Association
External Full Text Location
https://doi.org/10.1198/jasa.2011.ap09587
ISSN
01621459
First Page
73
Last Page
88
Issue
493
Volume
106
Grant
DMS-10-07675
Fund Ref
National Science Foundation
Recommended Citation
Sun, Wenguang and Wei, Zhi, "Multiple testing for pattern identification, with applications to microarray time-course experiments" (2011). Faculty Publications. 11438.
https://digitalcommons.njit.edu/fac_pubs/11438
