Clustering single-cell RNA-seq data with a model-based deep learning approach
Document Type
Article
Publication Date
4-1-2019
Abstract
Single-cell RNA sequencing (scRNA-seq) promises to provide higher resolution of cellular differences than bulk RNA sequencing. Clustering transcriptomes profiled by scRNA-seq has been routinely conducted to reveal cell heterogeneity and diversity. However, clustering analysis of scRNA-seq data remains a statistical and computational challenge, due to the pervasive dropout events obscuring the data matrix with prevailing ‘false’ zero count observations. Here, we have developed scDeepCluster, a single-cell model-based deep embedded clustering method, which simultaneously learns feature representation and clustering via explicit modelling of scRNA-seq data generation. Based on testing extensive simulated data and real datasets from four representative single-cell sequencing platforms, scDeepCluster outperformed state-of-the-art methods under various clustering performance metrics and exhibited improved scalability, with running time increasing linearly with sample size. Its accuracy and efficiency make scDeepCluster a promising algorithm for clustering large-scale scRNA-seq data.
Identifier
85073831292 (Scopus)
Publication Title
Nature Machine Intelligence
External Full Text Location
https://doi.org/10.1038/s42256-019-0037-0
e-ISSN
25225839
First Page
191
Last Page
198
Issue
4
Volume
1
Recommended Citation
Tian, Tian; Wan, Ji; Song, Qi; and Wei, Zhi, "Clustering single-cell RNA-seq data with a model-based deep learning approach" (2019). Faculty Publications. 7690.
https://digitalcommons.njit.edu/fac_pubs/7690
