Document Type
Thesis
Date of Award
5-31-2020
Degree Name
Master of Science in Computer Science - (M.S.)
Department
Computer Science
First Advisor
Zhi Wei
Second Advisor
Usman W. Roshan
Third Advisor
Wenge Guoe
Fourth Advisor
Nan Gao
Abstract
In the biological field, the smallest unit of organisms in most biological systems is the single cell, and the classification of cells is an everlasting problem. A central task for analysis of single-cell RNA-seq data is to identify and characterize novel cell types. Currently, there are several classical methods, such as K-means algorithm, spectral clustering, and Gaussian Mixture Models (GMMs), which are widely used to cluster the cells. Furthermore, typical dimensional reduction methods such as PCA, t-SNE, and ZIDA have been introduced to overcome “the curse of dimensionality”. A more recent method scDeepCluster has demonstrated improved and promising performances in clustering single-cell data. In this study, a clustering method is proposed to optimize scDeepCluster with Siamese networks, which will learn more reliable functions for mapping inputs to the latent space. Also, the spectral clustering based on the SpectralNet algorithm is employed to improve clustering performances. Extensive experiments are conducted to demonstrate its superior performance in comparison with the current state-of-art methods.
Recommended Citation
Meng, Zixia, "Model-based deep siamese autoencoder for clustering single cell rna-seq data" (2020). Theses. 1794.
https://digitalcommons.njit.edu/theses/1794
Included in
Artificial Intelligence and Robotics Commons, Biomedical Engineering and Bioengineering Commons