A. Regev, S. A. Teichmann, E. S. Lander, I. Amit, C. Benoist, E. Birney, B. Bodenmiller, P. Campbell, P. Carninci, M. Clatworthy, et al., Science forum: The human cell atlas, eLife, vol. 6, p. e27041, 2017.
A. Regev, S. Teichmann, O. Rozenblatt-Rosen, M. Stubbington, K. Ardlie, I. Amit, P. Arlotta, G. Bader, C. Benoist, M. Biton, et al., The human cell atlas white paper, arXiv preprint arXiv: 1810.05192, 2018.
O. Rozenblatt-Rosen, M. J. T. Stubbington, A. Regev, and S. A. Teichmann, The human cell atlas: from vision to reality, Nature, vol. 550, no. 7677, pp. 451–453, 2017.
Y. H. Choi and J. K. Kim, Dissecting cellular heterogeneity using single-cell RNA sequencing, Molecules and Cells, vol. 42, no. 3, p. 189, 2019.
E. A. A. Alaoui, S. C. K. Tekouabou, S. Hartini, Z. Rustam, H. Silkan, and S Agoujil, Improvement in automated diagnosis of soft tissues tumors using machine learning, Big Data Mining and Analytics, vol. 4, no. 1, pp. 33–46, 2021.
A. E. Saliba, A. J. Westermann, S. A. Gorski, and J. Vogel, Single-cell RNA-seq: Advances and future challenges, Nucleic Acids Research, vol. 42, no. 14, pp. 8845–8860, 2014.
V. Y. Kiselev, T. S. Andrews, and M. Hemberg, Challenges in unsupervised clustering of single-cell RNA-seq data, Nature Reviews Genetics, vol. 20, no. 5, pp. 273–282, 2019.
O. Stegle, S. A. Teichmann, and J. C. Marioni, Computational and analytical challenges in single-cell transcriptomics, Nature Reviews Genetics, vol. 16, no. 3, pp. 133–145, 2015.
B. Wang, J. J. Zhu, E. Pierson, D. Ramazzotti, and S. Batzoglou, Visualization and analysis of single-cell RNA-seq data by kernel-based similarity learning, Nature Methods, vol. 14, no. 4, pp. 414–416, 2017.
P. J. Lin, M. Troup, and J. W. K. Ho, CIDR: Ultrafast and accurate clustering through imputation for single-cell RNA-seq data, Genome Biology, vol. 18, pp. 59, 2017.
E. Pierson and C. Yau, ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis, Genome Biology, vol. 16, pp. 241, 2015.
W. V. Li and J. J. Li, An accurate and robust imputation method scImpute for single-cell RNA-seq data, Nature Communications, vol. 9, pp. 997, 2018.
Z. G. Wang, X. Xiao, and S. Rajasekaran, Novel and efficient randomized algorithms for feature selection, Big Data Mining and Analytics, vol. 3, no. 3, pp. 208–224, 2020.
E. Becht, L. McInnes, J. Healy, C. A. Dutertre, I. W. H. Kwok, L. G. Ng, F. Ginhoux, and E. W. Newell, Dimensionality reduction for visualizing single-cell data using UMAP, Nature Biotechnology, vol. 37, pp. 38–44, 2019.
M. Z. Guo, H. Wang, S. S. Potter, J. A. Whitsett, and Y. Xu, SINCERA: A pipeline for single-cell RNA-Seq profiling analysis, PLoS Computational Biology, vol. 11, no. 11, p. e1004575, 2015.
H. Jiang, L. L. Sohn, H. Y. Huang, and L. N. Chen, Single cell clustering based on cell-pair differentiability correlation and variance analysis, Bioinformatics, vol. 34, no. 21, pp. 3684–3694, 2018.
V. Ntranos, G. M. Kamath, J. M. Zhang, L. Pachter, and D. N. Tse, Fast and accurate single-cell RNA-seq analysis by clustering of transcript-compatibility counts, Genome Biology, vol. 17, pp. 112, 2016.
M. B. Pouyan and D. Kostka, Random forest based similarity learning for single cell RNA sequencing data, Bioinformatics, vol. 34, no. 13, pp. i79–i88, 2018.
G. C. Liu, Z. C. Lin, and Y. Yu, Robust subspace segmentation by low-rank representation, in Proc. 27th Int. Conf. Machine Learning, Madison, WI, USA, 2010, pp. 663–670.
R. Vidal and P. Favaro, Low rank subspace clustering (LRSC), Pattern Recognition Letters, vol. 43, pp. 47–61, 2014.
R. Q. Zheng, M. Li, Z. L. Liang, F. X. Wu, Y. Pan, and J. X. Wang, SinNLRR: A robust subspace clustering method for cell type detection by non-negative and low-rank representation, Bioinformatics, vol. 35, no. 19, pp. 3642–3650, 2019.
R. Q. Zheng, Z. L. Liang, X. Chen, Y. Tian, C. Cao, and M. Li, An adaptive sparse subspace clustering for cell type identification, Frontiers in Genetics, vol. 11, pp. 407, 2020.
A. Butler, P. Hoffman, P. Smibert, E. Papalexi, and R. Satija, Integrating single-cell transcriptomic data across different conditions, technologies, and species, Nature Biotechnology, vol. 36, no. 5, pp. 411–420, 2018.
S. Park and H. Y. Zhao, Spectral clustering based on learning similarity matrix, Bioinformatics, vol. 34, no. 12, pp. 2069–2076, 2018.
V. Y. Kiselev, K. Kirschner, M. T. Schaub, T. Andrews, A. Yiu, T. Chandra, K. N. Natarajan, W. Reik, M. Barahona, A. R. Green, et al., SC3: Consensus clustering of single-cell RNA-seq data, Nature Methods, vol. 14, no. 5, pp. 483–486, 2017.
R. Huh, Y. C. Yang, Y. C. Jiang, Y. Shen, and Y. Li, SAME-clustering: Single-cell aggregated clustering via mixture model ensemble, Nucleic Acids Research, vol. 48, no. 1, pp. 86–95, 2020.
A. Zeisel, A. B. Muñoz-Manchado, S. Codeluppi, P. Lönnerberg, G. La Manno, A. Juréus, S. Marques, H. Munguba, L. Q. He, C. Betsholtz, et al., Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq, Science, vol. 347, no. 6226, pp. 1138–1142, 2015.
J. Žurauskienė and C. Yau, pcaReduce: Hierarchical clustering of single cell transcriptional profiles, BMC Bioinformatics, vol. 17, p. 140, 2016.
J. M. Zhang, J. Fan, H. C. Fan, D. Rosenfeld, and D. N. Tse, An interpretable framework for clustering single-cell RNA-Seq datasets, BMC Bioinformatics, vol. 19, pp. 93, 2018.
L. U. Von, A tutorial on spectral clustering, Statistics and Computing, vol. 17, no. 4, pp. 395–416, 2007.
V. D. Blondel, J. L. Guillaume, R. Lambiotte, and E. Lefebvre, Fast unfolding of communities in large networks, Journal of Statistical Mechanics: Theory and Experiment, vol. 2008, no. 10, p. P10008, 2008.
Y. Zhang, B. Wu, Y. Liu and J. Lv, Local community detection based on network motifs, Tsinghua Science and Technology, vol. 24, no. 6, pp. 716–727, 2019.
B. Zhao, J. Wang, M. Li, F. Wu, and Y. Pan, Detecting protein complexes based on uncertain graph model, IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 11, no. 3, pp. 486–497, 2014.
X. Meng, J. Xiang, R. Zheng, F. Wu, and M. Li, DPCMNE: Detecting protein complexes from protein-protein interaction networks via multi-level network embedding, IEEE/ACM Transactions on Computational Biology and Bioinformatics, .
Z. Liang, M. Li, R. Zheng, Y. Tian, X. Yan, J. Chen, F. X. Wu, and J. Wang, Cell type detection based on sparse subspace representation and similarity enhancement, Genomics, Proteomics & Bioinformatics, https://doi.org/10.1016/j.gpb.2020.09.004.
L Jiang, H Chen, L Pinello and GC Yuan, GiniClust: Detecting rare cell types from single-cell gene expression data with Gini index, Genome Biology, vol. 17, no. 1, pp. 1–13, 2016.
C. Trapnell, D. Cacchiarelli, J. Grimsby, P. Pokharel, S. Li, M. Morse, N. J Lennon, K. J. Livak, T. S. Mikkelsen, and J. L. Rinn, The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells, Nature Biotechnology, vol. 32, no. 4, p. 381, 2014.
A. Brazma, H. Parkinson, U. Sarkans, M. Shojatalab, J. Vilo, N. Abeygunawardena, E. Holloway, M. Kapushesky, P. Kemmeren, G. G. Lara, et al., ArrayExpress—A public repository for microarray gene expression data at the EBI, Nucleic Acids Research, vol. 31, no. 1, pp. 68–71, 2013.
R. Edgar, M. Domrachev, and A. E. Lash, Gene Expression Omnibus: NCBI gene expression and hybridization array data repository, Nucleic Acids Research, vol. 30, no. 1, pp. 207–210, 2002.
J. Zhao, S. Zhang, Y. Liu, X. He, M. Qu, G. Xu, H. Wang, M. Huang, J. Pan, Z. Liu, Z. Li, L. Liu, and Z. Zhang, Single-cell RNA sequencing reveals the heterogeneity of liver-resident immune cells in human, Cell Discovery, vol. 6, no. 1, pp. 1–19, 2020.
S. L. Goldman, M. MacKay, E. Afshinnekoo, A. M. Melnick, S. Wu, and C. E. Mason, The impact of heterogeneity on single-cell sequencing, Frontiers in Genetics, vol. 10, p. 8, 2019.
D. T. Ting, B. S. Wittner, M. Ligorio, N. V. Jordan, A. M. Shah, D. T. Miyamoto, N. Aceto, F. Bersani, B. W. Brannigan, K. Xega, et al., Single-cell rna sequencing identifies extracellular matrix gene expression by pancreatic circulating tumor cells, Cell Reports, vol. 8, no. 6, pp. 1905–1918, 2014.
F. Buettner, K. N. Natarajan, F. P. Casale, V. Proserpoi, A. Scialdone, F. J. Theis, S. A. Teichmann, J. C. Marioni, and O. Stegle, Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells, Nature Biotechnology, vol. 33, no. 2, pp. 155–160, 2015.
A. A. Pollen and T. J. Nowakowski, Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex, Nature Biotechnology, vol. 32, no. 10, p. 1053, 2014.
A. Schlitzer, V. Sivakamasundari, J. Chen, H. R. B. Sumatoh, J. Schreuder, J. Lum, B. Malleret, S. Zhang, A. Larbi, F. Zolezzi, et al., Identification of cdc1- and cdc2- committed dc progenitors reveals early lineage priming at the common dc progenitor stage in the bone marrow, Nature Immunology, vol. 16, no. 7, pp. 718–728, 2015.
G. La Manno, D. Gyllborg, S. Codeluppi, K. Nishimura, C. Salto, A. Zeisel, L. E. Borrm, S. R. W. Stott, E. M. Toledo, et al, Molecular diversity of midbrain development in mouse, Human, and Stem Cells, Cell, vol. 167, no. 2, pp. 566–580, 2016.
S. Darmanis, S. A. Sloan, Y. Zhang, M. Enge, C. Caneda, L. M. Shuer, M. G. H. Gephart, B. A. Barres, and S. R. Quake, A survey of human brain transcriptome diversity at the single cell level, PNAS, vol. 112, no. 23, pp. 7285–7290, 2015.
N. Leng, L. F. Chu, C. Barry, Y. Li, J. Choi, P. Jiang, R. M. Stewart, J. Thomson, and C Kendziorski, Oscope identifies oscillatory genes in unsynchronized single-cell RNA-seq experiments, Nature Methods, vol. 12, no. 10, p. 947, 2015.
J. G. Camp, K. Sekine, T. Gerber, H. Loeffler-Wirth, H. Binder, M. Gac, S. Kanton, J. Kageyama, G. Damm, D. Seehofer, L. Belicova, et al., Multilineage communication regulates human liver bud development from pluripotency, Nature, vol. 546, no. 7659, pp. 533–538, 2017.
D. Gokie, G. M. Stanley, B. Treutlein, N. F. Neff, J. G. Camp, R. C. Malenka, P. E. Rothwell, M. V. Fuccillo, T. C. Südhof, and S. R. Quake, Cellular Taxonomy of the Mouse Striatum as Revealed by Single Cell RNA Sequencing, Biophysical Journal, vol. 16, no. 4, pp. 1126–1137, 2016.
S. Nestorowa, F. K. Hamey, S. B. Pijuan, E. Diamanti, M. Shepherd, E. Laurenti, N. K. Wilson, D. G. Kent, and B. Gottgens, A single-cell resolution map of mouse hematopoietic stem and progenitor cell differentiation, Blood, vol. 128, no. 8, pp. e20-e31, 2016.
J. L. Close, Z. Z. Yao, B. P. Levi, J. A. Miller, T. E. Bakken, V. Menon, J. T. Ting, A. Wall, A. R. Krostag, E. R. Thomsen, et al., Single-cell profiling of an in vitro model of human interneuron development reveals temporal dynamics of cell type production and maturation, Neuron, vol. 93, no. 5, pp. 1035–1048, 2017.
M. Ester, H. P. Kriegel, J. Sander, and X. W. Xu, A density-based algorithm for discovering clusters in large spatial databases with noise, in Proc. 2nd Int. Conf. Knowledge Discovery and Data Mining, Portland, OR, USA, 1996, pp. 226–231.
A. Smoliński, B. Walczak, and J. W. Einax, Hierarchical clustering extended with visual complements of environmental data set, Chemometrics and Intelligent Laboratory Systems, vol. 64, no. 1, pp. 45–54, 2002.
A. Strehl and J. Ghosh, Cluster ensembles—A knowledge reuse framework for combining multiple partitions, Journal of Machine Learning Research, vol. 3, pp. 583–617, 2003.
S. Wagner and D. Wagner, Comparing Clusterings—AnOverview. Karlsruhe, Germany: University at Karlsruhe, 2017.
H. Cho, B. Berger, and J. Peng, Generalizable and scalable visualization of single-cell data using neural networks, Cell Systems, vol. 7, no. 2, pp. 185–191, 2018.
L. V. der Maaten and G Hinton, Visualizing data using tSNE, Journal of machine learning research, vol. 9, no. 11, pp. 2579-2605, 2008.
L. McInnes, J. Healy, and J. Melville, UMAP: Uniform manifold approximation and projection for dimension reduction, arXiv preprint arXiv: 1802.03426, 2018.
T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms, 2nd ed. Cambridge, MA, USA: MIT Press, 2001, pp. 561–579.
W. B. March, P. Ram, and A. G. Gray, Fast euclidean minimum spanning tree: Algorithm, analysis, and applications, in Proc. 16th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, Washington, DC, USA, 2010, pp. 603–612.
R. R. Curtin, J. R. Cline, N. P. Slagle, W. B. March, P. Ram, N. A. Mehta, and A. G. Gray, MLPACK: A scalable C++ machine learning library, Journal of Machine Learning Research, vol. 14, pp. 801–805, 2013.
T. Nakamura, I. Okamoto, K. Sasaki, Y. Yabuta, C. Iwatani, H. Tsuchiya, Y. Seita, S. Nakamura, T. Yamamoto, and M. Saitou, A developmental coordinate of pluripotency among mice, monkeys and humans, Nature, vol. 537, no. 7618, pp. 57–62, 2016.
C. Lin, S. Jain, H. Kim, and Z. Bar-Joseph, Using neural networks for reducing the dimensions of single-cell RNA-seq data, Nucleic Acids Research, vol. 45, no. 17, p. e156, 2017.
H. J. Li, F. Horns, B. Wu, Q. J. Xie, J. F. Li, T. C. Li, D. J. Luginbuhl, S. R. Quake, and L. Q. Luo, Classifying Drosophila olfactory projection neuron subtypes by single-cell RNA sequencing, Cell, vol. 171, no. 5, pp. 1206–1220, 2017.
D. Usoskin, A. Furlan, S. Islam, H. Abdo, P. Lönnerberg, D. H. Lou, J. Hjerling-Leffler, J. Haeggström, O. Kharchenko, P. V. Kharchenko, et al., Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing, Nature Neuroscience, vol. 18, no. 1, pp. 145–153, 2015.
L. F. Chu, N. Leng, J. Zhang, Z. Hou, D. Manott, D. T. Vereide, J. Choi, C. Kendziorski, R. Stewart, and J. A. Thomson, Singlecell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm, Genome Biology, vol. 17, no. 1, pp. 1–20, 2016.
S. Petropoulos, D. Edsgärd, B. Reinius, Q. L. Deng, S. P. Panula, S. Codeluppi, A. P. Reyes, S. Linnarsson, R. Sandberg, and F. Lanner, Single-cell RNA-seq reveals lineage and X chromosome dynamics in human preimplantation embryos, Cell, vol. 165, no. 4, pp. 1012–1026, 2016.
M. Baron, A. Veres, S. L. Wolock, A. L. Faust, R. Gaujoux, A. Vetere, J. H. Ryu, B. K. Wagner, S. S. Shen-Orr, and A. M. Klein, A single-cell transcriptomic map of the human and mouse pancreas reveals inter- and intra-cell population structure, Cell Systems, vol. 3, no. 4, pp. 346–360, 2016.
J. Park, R. Shrestha, C. X. Qiu, A. Kondo, S. Z. Huang, M. Werth, M. Y. Li, J. Barasch, and K. Suszták, Single-cell transcriptomics of the mouse kidney reveals potential cellular targets of kidney disease, Science, vol. 360, no. 6390, pp. 758–763, 2018.
A. T. L. Lun, D. J. McCarthy, and J. C. Marioni, A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor, F1000 Research, vol. 5, pp. 2122, 2016.
D. J. McCarthy, K. R. Campbell, A. T. L. Lun, and Q. F. Wills, Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R, Bioinformatics, vol. 33, no. 8, pp. 1179–1186, 2017.
W. Saelens, R. Cannoodt, H. Todorov, and Y. Saeys, A comparison of single-cell trajectory inference methods, Nature Biotechnology, vol. 37, no. 5, pp. 547–554, 2019.
R. Q. Zheng, M. Li, X. Chen, S. Y. Zhao, F. X. Wu, Y. Pan, and J. X. Wang, An ensemble method to reconstruct gene regulatory networks based on multivariate adaptive regression splines, IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 18, no. 1, pp. 347–354, 2021.
X. Chen, M. Li, R. Q. Zheng, S. Y. Zhao, F. X. Wu, Y. H. Li, and J. X. Wang, A novel method of gene regulatory network structure inference from gene knock-out expression data, Tsinghua Science and Technology, vol. 24, no. 4, pp. 446–455, 2019.
M. S. Mahmud, J. Z. Huang, S. Salloum, T. Z. Emara, and K. Sadatdiynov, A survey of data partitioning and sampling methods to support big data analysis, Big Data Mining and Analytics, vol. 3, no. 2, pp. 85–101, 2020.
L. Wang and W. Fan, A multilevel splitting algorithm for quick sampling, Tsinghua Science and Technology, vol. 26, no. 4, pp. 417–425, 2021.