Abstract

Abstract Motivation: The recent advance of single-cell technologies has brought new insights into complex biological phenomena. In particular, genome-wide single-cell measurements such as transcriptome sequencing enable the characterization of cellular composition as well as functional variation in homogenic cell populations. An important step in the single-cell transcriptome analysis is to group cells that belong to the same cell types based on gene expression patterns. The corresponding computational problem is to cluster a noisy high dimensional dataset with substantially fewer objects (cells) than the number of variables (genes). Results: In this article, we describe a novel algorithm named shared nearest neighbor (SNN)-Cliq that clusters single-cell transcriptomes. SNN-Cliq utilizes the concept of shared nearest neighbor that shows advantages in handling high-dimensional data. When evaluated on a variety of synthetic and real experimental datasets, SNN-Cliq outperformed the state-of-the-art methods tested. More importantly, the clustering results of SNN-Cliq reflect the cell types or origins with high accuracy. Availability and implementation: The algorithm is implemented in MATLAB and Python. The source code can be downloaded at http://bioinfo.uncc.edu/SNNCliq. Contact: zcsu@uncc.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Keywords

Cluster analysisComputer scienceTranscriptomePython (programming language)Computational biologyIdentification (biology)k-nearest neighbors algorithmData miningGeneBiologyArtificial intelligenceGeneticsGene expression

MeSH Terms

AlgorithmsAnimalsCell LineageCluster AnalysisEmbryoMammalianGene Expression RegulationHigh-Throughput Nucleotide SequencingHumansMiceNeoplasmsProgramming LanguagesSingle-Cell AnalysisTranscriptome

Affiliated Institutions

Related Publications

Publication Info

Year
2015
Type
article
Volume
31
Issue
12
Pages
1974-1980
Citations
594
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

594
OpenAlex
27
Influential
524
CrossRef

Cite This

Chen Xu, Zhengchang Su (2015). Identification of cell types from single-cell transcriptomes using a novel clustering method. Bioinformatics , 31 (12) , 1974-1980. https://doi.org/10.1093/bioinformatics/btv088

Identifiers

DOI
10.1093/bioinformatics/btv088
PMID
25805722
PMCID
PMC6280782

Data Quality

Data completeness: 86%