File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)
Related Researcher

남덕우

Nam, Dougu
Bioinformatics Lab.
Read More

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Benchmarking integration of single-cell differential expression

Author(s)
Nguyen, Hai C. T.Baik, BukyungYoon, SoraPark, TaesungNam, Dougu
Issued Date
2023-03
DOI
10.1038/s41467-023-37126-3
URI
https://scholarworks.unist.ac.kr/handle/201301/65212
Citation
NATURE COMMUNICATIONS, v.14, no.1, pp.1570
Abstract
Integration of single-cell RNA sequencing data between different samples has been a major challenge for analyzing cell populations. Here the authors benchmark 46 workflows for differential expression analysis of single-cell data with multiple batches and suggest several high-performance methods under different conditions based on simulation and real data analyses. Integration of single-cell RNA sequencing data between different samples has been a major challenge for analyzing cell populations. However, strategies to integrate differential expression analysis of single-cell data remain underinvestigated. Here, we benchmark 46 workflows for differential expression analysis of single-cell data with multiple batches. We show that batch effects, sequencing depth and data sparsity substantially impact their performances. Notably, we find that the use of batch-corrected data rarely improves the analysis for sparse data, whereas batch covariate modeling improves the analysis for substantial batch effects. We show that for low depth data, single-cell techniques based on zero-inflation model deteriorate the performance, whereas the analysis of uncorrected data using limmatrend, Wilcoxon test and fixed effects model performs well. We suggest several high-performance methods under different conditions based on various simulation and real data analyses. Additionally, we demonstrate that differential expression analysis for a specific cell type outperforms that of large-scale bulk sample data in prioritizing disease-related genes.
Publisher
NATURE PORTFOLIO
ISSN
2041-1723
Keyword
RNA-SEQ EXPERIMENTSQUALITY-CONTROLINFECTIONPACKAGECANCER

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.