An Effcient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Estimating cell type compositions for complex diseases is an important step to investigate the cellular heterogeneity for understanding disease etiology and potentially facilitate early disease diagnosis and prevention. Here, we developed a computationally statistical method, referring to Multi-Omics Matrix Factorization (MOMF), to estimate the cell-type compositions of bulk RNA sequencing (RNA-seq) data by leveraging cell type-specific gene expression levels from single-cell RNA sequencing (scRNA-seq) data. MOMF not only directly models the count nature of gene expression data, but also effectively accounts for the uncertainty of cell type-specific mean gene expression levels. We demonstrate the benefits of MOMF through three real data applications, i.e., Glioblastomas (GBM), colorectal cancer (CRC) and type II diabetes (T2D) studies. MOMF is able to accurately estimate disease-related cell type proportions, i.e., oligodendrocyte progenitor cells and macrophage cells, which are strongly associated with the survival of GBM and CRC, respectively.

Original languageEnglish
Article number1161
JournalCells
Volume8
Issue number10
DOIs
StatePublished - Oct 2019
Externally publishedYes

Keywords

  • Cell-type compositions
  • Deconvolution
  • Gene expression
  • Nonnegative matrix factorization
  • Single-cell rna-seq

Fingerprint

Dive into the research topics of 'An Effcient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data'. Together they form a unique fingerprint.

Cite this