B-Lightning: using bait genes for marker gene hunting in single-cell data with complex heterogeneity.

Publication date: Nov 22, 2024

In single-cell studies, cells can be characterized with multiple sources of heterogeneity (SOH) such as cell type, developmental stage, cell cycle phase, activation state, and so on. In some studies, many nuisance SOH are of no interest, but may confound the identification of the SOH of interest, and thus affect the accurate annotate the corresponding cell subpopulations. In this paper, we develop B-Lightning, a novel and robust method designed to identify marker genes and cell subpopulations corresponding to an SOH (e. g. cell activation status), isolating it from other SOH (e. g. cell type, cell cycle phase). B-Lightning uses an iterative approach to enrich a small set of trustworthy marker genes to more reliable marker genes and boost the signals of the SOH of interest. Multiple numerical and experimental studies showed that B-Lightning outperforms existing methods in terms of sensitivity and robustness in identifying marker genes. Moreover, it increases the power to differentiate cell subpopulations of interest from other heterogeneous cohorts. B-Lightning successfully identified new senescence markers in ciliated cells from human idiopathic pulmonary fibrosis lung tissues, new T-cell memory and effector markers in the context of SARS-COV-2 infections, and their synchronized patterns that were previously neglected, new AD markers that can better differentiate AD severity, and new dendritic cell functioning markers with differential transcriptomics profiles across breast cancer subtypes. This paper highlights B-Lightning’s potential as a powerful tool for single-cell data analysis, particularly in complex data sets where SOH of interest are entangled with numerous nuisance factors.

Open Access PDF

Concepts Keywords
Fibrosis Biomarkers
Hunting Biomarkers
Lightning COVID-19
Transcriptomics Genetic Markers
Trustworthy Genetic Markers
Humans
marker gene identification
multi-source heterogeneity
SARS-CoV-2
Single-Cell Analysis
single-cell RNA sequencing

Original Article

(Visited 1 times, 1 visits today)