Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang N, Hoffman EP, Chen L, Chen L, Zhang Z, Liu C, Yu G, Herrington DM, Clarke R, Wang Y. Mathematical modelling of transcriptional heterogeneity identifies novel markers and subpopulations in complex tissues. Sci Rep 2016;6:18909. [PMID: 26739359 DOI: 10.1038/srep18909] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 11/23/2015] [Indexed: 01/18/2023] Open

For:	Wang N, Hoffman EP, Chen L, Chen L, Zhang Z, Liu C, Yu G, Herrington DM, Clarke R, Wang Y. Mathematical modelling of transcriptional heterogeneity identifies novel markers and subpopulations in complex tissues. Sci Rep 2016;6:18909. [PMID: 26739359 DOI: 10.1038/srep18909] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Accepted: 11/23/2015] [Indexed: 01/18/2023] Open

Number

Cited by Other Article(s)

Liu T, Liu C, Li Q, Zheng X, Zou F. Adaptive Regularized Tri-Factor Non-Negative Matrix Factorization for Cell Type Deconvolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.07.570631. [PMID: 38106220 PMCID: PMC10723472 DOI: 10.1101/2023.12.07.570631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Tiong KL, Luzhbin D, Yeang CH. Assessing transcriptomic heterogeneity of single-cell RNASeq data by bulk-level gene expression data. BMC Bioinformatics 2024;25:209. [PMID: 38867193 PMCID: PMC11167951 DOI: 10.1186/s12859-024-05825-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 06/03/2024] [Indexed: 06/14/2024] Open

Abstract

BACKGROUND

Single-cell RNA sequencing (sc-RNASeq) data illuminate transcriptomic heterogeneity but also possess a high level of noise, abundant missing entries and sometimes inadequate or no cell type annotations at all. Bulk-level gene expression data lack direct information of cell population composition but are more robust and complete and often better annotated. We propose a modeling framework to integrate bulk-level and single-cell RNASeq data to address the deficiencies and leverage the mutual strengths of each type of data and enable a more comprehensive inference of their transcriptomic heterogeneity. Contrary to the standard approaches of factorizing the bulk-level data with one algorithm and (for some methods) treating single-cell RNASeq data as references to decompose bulk-level data, we employed multiple deconvolution algorithms to factorize the bulk-level data, constructed the probabilistic graphical models of cell-level gene expressions from the decomposition outcomes, and compared the log-likelihood scores of these models in single-cell data. We term this framework backward deconvolution as inference operates from coarse-grained bulk-level data to fine-grained single-cell data. As the abundant missing entries in sc-RNASeq data have a significant effect on log-likelihood scores, we also developed a criterion for inclusion or exclusion of zero entries in log-likelihood score computation.

RESULTS

We selected nine deconvolution algorithms and validated backward deconvolution in five datasets. In the in-silico mixtures of mouse sc-RNASeq data, the log-likelihood scores of the deconvolution algorithms were strongly anticorrelated with their errors of mixture coefficients and cell type specific gene expression signatures. In the true bulk-level mouse data, the sample mixture coefficients were unknown but the log-likelihood scores were strongly correlated with accuracy rates of inferred cell types. In the data of autism spectrum disorder (ASD) and normal controls, we found that ASD brains possessed higher fractions of astrocytes and lower fractions of NRGN-expressing neurons than normal controls. In datasets of breast cancer and low-grade gliomas (LGG), we compared the log-likelihood scores of three simple hypotheses about the gene expression patterns of the cell types underlying the tumor subtypes. The model that tumors of each subtype were dominated by one cell type persistently outperformed an alternative model that each cell type had elevated expression in one gene group and tumors were mixtures of those cell types. Superiority of the former model is also supported by comparing the real breast cancer sc-RNASeq clusters with those generated by simulated sc-RNASeq data.

CONCLUSIONS

The results indicate that backward deconvolution serves as a sensible model selection tool for deconvolution algorithms and facilitates discerning hypotheses about cell type compositions underlying heterogeneous specimens such as tumors.

Collapse

Wu CT, Du D, Chen L, Dai R, Liu C, Yu G, Bhardwaj S, Parker SJ, Zhang Z, Clarke R, Herrington DM, Wang Y. CAM3.0: determining cell type composition and expression from bulk tissues with fully unsupervised deconvolution. Bioinformatics 2024;40:btae107. [PMID: 38407991 PMCID: PMC10924278 DOI: 10.1093/bioinformatics/btae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 01/13/2024] [Accepted: 02/25/2024] [Indexed: 02/28/2024] Open

Herrington D, Wang Y. CLINICAL HETEROGENEITY IN THE AGE OF BIG DATA, ADVANCED ANALYTICS, AND COMPLEXITY THEORY. TRANSACTIONS OF THE AMERICAN CLINICAL AND CLIMATOLOGICAL ASSOCIATION 2023;133:56-68. [PMID: 37701617 PMCID: PMC10493739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 09/14/2023]

MRI Radiogenomics in Precision Oncology: New Diagnosis and Treatment Method. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:2703350. [PMID: 35845886 PMCID: PMC9282990 DOI: 10.1155/2022/2703350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 05/04/2022] [Accepted: 05/25/2022] [Indexed: 11/21/2022]

Predicting Algorithm of Tissue Cell Ratio Based on Deep Learning Using Single-Cell RNA Sequencing. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12125790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]

Wang Y, Gao J, Xuan C, Guan T, Wang Y, Zhou G, Ding T. FSCAM: CAM-Based Feature Selection for Clustering scRNA-seq. Interdiscip Sci 2022;14:394-408. [PMID: 35028910 DOI: 10.1007/s12539-021-00495-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 11/22/2021] [Accepted: 11/23/2021] [Indexed: 06/14/2023]

Lu Y, Wu CT, Parker SJ, Cheng Z, Saylor G, Van Eyk JE, Yu G, Clarke R, Herrington DM, Wang Y. COT: an efficient and accurate method for detecting marker genes among many subtypes. BIOINFORMATICS ADVANCES 2022;2:vbac037. [PMID: 35673616 PMCID: PMC9163574 DOI: 10.1093/bioadv/vbac037] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 04/10/2022] [Accepted: 05/16/2022] [Indexed: 01/27/2023]

Comprehensive evaluation of deconvolution methods for human brain gene expression. Nat Commun 2022;13:1358. [PMID: 35292647 PMCID: PMC8924248 DOI: 10.1038/s41467-022-28655-4] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 01/28/2022] [Indexed: 11/08/2022] Open

Chen L, Wu CT, Lin CH, Dai R, Liu C, Clarke R, Yu G, Van Eyk JE, Herrington DM, Wang Y. swCAM: estimation of subtype-specific expressions in individual samples with unsupervised sample-wise deconvolution. Bioinformatics 2022;38:1403-1410. [PMID: 34904628 PMCID: PMC8826012 DOI: 10.1093/bioinformatics/btab839] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 10/30/2021] [Accepted: 12/10/2021] [Indexed: 02/04/2023] Open

Abstract

MOTIVATION

Complex biological tissues are often a heterogeneous mixture of several molecularly distinct cell subtypes. Both subtype compositions and subtype-specific (STS) expressions can vary across biological conditions. Computational deconvolution aims to dissect patterns of bulk tissue data into subtype compositions and STS expressions. Existing deconvolution methods can only estimate averaged STS expressions in a population, while many downstream analyses such as inferring co-expression networks in particular subtypes require subtype expression estimates in individual samples. However, individual-level deconvolution is a mathematically underdetermined problem because there are more variables than observations.

RESULTS

We report a sample-wise Convex Analysis of Mixtures (swCAM) method that can estimate subtype proportions and STS expressions in individual samples from bulk tissue transcriptomes. We extend our previous CAM framework to include a new term accounting for between-sample variations and formulate swCAM as a nuclear-norm and ℓ2,1-norm regularized matrix factorization problem. We determine hyperparameter values using cross-validation with random entry exclusion and obtain a swCAM solution using an efficient alternating direction method of multipliers. Experimental results on realistic simulation data show that swCAM can accurately estimate STS expressions in individual samples and successfully extract co-expression networks in particular subtypes that are otherwise unobtainable using bulk data. In two real-world applications, swCAM analysis of bulk RNASeq data from brain tissue of cases and controls with bipolar disorder or Alzheimer's disease identified significant changes in cell proportion, expression pattern and co-expression module in patient neurons. Comparative evaluation of swCAM versus peer methods is also provided.

AVAILABILITY AND IMPLEMENTATION

The R Scripts of swCAM are freely available at https://github.com/Lululuella/swCAM. A user's guide and a vignette are provided.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Boldina G, Fogel P, Rocher C, Bettembourg C, Luta G, Augé F. A2Sign: Agnostic Algorithms for Signatures-a universal method for identifying molecular signatures from transcriptomic datasets prior to cell-type deconvolution. Bioinformatics 2022;38:1015-1021. [PMID: 34788798 DOI: 10.1093/bioinformatics/btab773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 09/17/2021] [Accepted: 11/09/2021] [Indexed: 02/03/2023] Open

Comparative assessment and novel strategy on methods for imputing proteomics data. Sci Rep 2022;12:1067. [PMID: 35058491 PMCID: PMC8776850 DOI: 10.1038/s41598-022-04938-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 01/04/2022] [Indexed: 11/09/2022] Open

Saddic L, Orosco A, Guo D, Milewicz DM, Troxlair D, Heide RV, Herrington D, Wang Y, Azizzadeh A, Parker SJ. Proteomic analysis of descending thoracic aorta identifies unique and universal signatures of aneurysm and dissection. JVS Vasc Sci 2022;3:85-181. [PMID: 35280433 PMCID: PMC8914561 DOI: 10.1016/j.jvssci.2022.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 01/05/2022] [Indexed: 01/05/2023] Open

Ahmed M, Lai TH, Kim DR. A Small Fraction of Progenitors Differentiate Into Mature Adipocytes by Escaping the Constraints on the Cell Structure. Front Cell Dev Biol 2021;9:753042. [PMID: 34708046 PMCID: PMC8542793 DOI: 10.3389/fcell.2021.753042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 09/10/2021] [Indexed: 11/13/2022] Open

Xie Y, Zhao J, Zhang P. A multicompartment model for intratumor tissue-specific analysis of DCE-MRI using non-negative matrix factorization. Med Phys 2021;48:2400-2411. [PMID: 33608885 DOI: 10.1002/mp.14793] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Revised: 12/22/2020] [Accepted: 01/29/2021] [Indexed: 11/12/2022] Open

Amrhein L, Fuchs C. stochprofML: stochastic profiling using maximum likelihood estimation in R. BMC Bioinformatics 2021;22:123. [PMID: 33722188 PMCID: PMC7958472 DOI: 10.1186/s12859-021-03970-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Accepted: 01/15/2021] [Indexed: 11/10/2022] Open

Hunt GJ, Gagnon-Bartsch JA. The role of scale in the estimation of cell-type proportions. Ann Appl Stat 2021. [DOI: 10.1214/20-aoas1395] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Data-driven detection of subtype-specific differentially expressed genes. Sci Rep 2021;11:332. [PMID: 33432005 PMCID: PMC7801594 DOI: 10.1038/s41598-020-79704-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 12/11/2020] [Indexed: 11/08/2022] Open

Chen Z, Wu A. Progress and challenge for computational quantification of tissue immune cells. Brief Bioinform 2021;22:6065002. [PMID: 33401306 DOI: 10.1093/bib/bbaa358] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/23/2020] [Accepted: 11/07/2020] [Indexed: 12/28/2022] Open

Chen L, Wu CT, Wang N, Herrington DM, Clarke R, Wang Y. debCAM: a bioconductor R package for fully unsupervised deconvolution of complex tissues. Bioinformatics 2020;36:3927-3929. [PMID: 32219387 DOI: 10.1093/bioinformatics/btaa205] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 03/05/2020] [Accepted: 03/23/2020] [Indexed: 11/14/2022] Open

Zhou T, Sengupta S, Müller P, Ji Y. RNDClone: Tumor subclone reconstruction based on integrating DNA and RNA sequence data. Ann Appl Stat 2020. [DOI: 10.1214/20-aoas1368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Integrative analyses prioritize GNL3 as a risk gene for bipolar disorder. Mol Psychiatry 2020;25:2672-2684. [PMID: 32826963 DOI: 10.1038/s41380-020-00866-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 07/30/2020] [Accepted: 08/06/2020] [Indexed: 12/14/2022]

Radiogenomic signatures reveal multiscale intratumour heterogeneity associated with biological functions and survival in breast cancer. Nat Commun 2020;11:4861. [PMID: 32978398 PMCID: PMC7519071 DOI: 10.1038/s41467-020-18703-2] [Citation(s) in RCA: 49] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 09/08/2020] [Indexed: 12/24/2022] Open

Clarke R, Kraikivski P, Jones BC, Sevigny CM, Sengupta S, Wang Y. A systems biology approach to discovering pathway signaling dysregulation in metastasis. Cancer Metastasis Rev 2020;39:903-918. [PMID: 32776157 PMCID: PMC7487029 DOI: 10.1007/s10555-020-09921-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Accepted: 07/13/2020] [Indexed: 02/07/2023]

Parker SJ, Chen L, Spivia W, Saylor G, Mao C, Venkatraman V, Holewinski RJ, Mastali M, Pandey R, Athas G, Yu G, Fu Q, Troxlair D, Vander Heide R, Herrington D, Van Eyk JE, Wang Y. Identification of Putative Early Atherosclerosis Biomarkers by Unsupervised Deconvolution of Heterogeneous Vascular Proteomes. J Proteome Res 2020;19:2794-2806. [PMID: 32202800 DOI: 10.1021/acs.jproteome.0c00118] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract

Coronary artery disease remains a leading cause of death in industrialized nations, and early detection of disease is a critical intervention target to effectively treat patients and manage risk. Proteomic analysis of mixed tissue homogenates may obscure subtle protein changes that occur uniquely in underlying tissue subtypes. The unsupervised 'convex analysis of mixtures' (CAM) tool has previously been shown to effectively segregate cellular subtypes from mixed expression data. In this study, we hypothesized that CAM would identify proteomic information specifically informative to early atherosclerosis lesion involvement that could lead to potential markers of early disease detection. We quantified the proteome of 99 paired abdominal aorta (AA) and left anterior descending coronary artery (LAD) specimens (N = 198 specimens total) acquired during autopsy of young adults free of diagnosed cardiac disease. The CAM tool was then used to segregate protein subsets uniquely associated with different underlying tissue types, yielding markers of normal and fibrous plaque (FP) tissues in LAD and AA (N = 62 lesions markers). CAM-derived FP marker expression was validated against pathologist estimated luminal surface involvement of FP, as well as in an orthogonal cohort of "pure" fibrous plaque, fatty streak, and normal vascular specimens. A targeted mass spectrometry (MS) assay quantified 39 of 62 CAM-FP markers in plasma from women with angiographically verified coronary artery disease (CAD, N = 46) or free from apparent CAD (control, N = 40). Elastic net variable selection with logistic regression reduced this list to 10 proteins capable of classifying CAD status in this cohort with <6% misclassification error, and a mean area under the receiver operating characteristic curve of 0.992 (confidence interval 0.968-0.998) after cross validation. The proteomics-CAM workflow identified lesion-specific molecular biomarker candidates by distilling the most representative molecules from heterogeneous tissue types.

Collapse

Affiliation(s)

Sarah J Parker Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Lulu Chen Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, Virginia 24061, United States
Weston Spivia Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Georgia Saylor Department of Cardiovascular Medicine, Wake Forest University, Winston-Salem, North Carolina 27101, United States
Chunhong Mao Biocomplexity Institute & Initiative, University of Virginia, Charlottesville, Virginia 22904, United States
Vidya Venkatraman Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Ronald J Holewinski Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Mitra Mastali Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Rakhi Pandey Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Grace Athas Department of Pathology, Louisiana State University, New Orleans, Louisiana 70112, United States
Guoqiang Yu Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, Virginia 24061, United States
Qin Fu Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Dana Troxlair Department of Pathology, Louisiana State University, New Orleans, Louisiana 70112, United States
Richard Vander Heide Department of Pathology, Louisiana State University, New Orleans, Louisiana 70112, United States
David Herrington Department of Cardiovascular Medicine, Wake Forest University, Winston-Salem, North Carolina 27101, United States
Jennifer E Van Eyk Heart Institute & Advanced Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
Yue Wang Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Arlington, Virginia 24061, United States

Collapse

Psychiatric Genetics, Epigenetics, and Cellular Models in Coming Years. JOURNAL OF PSYCHIATRY AND BRAIN SCIENCE 2019;4. [PMID: 31608310 PMCID: PMC6788748 DOI: 10.20900/jpbs.20190012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Sun X, Sun S, Yang S. An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data. Cells 2019;8:E1161. [PMID: 31569701 PMCID: PMC6830085 DOI: 10.3390/cells8101161] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Revised: 09/23/2019] [Accepted: 09/26/2019] [Indexed: 12/25/2022] Open

Sompairac N, Nazarov PV, Czerwinska U, Cantini L, Biton A, Molkenov A, Zhumadilov Z, Barillot E, Radvanyi F, Gorban A, Kairov U, Zinovyev A. Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets. Int J Mol Sci 2019;20:E4414. [PMID: 31500324 PMCID: PMC6771121 DOI: 10.3390/ijms20184414] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Revised: 09/02/2019] [Accepted: 09/04/2019] [Indexed: 12/13/2022] Open

Affiliation(s)

Nicolas Sompairac Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France. Centre de Recherches Interdisciplinaires, Université Paris Descartes, 75004 Paris, France.
Petr V Nazarov Multiomics Data Science Research Group, Quantitative Biology Unit, Luxembourg Institute of Health (LIH), L-1445 Strassen, Luxembourg.
Urszula Czerwinska Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.
Laura Cantini Computational Systems Biology Team, Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197, INSERM U1024, Ecole Normale Supérieure, PSL Research University, 75005 Paris, France.
Anne Biton Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI, USR 3756 Institut Pasteur et CNRS), 75015 Paris, France.
Askhat Molkenov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Zhaxybay Zhumadilov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan. University Medical Center, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Emmanuel Barillot Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.
Francois Radvanyi Institut Curie, PSL Research University, 75005 Paris, France. CNRS, UMR 144, 75248 Paris, France.
Alexander Gorban Center for Mathematical Modeling, University of Leicester, Leicester LE1 7RH, UK. Lobachevsky University, 603022 Nizhny Novgorod, Russia.
Ulykbek Kairov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Andrei Zinovyev Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.

Collapse

Avila Cobos F, Vandesompele J, Mestdagh P, De Preter K. Computational deconvolution of transcriptomics data from mixed cell populations. Bioinformatics 2019;34:1969-1979. [PMID: 29351586 DOI: 10.1093/bioinformatics/bty019] [Citation(s) in RCA: 130] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 01/10/2018] [Indexed: 12/22/2022] Open

Clarke R, Tyson JJ, Tan M, Baumann WT, Jin L, Xuan J, Wang Y. Systems biology: perspectives on multiscale modeling in research on endocrine-related cancers. Endocr Relat Cancer 2019;26:R345-R368. [PMID: 30965282 PMCID: PMC7045974 DOI: 10.1530/erc-18-0309] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Accepted: 04/08/2019] [Indexed: 12/12/2022]

Complete deconvolution of cellular mixtures based on linearity of transcriptional signatures. Nat Commun 2019;10:2209. [PMID: 31101809 PMCID: PMC6525259 DOI: 10.1038/s41467-019-09990-5] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Accepted: 04/11/2019] [Indexed: 11/08/2022] Open

Radiomic analysis of imaging heterogeneity in tumours and the surrounding parenchyma based on unsupervised decomposition of DCE-MRI for predicting molecular subtypes of breast cancer. Eur Radiol 2019;29:4456-4467. [PMID: 30617495 DOI: 10.1007/s00330-018-5891-3] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Revised: 10/02/2018] [Accepted: 11/13/2018] [Indexed: 10/27/2022]

Abstract

OBJECTIVES

This study aimed to predict the molecular subtypes of breast cancer via intratumoural and peritumoural radiomic analysis with subregion identification based on the decomposition of contrast-enhanced magnetic resonance imaging (DCE-MRI).

METHODS

The study included 211 women with histopathologically confirmed breast cancer. We utilised a completely unsupervised convex analysis of mixtures (CAM) method by unmixing dynamic imaging series from heterogeneous tissues. Each tumour and the surrounding parenchyma were thus decomposed into multiple subregions, representing different vascular characterisations, from which radiomic features were extracted. A random forest model was trained and tested using a leave-one-out cross-validation (LOOCV) method to predict breast cancer subtypes. The predictive models from tumoural and peritumoural subregions were fused for classification.

RESULTS

Tumour and peritumour DCE-MR images were decomposed into three compartments, representing plasma input, fast-flow kinetics, and slow-flow kinetics. The tumour subregion related to fast-flow kinetics showed the best performance among the subregions for differentiating between patients with four molecular subtypes (area under the receiver operating characteristic curve (AUC) = 0.832), exhibiting an AUC value significantly (p < 0.0001) higher than that obtained with the entire tumour (AUC = 0.719). When the tumour- and parenchyma-based predictive models were fused, the performance, measured as the AUC, increased to 0.897; this value was significantly higher than that obtained with other tumour partition methods.

CONCLUSIONS

Radiomic analysis of intratumoural and peritumoural heterogeneity based on the decomposition of image time-series signals has the potential to more accurately identify tumour kinetic features and serve as a valuable clinical marker to enhance the prediction of breast cancer subtypes.

KEY POINTS

• Decomposition of image time-series signals has the potential to more accurately identify tumour kinetic features. • Fusion of intratumoural- and peritumoural-based predictive models improves the prediction of breast cancer subtypes.

Collapse

Hunt GJ, Freytag S, Bahlo M, Gagnon-Bartsch JA. dtangle: accurate and robust cell type deconvolution. Bioinformatics 2018;35:2093-2099. [DOI: 10.1093/bioinformatics/bty926] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 10/20/2018] [Accepted: 11/06/2018] [Indexed: 11/14/2022] Open

Dimitrakopoulou K, Wik E, Akslen LA, Jonassen I. Deblender: a semi-/unsupervised multi-operational computational method for complete deconvolution of expression data from heterogeneous samples. BMC Bioinformatics 2018;19:408. [PMID: 30404611 PMCID: PMC6223087 DOI: 10.1186/s12859-018-2442-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 10/22/2018] [Indexed: 12/24/2022] Open

Abstract

BACKGROUND

Towards discovering robust cancer biomarkers, it is imperative to unravel the cellular heterogeneity of patient samples and comprehend the interactions between cancer cells and the various cell types in the tumor microenvironment. The first generation of 'partial' computational deconvolution methods required prior information either on the cell/tissue type proportions or the cell/tissue type-specific expression signatures and the number of involved cell/tissue types. The second generation of 'complete' approaches allowed estimating both of the cell/tissue type proportions and cell/tissue type-specific expression profiles directly from the mixed gene expression data, based on known (or automatically identified) cell/tissue type-specific marker genes.

RESULTS

We present Deblender, a flexible complete deconvolution tool operating in semi-/unsupervised mode based on the user's access to known marker gene lists and information about cell/tissue composition. In case of no prior knowledge, global gene expression variability is used in clustering the mixed data to substitute marker sets with cluster sets. In addition, we integrate a model selection criterion to predict the number of constituent cell/tissue types. Moreover, we provide a tailored algorithmic scheme to estimate mixture proportions for realistic experimental cases where the number of involved cell/tissue types exceeds the number of mixed samples. We assess the performance of Deblender and a set of state-of-the-art existing tools on a comprehensive set of benchmark and patient cancer mixture expression datasets (including TCGA).

CONCLUSION

Our results corroborate that Deblender can be a valuable tool to improve understanding of gene expression datasets with implications for prediction and clinical utilization. Deblender is implemented in MATLAB and is available from ( https://github.com/kondim1983/Deblender/ ).

Collapse

Xie F, Zhou M, Xu Y. BayCount: A Bayesian decomposition method for inferring tumor heterogeneity using RNA-Seq counts. Ann Appl Stat 2018. [DOI: 10.1214/17-aoas1123] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Lin CH, Chi CY, Chen L, Miller DJ, Wang Y. Detection of Sources in Non-Negative Blind Source Separation by Minimum Description Length Criterion. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018;29:4022-4037. [PMID: 28981430 DOI: 10.1109/tnnls.2017.2749279] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Herrington DM, Mao C, Parker SJ, Fu Z, Yu G, Chen L, Venkatraman V, Fu Y, Wang Y, Howard TD, Jun G, Zhao CF, Liu Y, Saylor G, Spivia WR, Athas GB, Troxclair D, Hixson JE, Vander Heide RS, Wang Y, Van Eyk JE. Proteomic Architecture of Human Coronary and Aortic Atherosclerosis. Circulation 2018;137:2741-2756. [PMID: 29915101 PMCID: PMC6011234 DOI: 10.1161/circulationaha.118.034365] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 04/12/2018] [Indexed: 12/26/2022]

Abstract

BACKGOUND

The inability to detect premature atherosclerosis significantly hinders implementation of personalized therapy to prevent coronary heart disease. A comprehensive understanding of arterial protein networks and how they change in early atherosclerosis could identify new biomarkers for disease detection and improved therapeutic targets.

METHODS

Here we describe the human arterial proteome and proteomic features strongly associated with early atherosclerosis based on mass spectrometry analysis of coronary artery and aortic specimens from 100 autopsied young adults (200 arterial specimens). Convex analysis of mixtures, differential dependent network modeling, and bioinformatic analyses defined the composition, network rewiring, and likely regulatory features of the protein networks associated with early atherosclerosis and how they vary across 2 anatomic distributions.

RESULTS

The data document significant differences in mitochondrial protein abundance between coronary and aortic samples (coronary>>aortic), and between atherosclerotic and normal tissues (atherosclerotic<

CONCLUSIONS

The human arterial proteome can be viewed as a complex network whose architectural features vary considerably as a function of anatomic location and the presence or absence of atherosclerosis. The data suggest important reductions in mitochondrial protein abundance in early atherosclerosis and also identify a subset of plasma proteins that are highly predictive of angiographically defined coronary disease.

Collapse

Affiliation(s)

David M Herrington Section on Cardiovascular Medicine, Department of Internal Medicine (D.M.H., C.F.Z., G.S.)
Chunhong Mao Biocomplexity Institute of Virginia Tech, Virginia Tech, Blacksburg (C.M.)
Sarah J Parker Advanced Clinical Biosystems Research Institute, Cedars-Sinai Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA (S.T.P., V.V., W.R.S., J.E.V.E.)
Zongming Fu Johns Hopkins Medical Institute, Baltimore, MD (Z.F.)
Guoqiang Yu Department of Electrical and Computer Engineering, Virginia Tech, Arlington (G.Y., L.C., Y.F., Yizhi Wang, Yue Wang)
Lulu Chen Department of Electrical and Computer Engineering, Virginia Tech, Arlington (G.Y., L.C., Y.F., Yizhi Wang, Yue Wang)
Vidya Venkatraman Advanced Clinical Biosystems Research Institute, Cedars-Sinai Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA (S.T.P., V.V., W.R.S., J.E.V.E.)
Yi Fu Department of Electrical and Computer Engineering, Virginia Tech, Arlington (G.Y., L.C., Y.F., Yizhi Wang, Yue Wang)
Yizhi Wang Department of Electrical and Computer Engineering, Virginia Tech, Arlington (G.Y., L.C., Y.F., Yizhi Wang, Yue Wang)
Timothy D Howard Department of Biochemistry (T.D.H.)
Goo Jun Department of Epidemiology, Human Genetics and Environmental Sciences, Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston (G.J., J.E.H.)
Caroline F Zhao Section on Cardiovascular Medicine, Department of Internal Medicine (D.M.H., C.F.Z., G.S.)
Yongmei Liu Department of Epidemiology, Division of Public Health Sciences (Y.L.), Wake Forest School of Medicine, Winston-Salem, NC
Georgia Saylor Section on Cardiovascular Medicine, Department of Internal Medicine (D.M.H., C.F.Z., G.S.)
Weston R Spivia Advanced Clinical Biosystems Research Institute, Cedars-Sinai Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA (S.T.P., V.V., W.R.S., J.E.V.E.)
Grace B Athas Department of Pathology, Louisiana State Health Science Center, New Orleans (G.B.A., D.T., R.C.V.H.)
Dana Troxclair Department of Pathology, Louisiana State Health Science Center, New Orleans (G.B.A., D.T., R.C.V.H.)
James E Hixson Department of Epidemiology, Human Genetics and Environmental Sciences, Human Genetics Center, School of Public Health, University of Texas Health Science Center at Houston (G.J., J.E.H.)
Richard S Vander Heide Department of Pathology, Louisiana State Health Science Center, New Orleans (G.B.A., D.T., R.C.V.H.)
Yue Wang Department of Electrical and Computer Engineering, Virginia Tech, Arlington (G.Y., L.C., Y.F., Yizhi Wang, Yue Wang)
Jennifer E Van Eyk Advanced Clinical Biosystems Research Institute, Cedars-Sinai Heart Institute, and Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA (S.T.P., V.V., W.R.S., J.E.V.E.)

Collapse

Computational de novo discovery of distinguishing genes for biological processes and cell types in complex tissues. PLoS One 2018;13:e0193067. [PMID: 29494600 PMCID: PMC5832224 DOI: 10.1371/journal.pone.0193067] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 02/02/2018] [Indexed: 11/30/2022] Open

Houseman EA, Kile ML, Christiani DC, Ince TA, Kelsey KT, Marsit CJ. Reference-free deconvolution of DNA methylation data and mediation by cell composition effects. BMC Bioinformatics 2016;17:259. [PMID: 27358049 PMCID: PMC4928286 DOI: 10.1186/s12859-016-1140-4] [Citation(s) in RCA: 160] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2016] [Accepted: 06/19/2016] [Indexed: 12/28/2022] Open

Teschendorff AE, Jones A, Widschwendter M. Stochastic epigenetic outliers can define field defects in cancer. BMC Bioinformatics 2016;17:178. [PMID: 27103033 PMCID: PMC4840974 DOI: 10.1186/s12859-016-1056-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2015] [Accepted: 04/16/2016] [Indexed: 12/14/2022] Open