Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhou W, Altman RB. Data-driven human transcriptomic modules determined by independent component analysis. BMC Bioinformatics 2018;19:327. [PMID: 30223787 PMCID: PMC6142401 DOI: 10.1186/s12859-018-2338-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 08/28/2018] [Indexed: 12/20/2022] Open

For:	Zhou W, Altman RB. Data-driven human transcriptomic modules determined by independent component analysis. BMC Bioinformatics 2018;19:327. [PMID: 30223787 PMCID: PMC6142401 DOI: 10.1186/s12859-018-2338-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Accepted: 08/28/2018] [Indexed: 12/20/2022] Open

Number

Cited by Other Article(s)

Oshternian SR, Loipfinger S, Bhattacharya A, Fehrmann RSN. Exploring combinations of dimensionality reduction, transfer learning, and regularization methods for predicting binary phenotypes with transcriptomic data. BMC Bioinformatics 2024;25:167. [PMID: 38671342 PMCID: PMC11046904 DOI: 10.1186/s12859-024-05795-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 04/22/2024] [Indexed: 04/28/2024] Open

Abstract

BACKGROUND

Numerous transcriptomic-based models have been developed to predict or understand the fundamental mechanisms driving biological phenotypes. However, few models have successfully transitioned into clinical practice due to challenges associated with generalizability and interpretability. To address these issues, researchers have turned to dimensionality reduction methods and have begun implementing transfer learning approaches.

METHODS

In this study, we aimed to determine the optimal combination of dimensionality reduction and regularization methods for predictive modeling. We applied seven dimensionality reduction methods to various datasets, including two supervised methods (linear optimal low-rank projection and low-rank canonical correlation analysis), two unsupervised methods [principal component analysis and consensus independent component analysis (c-ICA)], and three methods [autoencoder (AE), adversarial variational autoencoder, and c-ICA] within a transfer learning framework, trained on > 140,000 transcriptomic profiles. To assess the performance of the different combinations, we used a cross-validation setup encapsulated within a permutation testing framework, analyzing 30 different transcriptomic datasets with binary phenotypes. Furthermore, we included datasets with small sample sizes and phenotypes of varying degrees of predictability, and we employed independent datasets for validation.

RESULTS

Our findings revealed that regularized models without dimensionality reduction achieved the highest predictive performance, challenging the necessity of dimensionality reduction when the primary goal is to achieve optimal predictive performance. However, models using AE and c-ICA with transfer learning for dimensionality reduction showed comparable performance, with enhanced interpretability and robustness of predictors, compared to models using non-dimensionality-reduced data.

CONCLUSION

These findings offer valuable insights into the optimal combination of strategies for enhancing the predictive performance, interpretability, and generalizability of transcriptomic-based models.

Collapse

Wang J, Wan YW, Al-Ouran R, Huang M, Liu Z. CoRegNet: unraveling gene co-regulation networks from public RNA-Seq repositories using a beta-binomial statistical model. Brief Bioinform 2023;25:bbad380. [PMID: 38113079 PMCID: PMC10729864 DOI: 10.1093/bib/bbad380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Revised: 09/13/2023] [Indexed: 12/21/2023] Open

Chow YL, Singh S, Carpenter AE, Way GP. Predicting drug polypharmacology from cell morphology readouts using variational autoencoder latent space arithmetic. PLoS Comput Biol 2022;18:e1009888. [PMID: 35213530 PMCID: PMC8906577 DOI: 10.1371/journal.pcbi.1009888] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/09/2022] [Accepted: 02/01/2022] [Indexed: 01/13/2023] Open

Ashenova A, Daniyarov A, Molkenov A, Sharip A, Zinovyev A, Kairov U. Meta-Analysis of Esophageal Cancer Transcriptomes Using Independent Component Analysis. Front Genet 2021;12:683632. [PMID: 34795689 PMCID: PMC8594933 DOI: 10.3389/fgene.2021.683632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 10/05/2021] [Indexed: 11/17/2022] Open

Altman MC, Rinchai D, Baldwin N, Toufiq M, Whalen E, Garand M, Syed Ahamed Kabeer B, Alfaki M, Presnell SR, Khaenam P, Ayllón-Benítez A, Mougin F, Thébault P, Chiche L, Jourde-Chiche N, Phillips JT, Klintmalm G, O'Garra A, Berry M, Bloom C, Wilkinson RJ, Graham CM, Lipman M, Lertmemongkolchai G, Bedognetti D, Thiebaut R, Kheradmand F, Mejias A, Ramilo O, Palucka K, Pascual V, Banchereau J, Chaussabel D. Development of a fixed module repertoire for the analysis and interpretation of blood transcriptome data. Nat Commun 2021;12:4385. [PMID: 34282143 PMCID: PMC8289976 DOI: 10.1038/s41467-021-24584-w] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Accepted: 06/21/2021] [Indexed: 01/21/2023] Open

Affiliation(s)

Matthew C Altman Systems Immunology, Benaroya Research Institute, Seattle, WA, USA. Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA, USA.
Darawan Rinchai Research Branch, Sidra Medicine, Doha, Qatar.
Nicole Baldwin Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA
Mohammed Toufiq Research Branch, Sidra Medicine, Doha, Qatar
Elizabeth Whalen Systems Immunology, Benaroya Research Institute, Seattle, WA, USA
Mathieu Garand Research Branch, Sidra Medicine, Doha, Qatar
Basirudeen Syed Ahamed Kabeer Research Branch, Sidra Medicine, Doha, Qatar
Mohamed Alfaki Research Branch, Sidra Medicine, Doha, Qatar
Scott R Presnell Systems Immunology, Benaroya Research Institute, Seattle, WA, USA
Prasong Khaenam Systems Immunology, Benaroya Research Institute, Seattle, WA, USA
Aaron Ayllón-Benítez Inserm U1219 Bordeaux Population Health Research Center, Bordeaux University, Bordeaux, France
Fleur Mougin Inserm U1219 Bordeaux Population Health Research Center, Bordeaux University, Bordeaux, France
Patricia Thébault LaBRI, CNRS UMR5800, Bordeaux University, Bordeaux, France
Laurent Chiche Department of Internal Medicine, Hopital Européen, Marseille, France
Noemie Jourde-Chiche Aix-Marseille University, C2VN, INSERM 1263, INRA 1260, Marseille, France
J Theodore Phillips Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA
Goran Klintmalm Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA
Anne O'Garra Laboratory of Immunoregulation and Infection, The Francis Crick Institute, London, UK National Heart and Lung Institute, Imperial College London, London, UK
Matthew Berry Royal Cornwall Hospitals NHS Trust, Truro, UK
Chloe Bloom National Heart and Lung Institute, Imperial College London, London, UK
Robert J Wilkinson The Francis Crick Institute, London, UK Department of Infectious Disease, Imperial College, London, UK Wellcome Center for Infectious Diseases Research in Africa and Department of Medicine, Institute of Infectious Diseases and Molecular Medicine, University of Cape Town Observatory, 7925, Cape Town, Republic of South Africa
Christine M Graham Laboratory of Immunoregulation and Infection, The Francis Crick Institute, London, UK
Marc Lipman UCL Respiratory, Division of Medicine, University College London, London, UK
Ganjana Lertmemongkolchai Centre for Research and Development of Medical Diagnostic Laboratories, Faculty of Associated Medical Sciences, Khon Kaen University, Khon Kaen, Thailand
Davide Bedognetti Research Branch, Sidra Medicine, Doha, Qatar
Rodolphe Thiebaut Inserm U1219 Bordeaux Population Health Research Center, Bordeaux University, Bordeaux, France
Farrah Kheradmand Baylor College of Medicine & Center for Translational Research on Inflammatory Diseases, Michael E. DeBakey VAMC, Houston, TX, USA
Asuncion Mejias Abigail Wexner Research Institute at Nationwide Children's Hospital and the Ohio State University School of Medicine, Columbus, OH, USA
Octavio Ramilo Abigail Wexner Research Institute at Nationwide Children's Hospital and the Ohio State University School of Medicine, Columbus, OH, USA
Karolina Palucka Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
Virginia Pascual Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA Weill Cornell Medicine, New York, NY, USA
Jacques Banchereau Baylor Institute for Immunology Research, Baylor Research Institute, Dallas, TX, USA The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
Damien Chaussabel Systems Immunology, Benaroya Research Institute, Seattle, WA, USA. Research Branch, Sidra Medicine, Doha, Qatar.

Collapse

Rinchai D, Roelands J, Toufiq M, Hendrickx W, Altman MC, Bedognetti D, Chaussabel D. BloodGen3Module: Blood transcriptional module repertoire analysis and visualization using R. Bioinformatics 2021;37:2382-2389. [PMID: 33624743 PMCID: PMC8388021 DOI: 10.1093/bioinformatics/btab121] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 01/14/2021] [Accepted: 02/23/2021] [Indexed: 11/28/2022] Open

Lee AJ, Park Y, Doing G, Hogan DA, Greene CS. Correcting for experiment-specific variability in expression compendia can remove underlying signals. Gigascience 2020;9:giaa117. [PMID: 33140829 PMCID: PMC7607552 DOI: 10.1093/gigascience/giaa117] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 08/28/2020] [Accepted: 09/29/2020] [Indexed: 11/14/2022] Open

Application of Transcriptional Gene Modules to Analysis of Caenorhabditis elegans' Gene Expression Data. G3-GENES GENOMES GENETICS 2020;10:3623-3638. [PMID: 32759329 PMCID: PMC7534440 DOI: 10.1534/g3.120.401270] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Byrd JB, Greene AC, Prasad DV, Jiang X, Greene CS. Responsible, practical genomic data sharing that accelerates research. Nat Rev Genet 2020;21:615-629. [PMID: 32694666 PMCID: PMC7974070 DOI: 10.1038/s41576-020-0257-5] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2020] [Indexed: 12/13/2022]

Rinchai D, Syed Ahamed Kabeer B, Toufiq M, Tatari-Calderone Z, Deola S, Brummaier T, Garand M, Branco R, Baldwin N, Alfaki M, Altman MC, Ballestrero A, Bassetti M, Zoppoli G, De Maria A, Tang B, Bedognetti D, Chaussabel D. A modular framework for the development of targeted Covid-19 blood transcript profiling panels. J Transl Med 2020;18:291. [PMID: 32736569 PMCID: PMC7393249 DOI: 10.1186/s12967-020-02456-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 07/21/2020] [Indexed: 02/07/2023] Open

Affiliation(s)

Darawan Rinchai Sidra Medicine, Doha, Qatar
Basirudeen Syed Ahamed Kabeer Sidra Medicine, Doha, Qatar
Mohammed Toufiq Sidra Medicine, Doha, Qatar
Zohreh Tatari-Calderone Sidra Medicine, Doha, Qatar
Sara Deola Sidra Medicine, Doha, Qatar
Tobias Brummaier Shoklo Malaria Research Unit, Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand Centre for Tropical Medicine and Global Health, Nuffield Department of Medicine, University of Oxford, Oxford, UK Swiss Tropical and Public Health Institute, Basel, Switzerland University of Basel, Basel, Switzerland
Mathieu Garand Sidra Medicine, Doha, Qatar
Ricardo Branco Sidra Medicine, Doha, Qatar
Nicole Baldwin Baylor Institute for Immunology Research and Baylor Research Institute, Dallas, TX, USA
Mohamed Alfaki Sidra Medicine, Doha, Qatar
Matthew C Altman Division of Allergy and Infectious Diseases, University of Washington, Seattle, WA, USA Systems Immunology, Benaroya Research Institute, Seattle, WA, USA
Alberto Ballestrero Department of Internal Medicine, Università degli Studi di Genova, Genoa, Italy IRCCS Ospedale Policlinico San Martino, Genoa, Italy
Matteo Bassetti Division of Infectious and Tropical Diseases, IRCCS Ospedale Policlinico San Martino, Genoa, Italy Department of Health Sciences, University of Genoa, Genoa, Italy
Gabriele Zoppoli Department of Internal Medicine, Università degli Studi di Genova, Genoa, Italy IRCCS Ospedale Policlinico San Martino, Genoa, Italy
Andrea De Maria Division of Infectious and Tropical Diseases, IRCCS Ospedale Policlinico San Martino, Genoa, Italy Department of Health Sciences, University of Genoa, Genoa, Italy
Benjamin Tang Nepean Clinical School, University of Sydney, Sydney, NSW, Australia
Davide Bedognetti Sidra Medicine, Doha, Qatar Department of Internal Medicine, Università degli Studi di Genova, Genoa, Italy
Damien Chaussabel Sidra Medicine, Doha, Qatar.

Collapse

Pineda S, Bunis DG, Kosti I, Sirota M. Data Integration for Immunology. Annu Rev Biomed Data Sci 2020. [DOI: 10.1146/annurev-biodatasci-012420-122454] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Way GP, Zietz M, Rubinetti V, Himmelstein DS, Greene CS. Compressing gene expression data using multiple latent space dimensionalities learns complementary biological representations. Genome Biol 2020;21:109. [PMID: 32393369 PMCID: PMC7212571 DOI: 10.1186/s13059-020-02021-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 04/16/2020] [Indexed: 12/27/2022] Open

Abstract

BACKGROUND

Unsupervised compression algorithms applied to gene expression data extract latent or hidden signals representing technical and biological sources of variation. However, these algorithms require a user to select a biologically appropriate latent space dimensionality. In practice, most researchers fit a single algorithm and latent dimensionality. We sought to determine the extent by which selecting only one fit limits the biological features captured in the latent representations and, consequently, limits what can be discovered with subsequent analyses.

RESULTS

We compress gene expression data from three large datasets consisting of adult normal tissue, adult cancer tissue, and pediatric cancer tissue. We train many different models across a large range of latent space dimensionalities and observe various performance differences. We identify more curated pathway gene sets significantly associated with individual dimensions in denoising autoencoder and variational autoencoder models trained using an intermediate number of latent dimensionalities. Combining compressed features across algorithms and dimensionalities captures the most pathway-associated representations. When trained with different latent dimensionalities, models learn strongly associated and generalizable biological representations including sex, neuroblastoma MYCN amplification, and cell types. Stronger signals, such as tumor type, are best captured in models trained at lower dimensionalities, while more subtle signals such as pathway activity are best identified in models trained with more latent dimensionalities.

CONCLUSIONS

There is no single best latent dimensionality or compression algorithm for analyzing gene expression data. Instead, using features derived from different compression models across multiple latent space dimensionalities enhances biological representations.

Collapse

Sompairac N, Nazarov PV, Czerwinska U, Cantini L, Biton A, Molkenov A, Zhumadilov Z, Barillot E, Radvanyi F, Gorban A, Kairov U, Zinovyev A. Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets. Int J Mol Sci 2019;20:E4414. [PMID: 31500324 PMCID: PMC6771121 DOI: 10.3390/ijms20184414] [Citation(s) in RCA: 44] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2019] [Revised: 09/02/2019] [Accepted: 09/04/2019] [Indexed: 12/13/2022] Open

Affiliation(s)

Nicolas Sompairac Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France. Centre de Recherches Interdisciplinaires, Université Paris Descartes, 75004 Paris, France.
Petr V Nazarov Multiomics Data Science Research Group, Quantitative Biology Unit, Luxembourg Institute of Health (LIH), L-1445 Strassen, Luxembourg.
Urszula Czerwinska Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.
Laura Cantini Computational Systems Biology Team, Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197, INSERM U1024, Ecole Normale Supérieure, PSL Research University, 75005 Paris, France.
Anne Biton Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI, USR 3756 Institut Pasteur et CNRS), 75015 Paris, France.
Askhat Molkenov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Zhaxybay Zhumadilov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan. University Medical Center, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Emmanuel Barillot Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.
Francois Radvanyi Institut Curie, PSL Research University, 75005 Paris, France. CNRS, UMR 144, 75248 Paris, France.
Alexander Gorban Center for Mathematical Modeling, University of Leicester, Leicester LE1 7RH, UK. Lobachevsky University, 603022 Nizhny Novgorod, Russia.
Ulykbek Kairov Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, 010000 Nur-Sultan, Kazakhstan.
Andrei Zinovyev Institut Curie, PSL Research University, 75005 Paris, France. INSERM U900, 75248 Paris, France. CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France.

Collapse

Way GP, Greene CS. Discovering Pathway and Cell Type Signatures in Transcriptomic Compendia with Machine Learning. Annu Rev Biomed Data Sci 2019. [DOI: 10.1146/annurev-biodatasci-072018-021348] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Pal A, Chiu HY, Taneja R. Genetics, epigenetics and redox homeostasis in rhabdomyosarcoma: Emerging targets and therapeutics. Redox Biol 2019;25:101124. [PMID: 30709791 PMCID: PMC6859585 DOI: 10.1016/j.redox.2019.101124] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Revised: 01/20/2019] [Accepted: 01/24/2019] [Indexed: 12/16/2022] Open