1
|
Pérez-González AP, García-Kroepfly AL, Pérez-Fuentes KA, García-Reyes RI, Solis-Roldan FF, Alba-González JA, Hernández-Lemus E, de Anda-Jáuregui G. The ROSMAP project: aging and neurodegenerative diseases through omic sciences. Front Neuroinform 2024; 18:1443865. [PMID: 39351424 PMCID: PMC11439699 DOI: 10.3389/fninf.2024.1443865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Accepted: 08/26/2024] [Indexed: 10/04/2024] Open
Abstract
The Religious Order Study and Memory and Aging Project (ROSMAP) is an initiative that integrates two longitudinal cohort studies, which have been collecting clinicopathological and molecular data since the early 1990s. This extensive dataset includes a wide array of omic data, revealing the complex interactions between molecular levels in neurodegenerative diseases (ND) and aging. Neurodegenerative diseases (ND) are frequently associated with morbidity and cognitive decline in older adults. Omics research, in conjunction with clinical variables, is crucial for advancing our understanding of the diagnosis and treatment of neurodegenerative diseases. This summary reviews the extensive omics research-encompassing genomics, transcriptomics, proteomics, metabolomics, epigenomics, and multiomics-conducted through the ROSMAP study. It highlights the significant advancements in understanding the mechanisms underlying neurodegenerative diseases, with a particular focus on Alzheimer's disease.
Collapse
Affiliation(s)
- Alejandra P Pérez-González
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City, Mexico
- Programa de Doctorado en Ciencias Biomedicas, Unidad de Posgrado Edificio B Primer Piso, Ciudad Universitaria, Mexico City, Mexico
- Facultad de Estudios Superiores Iztacala UNAM, Mexico City, Mexico
| | | | | | | | | | | | - Enrique Hernández-Lemus
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Guillermo de Anda-Jáuregui
- División de Genómica Computacional, Instituto Nacional de Medicina Genómica, Mexico City, Mexico
- Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
- Programa de Investigadoras e Investigadores por México Consejo Nacional de Humanidades, Ciencias y Tecnologías (CONAHCYT), Mexico City, Mexico
| |
Collapse
|
2
|
Emani PS, Liu JJ, Clarke D, Jensen M, Warrell J, Gupta C, Meng R, Lee CY, Xu S, Dursun C, Lou S, Chen Y, Chu Z, Galeev T, Hwang A, Li Y, Ni P, Zhou X, Bakken TE, Bendl J, Bicks L, Chatterjee T, Cheng L, Cheng Y, Dai Y, Duan Z, Flaherty M, Fullard JF, Gancz M, Garrido-Martín D, Gaynor-Gillett S, Grundman J, Hawken N, Henry E, Hoffman GE, Huang A, Jiang Y, Jin T, Jorstad NL, Kawaguchi R, Khullar S, Liu J, Liu J, Liu S, Ma S, Margolis M, Mazariegos S, Moore J, Moran JR, Nguyen E, Phalke N, Pjanic M, Pratt H, Quintero D, Rajagopalan AS, Riesenmy TR, Shedd N, Shi M, Spector M, Terwilliger R, Travaglini KJ, Wamsley B, Wang G, Xia Y, Xiao S, Yang AC, Zheng S, Gandal MJ, Lee D, Lein ES, Roussos P, Sestan N, Weng Z, White KP, Won H, Girgenti MJ, Zhang J, Wang D, Geschwind D, Gerstein M. Single-cell genomics and regulatory networks for 388 human brains. Science 2024; 384:eadi5199. [PMID: 38781369 PMCID: PMC11365579 DOI: 10.1126/science.adi5199] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 04/05/2024] [Indexed: 05/25/2024]
Abstract
Single-cell genomics is a powerful tool for studying heterogeneous tissues such as the brain. Yet little is understood about how genetic variants influence cell-level gene expression. Addressing this, we uniformly processed single-nuclei, multiomics datasets into a resource comprising >2.8 million nuclei from the prefrontal cortex across 388 individuals. For 28 cell types, we assessed population-level variation in expression and chromatin across gene families and drug targets. We identified >550,000 cell type-specific regulatory elements and >1.4 million single-cell expression quantitative trait loci, which we used to build cell-type regulatory and cell-to-cell communication networks. These networks manifest cellular changes in aging and neuropsychiatric disorders. We further constructed an integrative model accurately imputing single-cell expression and simulating perturbations; the model prioritized ~250 disease-risk genes and drug targets with associated cell types.
Collapse
Affiliation(s)
- Prashant S Emani
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Jason J Liu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Declan Clarke
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Matthew Jensen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Jonathan Warrell
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Chirag Gupta
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA
| | - Ran Meng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Che Yu Lee
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Siwei Xu
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Cagatay Dursun
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Shaoke Lou
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Yuhang Chen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Zhiyuan Chu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Timur Galeev
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Ahyeon Hwang
- Department of Computer Science, University of California, Irvine, CA 92697, USA
- Mathematical, Computational and Systems Biology, University of California, Irvine, CA 92697, USA
| | - Yunyang Li
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, New Haven, CT 06520, USA
| | - Pengyu Ni
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Xiao Zhou
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Lucy Bicks
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Tanima Chatterjee
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | - Yuyan Cheng
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Department of Ophthalmology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Yi Dai
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Ziheng Duan
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | | | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Michael Gancz
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Diego Garrido-Martín
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona 08028, Spain
| | - Sophia Gaynor-Gillett
- Tempus Labs, Chicago, IL 60654, USA
- Department of Biology, Cornell College, Mount Vernon, IA 52314, USA
| | - Jennifer Grundman
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Natalie Hawken
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Ella Henry
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY 10468, USA
- Center for Precision Medicine and Translational Therapeutics, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| | - Ao Huang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Yunzhe Jiang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Ting Jin
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA
| | | | - Riki Kawaguchi
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, CA 90095, USA
| | - Saniya Khullar
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA
| | - Jianyin Liu
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Junhao Liu
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Shuang Liu
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA
| | - Shaojie Ma
- Department of Neuroscience, Yale University, New Haven, CT 06510, USA
- Institute of Neuroscience, CAS Center for Excellence in Brain Science and Intelligence Technology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | | | - Samantha Mazariegos
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Jill Moore
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA 01605, USA
| | | | - Eric Nguyen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Nishigandha Phalke
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA 01605, USA
| | - Milos Pjanic
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Henry Pratt
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA 01605, USA
| | - Diana Quintero
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | | | - Tiernon R Riesenmy
- Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA
| | - Nicole Shedd
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA 01605, USA
| | | | | | - Rosemarie Terwilliger
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06520, USA
| | | | - Brie Wamsley
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Gaoyuan Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Yan Xia
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Shaohua Xiao
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Andrew C Yang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Suchen Zheng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | - Michael J Gandal
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles CA, 90095, USA
- Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Lifespan Brain Institute, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ed S Lein
- Allen Institute for Brain Science, Seattle, WA 98109, USA
- Department of Neurological Surgery, University of Washington, Seattle, WA 98195, USA
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY 10468, USA
- Center for Precision Medicine and Translational Therapeutics, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| | - Nenad Sestan
- Department of Neuroscience, Yale University, New Haven, CT 06510, USA
| | - Zhiping Weng
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA 01605, USA
| | - Kevin P White
- Yong Loo Lin School of Medicine, National University of Singapore, 117597 Singapore
| | - Hyejung Won
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Matthew J Girgenti
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT 06520, USA
- Wu Tsai Institute, Yale University, New Haven, CT 06520, USA
- Clinical Neuroscience Division, National Center for Posttraumatic Stress Disorder, Veterans Affairs Connecticut Healthcare System, West Haven, CT 06516, USA
| | - Jing Zhang
- Department of Computer Science, University of California, Irvine, CA 92697, USA
| | - Daifeng Wang
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI 53705, USA
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Daniel Geschwind
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
- Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, CA 90095, USA
- Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, CA 90095, USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, New Haven, CT 06520, USA
- Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA
- Department of Biomedical Informatics & Data Science, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
3
|
Emani PS, Liu JJ, Clarke D, Jensen M, Warrell J, Gupta C, Meng R, Lee CY, Xu S, Dursun C, Lou S, Chen Y, Chu Z, Galeev T, Hwang A, Li Y, Ni P, Zhou X, Bakken TE, Bendl J, Bicks L, Chatterjee T, Cheng L, Cheng Y, Dai Y, Duan Z, Flaherty M, Fullard JF, Gancz M, Garrido-Martín D, Gaynor-Gillett S, Grundman J, Hawken N, Henry E, Hoffman GE, Huang A, Jiang Y, Jin T, Jorstad NL, Kawaguchi R, Khullar S, Liu J, Liu J, Liu S, Ma S, Margolis M, Mazariegos S, Moore J, Moran JR, Nguyen E, Phalke N, Pjanic M, Pratt H, Quintero D, Rajagopalan AS, Riesenmy TR, Shedd N, Shi M, Spector M, Terwilliger R, Travaglini KJ, Wamsley B, Wang G, Xia Y, Xiao S, Yang AC, Zheng S, Gandal MJ, Lee D, Lein ES, Roussos P, Sestan N, Weng Z, White KP, Won H, Girgenti MJ, Zhang J, Wang D, Geschwind D, Gerstein M. Single-cell genomics and regulatory networks for 388 human brains. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.18.585576. [PMID: 38562822 PMCID: PMC10983939 DOI: 10.1101/2024.03.18.585576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Single-cell genomics is a powerful tool for studying heterogeneous tissues such as the brain. Yet, little is understood about how genetic variants influence cell-level gene expression. Addressing this, we uniformly processed single-nuclei, multi-omics datasets into a resource comprising >2.8M nuclei from the prefrontal cortex across 388 individuals. For 28 cell types, we assessed population-level variation in expression and chromatin across gene families and drug targets. We identified >550K cell-type-specific regulatory elements and >1.4M single-cell expression-quantitative-trait loci, which we used to build cell-type regulatory and cell-to-cell communication networks. These networks manifest cellular changes in aging and neuropsychiatric disorders. We further constructed an integrative model accurately imputing single-cell expression and simulating perturbations; the model prioritized ~250 disease-risk genes and drug targets with associated cell types.
Collapse
Affiliation(s)
- Prashant S Emani
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Jason J Liu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Declan Clarke
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Matthew Jensen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Jonathan Warrell
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Chirag Gupta
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Ran Meng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Che Yu Lee
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | - Siwei Xu
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | - Cagatay Dursun
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Shaoke Lou
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Yuhang Chen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Zhiyuan Chu
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
| | - Timur Galeev
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Ahyeon Hwang
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
- Mathematical, Computational and Systems Biology, University of California, Irvine, CA, 92697, USA
| | - Yunyang Li
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
- Department of Computer Science, Yale University, New Haven, CT, 06520, USA
| | - Pengyu Ni
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Xiao Zhou
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | | | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Lucy Bicks
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Tanima Chatterjee
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | | | - Yuyan Cheng
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
- Department of Opthalmology, Perlman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Yi Dai
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | - Ziheng Duan
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | | | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Michael Gancz
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Diego Garrido-Martín
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Barcelona, 08028, Spain
| | - Sophia Gaynor-Gillett
- Tempus Labs, Inc., Chicago, IL, 60654, USA
- Department of Biology, Cornell College, Mount Vernon, IA, 52314, USA
| | - Jennifer Grundman
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Natalie Hawken
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Ella Henry
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, 10468, USA
- Center for Precision Medicine and Translational Therapeutics, James J. Peters VA Medical Center, Bronx, NY, 10468, USA
| | - Ao Huang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
| | - Yunzhe Jiang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Ting Jin
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | | | - Riki Kawaguchi
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
- Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, CA, 90095, USA
| | - Saniya Khullar
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Jianyin Liu
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Junhao Liu
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | - Shuang Liu
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Shaojie Ma
- Department of Neuroscience, Yale University, New Haven, CT, 06510, USA
- Institute of Neuroscience, CAS Center for Excellence in Brain Science and Intelligence Technology, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Michael Margolis
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Samantha Mazariegos
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Jill Moore
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA, 01605, USA
| | | | - Eric Nguyen
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Nishigandha Phalke
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA, 01605, USA
| | - Milos Pjanic
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Henry Pratt
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA, 01605, USA
| | - Diana Quintero
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | | | - Tiernon R Riesenmy
- Department of Statistics & Data Science, Yale University, New Haven, CT, 06520, USA
| | - Nicole Shedd
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA, 01605, USA
| | - Manman Shi
- Tempus Labs, Inc., Chicago, IL, 60654, USA
| | | | - Rosemarie Terwilliger
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, 06520, USA
| | | | - Brie Wamsley
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Gaoyuan Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Yan Xia
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Shaohua Xiao
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Andrew C Yang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Suchen Zheng
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
| | - Michael J Gandal
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Lifespan Brain Institute, The Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Ed S Lein
- Allen Institute for Brain Science, Seattle, WA, 98109, USA
- Department of Neurological Surgery, University of Washington, Seattle, WA, 98195, USA
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA, 98195, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Department of Genetics and Genomic Science, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Mental Illness Research Education and Clinical Center, James J. Peters VA Medical Center, Bronx, NY, 10468, USA
- Center for Precision Medicine and Translational Therapeutics, James J. Peters VA Medical Center, Bronx, NY, 10468, USA
| | - Nenad Sestan
- Department of Neuroscience, Yale University, New Haven, CT, 06510, USA
| | - Zhiping Weng
- Department of Genomics and Computational Biology, UMass Chan Medical School, Worcester, MA, 01605, USA
| | - Kevin P White
- Yong Loo Lin School of Medicine, National University of Singapore, 117597, Singapore
| | - Hyejung Won
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Matthew J Girgenti
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, 06520, USA
- Wu Tsai Institute, Yale University, New Haven, CT, 06520, USA
- Clinical Neuroscience Division, National Center for Posttraumatic Stress Disorder, Veterans Affairs Connecticut Healthcare System, West Haven, CT, 06516, USA
| | - Jing Zhang
- Department of Computer Science, University of California, Irvine, CA, 92697, USA
| | - Daifeng Wang
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Daniel Geschwind
- Program in Neurogenetics, Department of Neurology, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
- Center for Autism Research and Treatment, Semel Institute, University of California, Los Angeles, CA, 90095, USA
- Department of Psychiatry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
- Institute for Precision Health, David Geffen School of Medicine, University of California, Los Angeles, CA, 90095, USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT, 06520, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06520, USA
- Department of Computer Science, Yale University, New Haven, CT, 06520, USA
- Department of Statistics & Data Science, Yale University, New Haven, CT, 06520, USA
- Department of Biomedical Informatics & Data Science, Yale University, New Haven, CT, 06520, USA
| |
Collapse
|
4
|
Browning JL, Wilson KA, Shandra O, Wei X, Mahmutovic D, Maharathi B, Robel S, VandeVord PJ, Olsen ML. Applying Proteomics and Computational Approaches to Identify Novel Targets in Blast-Associated Post-Traumatic Epilepsy. Int J Mol Sci 2024; 25:2880. [PMID: 38474127 DOI: 10.3390/ijms25052880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 02/20/2024] [Accepted: 02/22/2024] [Indexed: 03/14/2024] Open
Abstract
Traumatic brain injury (TBI) can lead to post-traumatic epilepsy (PTE). Blast TBI (bTBI) found in Veterans presents with several complications, including cognitive and behavioral disturbances and PTE; however, the underlying mechanisms that drive the long-term sequelae are not well understood. Using an unbiased proteomics approach in a mouse model of repeated bTBI (rbTBI), this study addresses this gap in the knowledge. After rbTBI, mice were monitored using continuous, uninterrupted video-EEG for up to four months. Following this period, we collected cortex and hippocampus tissues from three groups of mice: those with post-traumatic epilepsy (PTE+), those without epilepsy (PTE-), and the control group (sham). Hundreds of differentially expressed proteins were identified in the cortex and hippocampus of PTE+ and PTE- relative to sham. Focusing on protein pathways unique to PTE+, pathways related to mitochondrial function, post-translational modifications, and transport were disrupted. Computational metabolic modeling using dysregulated protein expression predicted mitochondrial proton pump dysregulation, suggesting electron transport chain dysregulation in the epileptic tissue relative to PTE-. Finally, data mining enabled the identification of several novel and previously validated TBI and epilepsy biomarkers in our data set, many of which were found to already be targeted by drugs in various phases of clinical testing. These findings highlight novel proteins and protein pathways that may drive the chronic PTE sequelae following rbTBI.
Collapse
Affiliation(s)
- Jack L Browning
- School of Neuroscience, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
- Genetics, Bioinformatics and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | - Kelsey A Wilson
- Department of Biomedical Engineering and Mechanics, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | - Oleksii Shandra
- Department of Biomedical Engineering, Florida International University, Miami, FL 33174, USA
| | - Xiaoran Wei
- Virginia-Maryland College of Veterinary Medicine, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| | - Dzenis Mahmutovic
- Department of Cell Developmental and Integrative Biology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Biswajit Maharathi
- Neurology & Rehabilitation, University of Illinois, Chicago, IL 60612, USA
| | - Stefanie Robel
- Department of Cell Developmental and Integrative Biology, University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Pamela J VandeVord
- Department of Biomedical Engineering and Mechanics, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
- Salem Veteran Affairs Medical Center, Salem, VA 24153, USA
| | - Michelle L Olsen
- School of Neuroscience, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061, USA
| |
Collapse
|
5
|
Mahendran N, Vincent P M DR. Deep belief network-based approach for detecting Alzheimer's disease using the multi-omics data. Comput Struct Biotechnol J 2023; 21:1651-1660. [PMID: 36874164 PMCID: PMC9978469 DOI: 10.1016/j.csbj.2023.02.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/10/2023] [Accepted: 02/11/2023] [Indexed: 02/15/2023] Open
Abstract
Alzheimer's disease (AD) is the most uncertain form of Dementia in terms of finding out the mechanism. AD does not have a vital genetic factor to relate to. There were no reliable techniques and methods to identify the genetic risk factors associated with AD in the past. Most of the data available were from the brain images. However, recently, there have been drastic advancements in the high-throughput techniques in bioinformatics. It has led to focused researches in discovering the AD causing genetic risk factors. Recent analysis has resulted in considerable prefrontal cortex data with which classification and prediction models can be developed for AD. We have developed a Deep Belief Network-based prediction model using the DNA Methylation and Gene Expression Microarray Data, with High Dimension Low Sample Size (HDLSS) issues. To overcome the HDLSS challenge, we performed a two-layer feature selection considering the biological aspects of the features as well. In the two-layered feature selection approach, first the differentially expressed genes and differentially methylated positions are identified, then both the datasets are combined using Jaccard similarity measure. As the second step, an ensemble-based feature selection approach is implemented to further narrow down the gene selection. The results show that the proposed feature selection technique outperforms the existing commonly used feature selection techniques, such as Support Vector Machine Recursive Feature Elimination (SVM-RFE), and Correlation-based Feature Selection (CBS). Furthermore, the Deep Belief Network-based prediction model performs better than the widely used Machine Learning models. Also, the multi-omics dataset shows promising results compared to the single omics.
Collapse
Affiliation(s)
- Nivedhitha Mahendran
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - Durai Raj Vincent P M
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| |
Collapse
|
6
|
Liu L, Zhai W, Wang F, Yu L, Zhou F, Xiang Y, Huang S, Zheng C, Yuan Z, He Y, Yu Z, Ji J. Using machine learning to identify gene interaction networks associated with breast cancer. BMC Cancer 2022; 22:1070. [PMID: 36253742 PMCID: PMC9575346 DOI: 10.1186/s12885-022-10170-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 10/10/2022] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Breast cancer (BC) is one of the most prevalent cancers worldwide but its etiology remains unclear. Obesity is recognized as a risk factor for BC, and many obesity-related genes may be involved in its occurrence and development. Research assessing the complex genetic mechanisms of BC should not only consider the effect of a single gene on the disease, but also focus on the interaction between genes. This study sought to construct a gene interaction network to identify potential pathogenic BC genes. METHODS The study included 953 BC patients and 963 control individuals. Chi-square analysis was used to assess the correlation between demographic characteristics and BC. The joint density-based non-parametric differential interaction network analysis and classification (JDINAC) was used to build a BC gene interaction network using single nucleotide polymorphisms (SNP). The odds ratio (OR) and 95% confidence interval (95% CI) of hub gene SNPs were evaluated using a logistic regression model. To assess reliability, the hub genes were quantified by edgeR program using BC RNA-seq data from The Cancer Genome Atlas (TCGA) and identical edges were verified by logistic regression using UK Biobank datasets. Go and KEGG enrichment analysis were used to explore the biological functions of interactive genes. RESULTS Body mass index (BMI) and menopause are important risk factors for BC. After adjusting for potential confounding factors, the BC gene interaction network was identified using JDINAC. LEP, LEPR, XRCC6, and RETN were identified as hub genes and both hub genes and edges were verified. LEPR genetic polymorphisms (rs1137101 and rs4655555) were also significantly associated with BC. Enrichment analysis showed that the identified genes were mainly involved in energy regulation and fat-related signaling pathways. CONCLUSION We explored the interaction network of genes derived from SNP data in BC progression. Gene interaction networks provide new insight into the underlying mechanisms of BC.
Collapse
Affiliation(s)
- Liyuan Liu
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,School of Mathematics, Shandong University, Jinan, 250100, China
| | - Wenli Zhai
- Institute for Financial Studies, Shandong University, Jinan, 250100, China
| | - Fei Wang
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Lixiang Yu
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Fei Zhou
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Yujuan Xiang
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Shuya Huang
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Chao Zheng
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China.,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China
| | - Zhongshang Yuan
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan, 250012, China
| | - Yong He
- Institute for Financial Studies, Shandong University, Jinan, 250100, China
| | - Zhigang Yu
- Department of Breast Surgery, The Second Hospital, Cheeloo College of Medicine, Shandong University, 250033, Jinan, China. .,Institute of Translational Medicine of Breast Disease Prevention and Treatment, Shandong University, Jinan, 250100, China.
| | - Jiadong Ji
- Institute for Financial Studies, Shandong University, Jinan, 250100, China.
| |
Collapse
|
7
|
Machine Learning Framework for the Prediction of Alzheimer’s Disease Using Gene Expression Data Based on Efficient Gene Selection. Symmetry (Basel) 2022. [DOI: 10.3390/sym14030491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
In recent years, much research has focused on using machine learning (ML) for disease prediction based on gene expression (GE) data. However, many diseases have received considerable attention, whereas some, including Alzheimer’s disease (AD), have not, perhaps due to data shortage. The present work is intended to fill this gap by introducing a symmetric framework to predict AD from GE data, with the aim to produce the most accurate prediction using the smallest number of genes. The framework works in four stages after it receives a training dataset: pre-processing, gene selection (GS), classification, and AD prediction. The symmetry of the model is manifested in all of its stages. In the pre-processing stage gene columns in the training dataset are pre-processed identically. In the GS stage, the same user-defined filter metrics are invoked on every gene individually, and so are the same user-defined wrapper metrics. In the classification stage, a number of user-defined ML models are applied identically using the minimal set of genes selected in the preceding stage. The core of the proposed framework is a meticulous GS algorithm which we have designed to nominate eight subsets of the original set of genes provided in the training dataset. Exploring the eight subsets, the algorithm selects the best one to describe AD, and also the best ML model to predict the disease using this subset. For credible results, the framework calculates performance metrics using repeated stratified k-fold cross validation. To evaluate the framework, we used an AD dataset of 1157 cases and 39,280 genes, obtained by combining a number of smaller public datasets. The cases were split in two partitions, 1000 for training/testing, using 10-fold CV repeated 30 times, and 157 for validation. From the testing/training phase, the framework identified only 1058 genes to be the most relevant and the support vector machine (SVM) model to be the most accurate with these genes. In the final validation, we used the 157 cases that were never seen by the SVM classifier. For credible performance evaluation, we evaluated the classifier via six metrics, for which we obtained impressive values. Specifically, we obtained 0.97, 0.97, 0.98, 0.945, 0.972, and 0.975 for the sensitivity (recall), specificity, precision, kappa index, AUC, and accuracy, respectively.
Collapse
|
8
|
Wang Q, Chen K, Su Y, Reiman EM, Dudley JT, Readhead B. Deep learning-based brain transcriptomic signatures associated with the neuropathological and clinical severity of Alzheimer's disease. Brain Commun 2022; 4:fcab293. [PMID: 34993477 PMCID: PMC8728025 DOI: 10.1093/braincomms/fcab293] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 11/05/2021] [Accepted: 11/09/2021] [Indexed: 01/20/2023] Open
Abstract
Brain tissue gene expression from donors with and without Alzheimer's disease has been used to help inform the molecular changes associated with the development and potential treatment of this disorder. Here, we use a deep learning method to analyse RNA-seq data from 1114 brain donors from the Accelerating Medicines Project for Alzheimer's Disease consortium to characterize post-mortem brain transcriptome signatures associated with amyloid-β plaque, tau neurofibrillary tangles and clinical severity in multiple Alzheimer's disease dementia populations. Starting from the cross-sectional data in the Religious Orders Study and Memory and Aging Project cohort (n = 634), a deep learning framework was built to obtain a trajectory that mirrors Alzheimer's disease progression. A severity index was defined to quantitatively measure the progression based on the trajectory. Network analysis was then carried out to identify key gene (index gene) modules present in the model underlying the progression. Within this data set, severity indexes were found to be very closely correlated with all Alzheimer's disease neuropathology biomarkers (R ∼ 0.5, P < 1e-11) and global cognitive function (R = -0.68, P < 2.2e-16). We then applied the model to additional transcriptomic data sets from different brain regions (MAYO, n = 266; Mount Sinai Brain Bank, n = 214), and observed that the model remained significantly predictive (P < 1e-3) of neuropathology and clinical severity. The index genes that significantly contributed to the model were integrated with Alzheimer's disease co-expression regulatory networks, resolving four discrete gene modules that are implicated in vascular and metabolic dysfunction in different cell types, respectively. Our work demonstrates the generalizability of this signature to frontal and temporal cortex measurements and additional brain donors with Alzheimer's disease, other age-related neurological disorders and controls, and revealed that the transcriptomic network modules contribute to neuropathological and clinical disease severity. This study illustrates the promise of using deep learning methods to analyse heterogeneous omics data and discover potentially targetable molecular networks that can inform the development, treatment and prevention of neurodegenerative diseases like Alzheimer's disease.
Collapse
Affiliation(s)
- Qi Wang
- ASU-Banner Neurodegenerative Disease Research Center, Arizona State University, Tempe, AZ 85281, USA
| | - Kewei Chen
- Banner Alzheimer's Institute, Phoenix, AZ 85006, USA
| | - Yi Su
- Banner Alzheimer's Institute, Phoenix, AZ 85006, USA
| | - Eric M Reiman
- ASU-Banner Neurodegenerative Disease Research Center, Arizona State University, Tempe, AZ 85281, USA.,Banner Alzheimer's Institute, Phoenix, AZ 85006, USA
| | - Joel T Dudley
- ASU-Banner Neurodegenerative Disease Research Center, Arizona State University, Tempe, AZ 85281, USA.,Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Benjamin Readhead
- ASU-Banner Neurodegenerative Disease Research Center, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
9
|
Mahendran N, Vincent PMDR, Srinivasan K, Chang CY. Improving the Classification of Alzheimer's Disease Using Hybrid Gene Selection Pipeline and Deep Learning. Front Genet 2021; 12:784814. [PMID: 34868275 PMCID: PMC8632950 DOI: 10.3389/fgene.2021.784814] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 10/20/2021] [Indexed: 11/13/2022] Open
Abstract
Alzheimer’s is a progressive, irreversible, neurodegenerative brain disease. Even with prominent symptoms, it takes years to notice, decode, and reveal Alzheimer’s. However, advancements in technologies, such as imaging techniques, help in early diagnosis. Still, sometimes the results are inaccurate, which delays the treatment. Thus, the research in recent times focused on identifying the molecular biomarkers that differentiate the genotype and phenotype characteristics. However, the gene expression dataset’s generated features are huge, 1,000 or even more than 10,000. To overcome such a curse of dimensionality, feature selection techniques are introduced. We designed a gene selection pipeline combining a filter, wrapper, and unsupervised method to select the relevant genes. We combined the minimum Redundancy and maximum Relevance (mRmR), Wrapper-based Particle Swarm Optimization (WPSO), and Auto encoder to select the relevant features. We used the GSE5281 Alzheimer’s dataset from the Gene Expression Omnibus We implemented an Improved Deep Belief Network (IDBN) with simple stopping criteria after choosing the relevant genes. We used a Bayesian Optimization technique to tune the hyperparameters in the Improved Deep Belief Network. The tabulated results show that the proposed pipeline shows promising results.
Collapse
Affiliation(s)
- Nivedhitha Mahendran
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - P M Durai Raj Vincent
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Chuan-Yu Chang
- Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
| |
Collapse
|
10
|
Abd El Hamid MM, Shaheen M, Mabrouk MS, Omar YMK. MACHINE LEARNING FOR DETECTING EPISTASIS INTERACTIONS AND ITS RELEVANCE TO PERSONALIZED MEDICINE IN ALZHEIMER’S DISEASE: SYSTEMATIC REVIEW. BIOMEDICAL ENGINEERING: APPLICATIONS, BASIS AND COMMUNICATIONS 2021; 33. [DOI: 10.4015/s1016237221500472] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Alzheimer’s disease (AD) is a progressive disease that attacks the brain’s neurons and causes problems in memory, thinking, and reasoning skills. Personalized Medicine (PM) needs a better and more accurate understanding of the relationship between human genetic data and complex diseases like AD. The goal of PM is to tailor the treatment of a case person to his individual properties. PM requires the prediction of a person’s disease from genetic data, and its success depends on the accurate detection of genetic biomarkers. Single Nucleotide polymorphisms (SNPs) are considered the most prevalent type of variation in the human genome. Epistasis has a biological relevance to complex diseases and has an important impact on PM. Detection of the most significant epistasis interactions associated with complex diseases is a big challenge. This paper reviews several machine learning techniques and algorithms to detect the most significant epistasis interactions in Alzheimer’s disease. We discuss many machine learning techniques that can be used for detecting SNPs’ combinations like Random Forests, Support Vector Machines, Multifactor Dimensionality Reduction, Neural Network, and Deep Learning. This review paper highlights the pros and cons of these techniques and explains how they can be applied in an efficient framework to apply knowledge discovery and data mining in AD disease.
Collapse
Affiliation(s)
- Marwa M. Abd El Hamid
- The Higher Institute of Computer Science & Information Technology, El-Shorouk Academy, El Shorouk City, Cairo, Egypt
- College of Computing and Information Technology AASTMT, Egypt
| | - Mohamed Shaheen
- College of Computing and Information Technology AASTMT, Egypt
| | - Mai S. Mabrouk
- Biomedical Engineering Department Misr University for Science and Technology 6th of October City, Egypt
| | | |
Collapse
|
11
|
Potential associations between immune signaling genes, deactivated microglia, and oligodendrocytes and cortical gray matter loss in patients with long-term remitted Cushing's disease. Psychoneuroendocrinology 2021; 132:105334. [PMID: 34225183 DOI: 10.1016/j.psyneuen.2021.105334] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 04/30/2021] [Accepted: 06/15/2021] [Indexed: 12/19/2022]
Abstract
INTRODUCTION Cushing's disease (CD) is a rare and severe endocrine disease characterized by hypercortisolemia. Previous studies have found structural brain alterations in remitted CD patients compared to healthy controls, specifically in the anterior cingulate cortex (ACC). However, potential mechanisms through which these persistent alterations may have occurred are currently unknown. METHODS Structural 3T MRI's from 25 remitted CD patients were linked with gene expression data from neurotypical donors, derived from the Allen Human Brain Atlas. Differences in gene expression between the ACC and an unaffected control cortical region were examined, followed by a Gene Ontology (GO) enrichment analysis. A cell type enrichment analysis was conducted on the differentially expressed genes, and a disease association enrichment analysis was conducted to determine possible associations between differentially expressed genes and specific diseases. Subsequently, cortisol sensitivity of these genes in existing datasets was examined. RESULTS The gene expression analysis identified 300 differentially expressed genes in the ACC compared to the cortical control region. GO analyses found underexpressed genes to represent immune function. The cell type specificity analysis indicated that underexpressed genes were enriched for deactivated microglia and oligodendrocytes. Neither significant associations with diseases, nor evidence of cortisol sensitivity with the differentially expressed genes were found. DISCUSSION Underexpressed genes in the ACC, the area vulnerable to permanent changes in remitted CD patients, were often associated with immune functioning. The specific lack of deactivated microglia and oligodendrocytes implicates protective effects of these cell types against the long-term effects of cortisol overexposure.
Collapse
|
12
|
Chen K, Xu H, Lei Y, Lio P, Li Y, Guo H, Ali Moni M. Integration and interplay of machine learning and bioinformatics approach to identify genetic interaction related to ovarian cancer chemoresistance. Brief Bioinform 2021; 22:6272796. [PMID: 33971668 DOI: 10.1093/bib/bbab100] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/04/2021] [Accepted: 03/06/2021] [Indexed: 11/15/2022] Open
Abstract
Although chemotherapy is the first-line treatment for ovarian cancer (OCa) patients, chemoresistance (CR) decreases their progression-free survival. This paper investigates the genetic interaction (GI) related to OCa-CR. To decrease the complexity of establishing gene networks, individual signature genes related to OCa-CR are identified using a gradient boosting decision tree algorithm. Additionally, the genetic interaction coefficient (GIC) is proposed to measure the correlation of two signature genes quantitatively and explain their joint influence on OCa-CR. Gene pair that possesses high GIC is identified as signature pair. A total of 24 signature gene pairs are selected that include 10 individual signature genes and the influence of signature gene pairs on OCa-CR is explored. Finally, a signature gene pair-based prediction of OCa-CR is identified. The area under curve (AUC) is a widely used performance measure for machine learning prediction. The AUC of signature gene pair reaches 0.9658, whereas the AUC of individual signature gene-based prediction is 0.6823 only. The identified signature gene pairs not only build an efficient GI network of OCa-CR but also provide an interesting way for OCa-CR prediction. This improvement shows that our proposed method is a useful tool to investigate GI related to OCa-CR.
Collapse
Affiliation(s)
- Kexin Chen
- School of Electronics Engineering and Computer Science, Peking University, 100871, Beijing, China
| | - Haoming Xu
- Department of Biomedical Engineering, Duke University, 27708, Durham, United States
| | - Yiming Lei
- School of Electronics Engineering and Computer Science, Peking University, 100871, Beijing, China
| | - Pietro Lio
- Computer Laboratory, University of Cambridge, CB3-0FD, Cambridge, United Kingdom
| | - Yuan Li
- Department of Obstetrics and Gynecology, Peking University Third Hospital, 100083, Beijing, China
| | - Hongyan Guo
- Department of Obstetrics and Gynecology, Peking University Third Hospital, 100083, Beijing, China
| | - Mohammad Ali Moni
- School of Public health and Community Medicine, University of New South Wales, 2052, Sydney, Australia
| |
Collapse
|
13
|
Zhao X, Yao H, Li X. Unearthing of Key Genes Driving the Pathogenesis of Alzheimer's Disease via Bioinformatics. Front Genet 2021; 12:641100. [PMID: 33936168 PMCID: PMC8085575 DOI: 10.3389/fgene.2021.641100] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2020] [Accepted: 03/15/2021] [Indexed: 01/23/2023] Open
Abstract
Alzheimer’s disease (AD) is a neurodegenerative disease with unelucidated molecular pathogenesis. Herein, we aimed to identify potential hub genes governing the pathogenesis of AD. The AD datasets of GSE118553 and GSE131617 were collected from the NCBI GEO database. The weighted gene coexpression network analysis (WGCNA), differential gene expression analysis, and functional enrichment analysis were performed to reveal the hub genes and verify their role in AD. Hub genes were validated by machine learning algorithms. We identified modules and their corresponding hub genes from the temporal cortex (TC), frontal cortex (FC), entorhinal cortex (EC), and cerebellum (CE). We obtained 33, 42, 42, and 41 hub genes in modules associated with AD in TC, FC, EC, and CE tissues, respectively. Significant differences were recorded in the expression levels of hub genes between AD and the control group in the TC and EC tissues (P < 0.05). The differences in the expressions of FCGRT, SLC1A3, PTN, PTPRZ1, and PON2 in the FC and CE tissues among the AD and control groups were significant (P < 0.05). The expression levels of PLXNB1, GRAMD3, and GJA1 were statistically significant between the Braak NFT stages of AD. Overall, our study uncovered genes that may be involved in AD pathogenesis and revealed their potential for the development of AD biomarkers and appropriate AD therapeutics targets.
Collapse
Affiliation(s)
- Xingxing Zhao
- Department of Neurology, Bethune Hospital Affiliated to Shanxi Medical University, Taiyuan, China.,Department of Cardiology, First Hospital of Shanxi Medical University, Taiyuan, China
| | - Hongmei Yao
- Department of Cardiology, First Hospital of Shanxi Medical University, Taiyuan, China
| | - Xinyi Li
- Department of Neurology, Bethune Hospital Affiliated to Shanxi Medical University, Taiyuan, China
| |
Collapse
|
14
|
He Y, Chen H, Sun H, Ji J, Shi Y, Zhang X, Liu L. High-dimensional integrative copula discriminant analysis for multiomics data. Stat Med 2020; 39:4869-4884. [PMID: 33617001 DOI: 10.1002/sim.8758] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 08/30/2020] [Accepted: 09/04/2020] [Indexed: 11/08/2022]
Abstract
Multiomics or integrative omics data have been increasingly common in biomedical studies, holding a promise in better understanding human health and disease. In this article, we propose an integrative copula discrimination analysis classifier in the context of two-class classification, which relaxes the common Gaussian assumption and gains power by borrowing information from multiple omics data types in discriminant analysis. Numerical studies are conducted to assess the finite sample performance of the new classifier. We apply our model to the Religious Orders Study and Memory and Aging Project (ROSMAP) Study, integrating gene expression and DNA methylation data for better prediction.
Collapse
Affiliation(s)
- Yong He
- Shandong University, Jinan, China
| | - Hao Chen
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | - Hao Sun
- School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | | | - Yufeng Shi
- Shandong University, Jinan, China.,School of Statistics, Shandong University of Finance and Economics, Jinan, China
| | | | - Lei Liu
- Division of Biostatistics, Washington University in St. Louis, St. Louis, Missouri, USA
| |
Collapse
|
15
|
Chen H, He Y, Ji J, Shi Y. The sparse group lasso for high-dimensional integrative linear discriminant analysis with application to alzheimer's disease prediction. J STAT COMPUT SIM 2020. [DOI: 10.1080/00949655.2020.1800011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Hao Chen
- School of Statistics, Shandong University of Finance and Economics, Jinan, People's Republic of China
| | - Yong He
- Institute for Financial Studies, Shandong University, Jinan, People's Republic of China
| | - Jiadong Ji
- School of Statistics, Shandong University of Finance and Economics, Jinan, People's Republic of China
| | - Yufeng Shi
- School of Statistics, Shandong University of Finance and Economics, Jinan, People's Republic of China
- Institute for Financial Studies, Shandong University, Jinan, People's Republic of China
| |
Collapse
|