1
|
Woo BJ, Moussavi-Baygi R, Karner H, Karimzadeh M, Yousefi H, Lee S, Garcia K, Joshi T, Yin K, Navickas A, Gilbert LA, Wang B, Asgharian H, Feng FY, Goodarzi H. Integrative identification of non-coding regulatory regions driving metastatic prostate cancer. Cell Rep 2024; 43:114764. [PMID: 39276353 PMCID: PMC11466230 DOI: 10.1016/j.celrep.2024.114764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/08/2024] [Accepted: 08/29/2024] [Indexed: 09/17/2024] Open
Abstract
Large-scale sequencing efforts have been undertaken to understand the mutational landscape of the coding genome. However, the vast majority of variants occur within non-coding genomic regions. We designed an integrative computational and experimental framework to identify recurrently mutated non-coding regulatory regions that drive tumor progression. Applying this framework to sequencing data from a large prostate cancer patient cohort revealed a large set of candidate drivers. We used (1) in silico analyses, (2) massively parallel reporter assays, and (3) in vivo CRISPR interference screens to systematically validate metastatic castration-resistant prostate cancer (mCRPC) drivers. One identified enhancer region, GH22I030351, acts on a bidirectional promoter to simultaneously modulate expression of the U2-associated splicing factor SF3A1 and chromosomal protein CCDC157. SF3A1 and CCDC157 promote tumor growth in vivo. We nominated a number of transcription factors, notably SOX6, to regulate expression of SF3A1 and CCDC157. Our integrative approach enables the systematic detection of non-coding regulatory regions that drive human cancers.
Collapse
Affiliation(s)
- Brian J Woo
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
| | - Ruhollah Moussavi-Baygi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
| | - Heather Karner
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
| | - Mehran Karimzadeh
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Vector Institute, Toronto, ON, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada; Arc Institute, Palo Alto, CA 94305, USA
| | - Hassan Yousefi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
| | - Sean Lee
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
| | - Kristle Garcia
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
| | - Tanvi Joshi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
| | - Keyi Yin
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
| | - Albertas Navickas
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA
| | - Luke A Gilbert
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA
| | - Bo Wang
- Vector Institute, Toronto, ON, Canada; Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
| | - Hosseinali Asgharian
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
| | - Felix Y Feng
- Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Department of Radiation Oncology, University of California, San Francisco, San Francisco, CA, USA.
| | - Hani Goodarzi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, CA, USA; Department of Urology, University of California, San Francisco, San Francisco, CA, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, USA; Arc Institute, Palo Alto, CA 94305, USA; Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA.
| |
Collapse
|
2
|
Wang C, Wang YJ, Ying L, Wong RJ, Quaintance CC, Hong X, Neff N, Wang X, Biggio JR, Mesiano S, Quake SR, Alvira CM, Cornfield DN, Stevenson DK, Shaw GM, Li J. Integrative analysis of noncoding mutations identifies the druggable genome in preterm birth. SCIENCE ADVANCES 2024; 10:eadk1057. [PMID: 38241369 PMCID: PMC10798565 DOI: 10.1126/sciadv.adk1057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 12/21/2023] [Indexed: 01/21/2024]
Abstract
Preterm birth affects ~10% of pregnancies in the US. Despite familial associations, identifying at-risk genetic loci has been challenging. We built deep learning and graphical models to score mutational effects at base resolution via integrating the pregnant myometrial epigenome and large-scale patient genomes with spontaneous preterm birth (sPTB) from European and African American cohorts. We uncovered previously unidentified sPTB genes that are involved in myometrial muscle relaxation and inflammatory responses and that are regulated by the progesterone receptor near labor onset. We studied genomic variants in these genes in our recruited pregnant women administered progestin prophylaxis. We observed that mutation burden in these genes was predictive of responses to progestin treatment for preterm birth. To advance therapeutic development, we screened ~4000 compounds, identified candidate molecules that affect our identified genes, and experimentally validated their therapeutic effects on regulating labor. Together, our integrative approach revealed the druggable genome in preterm birth and provided a generalizable framework for studying complex diseases.
Collapse
Affiliation(s)
- Cheng Wang
- Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, Bakar Computational Health Sciences Institute, Parker Institute for Cancer Immunotherapy, and Department of Neurology, School of Medicine, University of California, San Francisco, CA, USA
| | - Yuejun Jessie Wang
- Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, Bakar Computational Health Sciences Institute, Parker Institute for Cancer Immunotherapy, and Department of Neurology, School of Medicine, University of California, San Francisco, CA, USA
| | - Lihua Ying
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Ronald J. Wong
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Cecele C. Quaintance
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Xiumei Hong
- Center on the Early Life Origins of Disease, Department of Population Family and Reproductive Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Norma Neff
- Chan Zuckerberg Biohub, San Francisco, CA, USA
| | - Xiaobin Wang
- Center on the Early Life Origins of Disease, Department of Population Family and Reproductive Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Joseph R. Biggio
- Department of Obstetrics and Gynecology, Division of Maternal Fetal Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
- Department of Obstetrics and Gynecology, Ochsner Health, New Orleans, LA, USA
| | - Sam Mesiano
- Department of Reproductive Biology, Case Western Reserve University and Department of Obstetrics and Gynecology, University Hospitals of Cleveland, Cleveland, OH, USA
| | - Stephen R. Quake
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- Department of Bioengineering, Stanford University School of Medicine, Stanford, CA, USA
| | - Cristina M. Alvira
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - David N. Cornfield
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - David K. Stevenson
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Gary M. Shaw
- Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
| | - Jingjing Li
- Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, Bakar Computational Health Sciences Institute, Parker Institute for Cancer Immunotherapy, and Department of Neurology, School of Medicine, University of California, San Francisco, CA, USA
| |
Collapse
|
3
|
Bazarkin A, Morozov A, Androsov A, Fajkovic H, Rivas JG, Singla N, Koroleva S, Teoh JYC, Zvyagin AV, Shariat SF, Somani B, Enikeev D. Assessment of Prostate and Bladder Cancer Genomic Biomarkers Using Artificial Intelligence: a Systematic Review. Curr Urol Rep 2024; 25:19-35. [PMID: 38099997 DOI: 10.1007/s11934-023-01193-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2023] [Indexed: 01/14/2024]
Abstract
PURPOSE OF REVIEW The aim of the systematic review is to assess AI's capabilities in the genetics of prostate cancer (PCa) and bladder cancer (BCa) to evaluate target groups for such analysis as well as to assess its prospects in daily practice. RECENT FINDINGS In total, our analysis included 27 articles: 10 articles have reported on PCa and 17 on BCa, respectively. The AI algorithms added clinical value and demonstrated promising results in several fields, including cancer detection, assessment of cancer development risk, risk stratification in terms of survival and relapse, and prediction of response to a specific therapy. Besides clinical applications, genetic analysis aided by the AI shed light on the basic urologic cancer biology. We believe, our results of the AI application to the analysis of PCa, BCa data sets will help to identify new targets for urological cancer therapy. The integration of AI in genomic research for screening and clinical applications will evolve with time to help personalizing chemotherapy, prediction of survival and relapse, aid treatment strategies such as reducing frequency of diagnostic cystoscopies, and clinical decision support, e.g., by predicting immunotherapy response. These factors will ultimately lead to personalized and precision medicine thereby improving patient outcomes.
Collapse
Affiliation(s)
- Andrey Bazarkin
- Institute for Urology and Reproductive Health, Sechenov University, Moscow, Russia
| | - Andrey Morozov
- Institute for Urology and Reproductive Health, Sechenov University, Moscow, Russia
| | - Alexander Androsov
- Department of Pediatric Surgery, Division of Pediatric Urology and Andrology, Sechenov University, Moscow, Russia
| | - Harun Fajkovic
- Department of Urology and Comprehensive Cancer Center, Medical University of Vienna, Währinger Gürtel 18-20, 1090, Vienna, Austria
- Karl Landsteiner Institute of Urology and Andrology, Vienna, Austria
| | - Juan Gomez Rivas
- Department of Urology, Clinico San Carlos University Hospital, Madrid, Spain
| | - Nirmish Singla
- School of Medicine, Brady Urological Institute, Johns Hopkins Medicine, Baltimore, MD, USA
| | - Svetlana Koroleva
- Clinical Institute for Children Health Named After N.F. Filatov, Sechenov University, Moscow, Russia
| | - Jeremy Yuen-Chun Teoh
- Department of Surgery, S.H. Ho Urology Centre, The Chinese University of Hong Kong, Hong Kong, China
| | - Andrei V Zvyagin
- Institute of Molecular Theranostics, Sechenov University, Moscow, Russia
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997, Moscow, Russia
| | - Shahrokh François Shariat
- Department of Urology and Comprehensive Cancer Center, Medical University of Vienna, Währinger Gürtel 18-20, 1090, Vienna, Austria
- Karl Landsteiner Institute of Urology and Andrology, Vienna, Austria
- Department of Urology, Weill Cornell Medical College, New York, NY, USA
- Department of Urology, University of Texas Southwestern, Dallas, TX, USA
- Department of Urology, Second Faculty of Medicine, Charles University, Prague, Czech Republic
- Division of Urology, Department of Special Surgery, Jordan University Hospital, The University of Jordan, Amman, Jordan
| | - Bhaskar Somani
- Department of Urology, University Hospital Southampton, Southampton, United Kingdom
| | - Dmitry Enikeev
- Institute for Urology and Reproductive Health, Sechenov University, Moscow, Russia.
- Department of Urology and Comprehensive Cancer Center, Medical University of Vienna, Währinger Gürtel 18-20, 1090, Vienna, Austria.
- Karl Landsteiner Institute of Urology and Andrology, Vienna, Austria.
- Division of Urology, Rabin Medical Center, Petah Tikva, Israel.
| |
Collapse
|
4
|
Waung MW, Ma F, Wheeler AG, Zai CC, So J. The Diagnostic Landscape of Adult Neurogenetic Disorders. BIOLOGY 2023; 12:1459. [PMID: 38132285 PMCID: PMC10740572 DOI: 10.3390/biology12121459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 11/11/2023] [Accepted: 11/16/2023] [Indexed: 12/23/2023]
Abstract
Neurogenetic diseases affect individuals across the lifespan, but accurate diagnosis remains elusive for many patients. Adults with neurogenetic disorders often undergo a long diagnostic odyssey, with multiple specialist evaluations and countless investigations without a satisfactory diagnostic outcome. Reasons for these diagnostic challenges include: (1) clinical features of neurogenetic syndromes are diverse and under-recognized, particularly those of adult-onset, (2) neurogenetic syndromes may manifest with symptoms that span multiple neurological and medical subspecialties, and (3) a positive family history may not be present or readily apparent. Furthermore, there is a large gap in the understanding of how to apply genetic diagnostic tools in adult patients, as most of the published literature focuses on the pediatric population. Despite these challenges, accurate genetic diagnosis is imperative to provide affected individuals and their families guidance on prognosis, recurrence risk, and, for an increasing number of disorders, offer targeted treatment. Here, we provide a framework for recognizing adult neurogenetic syndromes, describe the current diagnostic approach, and highlight studies using next-generation sequencing in different neurological disease cohorts. We also discuss diagnostic pitfalls, barriers to achieving a definitive diagnosis, and emerging technology that may increase the diagnostic yield of testing.
Collapse
Affiliation(s)
- Maggie W. Waung
- Division of General Neurology, Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, CA 94158, USA
| | - Fion Ma
- Institute for Human Genetics, University of California San Francisco School of Medicine, San Francisco, CA 94143, USA
| | - Allison G. Wheeler
- Institute for Human Genetics, University of California San Francisco School of Medicine, San Francisco, CA 94143, USA
- Department of Neurology, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Clement C. Zai
- Tanenbaum Centre for Pharmacogenetics, Molecular Brain Science, Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON M5T 1R8, Canada
- Department of Psychiatry, Institute of Medical Science, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Joyce So
- Division of Medical Genetics, Department of Pediatrics, University of California, San Francisco, CA 94158, USA
| |
Collapse
|
5
|
Shi M, Li X, Li M, Si Y. Attention-based generative adversarial networks improve prognostic outcome prediction of cancer from multimodal data. Brief Bioinform 2023; 24:bbad329. [PMID: 37756592 DOI: 10.1093/bib/bbad329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 08/20/2023] [Accepted: 08/28/2023] [Indexed: 09/29/2023] Open
Abstract
The prediction of prognostic outcome is critical for the development of efficient cancer therapeutics and potential personalized medicine. However, due to the heterogeneity and diversity of multimodal data of cancer, data integration and feature selection remain a challenge for prognostic outcome prediction. We proposed a deep learning method with generative adversarial network based on sequential channel-spatial attention modules (CSAM-GAN), a multimodal data integration and feature selection approach, for accomplishing prognostic stratification tasks in cancer. Sequential channel-spatial attention modules equipped with an encoder-decoder are applied for the input features of multimodal data to accurately refine selected features. A discriminator network was proposed to make the generator and discriminator learning in an adversarial way to accurately describe the complex heterogeneous information of multiple modal data. We conducted extensive experiments with various feature selection and classification methods and confirmed that the CSAM-GAN via the multilayer deep neural network (DNN) classifier outperformed these baseline methods on two different multimodal data sets with miRNA expression, mRNA expression and histopathological image data: lower-grade glioma and kidney renal clear cell carcinoma. The CSAM-GAN via the multilayer DNN classifier bridges the gap between heterogenous multimodal data and prognostic outcome prediction.
Collapse
Affiliation(s)
- Mingguang Shi
- School of Electrical Engineering and Automation, Hefei University of Technology, Hefei, Anhui 230009, China
| | - Xuefeng Li
- School of Electrical Engineering and Automation, Hefei University of Technology, Hefei, Anhui 230009, China
| | - Mingna Li
- School of Electrical Engineering and Automation, Hefei University of Technology, Hefei, Anhui 230009, China
| | - Yichong Si
- School of Electrical Engineering and Automation, Hefei University of Technology, Hefei, Anhui 230009, China
| |
Collapse
|
6
|
Agarwal I, Fuller ZL, Myers SR, Przeworski M. Relating pathogenic loss-of-function mutations in humans to their evolutionary fitness costs. eLife 2023; 12:e83172. [PMID: 36648429 PMCID: PMC9937649 DOI: 10.7554/elife.83172] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/16/2023] [Indexed: 01/18/2023] Open
Abstract
Causal loss-of-function (LOF) variants for Mendelian and severe complex diseases are enriched in 'mutation intolerant' genes. We show how such observations can be interpreted in light of a model of mutation-selection balance and use the model to relate the pathogenic consequences of LOF mutations at present to their evolutionary fitness effects. To this end, we first infer posterior distributions for the fitness costs of LOF mutations in 17,318 autosomal and 679 X-linked genes from exome sequences in 56,855 individuals. Estimated fitness costs for the loss of a gene copy are typically above 1%; they tend to be largest for X-linked genes, whether or not they have a Y homolog, followed by autosomal genes and genes in the pseudoautosomal region. We compare inferred fitness effects for all possible de novo LOF mutations to those of de novo mutations identified in individuals diagnosed with one of six severe, complex diseases or developmental disorders. Probands carry an excess of mutations with estimated fitness effects above 10%; as we show by simulation, when sampled in the population, such highly deleterious mutations are typically only a couple of generations old. Moreover, the proportion of highly deleterious mutations carried by probands reflects the typical age of onset of the disease. The study design also has a discernible influence: a greater proportion of highly deleterious mutations is detected in pedigree than case-control studies, and for autism, in simplex than multiplex families and in female versus male probands. Thus, anchoring observations in human genetics to a population genetic model allows us to learn about the fitness effects of mutations identified by different mapping strategies and for different traits.
Collapse
Affiliation(s)
- Ipsita Agarwal
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Statistics, University of OxfordOxfordUnited Kingdom
| | - Zachary L Fuller
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
| | - Simon R Myers
- Department of Statistics, University of OxfordOxfordUnited Kingdom
- The Wellcome Centre for Human Genetics, University of OxfordOxfordUnited Kingdom
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| |
Collapse
|
7
|
Porras-Quesada P, González-Cabezuelo JM, Sánchez-Conde V, Puche-Sanz I, Arenas-Rodríguez V, García-López C, Flores-Martín JF, Molina-Hernández JM, Álvarez-Cubero MJ, Martínez-González LJ, Vázquez-Alonso F. Role of IGF2 in the Study of Development and Evolution of Prostate Cancer. Front Genet 2022; 12:740641. [PMID: 35095996 PMCID: PMC8790605 DOI: 10.3389/fgene.2021.740641] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 11/29/2021] [Indexed: 11/13/2022] Open
Abstract
Prostate Cancer (PC) is commonly known as one of the most frequent tumors among males. A significant problem of this tumor is that in early stages most of the cases course as indolent forms, so an active surveillance will anticipate the appearance of aggressive stages. One of the main strategies in medical and biomedical research is to find non-invasive biomarkers for improving monitoring and performing a more precise follow-up of diseases like PC. Here we report the relevant role of IGF2 and miR-93-5p as non-invasive biomarker for PC. This event could improve current medical strategies in PC.
Collapse
Affiliation(s)
- P Porras-Quesada
- Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government (GENYO), Granada, Spain
| | | | - V Sánchez-Conde
- Urology Department, University Hospital Virgen de las Nieves, Granada, Spain
| | - I Puche-Sanz
- Urology Department, University Hospital Virgen de las Nieves, Granada, Spain
| | - V Arenas-Rodríguez
- Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government (GENYO), Granada, Spain
| | - C García-López
- Pathological Anatomy Service, University Hospital Virgen de las Nieves, Granada, Spain
| | | | | | - M J Álvarez-Cubero
- Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government (GENYO), Granada, Spain.,Department of Biochemistry and Molecular Biology III, Faculty of Medicine, University of Granada, Granada, Spain.,Biosanitary Research Institute (ibs. GRANADA), University of Granada, Granada, Spain
| | - L J Martínez-González
- Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government (GENYO), Granada, Spain
| | - F Vázquez-Alonso
- Urology Department, University Hospital Virgen de las Nieves, Granada, Spain
| |
Collapse
|
8
|
Zhao Y, Dong Y, Hong W, Jiang C, Yao K, Cheng C. Computational modeling of chromatin accessibility identified important epigenomic regulators. BMC Genomics 2022; 23:19. [PMID: 34996354 PMCID: PMC8742372 DOI: 10.1186/s12864-021-08234-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 12/03/2021] [Indexed: 11/28/2022] Open
Abstract
Chromatin accessibility is essential for transcriptional activation of genomic regions. It is well established that transcription factors (TFs) and histone modifications (HMs) play critical roles in chromatin accessibility regulation. However, there is a lack of studies that quantify these relationships. Here we constructed a two-layer model to predict chromatin accessibility by integrating DNA sequence, TF binding, and HM signals. By applying the model to two human cell lines (GM12878 and HepG2), we found that DNA sequences had limited power for accessibility prediction, while both TF binding and HM signals predicted chromatin accessibility with high accuracy. According to the HM model, HM features determined chromatin accessibility in a cell line shared manner, with the prediction power attributing to five core HM types. Results from the TF model indicated that chromatin accessibility was determined by a subset of informative TFs including both cell line-specific and generic TFs. The combined model of both TF and HM signals did not further improve the prediction accuracy, indicating that they provide redundant information in terms of chromatin accessibility prediction. The TFs and HM models can also distinguish the chromatin accessibility of proximal versus distal transcription start sites with high accuracy.
Collapse
Affiliation(s)
- Yanding Zhao
- Department of Medicine, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
- The Institute for Clinical and Translational Research, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Yadong Dong
- Department of Medicine, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
- The Institute for Clinical and Translational Research, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Wei Hong
- Department of Medicine, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
- The Institute for Clinical and Translational Research, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Chongming Jiang
- Department of Medicine, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
- The Institute for Clinical and Translational Research, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kevin Yao
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, 77843, USA
| | - Chao Cheng
- Department of Medicine, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA.
- The Institute for Clinical and Translational Research, Baylor College of Medicine, Room ICTR 100D, One Baylor Plaza, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
9
|
Deep Learning for Human Disease Detection, Subtype Classification, and Treatment Response Prediction Using Epigenomic Data. Biomedicines 2021; 9:biomedicines9111733. [PMID: 34829962 PMCID: PMC8615388 DOI: 10.3390/biomedicines9111733] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/26/2021] [Accepted: 11/17/2021] [Indexed: 12/25/2022] Open
Abstract
Deep learning (DL) is a distinct class of machine learning that has achieved first-class performance in many fields of study. For epigenomics, the application of DL to assist physicians and scientists in human disease-relevant prediction tasks has been relatively unexplored until very recently. In this article, we critically review published studies that employed DL models to predict disease detection, subtype classification, and treatment responses, using epigenomic data. A comprehensive search on PubMed, Scopus, Web of Science, Google Scholar, and arXiv.org was performed following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. Among 1140 initially identified publications, we included 22 articles in our review. DNA methylation and RNA-sequencing data are most frequently used to train the predictive models. The reviewed models achieved a high accuracy ranged from 88.3% to 100.0% for disease detection tasks, from 69.5% to 97.8% for subtype classification tasks, and from 80.0% to 93.0% for treatment response prediction tasks. We generated a workflow to develop a predictive model that encompasses all steps from first defining human disease-related tasks to finally evaluating model performance. DL holds promise for transforming epigenomic big data into valuable knowledge that will enhance the development of translational epigenomics.
Collapse
|
10
|
Abstract
Health is often qualitatively defined as a status free from disease and its quantitative definition requires finding the boundary separating health from pathological conditions. Since many complex diseases have a strong genetic component, substantial efforts have been made to sequence large-scale personal genomes; however, we are not yet able to effectively quantify health status from personal genomes. Since mutational impacts are ultimately manifested at the protein level, we envision that introducing a panoramic proteomic view of complex diseases will allow us to mechanistically understand the molecular etiologies of human diseases. In this perspective article, we will highlight key proteomic approaches to identify pathogenic mutations and map their convergent pathways underlying disease pathogenesis and the integration of omics data at multiple levels to define the borderline between health and disease.
Collapse
Affiliation(s)
- Mara Zilocchi
- Department of Biochemistry, University of Regina, Regina, Saskatchewan S4S 0A2, Canada
| | - Cheng Wang
- The Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, the Bakar Computational Health Sciences Institute, the Parker Institute for Cancer Immunotherapy, and the Department of Neurology, School of Medicine, University of California, San Francisco, CA, USA
| | - Mohan Babu
- Department of Biochemistry, University of Regina, Regina, Saskatchewan S4S 0A2, Canada
| | - Jingjing Li
- The Eli and Edythe Broad Center of Regeneration Medicine and Stem Cell Research, the Bakar Computational Health Sciences Institute, the Parker Institute for Cancer Immunotherapy, and the Department of Neurology, School of Medicine, University of California, San Francisco, CA, USA
| |
Collapse
|