1
|
Aktaş E, Özdemir Özgentürk N. A comprehensive examination of ACE2 receptor and prediction of spike glycoprotein and ACE2 interaction based on in silico analysis of ACE2 receptor. J Biomol Struct Dyn 2024; 42:4412-4428. [PMID: 37349943 DOI: 10.1080/07391102.2023.2220814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 05/28/2023] [Indexed: 06/24/2023]
Abstract
The ACE2 receptor plays a vital role not only in the SARS-CoV-induced epidemic but also in various other diseases, including cardiovascular diseases and ARDS. While studies have explored the interactions between ACE2 and SARS-CoV proteins, comprehensive research utilizing bioinformatic tools on the ACE2 protein has been lacking. The one aim of present study was to extensively analyze the regions of the ACE2 protein. After utilizing all bioinformatics tools especially G104 and L108 regions on ACE2 were come forward. The results of our analysis revealed that possible mutations or deletions in the G104 and L108 regions play a critical role in both the biological functioning and the determination of the chemical-physical properties of ACE2. Additionally, these regions were found to be more susceptible to mutations or deletions compared to other regions of the ACE2 protein. Notably, the randomly selected peptide, LQQNGSSVLS (100-109), which includes G104 and L108, exhibited a crucial role in binding the RBD of the spike protein, as supported by docking scores. Furthermore, both MDs and iMODs results provided evidence that G104 and L108 influence the dynamics of ACE2-spike complexes. This study is expected to offer a new perspective on the ACE2-SARS-CoV interaction and other research areas where ACE2 plays a significant role, such as biotechnology (protein engineering, enzyme optimization), medicine (RAS, pulmonary and cardiac diseases), and basic research (structural motifs, stabilizing protein folds, or facilitating important inter molecular contacts, protein's proper structure and function).Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Emre Aktaş
- Faculty of Art and Science, Molecular Biology and Genetics, Yıldız Technical University, Istanbul, Turkey
| | - Nehir Özdemir Özgentürk
- Faculty of Art and Science, Molecular Biology and Genetics, Yıldız Technical University, Istanbul, Turkey
| |
Collapse
|
2
|
Ibrahim MB, Flanagan J, Ibrahim T, Rouleau E. Unraveling noncoding DNA variants and epimutations: a paradigm shift in hereditary cancer research. Future Oncol 2024:1-10. [PMID: 38722139 DOI: 10.2217/fon-2023-0665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 03/11/2024] [Indexed: 06/12/2024] Open
Abstract
Exhaustive efforts have been dedicated to uncovering genomic aberrations linked to cancer susceptibility. Noncoding sequence variants and epigenetic alterations significantly influence gene regulation and could contribute to cancer development. However, exploring noncoding regions in hereditary cancer susceptibility demands cutting-edge methodologies for functionally characterizing genomic discoveries. Additionally, comprehending the impact on cancer development of variants in noncoding DNA and the epigenome necessitates integrating diverse data through bioinformatic analyses. As novel technologies and analytical methods continue to advance, this realm of research is rapidly gaining traction. Within this mini-review, we delve into future research domains concerning aberrations in noncoding DNA regions, such as pseudoexons, promoter variants and cis-epimutations.
Collapse
Affiliation(s)
- Maria Baz Ibrahim
- Department of Oncogenetics & Tumor Biology, Paul Brousse Hospital, Villejuif, France
| | - James Flanagan
- Department of Surgery & Cancer, Ovarian Cancer Action Research Centre, Imperial College London, London, W12 8EE, UK
| | - Tony Ibrahim
- International Department of Medical Oncology, Gustave Roussy, 94805, Villejuif, France
| | - Etienne Rouleau
- Department of Biology & Pathology-Cancer Genetics Laboratory, Gustave Roussy, 94805, Villejuif, France
| |
Collapse
|
3
|
Rospo G, Chilà R, Matafora V, Basso V, Lamba S, Bartolini A, Bachi A, Di Nicolantonio F, Mondino A, Germano G, Bardelli A. Non-canonical antigens are the largest fraction of peptides presented by MHC class I in mismatch repair deficient murine colorectal cancer. Genome Med 2024; 16:15. [PMID: 38243308 PMCID: PMC10797964 DOI: 10.1186/s13073-023-01275-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 12/12/2023] [Indexed: 01/21/2024] Open
Abstract
BACKGROUND Immunotherapy based on checkpoint inhibitors is highly effective in mismatch repair deficient (MMRd) colorectal cancer (CRC). These tumors carry a high number of mutations, which are predicted to translate into a wide array of neoepitopes; however, a systematic classification of the neoantigen repertoire in MMRd CRC is lacking. Mass spectrometry peptidomics has demonstrated the existence of MHC class I associated peptides (MAPs) originating from non-coding DNA regions. Based on these premises we investigated DNA genomic regions responsible for generating MMRd-induced peptides. METHODS We exploited mouse CRC models in which the MMR gene Mlh1 was genetically inactivated. Isogenic cell lines CT26 Mlh1+/+ and Mlh1-/- were inoculated in immunocompromised and immunocompetent mice. Whole genome and RNA sequencing data were generated from samples obtained before and after injection in murine hosts. First, peptide databases were built from transcriptomes of isogenic cell lines. We then compiled a database of peptides lost after tumor cells injection in immunocompetent mice, likely due to immune editing. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) and matched next-generation sequencing databases were employed to identify the DNA regions from which the immune-targeted MAPs originated. Finally, we adopted in vitro T cell assays to verify whether MAP-specific T cells were part of the in vivo immune response against Mlh1-/- cells. RESULTS Whole genome sequencing analyses revealed an unbalanced distribution of immune edited alterations across the genome in Mlh1-/- cells grown in immunocompetent mice. Specifically, untranslated (UTR) and coding regions exhibited the largest fraction of mutations leading to highly immunogenic peptides. Moreover, the integrated computational and LC-MS/MS analyses revealed that MAPs originate mainly from atypical translational events in both Mlh1+/+ and Mlh1-/- tumor cells. In addition, mutated MAPs-derived from UTRs and out-of-frame translation of coding regions-were highly enriched in Mlh1-/- cells. The MAPs trigger T-cell activation in mice primed with Mlh1-/- cells. CONCLUSIONS Our results suggest that-in comparison to MMR proficient CRC-MMRd tumors generate a significantly higher number of non-canonical mutated peptides able to elicit T cell responses. These results reveal the importance of evaluating the diversity of neoepitope repertoire in MMRd tumors.
Collapse
Affiliation(s)
- Giuseppe Rospo
- Department of Oncology, Molecular Biotechnology Center, University of Torino, Turin, Italy
- Present address: Boehringer Ingelheim RCV GmbH & Co KG, Vienna, Austria
| | - Rosaria Chilà
- IFOM ETS - The AIRC Institute of Molecular Oncology, 20139, Milan, Italy
| | - Vittoria Matafora
- IFOM ETS - The AIRC Institute of Molecular Oncology, 20139, Milan, Italy
| | - Veronica Basso
- Lymphocyte Activation Unit, Division of Immunology, Transplantation and Infectious Diseases, IRCCS San Raffaele Scientific Institute Via Olgettina, 58, 20132, Milan, Italy
| | - Simona Lamba
- Department of Oncology, Molecular Biotechnology Center, University of Torino, Turin, Italy
| | - Alice Bartolini
- Candiolo Cancer Institute, FPO-IRCCS, 10060, Candiolo, TO, Italy
| | - Angela Bachi
- IFOM ETS - The AIRC Institute of Molecular Oncology, 20139, Milan, Italy
| | - Federica Di Nicolantonio
- Department of Oncology, Molecular Biotechnology Center, University of Torino, Turin, Italy
- Candiolo Cancer Institute, FPO-IRCCS, 10060, Candiolo, TO, Italy
| | - Anna Mondino
- Lymphocyte Activation Unit, Division of Immunology, Transplantation and Infectious Diseases, IRCCS San Raffaele Scientific Institute Via Olgettina, 58, 20132, Milan, Italy
| | - Giovanni Germano
- Department of Oncology, Molecular Biotechnology Center, University of Torino, Turin, Italy.
- IFOM ETS - The AIRC Institute of Molecular Oncology, 20139, Milan, Italy.
| | - Alberto Bardelli
- Department of Oncology, Molecular Biotechnology Center, University of Torino, Turin, Italy.
- IFOM ETS - The AIRC Institute of Molecular Oncology, 20139, Milan, Italy.
| |
Collapse
|
4
|
Louarn M, Collet G, Barré È, Fest T, Dameron O, Siegel A, Chatonnet F. Regulus infers signed regulatory relations from few samples' information using discretization and likelihood constraints. PLoS Comput Biol 2024; 20:e1011816. [PMID: 38252636 PMCID: PMC10833539 DOI: 10.1371/journal.pcbi.1011816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 02/01/2024] [Accepted: 01/08/2024] [Indexed: 01/24/2024] Open
Abstract
MOTIVATION Transcriptional regulation is performed by transcription factors (TF) binding to DNA in context-dependent regulatory regions and determines the activation or inhibition of gene expression. Current methods of transcriptional regulatory circuits inference, based on one or all of TF, regions and genes activity measurements require a large number of samples for ranking the candidate TF-gene regulation relations and rarely predict whether they are activations or inhibitions. We hypothesize that transcriptional regulatory circuits can be inferred from fewer samples by (1) fully integrating information on TF binding, gene expression and regulatory regions accessibility, (2) reducing data complexity and (3) using biology-based likelihood constraints to determine the global consistency between a candidate TF-gene relation and patterns of genes expressions and region activations, as well as qualify regulations as activations or inhibitions. RESULTS We introduce Regulus, a method which computes TF-gene relations from gene expressions, regulatory region activities and TF binding sites data, together with the genomic locations of all entities. After aggregating gene expressions and region activities into patterns, data are integrated into a RDF (Resource Description Framework) endpoint. A dedicated SPARQL (SPARQL Protocol and RDF Query Language) query retrieves all potential relations between expressed TF and genes involving active regulatory regions. These TF-region-gene relations are then filtered using biological likelihood constraints allowing to qualify them as activation or inhibition. Regulus provides signed relations consistent with public databases and, when applied to biological data, identifies both known and potential new regulators. Regulus is devoted to context-specific transcriptional circuits inference in human settings where samples are scarce and cell populations are closely related, using discretization into patterns and likelihood reasoning to decipher the most robust regulatory relations.
Collapse
Affiliation(s)
- Marine Louarn
- Univ Rennes, CNRS, Inria, IRISA - UMR 6074, Rennes, France
- UMR_S 1236, Université Rennes 1, INSERM, Etablissement Français du Sang, Rennes, France
| | | | - Ève Barré
- Univ Rennes, CNRS, Inria, IRISA - UMR 6074, Rennes, France
| | - Thierry Fest
- UMR_S 1236, Université Rennes 1, INSERM, Etablissement Français du Sang, Rennes, France
- Laboratoire d’Hématologie, Pôle de Biologie, CHU de Rennes, Rennes, France
| | | | - Anne Siegel
- Univ Rennes, CNRS, Inria, IRISA - UMR 6074, Rennes, France
| | - Fabrice Chatonnet
- UMR_S 1236, Université Rennes 1, INSERM, Etablissement Français du Sang, Rennes, France
- Laboratoire d’Hématologie, Pôle de Biologie, CHU de Rennes, Rennes, France
| |
Collapse
|
5
|
Li R, Wan D, Liang J, Liang H, Huang H, Li G. Pan-cancer analysis of promoter activity quantitative trait loci. NAR Cancer 2023; 5:zcad053. [PMID: 38023732 PMCID: PMC10644876 DOI: 10.1093/narcan/zcad053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/29/2023] [Accepted: 10/31/2023] [Indexed: 12/01/2023] Open
Abstract
Altered promoter activity has been generally observed in diverse biological processes, including tumorigenesis. Accumulating evidence suggests that employing a quantitative trait locus mapping approach is effective in comprehending the genetic basis of promoter activity. By utilizing genotype data from The Cancer Genome Atlas and calculating corresponding promoter activity values using proActiv, we systematically evaluated the impact of genetic variants on promoter activity and identified >1.0 million promoter activity quantitative trait loci (paQTLs) as both cis- and trans-acting. Additionally, leveraging data from the genome-wide association study (GWAS) catalog, we discovered >1.3 million paQTLs that overlap with known GWAS linkage disequilibrium regions. Remarkably, ∼9324 paQTLs exhibited significant associations with patient prognosis. Moreover, investigating the impact of promoter activity on >1000 imputed antitumor therapy responses among pan-cancer patients revealed >43 000 million significant associations. Furthermore, ∼25 000 significant associations were identified between promoter activity and immune cell abundance. Finally, a user-friendly data portal, Pancan-paQTL (https://www.hbpding.com/PancanPaQTL/), was constructed for users to browse, search and download data of interest. Pancan-paQTL serves as a comprehensive multidimensional database, enabling functional and clinical investigations into genetic variants associated with promoter activity, drug responses and immune infiltration across multiple cancer types.
Collapse
Affiliation(s)
- Ran Li
- Department of Neurosurgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430000, China
| | - Dongyi Wan
- Department of Nuclear Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430000, China
| | - Junnan Liang
- Hepatic Surgery Center and Hubei Key Laboratory of Hepato-Biliary-Pancreatic Diseases, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430000, China
| | - Huifang Liang
- Hepatic Surgery Center and Hubei Key Laboratory of Hepato-Biliary-Pancreatic Diseases, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430000, China
| | - Haohao Huang
- Department of Neurosurgery, General Hospital of Central Theatre Command of People’s Liberation Army, Wuhan, Hubei, 430000, China
| | - Ganxun Li
- Hepatic Surgery Center and Hubei Key Laboratory of Hepato-Biliary-Pancreatic Diseases, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, 430000, China
| |
Collapse
|
6
|
Peña-Martínez EG, Pomales-Matos DA, Rivera-Madera A, Messon-Bird JL, Medina-Feliciano JG, Sanabria-Alberto L, Barreiro-Rosario AC, Rivera-Del Valle J, Rodríguez-Ríos JM, Rodríguez-Martínez JA. Prioritizing cardiovascular disease-associated variants altering NKX2-5 and TBX5 binding through an integrative computational approach. J Biol Chem 2023; 299:105423. [PMID: 37926287 PMCID: PMC10750078 DOI: 10.1016/j.jbc.2023.105423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/18/2023] [Accepted: 10/26/2023] [Indexed: 11/07/2023] Open
Abstract
Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies have mapped >90% of CVD-associated variants within the noncoding genome, which can alter the function of regulatory proteins, such as transcription factors (TFs). However, due to the overwhelming number of single-nucleotide polymorphisms (SNPs) (>500,000) in genome-wide association studies, prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1535 CVD-associated SNPs within TF footprints and putative cardiac enhancers plus 14,218 variants in linkage disequilibrium with genotype-dependent gene expression in cardiac tissues. Using ChIP-seq data from two cardiac TFs (NKX2-5 and TBX5) in human-induced pluripotent stem cell-derived cardiomyocytes, we trained a large-scale gapped k-mer SVM model to identify CVD-associated SNPs that altered NKX2-5 and TBX5 binding. The model was tested by scoring human heart TF genomic footprints within putative enhancers and measuring in vitro binding through electrophoretic mobility shift assay. Five variants predicted to alter NKX2-5 (rs59310144, rs6715570, and rs61872084) and TBX5 (rs7612445 and rs7790964) binding were prioritized for in vitro validation based on the magnitude of the predicted change in binding and are in cardiac tissue eQTLs. All five variants altered NKX2-5 and TBX5 DNA binding. We present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro analysis.
Collapse
Affiliation(s)
- Edwin G Peña-Martínez
- Department of Biology, University of Puerto Rico Río Piedras Campus, San Juan, Puerto Rico
| | - Diego A Pomales-Matos
- Department of Biology, University of Puerto Rico Río Piedras Campus, San Juan, Puerto Rico
| | | | - Jean L Messon-Bird
- Department of Biology, University of Puerto Rico Río Piedras Campus, San Juan, Puerto Rico
| | | | | | | | | | | | | |
Collapse
|
7
|
Jiang Y, Zhang Y, Ju C, Zhang R, Li H, Chen F, Zhu Y, Shen S, Wei Y. A cross-disorder study to identify causal relationships, shared genetic variants, and genes across 21 digestive disorders. iScience 2023; 26:108238. [PMID: 37965154 PMCID: PMC10641500 DOI: 10.1016/j.isci.2023.108238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 06/26/2023] [Accepted: 10/13/2023] [Indexed: 11/16/2023] Open
Abstract
Digestive disorders are a significant contributor to the global burden of disease and seriously affect human quality of life. Research has already confirmed the presence of pleiotropic genetic loci among digestive disorders, and studies have explored shared genetic factors among pan-cancers, including various malignant digestive disorders. However, most cross-phenotype studies within the digestive tract system have been limited to a few traits, with no systematic coverage of common benign and malignant digestive disorders. Here, we analyzed data from the UK Biobank to investigate 21 digestive disorders, exploring the genetic correlations and causal relationships between diseases, as well as the common genetic factors and potential biological pathways driving these relationships. Our findings confirmed the extensive genetic correlation and causal relationship between digestive disorders, providing important insights into the genetic etiology, causality, disease prevention, and clinical treatment of diseases.
Collapse
Affiliation(s)
- Yue Jiang
- Clinical Stem Cell Center, The Affiliated Drum Tower Hospital of Nanjing University Medical School, Nanjing 210008, China
| | - Yihong Zhang
- Laboratory Medicine Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing 210011, China
| | - Can Ju
- Department of Biostatistics, School of Public Health, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Ruyang Zhang
- Department of Biostatistics, School of Public Health, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Hui Li
- Department of Gastroenterology, The Affiliated Hospital of Integrated Traditional Chinese and Western Medicine, Nanjing University of Chinese Medicine, Nanjing 210028, China
| | - Feng Chen
- Department of Biostatistics, School of Public Health, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Yefei Zhu
- Laboratory Medicine Center, The Second Affiliated Hospital of Nanjing Medical University, Nanjing 210011, China
| | - Sipeng Shen
- Department of Biostatistics, School of Public Health, Center of Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Yongyue Wei
- Peking University Center for Public Health and Epidemic Preparedness and Response, Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing 100191, China
| |
Collapse
|
8
|
Dong ZB, Xiang HT, Wu HM, Cai XL, Chen ZW, Chen SS, He YC, Li H, Yu WM, Liang C. LncRNA expression signature identified using genome-wide transcriptomic profiling to predict lymph node metastasis in patients with stage T1 and T2 gastric cancer. Gastric Cancer 2023; 26:947-957. [PMID: 37691031 PMCID: PMC10640531 DOI: 10.1007/s10120-023-01428-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 08/28/2023] [Indexed: 09/12/2023]
Abstract
BACKGROUND Lymph node (LN) status is vital to evaluate the curative potential of relatively early gastric cancer (GC; T1-T2) treatment (endoscopic or surgery). Currently, there is a lack of robust and convenient methods to identify LN metastasis before therapeutic decision-making. METHODS Genome-wide expression profiles of long noncoding RNA (lncRNA) in primary T1 gastric cancer data from The Cancer Genome Atlas (TCGA) was used to identify lncRNA expression signature capable of detecting LN metastasis of GC and establish a 10-lncRNA risk-prediction model based on deep learning. The performance of the lncRNA panel in diagnosing LN metastasis was evaluated both in silico and clinical validation methods. In silico validation was conducted using TCGA and Asian Cancer Research Group (ACRG) datasets. Clinical validation was performed on T1 and T2 patients, and the panel's efficacy was compared with that of traditional tumor markers and computed tomography (CT) scans. RESULTS Profiling of genome-wide RNA expression identified a panel of lncRNA to predict LN metastasis in T1 stage gastric cancer (AUC = 0.961). A 10-lncRNA risk-prediction model was then constructed, which was validated successfully in T1 and T2 datasets (TCGA, AUC = 0.852; ACRG, AUC = 0.834). Thereafter, the clinical performance of the lncRNA panel was validated in clinical cohorts (T1, AUC = 0.812; T2, AUC = 0.805; T1 + T2, AUC = 0.764). Notably, the panel demonstrated significantly better performance compared with CT and traditional tumor markers. CONCLUSIONS The novel 10-lncRNA could diagnose LN metastasis robustly in relatively early gastric cancer (T1-T2), with promising clinical potential.
Collapse
Affiliation(s)
- Zhe-Bin Dong
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Han-Ting Xiang
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Heng-Miao Wu
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Xian-Lei Cai
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Zheng-Wei Chen
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Sang-Sang Chen
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Yi-Chen He
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Hong Li
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Wei-Ming Yu
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China
| | - Chao Liang
- Department of General Surgery, The Affiliated Lihuili Hospital, Ningbo University, 57 Xingning Road, Ningbo, 315000, People's Republic of China.
| |
Collapse
|
9
|
Wang Z, Luo M, Liang Q, Zhao K, Hu Y, Wang W, Feng X, Hu B, Teng J, You T, Li R, Bao Z, Pan W, Yang T, Zhang C, Li T, Dong X, Yi X, Liu B, Zhao L, Li M, Chen K, Song W, Yang J, Li MJ. Landscape of enhancer disruption and functional screen in melanoma cells. Genome Biol 2023; 24:248. [PMID: 37904237 PMCID: PMC10614365 DOI: 10.1186/s13059-023-03087-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Accepted: 10/12/2023] [Indexed: 11/01/2023] Open
Abstract
BACKGROUND The high mutation rate throughout the entire melanoma genome presents a major challenge in stratifying true driver events from the background mutations. Numerous recurrent non-coding alterations, such as those in enhancers, can shape tumor evolution, thereby emphasizing the importance in systematically deciphering enhancer disruptions in melanoma. RESULTS Here, we leveraged 297 melanoma whole-genome sequencing samples to prioritize highly recurrent regions. By performing a genome-scale CRISPR interference (CRISPRi) screen on highly recurrent region-associated enhancers in melanoma cells, we identified 66 significant hits which could have tumor-suppressive roles. These functional enhancers show unique mutational patterns independent of classical significantly mutated genes in melanoma. Target gene analysis for the essential enhancers reveal many known and hidden mechanisms underlying melanoma growth. Utilizing extensive functional validation experiments, we demonstrate that a super enhancer element could modulate melanoma cell proliferation by targeting MEF2A, and another distal enhancer is able to sustain PTEN tumor-suppressive potential via long-range interactions. CONCLUSIONS Our study establishes a catalogue of crucial enhancers and their target genes in melanoma growth and progression, and illuminates the identification of novel mechanisms of dysregulation for melanoma driver genes and new therapeutic targeting strategies.
Collapse
Affiliation(s)
- Zhao Wang
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.
| | - Menghan Luo
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Qian Liang
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
- Scientific Research Center, Wenzhou Medical University, Wenzhou, China
| | - Ke Zhao
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Yuelin Hu
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Wei Wang
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Xiangling Feng
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Bolang Hu
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Jianjin Teng
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Tianyi You
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Ran Li
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Zhengkai Bao
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Wenhao Pan
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Tielong Yang
- Department of Bone and Soft Tissue Tumor, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Chao Zhang
- Department of Bone and Soft Tissue Tumor, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Ting Li
- Department of Bone and Soft Tissue Tumor, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Xiaobao Dong
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xianfu Yi
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Ben Liu
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Li Zhao
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Miaoxin Li
- Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, China
| | - Kexin Chen
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Weihong Song
- Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, The Second Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| | - Jilong Yang
- Department of Bone and Soft Tissue Tumor, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.
| | - Mulin Jun Li
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, The Province and Ministry Co-Sponsored Collaborative Innovation Center for Medical Epigenetics, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.
- Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.
| |
Collapse
|
10
|
Teh SSK, Bowland K, Bennett A, Halper-Stromberg E, Skaist A, Tang J, Cai F, Macoretta A, Liang H, Kamiyama H, Wheelan S, Lin MT, Hruban RH, Scharpf RB, Roberts NJ, Eshleman JR. CRISPR-Cas9 for selective targeting of somatic mutations in pancreatic cancers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.15.537042. [PMID: 37131822 PMCID: PMC10153132 DOI: 10.1101/2023.04.15.537042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Somatic mutations are desirable targets for selective elimination of cancer, yet most are found within the noncoding regions. We propose a novel, cancer-specific killing approach using CRISPR-Cas9 which exploits the requirement of a protospacer adjacent motif (PAM) for Cas9 activity. Through whole genome sequencing (WGS) of paired tumor minus normal (T-N) samples from three pancreatic cancer patients (Panc480, Panc504, and Panc1002), we identified an average of 417 somatic PAMs per tumor produced from single base substitutions. We analyzed 591 paired T-N samples from The International Cancer Genome Consortium and discovered medians of ~455 somatic PAMs per tumor in pancreatic, ~2800 in lung, and ~3200 in esophageal cancer cohorts. Finally, we demonstrated >80% selective cell death of two targeted pancreatic cancer cell lines in co-cultures using 4-9 sgRNAs, targeting noncoding regions, designed from the somatic PAM discovery approach. We also showed no off-target activity from these tumor-specific sgRNAs through WGS.
Collapse
Affiliation(s)
- Selina Shiqing K. Teh
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Kirsten Bowland
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Alexis Bennett
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Eitan Halper-Stromberg
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Alyza Skaist
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Jacqueline Tang
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Fidel Cai
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Antonella Macoretta
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Hong Liang
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - Sarah Wheelan
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Scientific Review Branch, National Human Genome Research Institute, Bethesda, MD, USA
| | - Ming-Tseh Lin
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ralph H. Hruban
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Robert B. Scharpf
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Nicholas J. Roberts
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - James R. Eshleman
- Department of Pathology, The Sol Goldman Pancreatic Cancer Research Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Oncology, Sidney Kimmel Comprehensive Cancer Center, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
| |
Collapse
|
11
|
Zheng R, Gao F, Mao Z, Xiao Y, Yuan L, Huang Z, Lv Q, Qin C, Du M, Zhang Z, Wang M. LncRNA BCCE4 Genetically Enhances the PD-L1/PD-1 Interaction in Smoking-Related Bladder Cancer by Modulating miR-328-3p-USP18 Signaling. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2303473. [PMID: 37705121 PMCID: PMC10602555 DOI: 10.1002/advs.202303473] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 07/28/2023] [Indexed: 09/15/2023]
Abstract
Identification of cancer-associated variants, especially those in functional regions of long noncoding RNAs (lncRNAs), has become an essential task in tumor etiology. However, the genetic function of lncRNA variants involved in bladder cancer susceptibility remains poorly understood. Herein, it is identified that the rs62483508 G > A variant in microRNA response elements (MREs) of lncRNA Bladder cancer Cell Cytoplasm-Enriched abundant transcript 4 (BCCE4) is significantly associated with decreased bladder cancer risk (odds ratio = 0.84, P = 7.33 × 10-8 ) in the Chinese population (3603 cases and 4986 controls) but not in the European population. The protective genetic effect of the rs62483508 A allele is found in smokers or cigarette smoke-related carcinogen 4-aminobiphenyl (4-ABP) exposure. Subsequent biological experiments reveal that the A allele of rs62483508 disrupts the binding affinity of miR-328-3p to facilitate USP18 from miRNA-mediated degradation and thus specifically attenuates the downstream PD-L1/PD-1 interaction. LncRNA BCCE4 is also enriched in exosomes from bladder cancer plasma, tissues, and cells. This comprehensive study clarifies the genetic mechanism of lncRNA BCCE4 in bladder cancer susceptibility and its role in the regulation of the immune response in tumorigenesis. The findings provide a valuable predictor of bladder cancer risk that can facilitate diagnosis and prevention.
Collapse
Affiliation(s)
- Rui Zheng
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of Genetic ToxicologyThe Key Laboratory of Modern Toxicology of Ministry of EducationCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
| | - Fang Gao
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Key Laboratory of Environmental Medicine EngineeringMinistry of Education of ChinaSchool of Public HealthSoutheast UniversityNanjing210009China
| | - Zhenguang Mao
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of Genetic ToxicologyThe Key Laboratory of Modern Toxicology of Ministry of EducationCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
| | - Yanping Xiao
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of Genetic ToxicologyThe Key Laboratory of Modern Toxicology of Ministry of EducationCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
| | - Lin Yuan
- Department of UrologyJiangsu Province Hospital of TCMNanjing210029China
- Department of Integrated Traditional Chinese and Western Medicine Tumor Research LabNanjing210028China
| | - Zhengkai Huang
- Department of UrologyThe First Affiliated Hospital of Nanjing Medical UniversityNanjing210029China
| | - Qiang Lv
- Department of UrologyThe First Affiliated Hospital of Nanjing Medical UniversityNanjing210029China
| | - Chao Qin
- Department of UrologyThe First Affiliated Hospital of Nanjing Medical UniversityNanjing210029China
| | - Mulong Du
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of BiostatisticsCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
| | - Zhengdong Zhang
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of Genetic ToxicologyThe Key Laboratory of Modern Toxicology of Ministry of EducationCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
- Institute of Clinical ResearchThe Affiliated Taizhou People's Hospital of NanjingMedical UniversityTaizhou225300China
| | - Meilin Wang
- Department of Environmental GenomicsJiangsu Key Laboratory of Cancer BiomarkersPrevention and TreatmentCollaborative Innovation Center for Cancer Personalized MedicineNanjing Medical UniversityNanjing211166China
- Department of Genetic ToxicologyThe Key Laboratory of Modern Toxicology of Ministry of EducationCenter for Global HealthSchool of Public HealthNanjing Medical UniversityNanjing211166China
- The Affiliated Suzhou Hospital of Nanjing Medical UniversitySuzhou Municipal HospitalGusu SchoolNanjing Medical UniversitySuzhou215008China
| |
Collapse
|
12
|
Aldea M, Friboulet L, Apcher S, Jaulin F, Mosele F, Sourisseau T, Soria JC, Nikolaev S, André F. Precision medicine in the era of multi-omics: can the data tsunami guide rational treatment decision? ESMO Open 2023; 8:101642. [PMID: 37769400 PMCID: PMC10539962 DOI: 10.1016/j.esmoop.2023.101642] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 08/14/2023] [Indexed: 09/30/2023] Open
Abstract
Precision medicine for cancer is rapidly moving to an approach that integrates multiple dimensions of the biology in order to model mechanisms of cancer progression in each patient. The discovery of multiple drivers per tumor challenges medical decision that faces several treatment options. Drug sensitivity depends on the actionability of the target, its clonal or subclonal origin and coexisting genomic alterations. Sequencing has revealed a large diversity of drivers emerging at treatment failure, which are potential targets for clinical trials or drug repurposing. To effectively prioritize therapies, it is essential to rank genomic alterations based on their proven actionability. Moving beyond primary drivers, the future of precision medicine necessitates acknowledging the intricate spatial and temporal heterogeneity inherent in cancer. The advent of abundant complex biological data will make artificial intelligence algorithms indispensable for thorough analysis. Here, we will discuss the advancements brought by the use of high-throughput genomics, the advantages and limitations of precision medicine studies and future perspectives in this field.
Collapse
Affiliation(s)
- M Aldea
- Department of Medical Oncology, Gustave Roussy, Villejuif; PRISM, INSERM, Gustave Roussy, Villejuif.
| | | | - S Apcher
- PRISM, INSERM, Gustave Roussy, Villejuif
| | - F Jaulin
- PRISM, INSERM, Gustave Roussy, Villejuif
| | - F Mosele
- Department of Medical Oncology, Gustave Roussy, Villejuif; PRISM, INSERM, Gustave Roussy, Villejuif
| | | | - J-C Soria
- Paris Saclay University, Orsay; Drug Development Department, Gustave Roussy, Villejuif, France
| | - S Nikolaev
- PRISM, INSERM, Gustave Roussy, Villejuif
| | - F André
- Department of Medical Oncology, Gustave Roussy, Villejuif; PRISM, INSERM, Gustave Roussy, Villejuif; Paris Saclay University, Orsay
| |
Collapse
|
13
|
Xu D, Forbes AN, Cohen S, Palladino A, Karadimitriou T, Khurana E. Recapitulation of patient-specific 3D chromatin conformation using machine learning. CELL REPORTS METHODS 2023; 3:100578. [PMID: 37673071 PMCID: PMC10545938 DOI: 10.1016/j.crmeth.2023.100578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 04/05/2023] [Accepted: 08/10/2023] [Indexed: 09/08/2023]
Abstract
Regulatory networks containing enhancer-gene edges define cellular states. Multiple efforts have revealed these networks for reference tissues and cell lines by integrating multi-omics data. However, the methods developed cannot be applied for large patient cohorts due to the infeasibility of chromatin immunoprecipitation sequencing (ChIP-seq) for limited biopsy material. We trained machine-learning models using chromatin interaction analysis with paired-end tag sequencing (ChIA-PET) and high-throughput chromosome conformation capture combined with chromatin immunoprecipitation (HiChIP) data that can predict connections using only assay for transposase-accessible chromatin using sequencing (ATAC-seq) and RNA-seq data as input, which can be generated from biopsies. Our method overcomes limitations of correlation-based approaches that cannot distinguish between distinct target genes of given enhancers or between active vs. poised states in different samples, a hallmark of network rewiring in cancer. Application of our model on 371 samples across 22 cancer types revealed 1,780 enhancer-gene connections for 602 cancer genes. Using CRISPR interference (CRISPRi), we validated enhancers predicted to regulate ESR1 in estrogen receptor (ER)+ breast cancer and A1CF in liver hepatocellular carcinoma.
Collapse
Affiliation(s)
- Duo Xu
- Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, USA; Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY, USA; Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Andre Neil Forbes
- Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, USA; Weill Cornell Graduate School of Medical Sciences, Weill Cornell Medicine, New York, NY, USA
| | - Sandra Cohen
- Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Ann Palladino
- Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, USA
| | | | - Ekta Khurana
- Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medical College, New York, NY, USA; Department of Physiology and Biophysics, Weill Cornell Medical College, New York, NY, USA; Englander Institute for Precision Medicine, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
14
|
Ying P, Chen C, Lu Z, Chen S, Zhang M, Cai Y, Zhang F, Huang J, Fan L, Ning C, Li Y, Wang W, Geng H, Liu Y, Tian W, Yang Z, Liu J, Huang C, Yang X, Xu B, Li H, Zhu X, Li N, Li B, Wei Y, Zhu Y, Tian J, Miao X. Genome-wide enhancer-gene regulatory maps link causal variants to target genes underlying human cancer risk. Nat Commun 2023; 14:5958. [PMID: 37749132 PMCID: PMC10520073 DOI: 10.1038/s41467-023-41690-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 09/14/2023] [Indexed: 09/27/2023] Open
Abstract
Genome-wide association studies have identified numerous variants associated with human complex traits, most of which reside in the non-coding regions, but biological mechanisms remain unclear. However, assigning function to the non-coding elements is still challenging. Here we apply Activity-by-Contact (ABC) model to evaluate enhancer-gene regulation effect by integrating multi-omics data and identified 544,849 connections across 20 cancer types. ABC model outperforms previous approaches in linking regulatory variants to target genes. Furthermore, we identify over 30,000 enhancer-gene connections in colorectal cancer (CRC) tissues. By integrating large-scale population cohorts (23,813 cases and 29,973 controls) and multipronged functional assays, we demonstrate an ABC regulatory variant rs4810856 associated with CRC risk (Odds Ratio = 1.11, 95%CI = 1.05-1.16, P = 4.02 × 10-5) by acting as an allele-specific enhancer to distally facilitate PREX1, CSE1L and STAU1 expression, which synergistically activate p-AKT signaling. Our study provides comprehensive regulation maps and illuminates a single variant regulating multiple genes, providing insights into cancer etiology.
Collapse
Grants
- Distinguished Young Scholars of China (NSFC-81925032), Key Program of National Natural Science Foundation of China (NSFC-82130098), the Fundamental Research Funds for the Central Universities (2042022rc0026, 2042023kf1005),Knowledge Innovation Program of Wuhan (2023020201010060).
- Youth Program of National Natural Science Foundation of China (NSFC-82003547), Program of Health Commission of Hubei Province (WJ2023M045) and Fundamental Research Funds for the Central Universities (WHU: 2042022kf1031).
- The National Science Fund for Excellent Young Scholars (NSFC-82322058), Program of National Natural Science Foundation of China (NSFC-82103929, NSFC-82273713), Young Elite Scientists Sponsorship Program by cst(2022QNRC001), National Science Fund for Distinguished Young Scholars of Hubei Province of China (2023AFA046), Fundamental Research Funds for the Central Universities (WHU:2042022kf1205) and Knowledge Innovation Program of Wuhan (whkxjsj011, 2023020201010073).
Collapse
Affiliation(s)
- Pingting Ying
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
- Department of Gastrointestinal Oncology, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China
- Department of Radiation Oncology, Renmin Hospital of Wuhan University, Wuhan, 430071, China
| | - Can Chen
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
- Department of Gastrointestinal Oncology, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China
- Department of Radiation Oncology, Renmin Hospital of Wuhan University, Wuhan, 430071, China
| | - Zequn Lu
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
- Department of Gastrointestinal Oncology, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China
- Department of Radiation Oncology, Renmin Hospital of Wuhan University, Wuhan, 430071, China
| | - Shuoni Chen
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Ming Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Yimin Cai
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Fuwei Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Jinyu Huang
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Linyun Fan
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Caibo Ning
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Yanmin Li
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Wenzhuo Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Hui Geng
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Yizhuo Liu
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Wen Tian
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Zhiyong Yang
- Department of Hepatobiliary and Pancreatic Surgery, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China
| | - Jiuyang Liu
- Department of Gastrointestinal Surgery, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan, 430071, China
| | - Chaoqun Huang
- Department of Gastrointestinal Surgery, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan, 430071, China
| | - Xiaojun Yang
- Department of Gastrointestinal Surgery, Zhongnan Hospital of Wuhan University, Wuhan University, Wuhan, 430071, China
| | - Bin Xu
- Cancer Center, Renmin Hospital of Wuhan University, Wuhan University, Wuhan, 430060, China
| | - Heng Li
- Department of Urology, Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China
| | - Xu Zhu
- Department of Gastrointestinal Surgery, Renmin Hospital of Wuhan University, Wuhan, 430071, China
| | - Ni Li
- Office of Cancer Screening, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100021, China
| | - Bin Li
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Yongchang Wei
- Department of Gastrointestinal Oncology, Hubei Cancer Clinical Study Center, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China
| | - Ying Zhu
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China
| | - Jianbo Tian
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China.
- Department of Gastrointestinal Oncology, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China.
- Department of Radiation Oncology, Renmin Hospital of Wuhan University, Wuhan, 430071, China.
| | - Xiaoping Miao
- Department of Epidemiology and Biostatistics, School of Public Health, Wuhan University, Wuhan, 430071, China.
- Department of Gastrointestinal Oncology, Zhongnan Hospital of Wuhan University, Wuhan, 430071, China.
- Department of Radiation Oncology, Renmin Hospital of Wuhan University, Wuhan, 430071, China.
- Department of Epidemiology and Biostatistics, School of Public Health, Tongji Medical College, Huazhong University of Sciences and Technology, Wuhan, 430030, China.
| |
Collapse
|
15
|
Cankara F, Doğan T. ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations. Comput Struct Biotechnol J 2023; 21:4743-4758. [PMID: 37822561 PMCID: PMC10562615 DOI: 10.1016/j.csbj.2023.09.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 09/15/2023] [Accepted: 09/15/2023] [Indexed: 10/13/2023] Open
Abstract
Background Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and data-centric analysis of sequence variations. Method In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. Results We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.
Collapse
Affiliation(s)
- Fatma Cankara
- Biological Data Science Laboratory, Dept. of Computer Engineering, Hacettepe University, Ankara, Turkey
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
- Department of Computational Sciences and Engineering, Koc University, Istanbul, Turkey
| | - Tunca Doğan
- Biological Data Science Laboratory, Dept. of Computer Engineering, Hacettepe University, Ankara, Turkey
- Institute of Informatics, Hacettepe University, Ankara, Turkey
- Department of Bioinformatics, Graduate School of Health Sciences, Hacettepe University, Ankara, Turkey
| |
Collapse
|
16
|
Wu P, Wang W. Distinct 3D contacts and phenotypic consequences of adjacent non-coding loci in the epigenetically quiescent regions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.11.557110. [PMID: 37745584 PMCID: PMC10515877 DOI: 10.1101/2023.09.11.557110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Non-coding regions of the human genome are important for functional regulations, but their mechanisms remain elusive. We used machine learning to guide a CRISPR screening on hubs (i.e. non-coding loci forming many 3D contacts) and significantly increased the discovery rate of hubs essential for cell growth. We found no clear genetic or epigenetic differences between essential and nonessential hubs, but we observed that some neighboring hubs in the linear genome have distinct spatial contacts and opposite effects on cell growth. One such pair in an epigenetically quiescent region showed different impacts on gene expression, chromatin accessibility and chromatin organization. We also found that deleting the essential hub altered the genetic network activity and increased the entropy of chromatin accessibility, more severe than that caused by deletion of the nonessential hub, suggesting that they are critical for maintaining an ordered chromatin structure. Our study reveals new insights into the system-level roles of non-coding regions in the human genome.
Collapse
Affiliation(s)
- Peiyao Wu
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0359
| | - Wei Wang
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0359
- Bioinformatics and Systems Biology program, University of California, San Diego, La Jolla, CA 92093-0359
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093-0359
| |
Collapse
|
17
|
Peña-Martínez EG, Pomales-Matos DA, Rivera-Madera A, Messon-Bird JL, Medina-Feliciano JG, Sanabria-Alberto L, Barreiro-Rosario AC, Rodriguez-Rios JM, Rodríguez-Martínez JA. Prioritizing Cardiovascular Disease-Associated Variants Altering NKX2-5 Binding through an Integrative Computational Approach. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.01.23294951. [PMID: 37693486 PMCID: PMC10491373 DOI: 10.1101/2023.09.01.23294951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Cardiovascular diseases (CVDs) are the leading cause of death worldwide and are heavily influenced by genetic factors. Genome-wide association studies (GWAS) have mapped > 90% of CVD-associated variants within the non-coding genome, which can alter the function of regulatory proteins, like transcription factors (TFs). However, due to the overwhelming number of GWAS single nucleotide polymorphisms (SNPs) (>500,000), prioritizing variants for in vitro analysis remains challenging. In this work, we implemented a computational approach that considers support vector machine (SVM)-based TF binding site classification and cardiac expression quantitative trait loci (eQTL) analysis to identify and prioritize potential CVD-causing SNPs. We identified 1,535 CVD-associated SNPs that occur within human heart footprints/enhancers and 9,309 variants in linkage disequilibrium (LD) with differential gene expression profiles in cardiac tissue. Using hiPSC-CM ChIP-seq data from NKX2-5 and TBX5, two cardiac TFs essential for proper heart development, we trained a large-scale gapped k-mer SVM (LS-GKM-SVM) predictive model that can identify binding sites altered by CVD-associated SNPs. The computational predictive model was tested by scoring human heart footprints and enhancers in vitro through electrophoretic mobility shift assay (EMSA). Three variants (rs59310144, rs6715570, and rs61872084) were prioritized for in vitro validation based on their eQTL in cardiac tissue and LS-GKM-SVM prediction to alter NKX2-5 DNA binding. All three variants altered NKX2-5 DNA binding. In summary, we present a bioinformatic approach that considers tissue-specific eQTL analysis and SVM-based TF binding site classification to prioritize CVD-associated variants for in vitro experimental analysis.
Collapse
|
18
|
Ruan Y, Xie L, Zou A. Association of CDKN2A/B mutations, PD-1, and PD-L1 with the risk of acute lymphoblastic leukemia in children. J Cancer Res Clin Oncol 2023; 149:10841-10850. [PMID: 37314514 PMCID: PMC10423156 DOI: 10.1007/s00432-023-04974-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Accepted: 06/04/2023] [Indexed: 06/15/2023]
Abstract
PURPOSE Currently, the significance of CDKN2A/B mutations in the pathogenesis and prognosis of acute lymphoblastic leukemia (ALL) is inconclusive. In this study, we analyzed the genetic and clinical features of children with CDKN2A/B mutations in ALL. In addition, we evaluated the expression and significance of programmed cell death protein 1 (PD-1) and programmed cell death ligand 1 (PD-L1) in serum and explored their role in the susceptibility of childhood ALL. METHODS We sequenced CDKN2A/B in the peripheral blood of 120 children with ALL and 100 healthy children with physical examination. The levels of CD4+ T, CD8+ T, and NK cells were measured by flow cytometry (FCM). Furthermore, the expression of PD-1 and PD-L1 was detected by ELISA. RESULTS We found 32 cases of CDKN2A rs3088440 and 11 of CDKN2B rs2069426 in 120 ALL children. Children with ALL in the CDKN2A rs3088440 were more likely to have hepatosplenomegaly (P = 0.019) and high risk (P = 0.014) than the wild group. In contrast, CDKN2B rs2069426 was more likely to develop lymph node metastasis (P = 0.017). The level of PD-L1 in the serum of ALL children was significantly higher than that of the control group, and there was no significant difference in PD-1 (P < 0.001). Additionally, children with CDKN2A rs3088440 had reduced CD8+ T cell counts than the wild group (P = 0.039). CONCLUSION CDKN2A rs3088440 and CDKN2B rs2069426 may be related to the occurrence and development of ALL in Chinese children. Additionally, PD-1/PD-L1 may be involved in the immune escape process of ALL, which is expected to become a new target for the treatment of the disease.
Collapse
Affiliation(s)
- Yang Ruan
- Department of Laboratory Medicine, Hunan Children's Hospital, Changsha, 410007, China.
| | - Longlong Xie
- Pediatrics Research Institute of Hunan Province, Hunan Children's Hospital, Changsha, 410007, China
| | - Aijun Zou
- Department of Laboratory Medicine, Hunan Children's Hospital, Changsha, 410007, China
| |
Collapse
|
19
|
Yang M, Ali O, Bjørås M, Wang J. Identifying functional regulatory mutation blocks by integrating genome sequencing and transcriptome data. iScience 2023; 26:107266. [PMID: 37520692 PMCID: PMC10371843 DOI: 10.1016/j.isci.2023.107266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 04/05/2023] [Accepted: 06/28/2023] [Indexed: 08/01/2023] Open
Abstract
Millions of single nucleotide variants (SNVs) exist in the human genome; however, it remains challenging to identify functional SNVs associated with diseases. We propose a non-encoding SNVs analysis tool bpb3, BayesPI-BAR version 3, aiming to identify the functional mutation blocks (FMBs) by integrating genome sequencing and transcriptome data. The identified FMBs display high frequency SNVs, significant changes in transcription factors (TFs) binding affinity and are nearby the regulatory regions of differentially expressed genes. A two-level Bayesian approach with a biophysical model for protein-DNA interactions is implemented, to compute TF-DNA binding affinity changes based on clustered position weight matrices (PWMs) from over 1700 TF-motifs. The epigenetic data, such as the DNA methylome can also be integrated to scan FMBs. By testing the datasets from follicular lymphoma and melanoma, bpb3 automatically and robustly identifies FMBs, demonstrating that bpb3 can provide insight into patho-mechanisms, and therapeutic targets from transcriptomic and genomic data.
Collapse
Affiliation(s)
- Mingyi Yang
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Medical Biochemistry, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - Omer Ali
- Department of Pathology, Oslo University Hospital - Norwegian Radium Hospital, Oslo, Norway
- Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Magnar Bjørås
- Department of Microbiology, Oslo University Hospital and University of Oslo, Oslo, Norway
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
| | - Junbai Wang
- Department of Clinical Molecular Biology (EpiGen), Akershus University Hospital and University of Oslo, Lørenskog, Norway
| |
Collapse
|
20
|
Sinkala M. Mutational landscape of cancer-driver genes across human cancers. Sci Rep 2023; 13:12742. [PMID: 37550388 PMCID: PMC10406856 DOI: 10.1038/s41598-023-39608-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 07/27/2023] [Indexed: 08/09/2023] Open
Abstract
The genetic mutations that contribute to the transformation of healthy cells into cancerous cells have been the subject of extensive research. The molecular aberrations that lead to cancer development are often characterised by gain-of-function or loss-of-function mutations in a variety of oncogenes and tumour suppressor genes. In this study, we investigate the genomic sequences of 20,331 primary tumours representing 41 distinct human cancer types to identify and catalogue the driver mutations present in 727 known cancer genes. Our findings reveal significant variations in the frequency of cancer gene mutations across different cancer types and highlight the frequent involvement of tumour suppressor genes (94%), oncogenes (93%), transcription factors (72%), kinases (64%), cell surface receptors (63%), and phosphatases (22%), in cancer. Additionally, our analysis reveals that cancer gene mutations are predominantly co-occurring rather than exclusive in all types of cancer. Notably, we discover that patients with tumours displaying different combinations of gene mutation patterns tend to exhibit variable survival outcomes. These findings provide new insights into the genetic landscape of cancer and bring us closer to a comprehensive understanding of the underlying mechanisms driving the development of various forms of cancer.
Collapse
Affiliation(s)
- Musalula Sinkala
- Department of Biomedical Sciences, School of Health Sciences, University of Zambia, Lusaka, Zambia.
- Computational Biology Division, Faculty of Health Sciences, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa.
| |
Collapse
|
21
|
Sakellaropoulos T, Do C, Jiang G, Cova G, Meyn P, Dimartino D, Ramaswami S, Heguy A, Tsirigos A, Skok JA. MethNet: a robust approach to identify regulatory hubs and their distal targets in cancer. RESEARCH SQUARE 2023:rs.3.rs-3150386. [PMID: 37577603 PMCID: PMC10418566 DOI: 10.21203/rs.3.rs-3150386/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Aberrations in the capacity of DNA/chromatin modifiers and transcription factors to bind non-coding regions can lead to changes in gene regulation and impact disease phenotypes. However, identifying distal regulatory elements and connecting them with their target genes remains challenging. Here, we present MethNet, a pipeline that integrates large-scale DNA methylation and gene expression data across multiple cancers, to uncover novel cis regulatory elements (CREs) in a 1Mb region around every promoter in the genome. MethNet identifies clusters of highly ranked CREs, referred to as 'hubs', which contribute to the regulation of multiple genes and significantly affect patient survival. Promoter-capture Hi-C confirmed that highly ranked associations involve physical interactions between CREs and their gene targets, and CRISPRi based scRNA Perturb-seq validated the functional impact of CREs. Thus, MethNet-identified CREs represent a valuable resource for unraveling complex mechanisms underlying gene expression, and for prioritizing the verification of predicted non-coding disease hotspots.
Collapse
Affiliation(s)
- Theodore Sakellaropoulos
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Catherine Do
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Guimei Jiang
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Giulia Cova
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| | - Peter Meyn
- Genome Technology Center, NYU Grossman School of Medicine, New York, NY, USA
| | - Dacia Dimartino
- Genome Technology Center, NYU Grossman School of Medicine, New York, NY, USA
| | - Sitharam Ramaswami
- Genome Technology Center, NYU Grossman School of Medicine, New York, NY, USA
| | - Adriana Heguy
- Genome Technology Center, NYU Grossman School of Medicine, New York, NY, USA
| | - Aristotelis Tsirigos
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
- Applied Bioinformatics Laboratories, Office of Science & Research, NYU Grossman School of Medicine, New York, NY, USA
| | - Jane A Skok
- Department of Pathology, NYU Grossman School of Medicine, New York, NY, USA
- Perlmutter Cancer Center, NYU Langone Health, New York, NY, USA
| |
Collapse
|
22
|
Chandra O, Sharma M, Pandey N, Jha IP, Mishra S, Kong SL, Kumar V. Patterns of transcription factor binding and epigenome at promoters allow interpretable predictability of multiple functions of non-coding and coding genes. Comput Struct Biotechnol J 2023; 21:3590-3603. [PMID: 37520281 PMCID: PMC10371796 DOI: 10.1016/j.csbj.2023.07.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 07/05/2023] [Accepted: 07/11/2023] [Indexed: 08/01/2023] Open
Abstract
Understanding the biological roles of all genes only through experimental methods is challenging. A computational approach with reliable interpretability is needed to infer the function of genes, particularly for non-coding RNAs. We have analyzed genomic features that are present across both coding and non-coding genes like transcription factor (TF) and cofactor ChIP-seq (823), histone modifications ChIP-seq (n = 621), cap analysis gene expression (CAGE) tags (n = 255), and DNase hypersensitivity profiles (n = 255) to predict ontology-based functions of genes. Our approach for gene function prediction was reliable (>90% balanced accuracy) for 486 gene-sets. PubMed abstract mining and CRISPR screens supported the inferred association of genes with biological functions, for which our method had high accuracy. Further analysis revealed that TF-binding patterns at promoters have high predictive strength for multiple functions. TF-binding patterns at the promoter add an unexplored dimension of explainable regulatory aspects of genes and their functions. Therefore, we performed a comprehensive analysis for the functional-specificity of TF-binding patterns at promoters and used them for clustering functions to reveal many latent groups of gene-sets involved in common major cellular processes. We also showed how our approach could be used to infer the functions of non-coding genes using the CRISPR screens of coding genes, which were validated using a long non-coding RNA CRISPR screen. Thus our results demonstrated the generality of our approach by using gene-sets from CRISPR screens. Overall, our approach opens an avenue for predicting the involvement of non-coding genes in various functions.
Collapse
Affiliation(s)
- Omkar Chandra
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| | - Madhu Sharma
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| | - Neetesh Pandey
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| | - Indra Prakash Jha
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| | - Shreya Mishra
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| | - Say Li Kong
- Genome Institute of Singapore, Agency for Science Technology and Research, Singapore, Singapore
| | - Vibhor Kumar
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Ph-III, New Delhi, India
| |
Collapse
|
23
|
Kin K, Bhogale S, Zhu L, Thomas D, Bertol J, Zheng WJ, Sinha S, Fakhouri WD. Sequence-to-expression approach to identify etiological non-coding DNA variations in P53 and cMYC-driven diseases. RESEARCH SQUARE 2023:rs.3.rs-3037310. [PMID: 37503250 PMCID: PMC10371153 DOI: 10.21203/rs.3.rs-3037310/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Background and methods Disease risk prediction based on DNA sequence and transcriptional profile can improve disease screening, prevention, and potential therapeutic approaches by revealing contributing genetic factors and altered regulatory networks. Despite identifying many disease-associated DNA variants through genome-wide association studies, distinguishing deleterious non-coding DNA variations remains poor for most common diseases. We previously reported that non-coding variations disrupting cis-overlapping motifs (CisOMs) of opposing transcription factors significantly affect enhancer activity. We designed in vitro experiments to uncover the significance of the co-occupancy and competitive binding and inhibition between P53 and cMYC on common target gene expression. Results Analyzing publicly available ChIP-seq data for P53 and cMYC in human embryonic stem cells and mouse embryonic cells showed that ~ 344-366 genomic regions are co-occupied by P53 and cMYC. We identified, on average, two CisOMs per region, suggesting that co-occupancy is evolutionarily conserved in vertebrates. Our data showed that treating U2OS cells with doxorubicin increased P53 protein level while reducing cMYC level. In contrast, no change in protein levels was observed in Raji cells. ChIP-seq analysis illustrated that 16-922 genomic regions were co-occupied by P53 and cMYC before and after treatment, and substitutions of cMYC signals by P53 were detected after doxorubicin treatment in U2OS. Around 187 expressed genes near co-occupied regions were altered at mRNA level according to RNA-seq data. We utilized a computational motif-matching approach to determine that changes in predicted P53 binding affinity by DNA variations in CisOMs of co-occupied elements significantly correlate with alterations in reporter gene expression. We performed a similar analysis using SNPs mapped in CisOMs for P53 and cMYC from ChIP-seq data in U2OS and Raji, and expression of target genes from the GTEx portal. Conclusions We found a significant correlation between change in motif-predicted cMYC binding affinity by SNPs in CisOMs and altered gene expression. Our study brings us closer to developing a generally applicable approach to filter etiological non-coding variations associated with P53 and cMYC-dependent diseases.
Collapse
Affiliation(s)
- Katherine Kin
- Department of Diagnostic and Biomedical Sciences, Center for Craniofacial Research, School of Dentistry, University of Texas Health Science Center at Houston
| | | | - Lisha Zhu
- School of Biomedical Informatics, University of Texas Health Science Center at Houston
| | - Derrick Thomas
- Department of Diagnostic and Biomedical Sciences, Center for Craniofacial Research, School of Dentistry, University of Texas Health Science Center at Houston
| | - Jessica Bertol
- Department of Diagnostic and Biomedical Sciences, Center for Craniofacial Research, School of Dentistry, University of Texas Health Science Center at Houston
| | - W Jim Zheng
- School of Biomedical Informatics, University of Texas Health Science Center at Houston
| | - Saurabh Sinha
- The Wallace H. Coulter Department of Biomedical Engineering
| | - Walid D Fakhouri
- Department of Diagnostic and Biomedical Sciences, Center for Craniofacial Research, School of Dentistry, University of Texas Health Science Center at Houston
| |
Collapse
|
24
|
Esposito R, Lanzós A, Uroda T, Ramnarayanan S, Büchi I, Polidori T, Guillen-Ramirez H, Mihaljevic A, Merlin BM, Mela L, Zoni E, Hovhannisyan L, McCluggage F, Medo M, Basile G, Meise DF, Zwyssig S, Wenger C, Schwarz K, Vancura A, Bosch-Guiteras N, Andrades Á, Tham AM, Roemmele M, Medina PP, Ochsenbein AF, Riether C, Kruithof-de Julio M, Zimmer Y, Medová M, Stroka D, Fox A, Johnson R. Tumour mutations in long noncoding RNAs enhance cell fitness. Nat Commun 2023; 14:3342. [PMID: 37291246 PMCID: PMC10250536 DOI: 10.1038/s41467-023-39160-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 06/01/2023] [Indexed: 06/10/2023] Open
Abstract
Long noncoding RNAs (lncRNAs) are linked to cancer via pathogenic changes in their expression levels. Yet, it remains unclear whether lncRNAs can also impact tumour cell fitness via function-altering somatic "driver" mutations. To search for such driver-lncRNAs, we here perform a genome-wide analysis of fitness-altering single nucleotide variants (SNVs) across a cohort of 2583 primary and 3527 metastatic tumours. The resulting 54 mutated and positively-selected lncRNAs are significantly enriched for previously-reported cancer genes and a range of clinical and genomic features. A number of these lncRNAs promote tumour cell proliferation when overexpressed in in vitro models. Our results also highlight a dense SNV hotspot in the widely-studied NEAT1 oncogene. To directly evaluate the functional significance of NEAT1 SNVs, we use in cellulo mutagenesis to introduce tumour-like mutations in the gene and observe a significant and reproducible increase in cell fitness, both in vitro and in a mouse model. Mechanistic studies reveal that SNVs remodel the NEAT1 ribonucleoprotein and boost subnuclear paraspeckles. In summary, this work demonstrates the utility of driver analysis for mapping cancer-promoting lncRNAs, and provides experimental evidence that somatic mutations can act through lncRNAs to enhance pathological cancer cell fitness.
Collapse
Affiliation(s)
- Roberta Esposito
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland.
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland.
- Institute of Genetics and Biophysics "Adriano Buzzati-Traverso", CNR, 80131, Naples, Italy.
| | - Andrés Lanzós
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Graduate School of Cellular and Biomedical Sciences, University of Bern, 3012, Bern, Switzerland
| | - Tina Uroda
- School of Biology and Environmental Science, University College Dublin, Dublin, D04 V1W8, Ireland
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, D04 V1W8, Ireland
| | - Sunandini Ramnarayanan
- School of Biology and Environmental Science, University College Dublin, Dublin, D04 V1W8, Ireland
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, D04 V1W8, Ireland
- The SFI Centre for Research Training in Genomics Data Science, Dublin, Ireland
| | - Isabel Büchi
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Visceral Surgery and Medicine, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Taisia Polidori
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Hugo Guillen-Ramirez
- School of Biology and Environmental Science, University College Dublin, Dublin, D04 V1W8, Ireland
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, D04 V1W8, Ireland
| | - Ante Mihaljevic
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Bernard Mefi Merlin
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Lia Mela
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Eugenio Zoni
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Urology, Inselspital, Bern University Hospital, Bern, Switzerland
| | - Lusine Hovhannisyan
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland
| | - Finn McCluggage
- School of Molecular Sciences, University of Western Australia, Crawley, WA, Australia
- School of Human Sciences, University of Western Australia, Crawley, WA, Australia
| | - Matúš Medo
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland
| | - Giulia Basile
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Dominik F Meise
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Sandra Zwyssig
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Corina Wenger
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Kyriakos Schwarz
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Adrienne Vancura
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Núria Bosch-Guiteras
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Graduate School of Cellular and Biomedical Sciences, University of Bern, 3012, Bern, Switzerland
| | - Álvaro Andrades
- GENYO, Centre for Genomics and Oncological Research, Pfizer/University of Granada/Andalusian Regional Government, Granada, 18016, Spain
- Instituto de Investigación Biosanitaria, Granada, 18014, Spain
- Department of Biochemistry and Molecular Biology I, University of Granada, Granada, 18071, Spain
| | - Ai Ming Tham
- School of Biology and Environmental Science, University College Dublin, Dublin, D04 V1W8, Ireland
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, D04 V1W8, Ireland
| | - Michaela Roemmele
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Pedro P Medina
- GENYO, Centre for Genomics and Oncological Research, Pfizer/University of Granada/Andalusian Regional Government, Granada, 18016, Spain
- Instituto de Investigación Biosanitaria, Granada, 18014, Spain
- Department of Biochemistry and Molecular Biology I, University of Granada, Granada, 18071, Spain
| | - Adrian F Ochsenbein
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Carsten Riether
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
| | - Marianna Kruithof-de Julio
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Urology, Inselspital, Bern University Hospital, Bern, Switzerland
| | - Yitzhak Zimmer
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland
| | - Michaela Medová
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Radiation Oncology, Inselspital, Bern University Hospital and University of Bern, Bern, Switzerland
| | - Deborah Stroka
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland
- Department of Visceral Surgery and Medicine, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Archa Fox
- School of Molecular Sciences, University of Western Australia, Crawley, WA, Australia
- School of Human Sciences, University of Western Australia, Crawley, WA, Australia
| | - Rory Johnson
- Department of Medical Oncology, Inselspital, Bern University Hospital, University of Bern, 3010, Bern, Switzerland.
- Department for BioMedical Research, University of Bern, 3008, Bern, Switzerland.
- School of Biology and Environmental Science, University College Dublin, Dublin, D04 V1W8, Ireland.
- Conway Institute for Biomolecular and Biomedical Research, University College Dublin, Dublin, D04 V1W8, Ireland.
| |
Collapse
|
25
|
Woo BJ, Moussavi-Baygi R, Karner H, Karimzadeh M, Garcia K, Joshi T, Yin K, Navickas A, Gilbert LA, Wang B, Asgharian H, Feng FY, Goodarzi H. Integrative identification of non-coding regulatory regions driving metastatic prostate cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.14.535921. [PMID: 37398273 PMCID: PMC10312451 DOI: 10.1101/2023.04.14.535921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Large-scale sequencing efforts of thousands of tumor samples have been undertaken to understand the mutational landscape of the coding genome. However, the vast majority of germline and somatic variants occur within non-coding portions of the genome. These genomic regions do not directly encode for specific proteins, but can play key roles in cancer progression, for example by driving aberrant gene expression control. Here, we designed an integrative computational and experimental framework to identify recurrently mutated non-coding regulatory regions that drive tumor progression. Application of this approach to whole-genome sequencing (WGS) data from a large cohort of metastatic castration-resistant prostate cancer (mCRPC) revealed a large set of recurrently mutated regions. We used (i) in silico prioritization of functional non-coding mutations, (ii) massively parallel reporter assays, and (iii) in vivo CRISPR-interference (CRISPRi) screens in xenografted mice to systematically identify and validate driver regulatory regions that drive mCRPC. We discovered that one of these enhancer regions, GH22I030351, acts on a bidirectional promoter to simultaneously modulate expression of U2-associated splicing factor SF3A1 and chromosomal protein CCDC157. We found that both SF3A1 and CCDC157 are promoters of tumor growth in xenograft models of prostate cancer. We nominated a number of transcription factors, including SOX6, to be responsible for higher expression of SF3A1 and CCDC157. Collectively, we have established and confirmed an integrative computational and experimental approach that enables the systematic detection of non-coding regulatory regions that drive the progression of human cancers.
Collapse
Affiliation(s)
- Brian J Woo
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Ruhollah Moussavi-Baygi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Heather Karner
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Mehran Karimzadeh
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
- Vector Institute, Toronto, ON, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
- Arc Institute, Palo Alto 94305, USA
| | - Kristle Garcia
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Tanvi Joshi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Keyi Yin
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Albertas Navickas
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
| | - Luke A. Gilbert
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
- Arc Institute, Palo Alto 94305, USA
| | - Bo Wang
- Vector Institute, Toronto, ON, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
| | - Hosseinali Asgharian
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, US
| | - Felix Y. Feng
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
- Department of Radiation Oncology, University of California, San Francisco, San Francisco, California, USA
| | - Hani Goodarzi
- Department of Biochemistry & Biophysics, University of California, San Francisco, San Francisco, California, USA
- Department of Urology, University of California, San Francisco, San Francisco, California, USA
- Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, California, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, CA, US
| |
Collapse
|
26
|
Kumar S, Gerstein M. Unified views on variant impact across many diseases. Trends Genet 2023; 39:442-450. [PMID: 36858880 PMCID: PMC10192142 DOI: 10.1016/j.tig.2023.02.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 02/02/2023] [Accepted: 02/02/2023] [Indexed: 03/03/2023]
Abstract
Genomic studies of human disorders are often performed by distinct research communities (i.e., focused on rare diseases, common diseases, or cancer). Despite underlying differences in the mechanistic origin of different disease categories, these studies share the goal of identifying causal genomic events that are critical for the clinical manifestation of the disease phenotype. Moreover, these studies face common challenges, including understanding the complex genetic architecture of the disease, deciphering the impact of variants on multiple scales, and interpreting noncoding mutations. Here, we highlight these challenges in depth and argue that properly addressing them will require a more unified vocabulary and approach across disease communities. Toward this goal, we present a unified perspective on relating variant impact to various genomic disorders.
Collapse
Affiliation(s)
- Sushant Kumar
- Department of Medical Biophysics, University of Toronto, Toronto, Ontario, Canada; Princess Margaret Cancer Centre, University Health Network, Toronto, Ontario, Canada.
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA; Department of Computer Science, Yale University, New Haven, CT 06520, USA; Department of Statistics & Data Science, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|
27
|
Lu H, Ma L, Quan C, Li L, Lu Y, Zhou G, Zhang C. RegVar: Tissue-specific Prioritization of Non-coding Regulatory Variants. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:385-395. [PMID: 34973416 PMCID: PMC10626172 DOI: 10.1016/j.gpb.2021.08.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 06/11/2021] [Accepted: 09/27/2021] [Indexed: 06/14/2023]
Abstract
Non-coding genomic variants constitute the majority of trait-associated genome variations; however, the identification of functional non-coding variants is still a challenge in human genetics, and a method for systematically assessing the impact of regulatory variants on gene expression and linking these regulatory variants to potential target genes is still lacking. Here, we introduce a deep neural network (DNN)-based computational framework, RegVar, which can accurately predict the tissue-specific impact of non-coding regulatory variants on target genes. We show that by robustly learning the genomic characteristics of massive variant-gene expression associations in a variety of human tissues, RegVar vastly surpasses all current non-coding variant prioritization methods in predicting regulatory variants under different circumstances. The unique features of RegVar make it an excellent framework for assessing the regulatory impact of any variant on its putative target genes in a variety of tissues. RegVar is available as a web server at https://regvar.omic.tech/.
Collapse
Affiliation(s)
- Hao Lu
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China
| | - Luyu Ma
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China
| | - Cheng Quan
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China
| | - Lei Li
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China
| | - Yiming Lu
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China.
| | - Gangqiao Zhou
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China.
| | - Chenggang Zhang
- Beijing Institute of Radiation Medicine, State Key Laboratory of Proteomics, Beijing 100850, China.
| |
Collapse
|
28
|
Association between Preoperative 18-FDG PET-CT SUVmax and Next-Generation Sequencing Results in Postoperative Ovarian Malignant Tissue in Patients with Advanced Ovarian Cancer. J Clin Med 2023; 12:jcm12062287. [PMID: 36983295 PMCID: PMC10057491 DOI: 10.3390/jcm12062287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 03/07/2023] [Accepted: 03/07/2023] [Indexed: 03/18/2023] Open
Abstract
This study investigated the association between maximum standardized uptake values (SUVmax) on preoperative 18-FDG PET-CT and next-generation sequencing (NGS) results in post-surgical ovarian malignant tissue in patients with advanced ovarian cancer. Twenty-five patients with stage IIIC or IV ovarian cancer who underwent both preoperative 18-FDG PET-CT and postoperative NGS for ovarian malignancies were retrospectively enrolled. Two patients had no detected variants, 21 of the 23 patients with any somatic variant had at least one single nucleotide variant (SNV) or insertion/deletion (indel), 10 patients showed copy number variation (CNV), and two patients had a fusion variant. SUVmax differed according to the presence of SNVs/indels, with an SUVmax of 13.06 for patients with ≥ 1 SNV/indel and 6.28 for patients without (p = 0.003). Seventeen of 20 patients with Tier 2 variants had TP53 variants, and there was a statistically significant association between SUVmax and the presence of TP53 variants (13.21 vs. 9.35, p = 0.041). Analysis of the correlation between the sum of the Tier 1 and Tier 2 numbers and SUVmax showed a statistically significant correlation (p = 0.002; Pearson’s r = 0.588). In conclusion, patients with advanced ovarian cancer with SNVs/indels on NGS, especially those with TP53 Tier 2 variants, showed a proportional association with tumor SUVmax on preoperative PET-CT.
Collapse
|
29
|
Fischer A, Lersch R, de Andrade Krätzig N, Strong A, Friedrich MJ, Weber J, Engleitner T, Öllinger R, Yen HY, Kohlhofer U, Gonzalez-Menendez I, Sailer D, Kogan L, Lahnalampi M, Laukkanen S, Kaltenbacher T, Klement C, Rezaei M, Ammon T, Montero JJ, Schneider G, Mayerle J, Heikenwälder M, Schmidt-Supprian M, Quintanilla-Martinez L, Steiger K, Liu P, Cadiñanos J, Vassiliou GS, Saur D, Lohi O, Heinäniemi M, Conte N, Bradley A, Rad L, Rad R. In vivo interrogation of regulatory genomes reveals extensive quasi-insufficiency in cancer evolution. CELL GENOMICS 2023; 3:100276. [PMID: 36950387 PMCID: PMC10025556 DOI: 10.1016/j.xgen.2023.100276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 09/05/2022] [Accepted: 02/08/2023] [Indexed: 03/10/2023]
Abstract
In contrast to mono- or biallelic loss of tumor-suppressor function, effects of discrete gene dysregulations, as caused by non-coding (epi)genome alterations, are poorly understood. Here, by perturbing the regulatory genome in mice, we uncover pervasive roles of subtle gene expression variation in cancer evolution. Genome-wide screens characterizing 1,450 tumors revealed that such quasi-insufficiency is extensive across entities and displays diverse context dependencies, such as distinct cell-of-origin associations in T-ALL subtypes. We compile catalogs of non-coding regions linked to quasi-insufficiency, show their enrichment with human cancer risk variants, and provide functional insights by engineering regulatory alterations in mice. As such, kilo-/megabase deletions in a Bcl11b-linked non-coding region triggered aggressive malignancies, with allele-specific tumor spectra reflecting gradual gene dysregulations through modular and cell-type-specific enhancer activities. Our study constitutes a first survey toward a systems-level understanding of quasi-insufficiency in cancer and gives multifaceted insights into tumor evolution and the tissue-specific effects of non-coding mutations.
Collapse
Affiliation(s)
- Anja Fischer
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Robert Lersch
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Niklas de Andrade Krätzig
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Alexander Strong
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, UK
| | - Mathias J. Friedrich
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- Department of Medicine II, Klinikum rechts der Isar, School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Julia Weber
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Thomas Engleitner
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Rupert Öllinger
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Hsi-Yu Yen
- German Cancer Consortium (DKTK), Heidelberg, Germany
- Comparative Experimental Pathology, School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Ursula Kohlhofer
- Institute of Pathology and Comprehensive Cancer Center, Eberhard Karls Universität Tübingen, 72076 Tübingen, Germany
| | - Irene Gonzalez-Menendez
- Institute of Pathology and Comprehensive Cancer Center, Eberhard Karls Universität Tübingen, 72076 Tübingen, Germany
| | - David Sailer
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Liz Kogan
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Mari Lahnalampi
- Institute of Biomedicine, School of Medicine, University of Eastern Finland, Kuopio, Finland
| | - Saara Laukkanen
- Faculty of Medicine and Health Technology, Tampere Center for Child, Adolescent and Maternal Health Research and Tays Cancer Center, Tampere University, Tampere, Finland
| | - Thorsten Kaltenbacher
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Christine Klement
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Majdaddin Rezaei
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Tim Ammon
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- Institute of Experimental Hematology, TUM School of Medicine, Technical University of Munich, 81675 Munich, Germany
| | - Juan J. Montero
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Günter Schneider
- Department of Medicine II, Klinikum rechts der Isar, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Department of General, Visceral and Pediatric Surgery, University Medical Center Göttingen, 37075 Göttingen, Germany
| | - Julia Mayerle
- Medical Department II, University Hospital, LMU Munich, Munich, Germany
| | - Mathias Heikenwälder
- German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
- Division of Chronic Inflammation and Cancer, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany
| | - Marc Schmidt-Supprian
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
- Institute of Experimental Hematology, TUM School of Medicine, Technical University of Munich, 81675 Munich, Germany
| | - Leticia Quintanilla-Martinez
- Institute of Pathology and Comprehensive Cancer Center, Eberhard Karls Universität Tübingen, 72076 Tübingen, Germany
| | - Katja Steiger
- German Cancer Consortium (DKTK), Heidelberg, Germany
- Comparative Experimental Pathology, School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Pentao Liu
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, UK
- Li Ka Shing Faculty of Medicine, Stem Cell and Regenerative Medicine Consortium, School of Biomedical Sciences, University of Hong Kong, Hong Kong, China
| | - Juan Cadiñanos
- Instituto de Medicina Oncológica y Molecular de Asturias (IMOMA), 33193 Oviedo, Spain
| | - George S. Vassiliou
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, UK
- Wellcome Trust-MRC Stem Cell Institute, Cambridge Biomedical Campus, University of Cambridge, Cambridge CB2 0XY, UK
- Department of Haematology, Cambridge University Hospitals NHS Trust, Cambridge CB2 0PT, UK
| | - Dieter Saur
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- Department of Medicine II, Klinikum rechts der Isar, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Institute for Experimental Cancer Therapy, School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Olli Lohi
- Faculty of Medicine and Health Technology, Tampere Center for Child, Adolescent and Maternal Health Research and Tays Cancer Center, Tampere University, Tampere, Finland
| | - Merja Heinäniemi
- Institute of Biomedicine, School of Medicine, University of Eastern Finland, Kuopio, Finland
| | - Nathalie Conte
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, UK
| | - Allan Bradley
- The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, UK
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), University of Cambridge, Puddicombe Way, Cambridge CB2 0AW, UK
| | - Lena Rad
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- Institute for Experimental Cancer Therapy, School of Medicine, Technische Universität München, 81675 Munich, Germany
| | - Roland Rad
- Institute of Molecular Oncology and Functional Genomics, School of Medicine, Technische Universität München, 81675 Munich, Germany
- Center for Translational Cancer Research (TranslaTUM), School of Medicine, Technische Universität München, 81675 Munich, Germany
- German Cancer Consortium (DKTK), Heidelberg, Germany
- Department of Medicine II, Klinikum rechts der Isar, School of Medicine, Technische Universität München, 81675 Munich, Germany
| |
Collapse
|
30
|
Tan X, Xu L, Jian X, Ouyang J, Hu B, Yang X, Wang T, Xie L. PGNneo: A Proteogenomics-Based Neoantigen Prediction Pipeline in Noncoding Regions. Cells 2023; 12:cells12050782. [PMID: 36899918 PMCID: PMC10000440 DOI: 10.3390/cells12050782] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 02/26/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023] Open
Abstract
The development of a neoantigen-based personalized vaccine has promise in the hunt for cancer immunotherapy. The challenge in neoantigen vaccine design is the need to rapidly and accurately identify, in patients, those neoantigens with vaccine potential. Evidence shows that neoantigens can be derived from noncoding sequences, but there are few specific tools for identifying neoantigens in noncoding regions. In this work, we describe a proteogenomics-based pipeline, namely PGNneo, for use in discovering neoantigens derived from the noncoding region of the human genome with reliability. In PGNneo, four modules are included: (1) noncoding somatic variant calling and HLA typing; (2) peptide extraction and customized database construction; (3) variant peptide identification; (4) neoantigen prediction and selection. We have demonstrated the effectiveness of PGNneo and applied and validated our methodology in two real-world hepatocellular carcinoma (HCC) cohorts. TP53, WWP1, ATM, KMT2C, and NFE2L2, which are frequently mutating genes associated with HCC, were identified in two cohorts and corresponded to 107 neoantigens from non-coding regions. In addition, we applied PGNneo to a colorectal cancer (CRC) cohort, demonstrating that the tool can be extended and verified in other tumor types. In summary, PGNneo can specifically detect neoantigens generated by noncoding regions in tumors, providing additional immune targets for cancer types with a low tumor mutational burden (TMB) in coding regions. PGNneo, together with our previous tool, can identify coding and noncoding region-derived neoantigens and, thus, will contribute to a complete understanding of the tumor immune target landscape. PGNneo source code and documentation are available at Github. To facilitate the installation and use of PGNneo, we provide a Docker container and a GUI.
Collapse
Affiliation(s)
- Xiaoxiu Tan
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Shanghai-MOST Key Laboratory of Health and Disease Genomics & Institute of Genome and Bioinformatics, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai 200237, China
| | - Linfeng Xu
- Shanghai-MOST Key Laboratory of Health and Disease Genomics & Institute of Genome and Bioinformatics, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai 200237, China
| | - Xingxing Jian
- Shanghai-MOST Key Laboratory of Health and Disease Genomics & Institute of Genome and Bioinformatics, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai 200237, China
| | - Jian Ouyang
- Shanghai-MOST Key Laboratory of Health and Disease Genomics & Institute of Genome and Bioinformatics, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai 200237, China
| | - Bo Hu
- Liver Cancer Institute, Fudan University, Shanghai 200032, China
| | - Xinrong Yang
- Liver Cancer Institute, Fudan University, Shanghai 200032, China
| | - Tao Wang
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Correspondence: (T.W.); (L.X.)
| | - Lu Xie
- Shanghai-MOST Key Laboratory of Health and Disease Genomics & Institute of Genome and Bioinformatics, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai 200237, China
- Correspondence: (T.W.); (L.X.)
| |
Collapse
|
31
|
Quan L, Chu X, Sun X, Wu T, Lyu Q. How Deepbics Quantifies Intensities of Transcription Factor-DNA Binding and Facilitates Prediction of Single Nucleotide Variant Pathogenicity With a Deep Learning Model Trained On ChIP-Seq Data Sets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1594-1599. [PMID: 35471887 DOI: 10.1109/tcbb.2022.3170343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The binding of DNA sequences to cell type-specific transcription factors is essential for regulating gene expression in all organisms. Many variants occurring in these binding regions play crucial roles in human disease by disrupting the cis-regulation of gene expression. We first implemented a sequence-based deep learning model called deepBICS to quantify the intensity of transcription factors-DNA binding. The experimental results not only showed the superiority of deepBICS on ChIP-seq data sets but also suggested deepBICS as a language model could help the classification of disease-related and neutral variants. We then built a language model-based method called deepBICS4SNV to predict the pathogenicity of single nucleotide variants. The good performance of deepBICS4SNV on 2 tests related to Mendelian disorders and viral diseases shows the sequence contextual information derived from language models can improve prediction accuracy and generalization capability.
Collapse
|
32
|
Peña-Martínez EG, Rivera-Madera A, Pomales-Matos DA, Sanabria-Alberto L, Rosario-Cañuelas BM, Rodríguez-Ríos JM, Carrasquillo-Dones EA, Rodríguez-Martínez JA. Disease-associated non-coding variants alter NKX2-5 DNA-binding affinity. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2023; 1866:194906. [PMID: 36690178 PMCID: PMC10013089 DOI: 10.1016/j.bbagrm.2023.194906] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 12/30/2022] [Accepted: 01/02/2023] [Indexed: 01/22/2023]
Abstract
Genome-wide association studies (GWAS) have mapped over 90 % of disease- or trait-associated variants within the non-coding genome, like cis-regulatory elements (CREs). Non-coding single nucleotide polymorphisms (SNPs) are genomic variants that can change how DNA-binding regulatory proteins, like transcription factors (TFs), interact with the genome and regulate gene expression. NKX2-5 is a TF essential for proper heart development, and mutations affecting its function have been associated with congenital heart diseases (CHDs). However, establishing a causal mechanism between non-coding genomic variants and human disease remains challenging. To address this challenge, we identified 8475 SNPs predicted to alter NKX2-5 DNA-binding using a position weight matrix (PWM)-based predictive model. Five variants were prioritized for in vitro validation; four of them are associated with traits and diseases that impact cardiovascular health. The impact of these variants on NKX2-5 binding was evaluated with electrophoretic mobility shift assay (EMSA) using purified recombinant NKX2-5 homeodomain. Binding curves were constructed to determine changes in binding between variant and reference alleles. Variants rs7350789, rs7719885, rs747334, and rs3892630 increased binding affinity, whereas rs61216514 decreased binding by NKX2-5 when compared to the reference genome. Our findings suggest that differential TF-DNA binding affinity can be key in establishing a causal mechanism of pathogenic variants.
Collapse
|
33
|
Carrasco Pro S, Hook H, Bray D, Berenzy D, Moyer D, Yin M, Labadorf AT, Tewhey R, Siggers T, Fuxman Bass JI. Widespread perturbation of ETS factor binding sites in cancer. Nat Commun 2023; 14:913. [PMID: 36808133 PMCID: PMC9938127 DOI: 10.1038/s41467-023-36535-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 02/03/2023] [Indexed: 02/19/2023] Open
Abstract
Although >90% of somatic mutations reside in non-coding regions, few have been reported as cancer drivers. To predict driver non-coding variants (NCVs), we present a transcription factor (TF)-aware burden test based on a model of coherent TF function in promoters. We apply this test to NCVs from the Pan-Cancer Analysis of Whole Genomes cohort and predict 2555 driver NCVs in the promoters of 813 genes across 20 cancer types. These genes are enriched in cancer-related gene ontologies, essential genes, and genes associated with cancer prognosis. We find that 765 candidate driver NCVs alter transcriptional activity, 510 lead to differential binding of TF-cofactor regulatory complexes, and that they primarily impact the binding of ETS factors. Finally, we show that different NCVs within a promoter often affect transcriptional activity through shared mechanisms. Our integrated computational and experimental approach shows that cancer NCVs are widespread and that ETS factors are commonly disrupted.
Collapse
Affiliation(s)
| | - Heather Hook
- Department of Biology, Boston University, Boston, MA, USA
| | - David Bray
- Bioinformatics Program, Boston University, Boston, MA, USA
| | | | - Devlin Moyer
- Bioinformatics Program, Boston University, Boston, MA, USA
| | - Meimei Yin
- Department of Biology, Boston University, Boston, MA, USA
| | - Adam Thomas Labadorf
- Bioinformatics Hub, Boston University, Boston, MA, USA
- Boston University School of Medicine, Department of Neurology, Boston, MA, USA
| | | | - Trevor Siggers
- Bioinformatics Program, Boston University, Boston, MA, USA.
- Department of Biology, Boston University, Boston, MA, USA.
- Biological Design Center, Boston University, Boston, MA, USA.
| | - Juan Ignacio Fuxman Bass
- Bioinformatics Program, Boston University, Boston, MA, USA.
- Department of Biology, Boston University, Boston, MA, USA.
| |
Collapse
|
34
|
Lin ZT, Chen GH, Peng X, Zhang ZH, Li T, Lin HX, Liang SS, Zheng YB, Yao ZP, Luo W. A 2-bp deletion in intron 1 of TMEM182 is associated with TMEM182 mRNA expression and chicken body weight. Br Poult Sci 2023; 64:11-18. [PMID: 35759289 DOI: 10.1080/00071668.2022.2094217] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
1. Searching for molecular markers related to growth and carcase traits plays a critical role in improvement of the production performance of broilers. Previous studies found that transmembrane protein 182 (TMEM182) inhibits skeletal muscle development, growth, and regeneration, implying that the TMEM182 gene plays an important role during the development process of skeletal muscle.2. A novel 2-bp indel in intron 1 of TMEM182 was detected in a yellow chicken population derived from the cross of White Recessive Rock chickens with Xinghua chickens, and three genotypes II (inserted homozygote), ID (inserted and deleted heterozygote) and DD (deleted homozygote) were observed. Association analyses indicated that the indel was significantly associated with the body weight, muscle fibre area, breast muscle weight and wing weight in the F2 population.3. The expression of TMEM182 in leg muscle of chickens with II genotype was higher than that with DD genotype, with the 2-bp indel located in one of the putative PAX4 binding sites. Further research through luciferase assays revealed that the PAX4 could bind to the putative binding site and increase the TMEM182 transcription, with the 2-bp deletion disrupting the binding of PAX4.4. The present study provides evidence for the association of the novel 2-bp indel in intron 1 of TMEM182 with the growth and carcase traits of chickens. This 2-bp indel could be used as a genetic marker in broiler breeding.
Collapse
Affiliation(s)
- Z T Lin
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - G H Chen
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - X Peng
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - Z H Zhang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - T Li
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - H X Lin
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - S S Liang
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - Y B Zheng
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - Z P Yao
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| | - W Luo
- Department of Animal Genetics, Breeding and Reproduction, College of Animal Science, South China Agricultural University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry of Agriculture, Guangzhou, Guangdong, China
| |
Collapse
|
35
|
Neoantigens: promising targets for cancer therapy. Signal Transduct Target Ther 2023; 8:9. [PMID: 36604431 PMCID: PMC9816309 DOI: 10.1038/s41392-022-01270-x] [Citation(s) in RCA: 145] [Impact Index Per Article: 145.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 11/14/2022] [Accepted: 11/27/2022] [Indexed: 01/07/2023] Open
Abstract
Recent advances in neoantigen research have accelerated the development and regulatory approval of tumor immunotherapies, including cancer vaccines, adoptive cell therapy and antibody-based therapies, especially for solid tumors. Neoantigens are newly formed antigens generated by tumor cells as a result of various tumor-specific alterations, such as genomic mutation, dysregulated RNA splicing, disordered post-translational modification, and integrated viral open reading frames. Neoantigens are recognized as non-self and trigger an immune response that is not subject to central and peripheral tolerance. The quick identification and prediction of tumor-specific neoantigens have been made possible by the advanced development of next-generation sequencing and bioinformatic technologies. Compared to tumor-associated antigens, the highly immunogenic and tumor-specific neoantigens provide emerging targets for personalized cancer immunotherapies, and serve as prospective predictors for tumor survival prognosis and immune checkpoint blockade responses. The development of cancer therapies will be aided by understanding the mechanism underlying neoantigen-induced anti-tumor immune response and by streamlining the process of neoantigen-based immunotherapies. This review provides an overview on the identification and characterization of neoantigens and outlines the clinical applications of prospective immunotherapeutic strategies based on neoantigens. We also explore their current status, inherent challenges, and clinical translation potential.
Collapse
|
36
|
Zhou X, Zheng H, Fu H, Dillehay McKillip KL, Pinney SM, Liu Y. CRAG: de novo characterization of cell-free DNA fragmentation hotspots in plasma whole-genome sequencing. Genome Med 2022; 14:138. [PMID: 36482487 PMCID: PMC9733064 DOI: 10.1186/s13073-022-01141-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 11/14/2022] [Indexed: 12/13/2022] Open
Abstract
The fine-scale cell-free DNA fragmentation patterns in early-stage cancers are poorly understood. We developed a de novo approach to characterize the cell-free DNA fragmentation hotspots from plasma whole-genome sequencing. Hotspots are enriched in open chromatin regions, and, interestingly, 3'end of transposons. Hotspots showed global hypo-fragmentation in early-stage liver cancers and are associated with genes involved in the initiation of hepatocellular carcinoma and associated with cancer stem cells. The hotspots varied across multiple early-stage cancers and demonstrated high performance for the diagnosis and identification of tissue-of-origin in early-stage cancers. We further validated the performance with a small number of independent case-control-matched early-stage cancer samples.
Collapse
Affiliation(s)
- Xionghui Zhou
- grid.239573.90000 0000 9025 8099Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229 USA ,grid.35155.370000 0004 1790 4137Present address: Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070 China
| | - Haizi Zheng
- grid.239573.90000 0000 9025 8099Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229 USA
| | - Hailu Fu
- grid.239573.90000 0000 9025 8099Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229 USA
| | - Kelsey L. Dillehay McKillip
- grid.24827.3b0000 0001 2179 9593University of Cincinnati Cancer Center, Cincinnati, OH 45229 USA ,grid.24827.3b0000 0001 2179 9593Department of Pathology & Laboratory Medicine, University of Cincinnati College of Medicine, Cincinnati, OH 45229 USA
| | - Susan M. Pinney
- grid.24827.3b0000 0001 2179 9593University of Cincinnati Cancer Center, Cincinnati, OH 45229 USA ,grid.24827.3b0000 0001 2179 9593Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, Cincinnati, OH 45229 USA
| | - Yaping Liu
- grid.239573.90000 0000 9025 8099Division of Human Genetics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229 USA ,grid.24827.3b0000 0001 2179 9593University of Cincinnati Cancer Center, Cincinnati, OH 45229 USA ,grid.239573.90000 0000 9025 8099Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229 USA ,grid.24827.3b0000 0001 2179 9593Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45229 USA ,grid.24827.3b0000 0001 2179 9593Department of Electrical Engineering and Computing Sciences, University of Cincinnati College of Engineering and Applied Science, Cincinnati, OH 45229 USA
| |
Collapse
|
37
|
Castro-Mondragon JA, Aure M, Lingjærde O, Langerød A, Martens JWM, Børresen-Dale AL, Kristensen V, Mathelier A. Cis-regulatory mutations associate with transcriptional and post-transcriptional deregulation of gene regulatory programs in cancers. Nucleic Acids Res 2022; 50:12131-12148. [PMID: 36477895 PMCID: PMC9757053 DOI: 10.1093/nar/gkac1143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 11/03/2022] [Accepted: 11/17/2022] [Indexed: 12/13/2022] Open
Abstract
Most cancer alterations occur in the noncoding portion of the human genome, where regulatory regions control gene expression. The discovery of noncoding mutations altering the cells' regulatory programs has been limited to few examples with high recurrence or high functional impact. Here, we show that transcription factor binding sites (TFBSs) have similar mutation loads to those in protein-coding exons. By combining cancer somatic mutations in TFBSs and expression data for protein-coding and miRNA genes, we evaluate the combined effects of transcriptional and post-transcriptional alterations on the regulatory programs in cancers. The analysis of seven TCGA cohorts culminates with the identification of protein-coding and miRNA genes linked to mutations at TFBSs that are associated with a cascading trans-effect deregulation on the cells' regulatory programs. Our analyses of cis-regulatory mutations associated with miRNAs recurrently predict 12 mature miRNAs (derived from 7 precursors) associated with the deregulation of their target gene networks. The predictions are enriched for cancer-associated protein-coding and miRNA genes and highlight cis-regulatory mutations associated with the dysregulation of key pathways associated with carcinogenesis. By combining transcriptional and post-transcriptional regulation of gene expression, our method predicts cis-regulatory mutations related to the dysregulation of key gene regulatory networks in cancer patients.
Collapse
Affiliation(s)
- Jaime A Castro-Mondragon
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, 0318 Oslo, Norway
| | - Miriam Ragle Aure
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital Radiumhospitalet, 0310 Oslo, Norway,Department of Medical Genetics, Institute of Clinical Medicine, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Ole Christian Lingjærde
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital Radiumhospitalet, 0310 Oslo, Norway,Centre for Bioinformatics, Department of Informatics, University of Oslo, Gaustadalléen 23 B, N-0373 Oslo, Norway,KG Jebsen Centre for B-cell malignancies, Institute for Clinical Medicine, University of Oslo, Ullernchausseen 70, N-0372 Oslo, Norway
| | - Anita Langerød
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital Radiumhospitalet, 0310 Oslo, Norway
| | - John W M Martens
- Erasmus MC Cancer Institute and Cancer Genomics Netherlands, University Medical Center Rotterdam, Department of Medical Oncology, 3015GD Rotterdam, The Netherlands
| | - Anne-Lise Børresen-Dale
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital Radiumhospitalet, 0310 Oslo, Norway
| | - Vessela N Kristensen
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital Radiumhospitalet, 0310 Oslo, Norway,Department of Medical Genetics, Institute of Clinical Medicine, University of Oslo and Oslo University Hospital, Oslo, Norway
| | | |
Collapse
|
38
|
Xu Z, Lee DS, Chandran S, Le VT, Bump R, Yasis J, Dallarda S, Marcotte S, Clock B, Haghani N, Cho CY, Akdemir K, Tyndale S, Futreal PA, McVicker G, Wahl GM, Dixon JR. Structural variants drive context-dependent oncogene activation in cancer. Nature 2022; 612:564-572. [PMID: 36477537 PMCID: PMC9810360 DOI: 10.1038/s41586-022-05504-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 11/01/2022] [Indexed: 12/12/2022]
Abstract
Higher-order chromatin structure is important for the regulation of genes by distal regulatory sequences1,2. Structural variants (SVs) that alter three-dimensional (3D) genome organization can lead to enhancer-promoter rewiring and human disease, particularly in the context of cancer3. However, only a small minority of SVs are associated with altered gene expression4,5, and it remains unclear why certain SVs lead to changes in distal gene expression and others do not. To address these questions, we used a combination of genomic profiling and genome engineering to identify sites of recurrent changes in 3D genome structure in cancer and determine the effects of specific rearrangements on oncogene activation. By analysing Hi-C data from 92 cancer cell lines and patient samples, we identified loci affected by recurrent alterations to 3D genome structure, including oncogenes such as MYC, TERT and CCND1. By using CRISPR-Cas9 genome engineering to generate de novo SVs, we show that oncogene activity can be predicted by using 'activity-by-contact' models that consider partner region chromatin contacts and enhancer activity. However, activity-by-contact models are only predictive of specific subsets of genes in the genome, suggesting that different classes of genes engage in distinct modes of regulation by distal regulatory elements. These results indicate that SVs that alter 3D genome organization are widespread in cancer genomes and begin to illustrate predictive rules for the consequences of SVs on oncogene activation.
Collapse
Affiliation(s)
- Zhichao Xu
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA,These authors contributed equally
| | - Dong-Sung Lee
- Department of Life Sciences, University of Seoul, Seoul, South Korea,These authors contributed equally
| | - Sahaana Chandran
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Victoria T. Le
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Rosalind Bump
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Jean Yasis
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Sofia Dallarda
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Samantha Marcotte
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Benjamin Clock
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Nicholas Haghani
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Chae Yun Cho
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Kadir Akdemir
- Department of Genomic Medicine; UT MD Anderson Cancer Center; Houston, TX, 77030; USA
| | - Selene Tyndale
- Integrative Biology Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - P. Andrew Futreal
- Department of Genomic Medicine; UT MD Anderson Cancer Center; Houston, TX, 77030; USA
| | - Graham McVicker
- Integrative Biology Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Geoffrey M. Wahl
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA
| | - Jesse R. Dixon
- Gene Expression Laboratory; Salk Institute for Biological Studies; La Jolla, CA, 92037; USA,Correspondence:
| |
Collapse
|
39
|
Li Z, Li X, Zhou H, Gaynor SM, Selvaraj MS, Arapoglou T, Quick C, Liu Y, Chen H, Sun R, Dey R, Arnett DK, Auer PL, Bielak LF, Bis JC, Blackwell TW, Blangero J, Boerwinkle E, Bowden DW, Brody JA, Cade BE, Conomos MP, Correa A, Cupples LA, Curran JE, de Vries PS, Duggirala R, Franceschini N, Freedman BI, Göring HHH, Guo X, Kalyani RR, Kooperberg C, Kral BG, Lange LA, Lin BM, Manichaikul A, Manning AK, Martin LW, Mathias RA, Meigs JB, Mitchell BD, Montasser ME, Morrison AC, Naseri T, O'Connell JR, Palmer ND, Peyser PA, Psaty BM, Raffield LM, Redline S, Reiner AP, Reupena MS, Rice KM, Rich SS, Smith JA, Taylor KD, Taub MA, Vasan RS, Weeks DE, Wilson JG, Yanek LR, Zhao W, Rotter JI, Willer CJ, Natarajan P, Peloso GM, Lin X. A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies. Nat Methods 2022; 19:1599-1611. [PMID: 36303018 PMCID: PMC10008172 DOI: 10.1038/s41592-022-01640-x] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2021] [Accepted: 09/06/2022] [Indexed: 02/07/2023]
Abstract
Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.
Collapse
Affiliation(s)
- Zilin Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN, USA.
| | - Xihao Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Hufeng Zhou
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Sheila M Gaynor
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Margaret Sunitha Selvaraj
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Theodore Arapoglou
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Corbin Quick
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Yaowu Liu
- School of Statistics, Southwestern University of Finance and Economics, Chengdu, China
| | - Han Chen
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Ryan Sun
- Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, USA
| | - Rounak Dey
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Donna K Arnett
- Dean's Office, University of Kentucky, College of Public Health, Lexington, KY, USA
| | - Paul L Auer
- Division of Biostatistics, Institute for Health & Equity and Cancer Center, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Lawrence F Bielak
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Joshua C Bis
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Thomas W Blackwell
- Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA
| | - John Blangero
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Donald W Bowden
- Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Jennifer A Brody
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
| | - Brian E Cade
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
| | - Matthew P Conomos
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Adolfo Correa
- Jackson Heart Study, Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
| | - L Adrienne Cupples
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute and Boston University, Framingham, MA, USA
| | - Joanne E Curran
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Paul S de Vries
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Ravindranath Duggirala
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Nora Franceschini
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Barry I Freedman
- Department of Internal Medicine, Nephrology, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Harald H H Göring
- Department of Human Genetics and South Texas Diabetes and Obesity Institute, School of Medicine, The University of Texas Rio Grande Valley, Brownsville, TX, USA
| | - Xiuqing Guo
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Rita R Kalyani
- GeneSTAR Research Program, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Charles Kooperberg
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Brian G Kral
- GeneSTAR Research Program, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Leslie A Lange
- Division of Biomedical Informatics and Personalized Medicine, Department of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Bridget M Lin
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
| | - Ani Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Alisa K Manning
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Metabolism Program, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Clinical and Translational Epidemiology Unit, Mongan Institute, Massachusetts General Hospital, Boston, MA, USA
| | - Lisa W Martin
- Division in Cardiology, George Washington School of Medicine and Health Sciences, Washington, DC, USA
| | - Rasika A Mathias
- GeneSTAR Research Program, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - James B Meigs
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Braxton D Mitchell
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
- Geriatrics Research and Education Clinical Center, Baltimore VA Medical Center, Baltimore, MD, USA
| | - May E Montasser
- Division of Endocrinology, Diabetes, and Nutrition, Program for Personalized and Genomic Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Alanna C Morrison
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Take Naseri
- Ministry of Health, Government of Samoa, Apia, Samoa
| | - Jeffrey R O'Connell
- Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Nicholette D Palmer
- Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Patricia A Peyser
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Bruce M Psaty
- Cardiovascular Health Research Unit, Department of Medicine, University of Washington, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
- Departments of Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Laura M Raffield
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Susan Redline
- Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA
- Division of Sleep Medicine, Harvard Medical School, Boston, MA, USA
- Division of Pulmonary, Critical Care, and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Alexander P Reiner
- Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Department of Epidemiology, University of Washington, Seattle, WA, USA
| | | | - Kenneth M Rice
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Jennifer A Smith
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
| | - Kent D Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Margaret A Taub
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Ramachandran S Vasan
- Framingham Heart Study, National Heart, Lung, and Blood Institute and Boston University, Framingham, MA, USA
- Department of Medicine, Boston University School of Medicine, Boston, MA, USA
| | - Daniel E Weeks
- Department of Human Genetics and Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
| | - James G Wilson
- Division of Cardiology, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Lisa R Yanek
- GeneSTAR Research Program, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Wei Zhao
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - Cristen J Willer
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA
| | - Pradeep Natarajan
- Center for Genomic Medicine and Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Gina M Peloso
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
- Framingham Heart Study, National Heart, Lung, and Blood Institute and Boston University, Framingham, MA, USA
| | - Xihong Lin
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Department of Statistics, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
40
|
Giannareas N, Zhang Q, Yang X, Na R, Tian Y, Yang Y, Ruan X, Huang D, Yang X, Wang C, Zhang P, Manninen A, Wang L, Wei GH. Extensive germline-somatic interplay contributes to prostate cancer progression through HNF1B co-option of TMPRSS2-ERG. Nat Commun 2022; 13:7320. [PMID: 36443337 PMCID: PMC9705428 DOI: 10.1038/s41467-022-34994-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 11/15/2022] [Indexed: 11/29/2022] Open
Abstract
Genome-wide association studies have identified 270 loci conferring risk for prostate cancer (PCa), yet the underlying biology and clinical impact remain to be investigated. Here we observe an enrichment of transcription factor genes including HNF1B within PCa risk-associated regions. While focused on the 17q12/HNF1B locus, we find a strong eQTL for HNF1B and multiple potential causal variants involved in the regulation of HNF1B expression in PCa. An unbiased genome-wide co-expression analysis reveals PCa-specific somatic TMPRSS2-ERG fusion as a transcriptional mediator of this locus and the HNF1B eQTL signal is ERG fusion status dependent. We investigate the role of HNF1B and find its involvement in several pathways related to cell cycle progression and PCa severity. Furthermore, HNF1B interacts with TMPRSS2-ERG to co-occupy large proportion of genomic regions with a remarkable enrichment of additional PCa risk alleles. We finally show that HNF1B co-opts ERG fusion to mediate mechanistic and biological effects of the PCa risk-associated locus 17p13.3/VPS53/FAM57A/GEMIN4. Taken together, we report an extensive germline-somatic interaction between TMPRSS2-ERG fusion and genetic variations underpinning PCa risk association and progression.
Collapse
Affiliation(s)
- Nikolaos Giannareas
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Qin Zhang
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Xiayun Yang
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Rong Na
- Division of Urology, Department of Surgery, Li Ka Shing Faculty of Medicine, the University of Hong Kong, Hong Kong, China
| | - Yijun Tian
- Department of Tumour Biology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Yuehong Yang
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Xiaohao Ruan
- Department of Urology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Da Huang
- Department of Urology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Xiaoqun Yang
- Department of Pathology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Chaofu Wang
- Department of Pathology, Ruijin Hospital, Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Peng Zhang
- Fudan University Shanghai Cancer Center & MOE Key Laboratory of Metabolism and Molecular Medicine and Department of Biochemistry and Molecular Biology of School of Basic Medical Sciences, Shanghai Medical College of Fudan University, Shanghai, China
| | - Aki Manninen
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Liang Wang
- Department of Tumour Biology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, FL, USA
| | - Gong-Hong Wei
- Disease Networks Research Unit, Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, University of Oulu, Oulu, Finland.
- Fudan University Shanghai Cancer Center & MOE Key Laboratory of Metabolism and Molecular Medicine and Department of Biochemistry and Molecular Biology of School of Basic Medical Sciences, Shanghai Medical College of Fudan University, Shanghai, China.
| |
Collapse
|
41
|
Yang Y, Wang D, Miao YR, Wu X, Luo H, Cao W, Yang W, Yang J, Guo AY, Gong J. lncRNASNP v3: an updated database for functional variants in long non-coding RNAs. Nucleic Acids Res 2022; 51:D192-D198. [PMID: 36350671 PMCID: PMC9825536 DOI: 10.1093/nar/gkac981] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/12/2022] [Accepted: 11/03/2022] [Indexed: 11/11/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) act as versatile regulators of many biological processes and play vital roles in various diseases. lncRNASNP is dedicated to providing a comprehensive repository of single nucleotide polymorphisms (SNPs) and somatic mutations in lncRNAs and their impacts on lncRNA structure and function. Since the last release in 2018, there has been a huge increase in the number of variants and lncRNAs. Thus, we updated the lncRNASNP to version 3 by expanding the species to eight eukaryotic species (human, chimpanzee, pig, mouse, rat, chicken, zebrafish, and fruitfly), updating the data and adding several new features. SNPs in lncRNASNP have increased from 11 181 387 to 67 513 785. The human mutations have increased from 1 174 768 to 2 387 685, including 1 031 639 TCGA mutations and 1 356 046 CosmicNCVs. Compared with the last release, updated and new features in lncRNASNP v3 include (i) SNPs in lncRNAs and their impacts on lncRNAs for eight species, (ii) SNP effects on miRNA-lncRNA interactions for eight species, (iii) lncRNA expression profiles for six species, (iv) disease & GWAS-associated lncRNAs and variants, (v) experimental & predicted lncRNAs and drug target associations and (vi) SNP effects on lncRNA expression (eQTL) across tumor & normal tissues. The lncRNASNP v3 is freely available at http://gong_lab.hzau.edu.cn/lncRNASNP3/.
Collapse
Affiliation(s)
| | | | - Ya-Ru Miao
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Xiaohong Wu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Haohui Luo
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wen Cao
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Wenqian Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Jianye Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - An-Yuan Guo
- Correspondence may also be addressed to An-Yuan Guo. Tel: +86 27 8779 3177; Fax: +86 27 8779 3177;
| | - Jing Gong
- To whom correspondence should be addressed. Tel: +86 27 8728 5085; Fax: +86 27 8728 5085;
| |
Collapse
|
42
|
Pang B, van Weerd JH, Hamoen FL, Snyder MP. Identification of non-coding silencer elements and their regulation of gene expression. Nat Rev Mol Cell Biol 2022; 24:383-395. [DOI: 10.1038/s41580-022-00549-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/29/2022] [Indexed: 11/09/2022]
|
43
|
Multi-omics approach dissects cis-regulatory mechanisms underlying North Carolina macular dystrophy, a retinal enhanceropathy. Am J Hum Genet 2022; 109:2029-2048. [PMID: 36243009 PMCID: PMC9674966 DOI: 10.1016/j.ajhg.2022.09.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 09/28/2022] [Indexed: 01/26/2023] Open
Abstract
North Carolina macular dystrophy (NCMD) is a rare autosomal-dominant disease affecting macular development. The disease is caused by non-coding single-nucleotide variants (SNVs) in two hotspot regions near PRDM13 and by duplications in two distinct chromosomal loci, overlapping DNase I hypersensitive sites near either PRDM13 or IRX1. To unravel the mechanisms by which these variants cause disease, we first established a genome-wide multi-omics retinal database, RegRet. Integration of UMI-4C profiles we generated on adult human retina then allowed fine-mapping of the interactions of the PRDM13 and IRX1 promoters and the identification of eighteen candidate cis-regulatory elements (cCREs), the activity of which was investigated by luciferase and Xenopus enhancer assays. Next, luciferase assays showed that the non-coding SNVs located in the two hotspot regions of PRDM13 affect cCRE activity, including two NCMD-associated non-coding SNVs that we identified herein. Interestingly, the cCRE containing one of these SNVs was shown to interact with the PRDM13 promoter, demonstrated in vivo activity in Xenopus, and is active at the developmental stage when progenitor cells of the central retina exit mitosis, suggesting that this region is a PRDM13 enhancer. Finally, mining of single-cell transcriptional data of embryonic and adult retina revealed the highest expression of PRDM13 and IRX1 when amacrine cells start to synapse with retinal ganglion cells, supporting the hypothesis that altered PRDM13 or IRX1 expression impairs interactions between these cells during retinogenesis. Overall, this study provides insight into the cis-regulatory mechanisms of NCMD and supports that this condition is a retinal enhanceropathy.
Collapse
|
44
|
Huang T, Li J, Zhao H, Ngamphiw C, Tongsima S, Kantaputra P, Kittitharaphan W, Wang SM. Core promoter in TNBC is highly mutated with rich ethnic signature. Brief Funct Genomics 2022; 22:9-19. [PMID: 36307127 PMCID: PMC9853936 DOI: 10.1093/bfgp/elac035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 09/23/2022] [Accepted: 09/28/2022] [Indexed: 01/25/2023] Open
Abstract
The core promoter plays an essential role in regulating transcription initiation by controlling the interaction between transcriptional factors and sequence motifs in the core promoter. Although mutation in core promoter sequences is expected to cause abnormal gene expression leading to pathogenic consequences, limited supporting evidence showed the involvement of core promoter mutation in diseases. Our previous study showed that the core promoter is highly polymorphic in worldwide human ethnic populations in reflecting human history and adaptation. Our recent characterization of the core promoter in triple-negative breast cancer (TNBC), a subtype of breast cancer, in a Chinese TNBC cohort revealed the wide presence of core promoter mutation in TNBC. In the current study, we analyzed the core promoter in a Thai TNBC cohort. We also observed rich core promoter mutation in the Thai TNBC patients. We compared the core promoter mutations between Chinese and Thai TNBC cohorts. We observed substantial differences of core promoter mutation in TNBC between the two cohorts, as reflected by the mutation spectrum, mutation-effected gene and functional category, and altered gene expression. Our study confirmed that the core promoter in TNBC is highly mutable, and is highly ethnic-specific.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - San Ming Wang
- Corresponding author: S.M. Wang, Faculty of Health Sciences, University of Macau, Taipa, Macau 999078, China. Tel.: +(853) 8822-4836; E-mail:
| |
Collapse
|
45
|
Ni P, Wilson D, Su Z. A map of cis-regulatory modules and constituent transcription factor binding sites in 80% of the mouse genome. BMC Genomics 2022; 23:714. [PMID: 36261804 PMCID: PMC9583556 DOI: 10.1186/s12864-022-08933-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 10/11/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mouse is probably the most important model organism to study mammal biology and human diseases. A better understanding of the mouse genome will help understand the human genome, biology and diseases. However, despite the recent progress, the characterization of the regulatory sequences in the mouse genome is still far from complete, limiting its use to understand the regulatory sequences in the human genome. RESULTS Here, by integrating binding peaks in ~ 9,000 transcription factor (TF) ChIP-seq datasets that cover 79.9% of the mouse mappable genome using an efficient pipeline, we were able to partition these binding peak-covered genome regions into a cis-regulatory module (CRM) candidate (CRMC) set and a non-CRMC set. The CRMCs contain 912,197 putative CRMs and 38,554,729 TF binding sites (TFBSs) islands, covering 55.5% and 24.4% of the mappable genome, respectively. The CRMCs tend to be under strong evolutionary constraints, indicating that they are likely cis-regulatory; while the non-CRMCs are largely selectively neutral, indicating that they are unlikely cis-regulatory. Based on evolutionary profiles of the genome positions, we further estimated that 63.8% and 27.4% of the mouse genome might code for CRMs and TFBSs, respectively. CONCLUSIONS Validation using experimental data suggests that at least most of the CRMCs are authentic. Thus, this unprecedentedly comprehensive map of CRMs and TFBSs can be a good resource to guide experimental studies of regulatory genomes in mice and humans.
Collapse
Affiliation(s)
- Pengyu Ni
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - David Wilson
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| | - Zhengchang Su
- Department of Bioinformatics and Genomics, the University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| |
Collapse
|
46
|
Wang D, Wu X, Jiang G, Yang J, Yu Z, Yang Y, Yang W, Niu X, Tang K, Gong J. Systematic analysis of the effects of genetic variants on chromatin accessibility to decipher functional variants in non-coding regions. Front Oncol 2022; 12:1035855. [PMID: 36330496 PMCID: PMC9623183 DOI: 10.3389/fonc.2022.1035855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2022] [Accepted: 10/03/2022] [Indexed: 11/18/2022] Open
Abstract
Genome-wide association study (GWAS) has identified thousands of single nucleotide polymorphisms (SNPs) associated with complex diseases and traits. However, deciphering the functions of these SNPs still faces challenges. Recent studies have shown that SNPs could alter chromatin accessibility and result in differences in tumor susceptibility between individuals. Therefore, systematically analyzing the effects of SNPs on chromatin accessibility could help decipher the functions of SNPs, especially those in non-coding regions. Using data from The Cancer Genome Atlas (TCGA), chromatin accessibility quantitative trait locus (caQTL) analysis was conducted to estimate the associations between genetic variants and chromatin accessibility. We analyzed caQTLs in 23 human cancer types and identified 9,478 caQTLs in breast carcinoma (BRCA). In BRCA, these caQTLs tend to alter the binding affinity of transcription factors, and open chromatin regions regulated by these caQTLs are enriched in regulatory elements. By integrating with eQTL data, we identified 141 caQTLs showing a strong signal for colocalization with eQTLs. We also identified 173 caQTLs in genome-wide association studies (GWAS) loci and inferred several possible target genes of these caQTLs. By performing survival analysis, we found that ~10% caQTLs potentially influence the prognosis of patients. To facilitate access to relevant data, we developed a user-friendly data portal, BCaQTL (http://gong_lab.hzau.edu.cn/caqtl_database), for data searching and downloading. Our work may facilitate fine-map regulatory mechanisms underlying risk loci of cancer and discover the biomarkers or therapeutic targets for cancer prognosis. The BCaQTL database will be an important resource for genetic and epigenetic studies.
Collapse
Affiliation(s)
- Dongyang Wang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Xiaohong Wu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Guanghui Jiang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Jianye Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Zhanhui Yu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Yanbo Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Wenqian Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Xiaohui Niu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Ke Tang
- Department of Biochemistry and Molecular Biology, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- *Correspondence: Jing Gong, ; Ke Tang,
| | - Jing Gong
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
- College of Biomedicine and Health, Huazhong Agricultural University, Wuhan, China
- *Correspondence: Jing Gong, ; Ke Tang,
| |
Collapse
|
47
|
Pudjihartono M, Perry JK, Print C, O'Sullivan JM, Schierding W. Interpretation of the role of germline and somatic non-coding mutations in cancer: expression and chromatin conformation informed analysis. Clin Epigenetics 2022; 14:120. [PMID: 36171609 PMCID: PMC9520844 DOI: 10.1186/s13148-022-01342-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 09/21/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND There has been extensive scrutiny of cancer driving mutations within the exome (especially amino acid altering mutations) as these are more likely to have a clear impact on protein functions, and thus on cell biology. However, this has come at the neglect of systematic identification of regulatory (non-coding) variants, which have recently been identified as putative somatic drivers and key germline risk factors for cancer development. Comprehensive understanding of non-coding mutations requires understanding their role in the disruption of regulatory elements, which then disrupt key biological functions such as gene expression. MAIN BODY We describe how advancements in sequencing technologies have led to the identification of a large number of non-coding mutations with uncharacterized biological significance. We summarize the strategies that have been developed to interpret and prioritize the biological mechanisms impacted by non-coding mutations, focusing on recent annotation of cancer non-coding variants utilizing chromatin states, eQTLs, and chromatin conformation data. CONCLUSION We believe that a better understanding of how to apply different regulatory data types into the study of non-coding mutations will enhance the discovery of novel mechanisms driving cancer.
Collapse
Affiliation(s)
| | - Jo K Perry
- Liggins Institute, The University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
| | - Cris Print
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
- Department of Molecular Medicine and Pathology, School of Medical Sciences, University of Auckland, Auckland, 1142, New Zealand
| | - Justin M O'Sullivan
- Liggins Institute, The University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
- Australian Parkinson's Mission, Garvan Institute of Medical Research, Sydney, NSW, Australia
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, UK
| | - William Schierding
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand.
| |
Collapse
|
48
|
Modeling tissue-specific breakpoint proximity of structural variations from whole-genomes to identify cancer drivers. Nat Commun 2022; 13:5640. [PMID: 36163358 PMCID: PMC9512825 DOI: 10.1038/s41467-022-32945-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Accepted: 08/24/2022] [Indexed: 11/11/2022] Open
Abstract
Structural variations (SVs) in cancer cells often impact large genomic regions with functional consequences. However, identification of SVs under positive selection is a challenging task because little is known about the genomic features related to the background breakpoint distribution in different cancers. We report a method that uses a generalized additive model to investigate the breakpoint proximity curves from 2,382 whole-genomes of 32 cancer types. We find that a multivariate model, which includes linear and nonlinear partial contributions of various tissue-specific features and their interaction terms, can explain up to 57% of the observed deviance of breakpoint proximity. In particular, three-dimensional genomic features such as topologically associating domains (TADs), TAD-boundaries and their interaction with other features show significant contributions. The model is validated by identification of known cancer genes and revealed putative drivers in cancers different than those with previous evidence of positive selection. Identifying structural variants (SVs) under positive selection in cancer is challenging. Here, the authors develop CSVDriver, a method that computes SV breakpoint proximity and the contribution of elements such as topologically associating domains, and identifies loci that show signs of positive selection and contain known and putative drivers.
Collapse
|
49
|
Nisar A, Kayani MA, Nasir W, Mehmood A, Ahmed MW, Parvez A, Mahjabeen I. Fyn and Lyn gene polymorphisms impact the risk of thyroid cancer. Mol Genet Genomics 2022; 297:1649-1659. [PMID: 36058999 DOI: 10.1007/s00438-022-01946-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 08/11/2022] [Indexed: 10/14/2022]
Abstract
Thyroid cancer is the most common malignancy of the endocrine glands, and during last couple of decades, its incidence has risen alarmingly, across the globe. Etiology of thyroid cancer is still debatable. There are a few worth mentioning risk factors which contribute to initiation of abnormalities in thyroid gland leading to cancer. Genetic instability is major risk factors in thyroid carcinogenesis. Among the genetic factors, the Src family of genes (Src, Yes1, Fyn and Lyn) have been implicated in many cancers but there is little data regarding the association of these (Src, Yes1, Fyn and Lyn) genes with thyroid carcinogenesis. Fyn and Lyn genes of Src family found engaged in proliferation, migration, invasion, angiogenesis, and metastasis in different cancers. This study was planned to examine the effect of Fyn and Lyn SNPs on thyroid cancer risk in Pakistani population in 500 patients and 500 controls. Three polymorphisms of Fyn gene (rs6916861, rs2182644 and rs12910) and three polymorphisms of Lyn gene (rs2668011, rs45587541 and rs45489500) were analyzed using Tetra-primer ARMS-PCR followed by DNA sequencing. SNP rs6916861 of Fyn gene mutant genotype (CC) showed statistically significant threefold increased risk of thyroid cancer (P < 0.0001). In case of rs2182644 of Fyn gene, mutant genotype (AA) indicated statistically significant 17-fold increased risk of thyroid cancer (P < 0.0001). Statistically significant threefold increased risk of thyroid cancer was observed in genotype AC (P < 0.0001) of Fyn gene polymorphism rs12910. In SNP rs2668011 of Lyn gene, TT genotype showed statistically significant threefold increased risk of thyroid cancer (P < 0.0001). In case of rs45587541 of Lyn gene, GA genotypes showed statistically significant 11-fold increased risk in thyroid cancer (P < 0.0001). Haplotype analysis revealed that AAATAG*, AGACAG*, AGCCAA*, AGCCAG*, CAATAG*, CGCCAG* and CGCCGA* haplotypes of Fyn and Lyn polymorphisms are associated with increased thyroid cancer risk. These results showed that genotypes and allele distribution of Fyn and Lyn are significantly linked with increased thyroid cancer risk and could be genetic adjuster for said disease.
Collapse
Affiliation(s)
- Asif Nisar
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Mahmood Akhtar Kayani
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Wajiha Nasir
- Department of Radiation, Nuclear Oncology Radiation Institute, Islamabad, Pakistan
| | - Azhar Mehmood
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Malik Waqar Ahmed
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan.,Pakistan Institute of Rehabilitation Sciences (PIRS), Isra University Islamabad Campus, Islamabad, Pakistan
| | - Aamir Parvez
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan
| | - Ishrat Mahjabeen
- Cancer Genetics and Epigenetics Lab, Department of Biosciences, COMSATS University Islamabad, Park Road Tarlai Kalan, Islamabad, Pakistan.
| |
Collapse
|
50
|
Identification of cell cycle-associated and -unassociated regulators for expression of a hepatocellular carcinoma oncogene cyclin-dependent kinase inhibitor 3. Biochem Biophys Res Commun 2022; 625:46-52. [DOI: 10.1016/j.bbrc.2022.07.088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 07/22/2022] [Indexed: 11/23/2022]
|