1
|
Abdelhameed AA, Ali M, Darwish DBE, AlShaqhaa MA, Selim DAFH, Nagah A, Zayed M. Induced genetic diversity through mutagenesis in wheat gene pool and significant use of SCoT markers to underpin key agronomic traits. BMC PLANT BIOLOGY 2024; 24:673. [PMID: 39004709 PMCID: PMC11247860 DOI: 10.1186/s12870-024-05345-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 06/26/2024] [Indexed: 07/16/2024]
Abstract
BACKGROUND This research explores the efficacy of mutagenesis, specifically using sodium azide (SA) and hydrazine hydrate (HZ) treatments, to introduce genetic diversity and enhance traits in three wheat (Triticum aestivum L.) genotypes. The experiment entails subjecting the seeds to different doses of SA and HZ and cultivating them in the field for two consecutive generations: M1 (first generation) and M2 (second generation). We then employed selective breeding techniques with Start Codon Targeted (SCoT) markers to select traits within the wheat gene pool. Also, the correlation between SCoT markers and specific agronomic traits provides insights into the genetic mechanisms underlying mutagenesis-induced changes in wheat. RESULTS In the study, eleven genotypes were derived from parent varieties Sids1, Sids12, and Giza 168, and eight mutant genotypes were selected from the M1 generation and further cultivated to establish the M2 generation. The results revealed that various morphological and agronomical characteristics, such as plant height, spikes per plant, spike length, spikelet per spike, grains per spikelet, and 100-grain weight, showed increases in different genotypes from M1 to M2. SCoT markers were employed to assess genetic diversity among the eleven genotypes. The bioinformatics analysis identified a correlation between SCoT markers and the transcription factors ABSCISIC ACID INSENSITIVE3 (ABI3) and VIVIPAROUS1 (VP1), crucial for plant development, growth, and stress adaptation. A comprehensive examination of genetic distance and the function identification of gene-associated SCoT markers may provide valuable insights into the mechanisms by which SA and HZ act as mutagens, enhancing wheat agronomic qualities. CONCLUSIONS This study demonstrates the effective use of SA and HZ treatments to induce gene diversity through mutagenesis in the wheat gene pool, resulting in the enhancement of agronomic traits, as revealed by SCoT markers. The significant improvements in morphological and agronomical characteristics highlight the potential of mutagenesis techniques for crop improvement. These findings offer valuable information for breeders to develop effective breeding programs to enhance wheat quality and resilience through increased genetic diversity.
Collapse
Affiliation(s)
- Ahmed Ali Abdelhameed
- Agricultural Botany Department (Genetics), Faculty of Agriculture, Al-Azhar University, Assuit Branch, Assuit, 71524, Egypt
| | - Mohammed Ali
- Maryout Research Station, Genetic Resources Department, Desert Research Center, 1 Mathaf El-Matarya St., El-Matareya, Cairo, 11753, Egypt
| | | | | | - Dalia Abdel-Fattah H Selim
- Department of Agricultural Botany, Faculty of Agricultural, Menoufia University, Shebin El-Kom, 32511, Egypt
| | - Aziza Nagah
- Botany and Microbiology Department, Faculty of Science, Benha University, Benha, 13518, Egypt
| | - Muhammad Zayed
- Department of Botany and Microbiology, Faculty of Science, Menoufia University, Shebin El-Kom, 32511, Egypt.
| |
Collapse
|
2
|
Ali M, Abdelkawy AM, Darwish DBE, Alatawi HA, Alshehri D, Al-Amrah H, Soudy FA. Changes in Metabolite Profiling and Expression Levels of Key Genes Involved in the Terpenoid Biosynthesis Pathway in Garden Sage ( Salvia officinalis) under the Effect of Hydrazine Hydrate. Metabolites 2023; 13:807. [PMID: 37512514 PMCID: PMC10385164 DOI: 10.3390/metabo13070807] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 06/20/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023] Open
Abstract
Mutagenesis is a highly efficient tool for establishing genetic variation and is widely used for genetic enhancement in various plants. The key benefit of mutation breeding is the prospect of enhancing one or several characteristics of a variety without altering the genetic background. In this study, we exposed the seeds of Salvia officinalis to four concentrations of hydrazine hydrate (HZ), i.e., (0%, 0.1%, 0.2%, and 0.3%) for 6 h. The contents of terpenoid compounds in the S. officinalis plantlets driven from the HZ-treated seeds were determined by GC-MS, which resulted in the identification of a total of 340 phytochemical compounds; 163 (87.48%), 145 (84.49%), 65 (97.45%), and 62 (98.32%), from the four concentrations of HZ (0%, 0.1%, 0.2%, and 0.3%), respectively. Furthermore, we used the qRT-PCR system to disclose the "transcriptional control" for twelve TPS genes related to terpenoid and terpene biosynthesis, namely, SoGPS, SoMYRS, SoNEOD, SoCINS, SoSABS, SoLINS, SoFPPS, SoHUMS, SoTPS6, SoSQUS, SoGGPS, and SoGA2. Altogether, results are likely to ensure some positive relationship between the concentrations of the chemical mutagen HZ used for treating the seeds, the type and amount of the produced terpenes, and the expression of their corresponding genes.
Collapse
Affiliation(s)
- Mohammed Ali
- Maryout Research Station, Genetic Resources Department, Desert Research Center, 1 Mathaf El-Matarya St., El-Matareya, Cairo 11753, Egypt
| | - Aisha M Abdelkawy
- Botany and Microbiology Department, Faculty of Science, Al-Azhar University (Girls Branch), Cairo 11751, Egypt
| | - Doaa Bahaa Eldin Darwish
- Biology Department, Faculty of Science, University of Tabuk, Tabuk 71491, Saudi Arabia
- Botany Department, Faculty of Science, Mansoura University, Mansoura 35511, Egypt
| | - Hanan Ali Alatawi
- Department of Biological Sciences, University Collage of Haqel, University of Tabuk, Tabuk 47512, Saudi Arabia
| | - Dikhnah Alshehri
- Biology Department, Faculty of Science, University of Tabuk, Tabuk 71491, Saudi Arabia
| | - Hadba Al-Amrah
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Fathia A Soudy
- Genetics and Genetic Engineering Department, Faculty of Agriculture, Benha University, Moshtohor 13736, Egypt
| |
Collapse
|
3
|
Chikhale V, Goswami N, Khan MA, Borah P, Varma AK. Evaluation of Pathogenicity and Structural Alterations for the Mutations Identified in the Conserved Region of the C-Terminal Kinase Domain of Human-Ribosomal S6 Kinase 1. ACS OMEGA 2023; 8:16273-16283. [PMID: 37179615 PMCID: PMC10173430 DOI: 10.1021/acsomega.3c00722] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 03/23/2023] [Indexed: 05/15/2023]
Abstract
Human-ribosomal s6 kinase 1 (h-RSK1) is an effector kinase of the Ras/MAPK signaling pathway, which is involved in the regulation of the cell cycle, proliferation, and survival. RSKs comprise two functionally distinct kinase domains at the N-terminal (NTKD) and C-terminal (CTKD) separated by a linker region. The mutations in RSK1 may have the potential to provide an extra benefit to the cancer cell to proliferate, migrate, and survive. The present study focuses on evaluating the structural basis for the missense mutations identified at the C-terminal kinase domain of human-RSK1. A total of 139 mutations reported on RSK1 were retrieved from cBioPortal, where 62 were located at the CTKD region. Furthermore, 10 missense mutations Arg434Pro, Thr701Met, Ala704Thr, Arg725Trp, Arg726Gln, His533Asn, Pro613Leu, Ser720Cys, Arg725Gln, and Ser732Phe were predicted to be deleterious using in silico tools. To our observation, these mutations are located in the evolutionarily conserved region of RSK1 and shown to alter the inter- and intramolecular interactions and also the conformational stability of RSK1-CTKD. The molecular dynamics (MD) simulation study further revealed that the five mutations Arg434Pro, Thr701Met, Ala704Thr, Arg725Trp, and Arg726Gln showed maximum structural alterations in RSK1-CTKD. Thus, based on the in silico and MD simulation analysis, it can be concluded that the reported mutations may serve as potential candidates for further functional studies.
Collapse
Affiliation(s)
- Vaishnvee Chikhale
- Advanced
Centre for Treatment, Research and Education in Cancer, Navi Mumbai, Maharashtra 410210, India
- Training
School Complex, Homi Bhabha National Institute, Anushaktinagar, Mumbai, Maharashtra 400094, India
| | - Nabajyoti Goswami
- Advanced
Centre for Treatment, Research and Education in Cancer, Navi Mumbai, Maharashtra 410210, India
| | - Mudassar Ali Khan
- Advanced
Centre for Treatment, Research and Education in Cancer, Navi Mumbai, Maharashtra 410210, India
- Training
School Complex, Homi Bhabha National Institute, Anushaktinagar, Mumbai, Maharashtra 400094, India
| | - Probodh Borah
- Bioinformatics
Infrastructure Facility, Department of Animal Biotechnology, Assam Agricultural University, Khanapara, Guwahati, Assam 781022, India
| | - Ashok K. Varma
- Advanced
Centre for Treatment, Research and Education in Cancer, Navi Mumbai, Maharashtra 410210, India
- Training
School Complex, Homi Bhabha National Institute, Anushaktinagar, Mumbai, Maharashtra 400094, India
| |
Collapse
|
4
|
Ostroverkhova D, Przytycka TM, Panchenko AR. Cancer driver mutations: predictions and reality. Trends Mol Med 2023:S1471-4914(23)00067-9. [PMID: 37076339 DOI: 10.1016/j.molmed.2023.03.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/17/2023] [Accepted: 03/23/2023] [Indexed: 04/21/2023]
Abstract
Cancer cells accumulate many genetic alterations throughout their lifetime, but only a few of them drive cancer progression, termed driver mutations. Driver mutations may vary between cancer types and patients, can remain latent for a long time and become drivers at particular cancer stages, or may drive oncogenesis only in conjunction with other mutations. The high mutational, biochemical, and histological tumor heterogeneity makes driver mutation identification very challenging. In this review we summarize recent efforts to identify driver mutations in cancer and annotate their effects. We underline the success of computational methods to predict driver mutations in finding novel cancer biomarkers, including in circulating tumor DNA (ctDNA). We also report on the boundaries of their applicability in clinical research.
Collapse
Affiliation(s)
- Daria Ostroverkhova
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON, Canada
| | - Teresa M Przytycka
- National Library of Medicine, National Institutes of Health (NIH), Bethesda, MD, USA.
| | - Anna R Panchenko
- Department of Pathology and Molecular Medicine, Queen's University, Kingston, ON, Canada; Department of Biology and Molecular Sciences, Queen's University, Kingston, ON, Canada; School of Computing, Queen's University, Kingston, ON, Canada; Ontario Institute of Cancer Research, Toronto, ON, Canada.
| |
Collapse
|
5
|
Computational Strategy for Minimizing Mycotoxins in Cereal Crops: Assessment of the Biological Activity of Compounds Resulting from Virtual Screening. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27082582. [PMID: 35458779 PMCID: PMC9025057 DOI: 10.3390/molecules27082582] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 04/12/2022] [Accepted: 04/14/2022] [Indexed: 11/17/2022]
Abstract
Cereal crops are frequently affected by toxigenic Fusarium species, among which the most common and worrying in Europe are Fusarium graminearum and Fusarium culmorum. These species are the causal agents of grain contamination with type B trichothecene (TCTB) mycotoxins. To help reduce the use of synthetic fungicides while guaranteeing low mycotoxin levels, there is an urgent need to develop new, efficient and environmentally-friendly plant protection solutions. Previously, F. graminearum proteins that could serve as putative targets to block the fungal spread and toxin production were identified and a virtual screening undertaken. Here, two selected compounds, M1 and M2, predicted, respectively, as the top compounds acting on the trichodiene synthase, a key enzyme of TCTB biosynthesis, and the 24-sterol-C-methyltransferase, a protein involved in ergosterol biosynthesis, were submitted for biological tests. Corroborating in silico predictions, M1 was shown to significantly inhibit TCTB yield by a panel of strains. Results were less obvious with M2 that induced only a slight reduction in fungal biomass. To go further, seven M1 analogs were assessed, which allowed evidencing of the physicochemical properties crucial for the anti-mycotoxin activity. Altogether, our results provide the first evidence of the promising potential of computational approaches to discover new anti-mycotoxin solutions
Collapse
|
6
|
SWAAT Bioinformatics Workflow for Protein Structure-Based Annotation of ADME Gene Variants. J Pers Med 2022; 12:jpm12020263. [PMID: 35207751 PMCID: PMC8875676 DOI: 10.3390/jpm12020263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Revised: 01/26/2022] [Accepted: 02/01/2022] [Indexed: 02/01/2023] Open
Abstract
Recent genomic studies have revealed the critical impact of genetic diversity within small population groups in determining the way individuals respond to drugs. One of the biggest challenges is to accurately predict the effect of single nucleotide variants and to get the relevant information that allows for a better functional interpretation of genetic data. Different conformational scenarios upon the changing in amino acid sequences of pharmacologically important proteins might impact their stability and plasticity, which in turn might alter the interaction with the drug. Current sequence-based annotation methods have limited power to access this type of information. Motivated by these calls, we have developed the Structural Workflow for Annotating ADME Targets (SWAAT) that allows for the prediction of the variant effect based on structural properties. SWAAT annotates a panel of 36 ADME genes including 22 out of the 23 clinically important members identified by the PharmVar consortium. The workflow consists of a set of Python codes of which the execution is managed within Nextflow to annotate coding variants based on 37 criteria. SWAAT also includes an auxiliary workflow allowing a versatile use for genes other than ADME members. Our tool also includes a machine learning random forest binary classifier that showed an accuracy of 73%. Moreover, SWAAT outperformed six commonly used sequence-based variant prediction tools (PROVEAN, SIFT, PolyPhen-2, CADD, MetaSVM, and FATHMM) in terms of sensitivity and has comparable specificity. SWAAT is available as an open-source tool.
Collapse
|
7
|
Ghadie M, Xia Y. Mutation Edgotype Drives Fitness Effect in Human. FRONTIERS IN BIOINFORMATICS 2021; 1:690769. [PMID: 36303776 PMCID: PMC9581054 DOI: 10.3389/fbinf.2021.690769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 08/18/2021] [Indexed: 11/24/2022] Open
Abstract
Missense mutations are known to perturb protein-protein interaction networks (known as interactome networks) in different ways. However, it remains unknown how different interactome perturbation patterns (“edgotypes”) impact organismal fitness. Here, we estimate the fitness effect of missense mutations with different interactome perturbation patterns in human, by calculating the fractions of neutral and deleterious mutations that do not disrupt PPIs (“quasi-wild-type”), or disrupt PPIs either by disrupting the binding interface (“edgetic”) or by disrupting overall protein stability (“quasi-null”). We first map pathogenic mutations and common non-pathogenic mutations onto homology-based three-dimensional structural models of proteins and protein-protein interactions in human. Next, we perform structure-based calculations to classify each mutation as either quasi-wild-type, edgetic, or quasi-null. Using our predicted as well as experimentally determined interactome perturbation patterns, we estimate that >∼40% of quasi-wild-type mutations are effectively neutral and the remaining are mostly mildly deleterious, that >∼75% of edgetic mutations are only mildly deleterious, and that up to ∼75% of quasi-null mutations may be strongly detrimental. These estimates are the first such estimates of fitness effect for different network perturbation patterns in any interactome. Our results suggest that while mutations that do not disrupt the interactome tend to be effectively neutral, the majority of human PPIs are under strong purifying selection and the stability of most human proteins is essential to human life.
Collapse
|
8
|
Ng JF, Fraternali F. Understanding the structural details of APOBEC3-DNA interactions using graph-based representations. Curr Res Struct Biol 2020; 2:130-143. [PMID: 34235473 PMCID: PMC8244423 DOI: 10.1016/j.crstbi.2020.07.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 07/17/2020] [Accepted: 07/21/2020] [Indexed: 12/22/2022] Open
Abstract
Human APOBEC3 (A3; apolipoprotein B mRNA editing catalytic polypeptide-like 3) is a family of seven enzymes involved in generating mutations in nascent reverse transcripts of many retroviruses, as well as the human genome in a range of cancer types. The structural details of the interaction between A3 proteins and DNA molecules are only available for a few family members. Here we use homology modelling techniques to address the difference in structural coverage of human A3 enzymes interacting with different DNA substrates. A3-DNA interfaces are represented as residue networks ("graphs"), based on which features at these interfaces are compared and quantified. We demonstrate that graph-based representations are effective in highlighting structural features of A3-DNA interfaces. By large-scale in silico mutagenesis of the bound DNA chain, we predicted the preference of substrate DNA sequence for multiple A3 domains. These data suggested that computational modelling approaches could contribute in the exploration of the structural basis for sequence specificity in A3 substrate selection, and demonstrated the utility of graph-based approaches in evaluating a large number of structural models generated in silico. APOBEC3(A3)-DNA structures have been resolved with modified deaminase domains. Structural modelling of interaction between wild-type A3 domains and DNA substrates. Graph-based representations reveal structural differences across A3-DNA interfaces. Using in silico mutagenesis we compared substrate preference of multiple A3 domains. Graph-based approaches can efficiently compare a large number of structural models.
Collapse
|
9
|
Nussinov R, Tsai C, Jang H. Autoinhibition can identify rare driver mutations and advise pharmacology. FASEB J 2019; 34:16-29. [DOI: 10.1096/fj.201901341r] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 09/18/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section Basic Science Program Frederick National Laboratory for Cancer Research Frederick MD USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| | - Chung‐Jung Tsai
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| | - Hyunbum Jang
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| |
Collapse
|
10
|
Nussinov R, Tsai CJ, Jang H. Why Are Some Driver Mutations Rare? Trends Pharmacol Sci 2019; 40:919-929. [PMID: 31699406 DOI: 10.1016/j.tips.2019.10.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/09/2019] [Accepted: 10/10/2019] [Indexed: 12/13/2022]
Abstract
Understanding why driver mutations that promote cancer are sometimes rare is important for precision medicine since it would help in their identification. Driver mutations are largely discovered through their frequencies. Thus, rare mutations often escape detection. Unlike high-frequency drivers, low-frequency drivers can be tissue specific; rare drivers have extremely low frequencies. Here, we discuss rare drivers and strategies to discover them. We suggest that allosteric driver mutations shift the protein ensemble from the inactive to the active state. Rare allosteric drivers are statistically rare since, to switch the protein functional state, they cooperate with additional mutations, and these are not considered in the patient cancer-specific protein sequence analysis. A complete landscape of mutations that drive cancer will reveal tumor-specific therapeutic vulnerabilities.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| |
Collapse
|
11
|
Malhotra S, Alsulami AF, Heiyun Y, Ochoa BM, Jubb H, Forbes S, Blundell TL. Understanding the impacts of missense mutations on structures and functions of human cancer-related genes: A preliminary computational analysis of the COSMIC Cancer Gene Census. PLoS One 2019; 14:e0219935. [PMID: 31323058 PMCID: PMC6641202 DOI: 10.1371/journal.pone.0219935] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2019] [Accepted: 07/03/2019] [Indexed: 12/12/2022] Open
Abstract
Genomics and genome screening are proving central to the study of cancer. However, a good appreciation of the protein structures coded by cancer genes is also invaluable, especially for the understanding of functions, for assessing ligandability of potential targets, and for designing new drugs. To complement the wealth of information on the genetics of cancer in COSMIC, the most comprehensive database for cancer somatic mutations available, structural information obtained experimentally has been brought together recently in COSMIC-3D. Even where structural information is available for a gene in the Cancer Gene Census, a list of genes in COSMIC with substantial evidence supporting their impacts in cancer, this information is quite often for a single domain in a larger protein or for a single protomer in a multiprotein assembly. Here, we show that over 60% of the genes included in the Cancer Gene Census are predicted to possess multiple domains. Many are also multicomponent and membrane-associated molecular assemblies, with mutations recorded in COSMIC affecting such assemblies. However, only 469 of the gene products have a structure represented in the PDB, and of these only 87 structures have 90-100% coverage over the sequence and 69 have less than 10% coverage. As a first step to bridging gaps in our knowledge in the many cases where individual protein structures and domains are lacking, we discuss our attempts of protein structure modelling using our pipeline and investigating the effects of mutations using two of our in-house methods (SDM2 and mCSM) and identifying potential driver mutations. This allows us to begin to understand the effects of mutations not only on protein stability but also on protein-protein, protein-ligand and protein-nucleic acid interactions. In addition, we consider ways to combine the structural information with the wealth of mutation data available in COSMIC. We discuss the impacts of COSMIC missense mutations on protein structure in order to identify and assess the molecular consequences of cancer-driving mutations.
Collapse
Affiliation(s)
- Sony Malhotra
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Ali F. Alsulami
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Yang Heiyun
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | | | - Harry Jubb
- Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Simon Forbes
- Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
12
|
Goncearenco A, Rager SL, Li M, Sang QX, Rogozin IB, Panchenko AR. Exploring background mutational processes to decipher cancer genetic heterogeneity. Nucleic Acids Res 2019; 45:W514-W522. [PMID: 28472504 PMCID: PMC5793731 DOI: 10.1093/nar/gkx367] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 04/21/2017] [Indexed: 01/08/2023] Open
Abstract
Much remains unknown about the progression and heterogeneity of mutational processes in different cancers and their diagnostic and clinical potential. A growing body of evidence supports mutation rate dependence on the local DNA sequence context for various types of mutations. We propose several tools for the analysis of cancer context-dependent mutations, which are implemented in an online computational framework MutaGene. The framework explores DNA context-dependent mutational patterns and underlying somatic cancer mutagenesis, analyzes mutational profiles of cancer samples, identifies the combinations of underlying mutagenic processes including those related to infidelity of DNA replication and repair machinery, and various other endogenous and exogenous mutagenic factors. As a result, the combination of mutagenic processes can be identified in any query sample with subsequent comparison to mutational profiles derived from malignant and benign samples. In addition, mutagen or cancer-specific mutational background models are applied to calculate expected DNA and protein site mutability to decouple relative contributions of mutagenesis and selection in carcinogenesis, thus elucidating the site-specific driving events in cancer. MutaGene is freely available at https://www.ncbi.nlm.nih.gov/projects/mutagene/.
Collapse
Affiliation(s)
| | - Stephanie L Rager
- National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA.,Columbia University, School of Engineering and Applied Science, New York, NY 10027, USA
| | - Minghui Li
- National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA
| | - Qing-Xiang Sang
- Department of Chemistry and Biochemistry, Florida State University, Tallahassee, Florida 32306, USA
| | - Igor B Rogozin
- National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA
| | - Anna R Panchenko
- National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA
| |
Collapse
|
13
|
Pagel KA, Antaki D, Lian A, Mort M, Cooper DN, Sebat J, Iakoucheva LM, Mooney SD, Radivojac P. Pathogenicity and functional impact of non-frameshifting insertion/deletion variation in the human genome. PLoS Comput Biol 2019; 15:e1007112. [PMID: 31199787 PMCID: PMC6594643 DOI: 10.1371/journal.pcbi.1007112] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 06/26/2019] [Accepted: 05/17/2019] [Indexed: 11/19/2022] Open
Abstract
Differentiation between phenotypically neutral and disease-causing genetic variation remains an open and relevant problem. Among different types of variation, non-frameshifting insertions and deletions (indels) represent an understudied group with widespread phenotypic consequences. To address this challenge, we present a machine learning method, MutPred-Indel, that predicts pathogenicity and identifies types of functional residues impacted by non-frameshifting insertion/deletion variation. The model shows good predictive performance as well as the ability to identify impacted structural and functional residues including secondary structure, intrinsic disorder, metal and macromolecular binding, post-translational modifications, allosteric sites, and catalytic residues. We identify structural and functional mechanisms impacted preferentially by germline variation from the Human Gene Mutation Database, recurrent somatic variation from COSMIC in the context of different cancers, as well as de novo variants from families with autism spectrum disorder. Further, the distributions of pathogenicity prediction scores generated by MutPred-Indel are shown to differentiate highly recurrent from non-recurrent somatic variation. Collectively, we present a framework to facilitate the interrogation of both pathogenicity and the functional effects of non-frameshifting insertion/deletion variants. The MutPred-Indel webserver is available at http://mutpred.mutdb.org/.
Collapse
Affiliation(s)
- Kymberleigh A. Pagel
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, United States of America
| | - Danny Antaki
- Department of Psychiatry, University of California San Diego, La Jolla, California, United States of America
| | - AoJie Lian
- Department of Psychiatry, University of California San Diego, La Jolla, California, United States of America
- Center for Medical Genetics, School of Life Sciences, Central South University, Changsha, China
| | - Matthew Mort
- Institute of Medical Genetics, Cardiff University, Cardiff, United Kingdom
| | - David N. Cooper
- Institute of Medical Genetics, Cardiff University, Cardiff, United Kingdom
| | - Jonathan Sebat
- Department of Psychiatry, University of California San Diego, La Jolla, California, United States of America
| | - Lilia M. Iakoucheva
- Department of Psychiatry, University of California San Diego, La Jolla, California, United States of America
| | - Sean D. Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington, United States of America
| | - Predrag Radivojac
- School of Informatics, Computing, and Engineering, Indiana University, Bloomington, Indiana, United States of America
- Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts, United States of America
| |
Collapse
|
14
|
Computational Approaches to Prioritize Cancer Driver Missense Mutations. Int J Mol Sci 2018; 19:ijms19072113. [PMID: 30037003 PMCID: PMC6073793 DOI: 10.3390/ijms19072113] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 07/02/2018] [Accepted: 07/05/2018] [Indexed: 12/31/2022] Open
Abstract
Cancer is a complex disease that is driven by genetic alterations. There has been a rapid development of genome-wide techniques during the last decade along with a significant lowering of the cost of gene sequencing, which has generated widely available cancer genomic data. However, the interpretation of genomic data and the prediction of the association of genetic variations with cancer and disease phenotypes still requires significant improvement. Missense mutations, which can render proteins non-functional and provide a selective growth advantage to cancer cells, are frequently detected in cancer. Effects caused by missense mutations can be pinpointed by in silico modeling, which makes it more feasible to find a treatment and reverse the effect. Specific human phenotypes are largely determined by stability, activity, and interactions between proteins and other biomolecules that work together to execute specific cellular functions. Therefore, analysis of missense mutations’ effects on proteins and their complexes would provide important clues for identifying functionally important missense mutations, understanding the molecular mechanisms of cancer progression and facilitating treatment and prevention. Herein, we summarize the major computational approaches and tools that provide not only the classification of missense mutations as cancer drivers or passengers but also the molecular mechanisms induced by driver mutations. This review focuses on the discussion of annotation and prediction methods based on structural and biophysical data, analysis of somatic cancer missense mutations in 3D structures of proteins and their complexes, predictions of the effects of missense mutations on protein stability, protein-protein and protein-nucleic acid interactions, and assessment of conformational changes in protein conformations induced by mutations.
Collapse
|
15
|
Exploring Protein-Protein Interactions as Drug Targets for Anti-cancer Therapy with In Silico Workflows. Methods Mol Biol 2017; 1647:221-236. [PMID: 28809006 DOI: 10.1007/978-1-4939-7201-2_15] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We describe a computational protocol to aid the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anti-cancer therapy. To achieve this goal, we explore multiple strategies, including finding binding hot spots, incorporating chemical similarity and bioactivity data, and sampling similar binding sites from homologous protein complexes. We demonstrate how to combine existing interdisciplinary resources with examples of semi-automated workflows. Finally, we discuss several major problems, including the occurrence of drug-resistant mutations, drug promiscuity, and the design of dual-effect inhibitors.
Collapse
|