1
|
Duraisamy AJ, Liu R, Sureshkumar S, Rose R, Jagannathan L, da Silva C, Coovadia A, Ramachander V, Chandrasekar S, Raja I, Sajnani M, Selvaraj SM, Narang B, Darvishi K, Bhayal AC, Katikala L, Guo F, Chen-Deutsch X, Balciuniene J, Ma Z, Nallamilli BRR, Bean L, Collins C, Hegde M. Focused Exome Sequencing Gives a High Diagnostic Yield in the Indian Subcontinent. J Mol Diagn 2024; 26:510-519. [PMID: 38582400 DOI: 10.1016/j.jmoldx.2024.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 09/11/2023] [Accepted: 03/01/2024] [Indexed: 04/08/2024] Open
Abstract
The genetically isolated yet heterogeneous and highly consanguineous Indian population has shown a higher prevalence of rare genetic disorders. However, there is a significant socioeconomic burden for genetic testing to be accessible to the general population. In the current study, we analyzed next-generation sequencing data generated through focused exome sequencing from individuals with different phenotypic manifestations referred for genetic testing to achieve a molecular diagnosis. Pathogenic or likely pathogenic variants are reported in 280 of 833 cases with a diagnostic yield of 33.6%. Homozygous sequence and copy number variants were found as positive diagnostic findings in 131 cases (15.7%) because of the high consanguinity in the Indian population. No relevant findings related to reported phenotype were identified in 6.2% of the cases. Patients referred for testing due to metabolic disorder and neuromuscular disorder had higher diagnostic yields. Carrier testing of asymptomatic individuals with a family history of the disease, through focused exome sequencing, achieved positive diagnosis in 54 of 118 cases tested. Copy number variants were also found in trans with single-nucleotide variants and mitochondrial variants in a few of the cases. The diagnostic yield and the findings from this study signify that a focused exome test is a good lower-cost alternative for whole-exome and whole-genome sequencing and as a first-tier approach to genetic testing.
Collapse
Affiliation(s)
| | - Ruby Liu
- Revvity Omics, Pittsburgh, Pennsylvania
| | | | - Rajiv Rose
- PerkinElmer Genomics, Revvity Omics, Chennai, India
| | | | | | | | | | | | - Indu Raja
- PerkinElmer Genomics, Revvity Omics, Chennai, India
| | | | | | | | | | | | | | - Fen Guo
- Revvity Omics, Pittsburgh, Pennsylvania
| | | | | | | | | | - Lora Bean
- Revvity Omics, Pittsburgh, Pennsylvania
| | | | | |
Collapse
|
2
|
Nussinov R, Liu Y, Zhang W, Jang H. Protein conformational ensembles in function: roles and mechanisms. RSC Chem Biol 2023; 4:850-864. [PMID: 37920394 PMCID: PMC10619138 DOI: 10.1039/d3cb00114h] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/02/2023] [Indexed: 11/04/2023] Open
Abstract
The sequence-structure-function paradigm has dominated twentieth century molecular biology. The paradigm tacitly stipulated that for each sequence there exists a single, well-organized protein structure. Yet, to sustain cell life, function requires (i) that there be more than a single structure, (ii) that there be switching between the structures, and (iii) that the structures be incompletely organized. These fundamental tenets called for an updated sequence-conformational ensemble-function paradigm. The powerful energy landscape idea, which is the foundation of modernized molecular biology, imported the conformational ensemble framework from physics and chemistry. This framework embraces the recognition that proteins are dynamic and are always interconverting between conformational states with varying energies. The more stable the conformation the more populated it is. The changes in the populations of the states are required for cell life. As an example, in vivo, under physiological conditions, wild type kinases commonly populate their more stable "closed", inactive, conformations. However, there are minor populations of the "open", ligand-free states. Upon their stabilization, e.g., by high affinity interactions or mutations, their ensembles shift to occupy the active states. Here we discuss the role of conformational propensities in function. We provide multiple examples of diverse systems, including protein kinases, lipid kinases, and Ras GTPases, discuss diverse conformational mechanisms, and provide a broad outlook on protein ensembles in the cell. We propose that the number of molecules in the active state (inactive for repressors), determine protein function, and that the dynamic, relative conformational propensities, rather than the rigid structures, are the hallmark of cell life.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University Tel Aviv 69978 Israel
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Wengang Zhang
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| |
Collapse
|
3
|
Fumagalli SE, Padhiar NH, Meyer D, Katneni U, Bar H, DiCuccio M, Komar AA, Kimchi-Sarfaty C. Analysis of 3.5 million SARS-CoV-2 sequences reveals unique mutational trends with consistent nucleotide and codon frequencies. Virol J 2023; 20:31. [PMID: 36812119 PMCID: PMC9936480 DOI: 10.1186/s12985-023-01982-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 02/02/2023] [Indexed: 02/19/2023] Open
Abstract
BACKGROUND Since the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses. METHODS Using a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence. RESULTS We show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant. CONCLUSIONS Our multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses.
Collapse
Affiliation(s)
- Sarah E Fumagalli
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Nigam H Padhiar
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Douglas Meyer
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Upendra Katneni
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Haim Bar
- Department of Statistics, University of Connecticut, Storrs, CT, USA
| | | | - Anton A Komar
- Department of Biological, Geological and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, USA
| | - Chava Kimchi-Sarfaty
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA.
| |
Collapse
|
4
|
Implementing computational methods in tandem with synonymous gene recoding for therapeutic development. Trends Pharmacol Sci 2023; 44:73-84. [PMID: 36307252 DOI: 10.1016/j.tips.2022.09.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 09/26/2022] [Accepted: 09/27/2022] [Indexed: 12/24/2022]
Abstract
Synonymous gene recoding, the substitution of synonymous variants into the genetic sequence, has been used to overcome many production limitations in therapeutic development. However, the safety and efficacy of recoded therapeutics can be difficult to evaluate because synonymous codon substitutions can result in subtle, yet impactful changes in protein features and require sensitive methods for detection. Given that computational approaches have made significant leaps in recent years, we propose that machine-learning (ML) tools may be leveraged to assess gene-recoded therapeutics and foresee an opportunity to adapt codon contexts to enhance some powerful existing tools. Here, we examine how synonymous gene recoding has been used to address challenges in therapeutic development, explain the biological mechanisms underlying its effects, and explore the application of computational platforms to improve the surveillance of functional variants in therapeutic design.
Collapse
|
5
|
Dhindsa RS, Wang Q, Vitsios D, Burren OS, Hu F, DiCarlo JE, Kruglyak L, MacArthur DG, Hurles ME, Petrovski S. A minimal role for synonymous variation in human disease. Am J Hum Genet 2022; 109:2105-2109. [PMID: 36459978 PMCID: PMC9808499 DOI: 10.1016/j.ajhg.2022.10.016] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Synonymous mutations change the DNA sequence of a gene without affecting the amino acid sequence of the encoded protein. Although some synonymous mutations can affect RNA splicing, translational efficiency, and mRNA stability, studies in human genetics, mutagenesis screens, and other experiments and evolutionary analyses have repeatedly shown that most synonymous variants are neutral or only weakly deleterious, with some notable exceptions. Based on a recent study in yeast, there have been claims that synonymous mutations could be as important as nonsynonymous mutations in causing disease, assuming the yeast findings hold up and translate to humans. Here, we argue that there is insufficient evidence to overturn the large, coherent body of knowledge establishing the predominant neutrality of synonymous variants in the human genome.
Collapse
Affiliation(s)
- Ryan S. Dhindsa
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA,Jan and Dan Duncan Neurological Research Institute at Texas Children’s Hospital, Houston, TX, USA,Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Waltham, MA, USA,Corresponding author
| | - Quanli Wang
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Waltham, MA, USA
| | - Dimitrios Vitsios
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Oliver S. Burren
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Fengyuan Hu
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - James E. DiCarlo
- Department of Pathology, Brigham and Women’s Hospital, Boston, MA, USA
| | - Leonid Kruglyak
- Department of Human Genetics and Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA,Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Daniel G. MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA,Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, NSW, Australia,Centre for Population Genomics, Murdoch Children’s Research Institute, Melbourne, VIC, Australia
| | | | - Slavé Petrovski
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK,Department of Medicine, University of Melbourne, Austin Health, Melbourne, VIC, Australia,Corresponding author
| |
Collapse
|