1
|
Dutheil JY, Hamidi D, Pajot B. The Site/Group Extended Data Format and Tools. Genome Biol Evol 2024; 16:evae011. [PMID: 38252924 PMCID: PMC10849175 DOI: 10.1093/gbe/evae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 01/12/2024] [Indexed: 01/24/2024] Open
Abstract
Comparative sequence analysis permits unraveling the molecular processes underlying gene evolution. Many statistical methods generate candidate positions within genes, such as fast or slowly evolving sites, coevolving groups of residues, sites undergoing positive selection, or changes in evolutionary rates. Understanding the functional causes of these evolutionary patterns requires combining the results of these analyses and mapping them onto molecular structures, a complex task involving distinct coordinate referential systems. To ease this task, we introduce the site/group extended data format, a simple text format to store (groups of) site annotations. We developed a toolset, the SgedTools, which permits site/group extended data file manipulation, creating them from various software outputs and translating coordinates between individual sequences, alignments, and three-dimensional structures. The package also includes a Monte-Carlo procedure to generate random site samples, possibly conditioning on site-specific features. This eases the statistical testing of evolutionary hypotheses, accounting for the structural properties of the encoded molecules.
Collapse
Affiliation(s)
- Julien Y Dutheil
- Research Group “Molecular Systems Evolution,” Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Diyar Hamidi
- Research Group “Molecular Systems Evolution,” Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| | - Basile Pajot
- Research Group “Molecular Systems Evolution,” Department of Theoretical Biology, Max Planck Institute for Evolutionary Biology, Plön 24306, Germany
| |
Collapse
|
2
|
Liénard MA, Valencia-Montoya WA, Pierce NE. Molecular advances to study the function, evolution and spectral tuning of arthropod visual opsins. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210279. [PMID: 36058235 PMCID: PMC9450095 DOI: 10.1098/rstb.2021.0279] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Visual opsins of vertebrates and invertebrates diversified independently and converged to detect ultraviolet to long wavelengths (LW) of green or red light. In both groups, colour vision largely derives from opsin number, expression patterns and changes in amino acids interacting with the chromophore. Functional insights regarding invertebrate opsin evolution have lagged behind those for vertebrates because of the disparity in genomic resources and the lack of robust in vitro systems to characterize spectral sensitivities. Here, we review bioinformatic approaches to identify and model functional variation in opsins as well as recently developed assays to measure spectral phenotypes. In particular, we discuss how transgenic lines, cAMP-spectroscopy and sensitive heterologous expression platforms are starting to decouple genotype–phenotype relationships of LW opsins to complement the classical physiological-behavioural-phylogenetic toolbox of invertebrate visual sensory studies. We illustrate the use of one heterologous method by characterizing novel LW Gq opsins from 10 species, including diurnal and nocturnal Lepidoptera, a terrestrial dragonfly and an aquatic crustacean, expressing them in HEK293T cells, and showing that their maximum absorbance spectra (λmax) range from 518 to 611 nm. We discuss the advantages of molecular approaches for arthropods with complications such as restricted availability, lateral filters, specialized photochemistry and/or electrophysiological constraints. This article is part of the theme issue ‘Understanding colour vision: molecular, physiological, neuronal and behavioural studies in arthropods’.
Collapse
Affiliation(s)
- Marjorie A Liénard
- Department of Biology, Lund University, 22362 Lund, Sweden.,Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Wendy A Valencia-Montoya
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Naomi E Pierce
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
3
|
Gupta SK, Minocha R, Thapa PJ, Srivastava M, Dandekar T. Role of the Pangolin in Origin of SARS-CoV-2: An Evolutionary Perspective. Int J Mol Sci 2022; 23:9115. [PMID: 36012377 DOI: 10.3390/ijms23169115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 08/10/2022] [Accepted: 08/12/2022] [Indexed: 11/30/2022] Open
Abstract
After the recent emergence of SARS-CoV-2 infection, unanswered questions remain related to its evolutionary history, path of transmission or divergence and role of recombination. There is emerging evidence on amino acid substitutions occurring in key residues of the receptor-binding domain of the spike glycoprotein in coronavirus isolates from bat and pangolins. In this article, we summarize our current knowledge on the origin of SARS-CoV-2. We also analyze the host ACE2-interacting residues of the receptor-binding domain of spike glycoprotein in SARS-CoV-2 isolates from bats, and compare it to pangolin SARS-CoV-2 isolates collected from Guangdong province (GD Pangolin-CoV) and Guangxi autonomous regions (GX Pangolin-CoV) of South China. Based on our comparative analysis, we support the view that the Guangdong Pangolins are the intermediate hosts that adapted the SARS-CoV-2 and represented a significant evolutionary link in the path of transmission of SARS-CoV-2 virus. We also discuss the role of intermediate hosts in the origin of Omicron.
Collapse
|
4
|
Abstract
Comparative sequence analyses have been used to discover numerous classes of structured noncoding RNAs, some of which are riboswitches that specifically recognize small-molecule or elemental ion ligands and influence expression of adjacent downstream genes. Determining the correct identity of the ligand for a riboswitch candidate typically is aided by an understanding of the genes under its regulatory control. Riboswitches whose ligands were straightforward to identify have largely been associated with well-characterized metabolic pathways, such as coenzyme or amino acid biosynthesis. Riboswitch candidates whose ligands resist identification, collectively known as orphan riboswitches, are often associated with genes coding for proteins of unknown function, or genes for various proteins with no established link to one another. The cognate ligands for 16 former orphan riboswitch motifs have been identified to date. The successful pursuit of the ligands for these classes has provided insight into areas of biology that are not yet fully explored, such as ion homeostasis, signaling networks, and other previously underappreciated biochemical or physiological processes. Herein we discuss the strategies and methods used to match ligands with orphan riboswitch classes, and overview the lessons learned to inform and motivate ongoing efforts to identify ligands for the many remaining candidates.
Collapse
Affiliation(s)
- Madeline E Sherlock
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Ronald R Breaker
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520, USA
| |
Collapse
|
5
|
Greenlee EB, Stav S, Atilho RM, Brewer KI, Harris KA, Malkowski SN, Mirihana Arachchilage G, Perkins KR, Sherlock ME, Breaker RR. Challenges of ligand identification for the second wave of orphan riboswitch candidates. RNA Biol 2018; 15:377-390. [PMID: 29135333 PMCID: PMC5927730 DOI: 10.1080/15476286.2017.1403002] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Revised: 10/25/2017] [Accepted: 10/30/2017] [Indexed: 12/18/2022] Open
Abstract
Orphan riboswitch candidates are noncoding RNA motifs whose representatives are believed to function as genetic regulatory elements, but whose target ligands have yet to be identified. The study of certain orphans, particularly classes that have resisted experimental validation for many years, has led to the discovery of important biological pathways and processes once their ligands were identified. Previously, we highlighted details for four of the most common and intriguing orphan riboswitch candidates. This facilitated the validation of riboswitches for the signaling molecules c-di-AMP, ZTP, and ppGpp, the metal ion Mn2+, and the metabolites guanidine and PRPP. Such studies also yield useful linkages between the ligands sensed by the riboswitches and numerous biochemical pathways. In the current report, we describe the known characteristics of 30 distinct classes of orphan riboswitch candidates - some of which have remained unsolved for over a decade. We also discuss the prospects for uncovering novel biological insights via focused studies on these RNAs. Lastly, we make recommendations for experimental objectives along the path to finding ligands for these mysterious RNAs.
Collapse
Affiliation(s)
- Etienne B. Greenlee
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
| | - Shira Stav
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
| | - Ruben M. Atilho
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Kenneth I. Brewer
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Kimberly A. Harris
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
| | | | | | - Kevin R. Perkins
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
| | - Madeline E. Sherlock
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
| | - Ronald R. Breaker
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT, USA
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
- Howard Hughes Medical Institute, Yale University, New Haven, CT, USA
| |
Collapse
|
6
|
Sharma VK, Abbat S, Bharatam PV. Pharmacoinformatic Study on the Selective Inhibition of the Protozoan Dihydrofolate Reductase Enzymes. Mol Inform 2017; 36. [PMID: 28605138 DOI: 10.1002/minf.201600156] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2016] [Accepted: 05/16/2017] [Indexed: 12/27/2022]
Abstract
Dihydrofolate reductase (DHFR) is an essential enzyme of the folate metabolic pathway in protozoa and it is a validated, potential drug target in many infectious diseases. Information about unique conserved residues of the DHFR enzyme is required to understand residual selectivity of the protozoan DHFR enzyme. The three dimensional crystal structures are not available for all the protozoan DHFR enzymes. Enzyme-substrate/inhibitor interaction information is required for the binding mode characterization in protozoan DHFR for selective inhibitor design. In this work, multiple sequence analysis was carried out in all the studied species. Homology models were built for protozoan DHFR enzymes, for which 3D structures are not available in PDB. The molecular docking and Prime-MMGBSA calculations of the natural substrate (dihydrofolate, DHF) and classical DHFR inhibitor (methotrexate, MTX) were performed in protozoan DHFR enzymes. Comparative sequence analysis showed that an overall sequence identity between the studied species ranging from 22.94 % (CfDHFR-BgDHFR) to 94.61 % (LdDHFR-LmDHFR). Interestingly, it was observed that most of the active site residues were conserved in all the cases and all the enzymes exhibit similar key binding interactions with DHF and MTX in molecular docking analysis, but there are a few key binding residues which differ in protozoan species that makes it suitable for target selectivity. This information can be used to design selective and potent protozoan DHFR enzyme inhibitors.
Collapse
Affiliation(s)
- Vishnu K Sharma
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research (NIPER), S.A.S Nagar-, 160 062, Punjab, India
| | - Sheenu Abbat
- Department of Pharmacoinformatics, National Institute of Pharmaceutical Education and Research (NIPER), S.A.S Nagar-, 160 062, Punjab, India
| | - P V Bharatam
- Department of Medicinal Chemistry, National Institute of Pharmaceutical Education and Research (NIPER), S.A.S Nagar-160 062, Punjab, India
| |
Collapse
|
7
|
Harris KA, Lünse CE, Li S, Brewer KI, Breaker RR. Biochemical analysis of pistol self-cleaving ribozymes. RNA 2015; 21:1852-8. [PMID: 26385507 PMCID: PMC4604425 DOI: 10.1261/rna.052514.115] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2015] [Accepted: 06/22/2015] [Indexed: 05/27/2023]
Abstract
Pistol RNAs are members of a distinct class of self-cleaving ribozymes that was recently discovered by using a bioinformatics search strategy. Several hundred pistol ribozymes share a consensus sequence including 10 highly conserved nucleotides and many other modestly conserved nucleotides associated with specific secondary structure features, including three base-paired stems and a pseudoknot. A representative pistol ribozyme from the bacterium Lysinibacillus sphaericus was found to promote RNA strand scission with a rate constant of ∼10 min(-1) under physiological Mg(2+) and pH conditions. The reaction proceeds via the nucleophilic attack of a 2'-oxygen atom on the adjacent phosphorus center, and thus adheres to the same general catalytic mechanism of internal phosphoester transfer as found with all other classes of natural self-cleaving ribozymes discovered to date. Analyses of the kinetic characteristics and the metal ion requirements of the cleavage reaction reveal that members of this ribozyme class likely use several catalytic strategies to promote the rapid cleavage of RNA.
Collapse
Affiliation(s)
- Kimberly A Harris
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Christina E Lünse
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Sanshu Li
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Kenneth I Brewer
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Ronald R Breaker
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8103, USA
| |
Collapse
|
8
|
Abstract
Hatchet RNAs are members of a novel self-cleaving ribozyme class that was recently discovered by using a bioinformatics search strategy. The consensus sequence and secondary structure of this class includes 13 highly conserved and numerous other modestly conserved nucleotides interspersed among bulges linking four base-paired substructures. A representative hatchet ribozyme from a metagenomic source requires divalent ions such as Mg(2+) to promote RNA strand scission with a maximum rate constant of ∼4 min(-1). As with all other small self-cleaving ribozymes discovered to date, hatchet ribozymes employ a general mechanism for catalysis involving the nucleophilic attack of a ribose 2'-oxygen atom on an adjacent phosphorus center. Kinetic characteristics of the reaction demonstrate that members of this ribozyme class have an essential requirement for divalent metal ions and that they might have a complex active site that employs multiple catalytic strategies to accelerate RNA cleavage by internal phosphoester transfer.
Collapse
Affiliation(s)
- Sanshu Li
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Christina E Lünse
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Kimberly A Harris
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA
| | - Ronald R Breaker
- Howard Hughes Medical Institute, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520-8103, USA Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520-8103, USA
| |
Collapse
|
9
|
Aflitos S, Schijlen E, de Jong H, de Ridder D, Smit S, Finkers R, Wang J, Zhang G, Li N, Mao L, Bakker F, Dirks R, Breit T, Gravendeel B, Huits H, Struss D, Swanson-Wagner R, van Leeuwen H, van Ham RCHJ, Fito L, Guignier L, Sevilla M, Ellul P, Ganko E, Kapur A, Reclus E, de Geus B, van de Geest H, Te Lintel Hekkert B, van Haarst J, Smits L, Koops A, Sanchez-Perez G, van Heusden AW, Visser R, Quan Z, Min J, Liao L, Wang X, Wang G, Yue Z, Yang X, Xu N, Schranz E, Smets E, Vos R, Rauwerda J, Ursem R, Schuit C, Kerns M, van den Berg J, Vriezen W, Janssen A, Datema E, Jahrman T, Moquet F, Bonnet J, Peters S. Exploring genetic variation in the tomato (Solanum section Lycopersicon) clade by whole-genome sequencing. Plant J 2014; 80:136-48. [PMID: 25039268 DOI: 10.1111/tpj.12616] [Citation(s) in RCA: 222] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Revised: 06/23/2014] [Accepted: 07/01/2014] [Indexed: 05/20/2023]
Abstract
We explored genetic variation by sequencing a selection of 84 tomato accessions and related wild species representative of the Lycopersicon, Arcanum, Eriopersicon and Neolycopersicon groups, which has yielded a huge amount of precious data on sequence diversity in the tomato clade. Three new reference genomes were reconstructed to support our comparative genome analyses. Comparative sequence alignment revealed group-, species- and accession-specific polymorphisms, explaining characteristic fruit traits and growth habits in the various cultivars. Using gene models from the annotated Heinz 1706 reference genome, we observed differences in the ratio between non-synonymous and synonymous SNPs (dN/dS) in fruit diversification and plant growth genes compared to a random set of genes, indicating positive selection and differences in selection pressure between crop accessions and wild species. In wild species, the number of single-nucleotide polymorphisms (SNPs) exceeds 10 million, i.e. 20-fold higher than found in most of the crop accessions, indicating dramatic genetic erosion of crop and heirloom tomatoes. In addition, the highest levels of heterozygosity were found for allogamous self-incompatible wild species, while facultative and autogamous self-compatible species display a lower heterozygosity level. Using whole-genome SNP information for maximum-likelihood analysis, we achieved complete tree resolution, whereas maximum-likelihood trees based on SNPs from ten fruit and growth genes show incomplete resolution for the crop accessions, partly due to the effect of heterozygous SNPs. Finally, results suggest that phylogenetic relationships are correlated with habitat, indicating the occurrence of geographical races within these groups, which is of practical importance for Solanum genome evolution studies.
Collapse
|
10
|
Abstract
The signal recognition particle (SRP) is a cytosolic ribonucleoprotein complex that guides secretory proteins to biological membranes in all organisms. The SRP RNA is at the center of the structure and function of the SRP. The comparison of the growing number of SRP RNA sequences provides a rich source for gaining valuable insight into the composition, assembly, and phylogeny of the SRP. In order to assist in the continuation of these studies, we propose an SRP RNA nomenclature applicable to the three divisions of life.
Collapse
Affiliation(s)
- Christian Zwieb
- Department of Molecular Biology, The University of Texas Health Science Center at Tyler, 11937 US Highway 271, Tyler, TX 75708-3154, USA.
| | | | | | | | | |
Collapse
|
11
|
Abstract
The 3'-portion of the genome of a U.S. isolate of the porcine reproductive and respiratory syndrome (PRRS) virus, ATCCVR-2332, was cloned and sequenced. The resultant 3358 nucleotides contain 6 open reading frames (ORFs) with homologies to ORFs 2 through 7 of the European strain of the PRRS virus and other members of the free-standing genus of arteriviruses. Both VR-2332 and the European isolate (called the Lelystad virus) have been identified as infectious agents responsible for the swine disease called PRRS. Comparative sequence analysis indicates that there are degrees of amino acid identity to the Lelystad virus open reading frames ranging from 55% in ORF 5 to 79% in ORF 6. Hydropathy profiles indicate that the ORFs of VR-2332 and Lelystad virus correspond structurally despite significant sequence differences. These results are consistent with the biological similarities but distinct serological properties of North American and European isolates of the virus.
Collapse
Affiliation(s)
- M P Murtaugh
- Department of Veterinary PathoBiology, University of Minnesota, St. Paul, USA
| | | | | |
Collapse
|
12
|
Abstract
Complementary oligonucleotide primers which flank a 1146-nucleotide gene fragment (S1B: nt 1185 to 2333) encompassing a polymorphic region (nt 1368 to 1776) of the S1 subunit of bovine coronavirus spike glycoprotein were used for enzymatic amplification by PCR. We chose four clinical isolates, recovered from cases of epidemic diarrhea in neonatal calves in Québec dairy herds between 1987-1990, to specifically amplify and analyze their sequences in the selected genomic area. Nucleotide sequence analysis of the four clinical isolates indicated that their S1B gene fragments were highly conserved. We also compared the S1B gene sequences of the Québec BCV isolates to the published corresponding sequences from BCV-L9 [37], BCV-MEB [1], and BCV-F15 [3] reference strains. A high degree of similarity was demonstrated for all viruses, no deletions or insertions were observed, and the only variations that were identified consisted of nucleotide substitutions. The differing nucleotides and amino acids (aa) were not distributed randomly over the entire sequence but rather were clustered in the polymorphic region. Of these, four sporadic aa changes were located in antigenic domain II (aa residues 517 to 720) of S1. This correlates with varied antigenicity observed among the BCV Québec isolates when reacting with MAbs directed against the S glycoprotein of the Mebus strain. The other mutations seem to be fixed in all Québec isolates.
Collapse
Affiliation(s)
- M R Rekik
- Centre de recherche en virologie, Institut Armand-Frappier, Université du Québec Laval, Canada
| | | |
Collapse
|