Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013;23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]

For:	Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013;23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Nallathambi P, Umamaheswari C, Reddy B, Aarthy B, Javed M, Ravikumar P, Watpade S, Kashyap PL, Boopalakrishnan G, Kumar S, Sharma A, Kumar A. Deciphering the Genomic Landscape and Virulence Mechanisms of the Wheat Powdery Mildew Pathogen Blumeria graminis f. sp. tritici Wtn1: Insights from Integrated Genome Assembly and Conidial Transcriptomics. J Fungi (Basel) 2024;10:267. [PMID: 38667938 PMCID: PMC11051031 DOI: 10.3390/jof10040267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 03/16/2024] [Accepted: 03/19/2024] [Indexed: 04/28/2024] Open

Abstract

A high-quality genome sequence from an Indian isolate of Blumeria graminis f. sp. tritici Wtn1, a persistent threat in wheat farming, was obtained using a hybrid method. The assembly of over 9.24 million DNA-sequence reads resulted in 93 contigs, totaling a 140.61 Mb genome size, potentially encoding 8480 genes. Notably, more than 73.80% of the genome, spanning approximately 102.14 Mb, comprises retro-elements, LTR elements, and P elements, influencing evolution and adaptation significantly. The phylogenomic analysis placed B. graminis f. sp. tritici Wtn1 in a distinct monocot-infecting clade. A total of 583 tRNA anticodon sequences were identified from the whole genome of the native virulent strain B. graminis f. sp. tritici, which comprises distinct genome features with high counts of tRNA anticodons for leucine (70), cysteine (61), alanine (58), and arginine (45), with only two stop codons (Opal and Ochre) present and the absence of the Amber stop codon. Comparative InterProScan analysis unveiled "shared and unique" proteins in B. graminis f. sp. tritici Wtn1. Identified were 7707 protein-encoding genes, annotated to different categories such as 805 effectors, 156 CAZymes, 6102 orthologous proteins, and 3180 distinct protein families (PFAMs). Among the effectors, genes like Avra10, Avrk1, Bcg-7, BEC1005, CSEP0105, CSEP0162, BEC1016, BEC1040, and HopI1 closely linked to pathogenesis and virulence were recognized. Transcriptome analysis highlighted abundant proteins associated with RNA processing and modification, post-translational modification, protein turnover, chaperones, and signal transduction. Examining the Environmental Information Processing Pathways in B. graminis f. sp. tritici Wtn1 revealed 393 genes across 33 signal transduction pathways. The key pathways included yeast MAPK signaling (53 genes), mTOR signaling (38 genes), PI3K-Akt signaling (23 genes), and AMPK signaling (21 genes). Additionally, pathways like FoxO, Phosphatidylinositol, the two-component system, and Ras signaling showed significant gene representation, each with 15-16 genes, key SNPs, and Indels in specific chromosomes highlighting their relevance to environmental responses and pathotype evolution. The SNP and InDel analysis resulted in about 3.56 million variants, including 3.45 million SNPs, 5050 insertions, and 5651 deletions within the whole genome of B. graminis f. sp. tritici Wtn1. These comprehensive genome and transcriptome datasets serve as crucial resources for understanding the pathogenicity, virulence effectors, retro-elements, and evolutionary origins of B. graminis f. sp. tritici Wtn1, aiding in developing robust strategies for the effective management of wheat powdery mildew.

Collapse

Spirov AV, Myasnikova EM. Problem of Domain/Building Block Preservation in the Evolution of Biological Macromolecules and Evolutionary Computation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:1345-1362. [PMID: 35594219 DOI: 10.1109/tcbb.2022.3175908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Zhou H, Hwarari D, Ma H, Xu H, Yang L, Luo Y. Genomic survey of TCP transcription factors in plants: Phylogenomics, evolution and their biology. Front Genet 2022;13:1060546. [PMID: 36437962 PMCID: PMC9682074 DOI: 10.3389/fgene.2022.1060546] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 10/27/2022] [Indexed: 09/29/2023] Open

Eicholt LA, Aubel M, Berk K, Bornberg‐Bauer E, Lange A. Heterologous expression of naturally evolved putative de novo proteins with chaperones. Protein Sci 2022;31:e4371. [PMID: 35900020 PMCID: PMC9278007 DOI: 10.1002/pro.4371] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 05/03/2022] [Accepted: 05/14/2022] [Indexed: 11/23/2022]

Hatano T, Palani S, Papatziamou D, Salzer R, Souza DP, Tamarit D, Makwana M, Potter A, Haig A, Xu W, Townsend D, Rochester D, Bellini D, Hussain HMA, Ettema TJG, Löwe J, Baum B, Robinson NP, Balasubramanian M. Asgard archaea shed light on the evolutionary origins of the eukaryotic ubiquitin-ESCRT machinery. Nat Commun 2022;13:3398. [PMID: 35697693 PMCID: PMC9192718 DOI: 10.1038/s41467-022-30656-2] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Accepted: 05/10/2022] [Indexed: 11/23/2022] Open

Affiliation(s)

Tomoyuki Hatano Centre for Mechanochemical Cell Biology, Division of Biomedical Sciences, Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK
Saravanan Palani Centre for Mechanochemical Cell Biology, Division of Biomedical Sciences, Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK Department of Biochemistry, Indian Institute of Science, Bangalore, India
Dimitra Papatziamou Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK
Ralf Salzer MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
Diorge P Souza MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
Daniel Tamarit Laboratory of Microbiology, Wageningen University, 6708 WE, Wageningen, The Netherlands Department of Aquatic Sciences and Assessment, Swedish University of Agricultural Sciences, SE-75007, Uppsala, Sweden
Mehul Makwana Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK
Antonia Potter Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK
Alexandra Haig Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK
Wenjue Xu Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK
David Townsend Department of Chemistry, Lancaster University, Lancaster, LA1 4YB, UK
David Rochester Department of Chemistry, Lancaster University, Lancaster, LA1 4YB, UK
Dom Bellini MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
Hamdi M A Hussain Centre for Mechanochemical Cell Biology, Division of Biomedical Sciences, Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK
Thijs J G Ettema Laboratory of Microbiology, Wageningen University, 6708 WE, Wageningen, The Netherlands
Jan Löwe MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
Buzz Baum MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK.
Nicholas P Robinson Division of Biomedical and Life Sciences, Faculty of Health and Medicine, Lancaster University, Lancaster, LA1 4YG, UK.
Mohan Balasubramanian Centre for Mechanochemical Cell Biology, Division of Biomedical Sciences, Warwick Medical School, University of Warwick, Coventry, CV4 7AL, UK.

Collapse

Martyn JE, Gomez-Valero L, Buchrieser C. The evolution and role of eukaryotic-like domains in environmental intracellular bacteria: the battle with a eukaryotic cell. FEMS Microbiol Rev 2022;46:6529235. [DOI: 10.1093/femsre/fuac012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 11/14/2022] Open

New Genomic Signals Underlying the Emergence of Human Proto-Genes. Genes (Basel) 2022;13:genes13020284. [PMID: 35205330 PMCID: PMC8871994 DOI: 10.3390/genes13020284] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 12/04/2022] Open

Coyote-Maestas W, Nedrud D, Suma A, He Y, Matreyek KA, Fowler DM, Carnevale V, Myers CL, Schmidt D. Probing ion channel functional architecture and domain recombination compatibility by massively parallel domain insertion profiling. Nat Commun 2021;12:7114. [PMID: 34880224 PMCID: PMC8654947 DOI: 10.1038/s41467-021-27342-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 11/16/2021] [Indexed: 11/10/2022] Open

Papadopoulos C, Callebaut I, Gelly JC, Hatin I, Namy O, Renard M, Lespinet O, Lopes A. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res 2021;31:2303-2315. [PMID: 34810219 PMCID: PMC8647833 DOI: 10.1101/gr.275638.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/23/2021] [Indexed: 01/08/2023]

Reddy B, Kumar A, Mehta S, Sheoran N, Chinnusamy V, Prakash G. Hybrid de novo genome-reassembly reveals new insights on pathways and pathogenicity determinants in rice blast pathogen Magnaporthe oryzae RMg_Dl. Sci Rep 2021;11:22922. [PMID: 34824307 PMCID: PMC8616942 DOI: 10.1038/s41598-021-01980-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 11/01/2021] [Indexed: 01/20/2023] Open

Abstract

Blast disease incited by Magnaporthe oryzae is a major threat to sustain rice production in all rice growing nations. The pathogen is widely distributed in all rice paddies and displays rapid aerial transmissions, and seed-borne latent infection. In order to understand the genetic variability, host specificity, and molecular basis of the pathogenicity-associated traits, the whole genome of rice infecting Magnaporthe oryzae (Strain RMg_Dl) was sequenced using the Illumina and PacBio (RSII compatible) platforms. The high-throughput hybrid assembly of short and long reads resulted in a total of 375 scaffolds with a genome size of 42.43 Mb. Furthermore, comparative genome analysis revealed 99% average nucleotide identity (ANI) with other oryzae genomes and 83% against M. grisea, and 73% against M. poe genomes. The gene calling identified 10,553 genes with 10,539 protein-coding sequences. Among the detected transposable elements, the LTR/Gypsy and Type LINE showed high occurrence. The InterProScan of predicted protein sequences revealed that 97% protein family (PFAM), 98% superfamily, and 95% CDD were shared among RMg_Dl and reference 70-15 genome, respectively. Additionally, 550 CAZymes with high GH family content/distribution and cell wall degrading enzymes (CWDE) such endoglucanase, beta-glucosidase, and pectate lyase were also deciphered in RMg_Dl. The prevalence of virulence factors determination revealed that 51 different VFs were found in the genome. The biochemical pathway such as starch and sucrose metabolism, mTOR signaling, cAMP signaling, MAPK signaling pathways related genes were identified in the genome. The 49,065 SNPs, 3267 insertions and 3611 deletions were detected, and majority of these varinats were located on downstream and upstream region. Taken together, the generated information will be useful to develop a specific marker for diagnosis, pathogen surveillance and tracking, molecular taxonomy, and species delineation which ultimately leads to device improved management strategies for blast disease.

Collapse

Lindenburg LH, Pantelejevs T, Gielen F, Zuazua-Villar P, Butz M, Rees E, Kaminski CF, Downs JA, Hyvönen M, Hollfelder F. Improved RAD51 binders through motif shuffling based on the modularity of BRC repeats. Proc Natl Acad Sci U S A 2021;118:e2017708118. [PMID: 34772801 PMCID: PMC8727024 DOI: 10.1073/pnas.2017708118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/10/2021] [Indexed: 01/20/2023] Open

Gilchrist CLM, Chooi YH. Synthaser: a CD-Search enabled Python toolkit for analysing domain architecture of fungal secondary metabolite megasynth(et)ases. Fungal Biol Biotechnol 2021;8:13. [PMID: 34763725 PMCID: PMC8582187 DOI: 10.1186/s40694-021-00120-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 10/29/2021] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Fungi are prolific producers of secondary metabolites (SMs), which are bioactive small molecules with important applications in medicine, agriculture and other industries. The backbones of a large proportion of fungal SMs are generated through the action of large, multi-domain megasynth(et)ases such as polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs). The structure of these backbones is determined by the domain architecture of the corresponding megasynth(et)ase, and thus accurate annotation and classification of these architectures is an important step in linking SMs to their biosynthetic origins in the genome.

RESULTS

Here we report synthaser, a Python package leveraging the NCBI's conserved domain search tool for remote prediction and classification of fungal megasynth(et)ase domain architectures. Synthaser is capable of batch sequence analysis, and produces rich textual output and interactive visualisations which allow for quick assessment of the megasynth(et)ase diversity of a fungal genome. Synthaser uses a hierarchical rule-based classification system, which can be extensively customised by the user through a web application ( http://gamcil.github.io/synthaser ). We show that synthaser provides more accurate domain architecture predictions than comparable tools which rely on curated profile hidden Markov model (pHMM)-based approaches; the utilisation of the NCBI conserved domain database also allows for significantly greater flexibility compared to pHMM approaches. In addition, we demonstrate how synthaser can be applied to large scale genome mining pipelines through the construction of an Aspergillus PKS similarity network.

CONCLUSIONS

Synthaser is an easy to use tool that represents a significant upgrade to previous domain architecture analysis tools. It is freely available under a MIT license from PyPI ( https://pypi.org/project/synthaser ) and GitHub ( https://github.com/gamcil/synthaser ).

Collapse

Gomes T, Martin-Malpartida P, Ruiz L, Aragón E, Cordeiro TN, Macias MJ. Conformational landscape of multidomain SMAD proteins. Comput Struct Biotechnol J 2021;19:5210-5224. [PMID: 34630939 PMCID: PMC8479633 DOI: 10.1016/j.csbj.2021.09.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 09/08/2021] [Accepted: 09/09/2021] [Indexed: 12/21/2022] Open

Carmi G, Gorohovski A, Frenkel-Morgenstern M. EvoProDom: Evolutionary modeling of protein families by assessing translocations of protein domains. FEBS Open Bio 2021;11:2507-2524. [PMID: 34196123 PMCID: PMC8409312 DOI: 10.1002/2211-5463.13245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 06/22/2021] [Accepted: 06/30/2021] [Indexed: 11/29/2022] Open

Dieci G. Removing quote marks from the RNA polymerase II CTD 'code'. Biosystems 2021;207:104468. [PMID: 34216714 DOI: 10.1016/j.biosystems.2021.104468] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 06/24/2021] [Accepted: 06/27/2021] [Indexed: 11/27/2022]

Abstract

In eukaryotes, RNA polymerase II (Pol II) is responsible for the synthesis of all mRNAs and myriads of short and long untranslated RNAs, whose fabrication involves close spatiotemporal coordination between transcription, RNA processing and chromatin modification. Crucial for such a coordination is an unusual C-terminal domain (CTD) of the Pol II largest subunit, made of tandem repetitions (26 in yeast, 52 in chordates) of the heptapeptide with the consensus sequence YSPTSPS. Although largely unstructured and with poor sequence content, the Pol II CTD derives its extraordinary functional versatility from the fact that each amino acid in the heptapeptide can be posttranslationally modified, and that different combinations of CTD covalent marks are specifically recognized by different protein binding partners. These features have led to propose the existence of a Pol II CTD code, but this expression is generally used by authors with some caution, revealed by the frequent use of quote marks for the word 'code'. Based on the theoretical framework of code biology, it is argued here that the Pol II CTD modification system meets the requirements of a true organic code, where different CTD modification states represent organic signs whose organic meanings are biological reactions contributing to the many facets of RNA biogenesis in coordination with RNA synthesis by Pol II. Importantly, the Pol II CTD code is instantiated by adaptor proteins possessing at least two distinct domains, one of which devoted to specific recognition of CTD modification profiles. Furthermore, code rules can be altered by experimental interchange of CTD recognition domains of different adaptor proteins, a fact arguing in favor of the arbitrariness, and thus bona fide character, of the Pol II CTD code. Since the growing family of CTD adaptors includes RNA binding proteins and histone modification complexes, the Pol II CTD code is by its nature integrated with other organic codes, in particular the splicing code and the histone code. These issues will be discussed taking into account fascinating developments in Pol II CTD research, like the discovery of novel modifications at non-consensus sites, the recently recognized CTD physicochemical properties favoring liquid-liquid phase separation, and the discovery that the Pol II CTD, originated before the divergence of most extant eukaryotic taxa, has expanded and diversified with developmental complexity in animals and plants.

Collapse

Lange A, Patel PH, Heames B, Damry AM, Saenger T, Jackson CJ, Findlay GD, Bornberg-Bauer E. Structural and functional characterization of a putative de novo gene in Drosophila. Nat Commun 2021;12:1667. [PMID: 33712569 PMCID: PMC7954818 DOI: 10.1038/s41467-021-21667-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 02/03/2021] [Indexed: 11/26/2022] Open

Han X, Guo J, Pang E, Song H, Lin K. Ab Initio Construction and Evolutionary Analysis of Protein-Coding Gene Families with Partially Homologous Relationships: Closely Related Drosophila Genomes as a Case Study. Genome Biol Evol 2021;12:185-202. [PMID: 32108239 PMCID: PMC7144356 DOI: 10.1093/gbe/evaa041] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/18/2020] [Indexed: 01/05/2023] Open

James JE, Willis SM, Nelson PG, Weibel C, Kosinski LJ, Masel J. Universal and taxon-specific trends in protein sequences as a function of age. eLife 2021;10:e57347. [PMID: 33416492 PMCID: PMC7819706 DOI: 10.7554/elife.57347] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Accepted: 01/05/2021] [Indexed: 01/12/2023] Open

Gumerov VM, Zhulin IB. TREND: a platform for exploring protein function in prokaryotes based on phylogenetic, domain architecture and gene neighborhood analyses. Nucleic Acids Res 2020;48:W72-W76. [PMID: 32282909 PMCID: PMC7319448 DOI: 10.1093/nar/gkaa243] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 03/16/2020] [Accepted: 04/01/2020] [Indexed: 01/16/2023] Open

Liu B, Leng L, Sun X, Wang Y, Ma J, Zhu Y. ECMPride: prediction of human extracellular matrix proteins based on the ideal dataset using hybrid features with domain evidence. PeerJ 2020;8:e9066. [PMID: 32377454 PMCID: PMC7195829 DOI: 10.7717/peerj.9066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 04/05/2020] [Indexed: 01/28/2023] Open

Bohnert S, Antelo L, Grünewald C, Yemelin A, Andresen K, Jacob S. Rapid adaptation of signaling networks in the fungal pathogen Magnaporthe oryzae. BMC Genomics 2019;20:763. [PMID: 31640564 PMCID: PMC6805500 DOI: 10.1186/s12864-019-6113-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Accepted: 09/20/2019] [Indexed: 11/10/2022] Open

Subirana JA, Messeguer X. Satellites in the prokaryote world. BMC Evol Biol 2019;19:181. [PMID: 31533616 PMCID: PMC6749651 DOI: 10.1186/s12862-019-1504-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 08/28/2019] [Indexed: 11/10/2022] Open

Abstract

Background

Satellites or tandem repeats are very abundant in many eukaryotic genomes. Occasionally they have been reported to be present in some prokaryotes, but to our knowledge there is no general comparative study on their occurrence. For this reason we present here an overview of the distribution and properties of satellites in a set of representative species. Our results provide novel insights into the evolutionary relationship between eukaryotes, Archaea and Bacteria.

Results

We have searched all possible satellites present in the NCBI reference group of genomes in Archaea (142 species) and in Bacteria (119 species), detecting 2735 satellites in Archaea and 1067 in Bacteria. We have found that the distribution of satellites is very variable in different organisms. The archaeal Methanosarcina class stands out for the large amount of satellites in their genomes. Satellites from a few species have similar characteristics to those in eukaryotes, but most species have very few satellites: only 21 species in Archaea and 18 in Bacteria have more than 4 satellites/Mb. The distribution of satellites in these species is reminiscent of what is found in eukaryotes, but we find two significant differences: most satellites have a short length and many of them correspond to segments of genes coding for amino acid repeats. Transposition of non-coding satellites throughout the genome occurs rarely: only in the bacteria Leptospira interrogans and the archaea Methanocella conradii we have detected satellite families of transposed satellites with long repeats.

Conclusions

Our results demonstrate that the presence of satellites in the genome is not an exclusive feature of eukaryotes. We have described a few prokaryotes which do contain satellites. We present a discussion on their eventual evolutionary significance.

Electronic supplementary material

The online version of this article (10.1186/s12862-019-1504-2) contains supplementary material, which is available to authorized users.

Collapse

Rodrigues JV, Ogbunugafor CB, Hartl DL, Shakhnovich EI. Chimeric dihydrofolate reductases display properties of modularity and biophysical diversity. Protein Sci 2019;28:1359-1367. [PMID: 31095809 DOI: 10.1002/pro.3646] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 05/13/2019] [Indexed: 01/12/2023]

Ratcliffe LE, Asiedu EK, Pickett CJ, Warburton MA, Izzi SA, Meedel TH. The Ciona myogenic regulatory factor functions as a typical MRF but possesses a novel N-terminus that is essential for activity. Dev Biol 2019;448:210-225. [PMID: 30365920 PMCID: PMC6478573 DOI: 10.1016/j.ydbio.2018.10.010] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 08/28/2018] [Accepted: 10/16/2018] [Indexed: 11/26/2022]

Sanchez de Groot N, Torrent Burgas M, Ravarani CN, Trusina A, Ventura S, Babu MM. The fitness cost and benefit of phase-separated protein deposits. Mol Syst Biol 2019;15:e8075. [PMID: 30962358 PMCID: PMC6452874 DOI: 10.15252/msb.20178075] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open

Rapid evolution of protein diversity by de novo origination in Oryza. Nat Ecol Evol 2019;3:679-690. [PMID: 30858588 DOI: 10.1038/s41559-019-0822-5] [Citation(s) in RCA: 85] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 01/23/2019] [Indexed: 12/22/2022]

Jiang F, Liu Q, Liu X, Wang XH, Kang L. Genomic data reveal high conservation but divergent evolutionary pattern of Polycomb/Trithorax group genes in arthropods. INSECT SCIENCE 2019;26:20-34. [PMID: 29127737 DOI: 10.1111/1744-7917.12558] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Revised: 11/04/2017] [Accepted: 11/05/2017] [Indexed: 06/07/2023]

Exaptation at the molecular genetic level. SCIENCE CHINA-LIFE SCIENCES 2018;62:437-452. [PMID: 30798493 DOI: 10.1007/s11427-018-9447-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Accepted: 12/01/2018] [Indexed: 12/22/2022]

Bitard‐Feildel T, Lamiable A, Mornon J, Callebaut I. Order in Disorder as Observed by the "Hydrophobic Cluster Analysis" of Protein Sequences. Proteomics 2018;18:e1800054. [PMID: 30299594 PMCID: PMC7168002 DOI: 10.1002/pmic.201800054] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 08/29/2018] [Indexed: 12/17/2022]

Dangwal M, Das S. Identification and Analysis of OVATE Family Members from Genome of the Early Land Plants Provide Insights into Evolutionary History of OFP Family and Function. J Mol Evol 2018;86:511-530. [PMID: 30206666 DOI: 10.1007/s00239-018-9863-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Accepted: 09/05/2018] [Indexed: 01/11/2023]

Abstract

Mosses, liverworts, hornworts and lycophytes represent transition stages between the aquatic to terrestrial/land plants. Several morphological and adaptive novelties driven by genomic components including emergence and expansion of new or existing gene families have played a critical role during and after the transition, and contributed towards successful colonization of terrestrial ecosystems. It is crucial to decipher the evolutionary transitions and natural selection on the gene structure and function to understand the emergence of phenotypic and adaptive diversity. Plants at the "transition zone", between aquatic and terrestrial ecosystem, are also the most vulnerable because of climate change and may contain clues for successful mitigation of the challenges of climate change. Identification and comparative analyses of such genetic elements and gene families are few in mosses, liverworts, hornworts and lycophytes. Ovate family proteins (OFPs) are plant-specific transcriptional repressors and are acknowledged for their roles in important growth and developmental processes in land plants, and information about the functional aspects of OFPs in early land plants is fragmentary. As a first step towards addressing this gap, a comprehensive in silico analysis was carried out utilizing publicly available genome sequences of Marchantia polymorpha (Mp), Physcomitrella patens (Pp), Selaginella moellendorffii (Sm) and Sphagnum fallax (Sf). Our analysis led to the identification of 4 MpOFPs, 19 PpOFPs, 6 SmOFPs and 3 SfOFPs. Cross-genera analysis revealed a drastic change in the structure and physiochemical properties in OFPs suggesting functional diversification and genomic plasticity during the evolutionary course. Knowledge gained from this comparative analysis will form the framework towards deciphering and dissection of their developmental and adaptive role/s in early land plants and could provide insights into evolutionary strategies adapted by land plants.

Collapse

Incipient de novo genes can evolve from frozen accidents that escaped rapid transcript turnover. Nat Ecol Evol 2018;2:1626-1632. [DOI: 10.1038/s41559-018-0639-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Accepted: 07/09/2018] [Indexed: 11/08/2022]

Willis S, Masel J. Gene Birth Contributes to Structural Disorder Encoded by Overlapping Genes. Genetics 2018;210:303-313. [PMID: 30026186 PMCID: PMC6116962 DOI: 10.1534/genetics.118.301249] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2018] [Accepted: 07/18/2018] [Indexed: 11/18/2022] Open

Jakubec D, Kratochvíl M, Vymĕtal J, Vondrášek J. Widespread evolutionary crosstalk among protein domains in the context of multi-domain proteins. PLoS One 2018;13:e0203085. [PMID: 30169546 PMCID: PMC6118372 DOI: 10.1371/journal.pone.0203085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2018] [Accepted: 08/14/2018] [Indexed: 11/20/2022] Open

Klasberg S, Bitard-Feildel T, Callebaut I, Bornberg-Bauer E. Origins and structural properties of novel and de novo protein domains during insect evolution. FEBS J 2018;285:2605-2625. [PMID: 29802682 DOI: 10.1111/febs.14504] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2017] [Revised: 04/12/2018] [Accepted: 05/11/2018] [Indexed: 12/11/2022]

Menichelli C, Gascuel O, Bréhélin L. Improving pairwise comparison of protein sequences with domain co-occurrence. PLoS Comput Biol 2018;14:e1005889. [PMID: 29293498 PMCID: PMC5766236 DOI: 10.1371/journal.pcbi.1005889] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 01/12/2018] [Accepted: 11/23/2017] [Indexed: 01/17/2023] Open

Abstract

Comparing and aligning protein sequences is an essential task in bioinformatics. More specifically, local alignment tools like BLAST are widely used for identifying conserved protein sub-sequences, which likely correspond to protein domains or functional motifs. However, to limit the number of false positives, these tools are used with stringent sequence-similarity thresholds and hence can miss several hits, especially for species that are phylogenetically distant from reference organisms. A solution to this problem is then to integrate additional contextual information to the procedure. Here, we propose to use domain co-occurrence to increase the sensitivity of pairwise sequence comparisons. Domain co-occurrence is a strong feature of proteins, since most protein domains tend to appear with a limited number of other domains on the same protein. We propose a method to take this information into account in a typical BLAST analysis and to construct new domain families on the basis of these results. We used Plasmodium falciparum as a case study to evaluate our method. The experimental findings showed an increase of 14% of the number of significant BLAST hits and an increase of 25% of the proteome area that can be covered with a domain. Our method identified 2240 new domains for which, in most cases, no model of the Pfam database could be linked. Moreover, our study of the quality of the new domains in terms of alignment and physicochemical properties show that they are close to that of standard Pfam domains. Source code of the proposed approach and supplementary data are available at: https://gite.lirmm.fr/menichelli/pairwise-comparison-with-cooccurrence

Deciphering the functions of the different proteins of an organism constitutes a first step toward the understanding of its biology. Because they provide strong clues regarding protein functions, domains occupy a key position among the relevant annotations that can be assigned to a protein. Protein domains are sequential motifs that are conserved along evolution and are found in different proteins and in different combinations. One common approach for identifying the domains of a protein is to run sequence-sequence comparisons with local alignment tools as BLAST. However these approaches sometimes miss several hits, especially for species that are phylogenetically distant from reference organisms. We propose here an approach to increase the sensitivity of pairwise sequence comparisons. This approach makes use of the fact that protein domains tend to appear with a limited number of other domains on the same protein (the domain co-occurrence property). On P. falciparum, our approach allows identifying 2240 new domains for which, in most cases, no domain of the Pfam database could be linked.

Collapse

Young Genes are Highly Disordered as Predicted by the Preadaptation Hypothesis of De Novo Gene Birth. Nat Ecol Evol 2017. [PMID: 28642936 PMCID: PMC5476217 DOI: 10.1038/s41559-017-0146] [Citation(s) in RCA: 91] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Craig EA, Marszalek J. How Do J-Proteins Get Hsp70 to Do So Many Different Things? Trends Biochem Sci 2017;42:355-368. [PMID: 28314505 DOI: 10.1016/j.tibs.2017.02.007] [Citation(s) in RCA: 130] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Revised: 02/09/2017] [Accepted: 02/16/2017] [Indexed: 01/07/2023]

Exploring the dark foldable proteome by considering hydrophobic amino acids topology. Sci Rep 2017;7:41425. [PMID: 28134276 PMCID: PMC5278394 DOI: 10.1038/srep41425] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 12/19/2016] [Indexed: 12/18/2022] Open

Schmitz JF, Bornberg-Bauer E. Fact or fiction: updates on how protein-coding genes might emerge de novo from previously non-coding DNA. F1000Res 2017;6:57. [PMID: 28163910 PMCID: PMC5247788 DOI: 10.12688/f1000research.10079.1] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2017] [Indexed: 12/31/2022] Open

Schüler A, Bornberg-Bauer E. Evolution of Protein Domain Repeats in Metazoa. Mol Biol Evol 2016;33:3170-3182. [PMID: 27671125 PMCID: PMC5100051 DOI: 10.1093/molbev/msw194] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

Maervoet VET, Briers Y. Synthetic biology of modular proteins. Bioengineered 2016;8:196-202. [PMID: 27645260 DOI: 10.1080/21655979.2016.1222993] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open

Emergence of de novo proteins from 'dark genomic matter' by 'grow slow and moult'. Biochem Soc Trans 2016;43:867-73. [PMID: 26517896 DOI: 10.1042/bst20150089] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Creating functional sophistication from simple protein building blocks, exemplified by factor H and the regulators of complement activation. Biochem Soc Trans 2016;43:812-8. [PMID: 26517887 DOI: 10.1042/bst20150074] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]

Klasberg S, Bitard-Feildel T, Mallet L. Computational Identification of Novel Genes: Current and Future Perspectives. Bioinform Biol Insights 2016;10:121-31. [PMID: 27493475 PMCID: PMC4970615 DOI: 10.4137/bbi.s39950] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Revised: 05/31/2016] [Accepted: 06/05/2016] [Indexed: 12/31/2022] Open

McLysaght A, Hurst LD. Open questions in the study of de novo genes: what, how and why. Nat Rev Genet 2016;17:567-78. [PMID: 27452112 DOI: 10.1038/nrg.2016.78] [Citation(s) in RCA: 125] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Lees JG, Dawson NL, Sillitoe I, Orengo CA. Functional innovation from changes in protein domains and their combinations. Curr Opin Struct Biol 2016;38:44-52. [DOI: 10.1016/j.sbi.2016.05.016] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Revised: 05/17/2016] [Accepted: 05/24/2016] [Indexed: 10/21/2022]

Cromar GL, Zhao A, Xiong X, Swapna LS, Loughran N, Song H, Parkinson J. PhyloPro2.0: a database for the dynamic exploration of phylogenetically conserved proteins and their domain architectures across the Eukarya. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016;2016:baw013. [PMID: 26980519 PMCID: PMC4792532 DOI: 10.1093/database/baw013] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/29/2016] [Indexed: 11/13/2022]

Neme R, Tautz D. Evolution: dynamics of de novo gene emergence. Curr Biol 2016;24:R238-40. [PMID: 24650912 DOI: 10.1016/j.cub.2014.02.016] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Scaiewicz A, Levitt M. The language of the protein universe. Curr Opin Genet Dev 2015;35:50-6. [PMID: 26451980 DOI: 10.1016/j.gde.2015.08.010] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 08/20/2015] [Accepted: 08/25/2015] [Indexed: 11/17/2022]

Kersting AR, Mizrachi E, Bornberg-Bauer E, Myburg AA. Protein domain evolution is associated with reproductive diversification and adaptive radiation in the genus Eucalyptus. THE NEW PHYTOLOGIST 2015;206:1328-36. [PMID: 25494981 DOI: 10.1111/nph.13211] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 11/04/2014] [Indexed: 05/04/2023]