26
|
Senkevich TG, Yutin N, Wolf YI, Koonin EV, Moss B. Ancient Gene Capture and Recent Gene Loss Shape the Evolution of Orthopoxvirus-Host Interaction Genes. mBio 2021; 12:e0149521. [PMID: 34253028 PMCID: PMC8406176 DOI: 10.1128/mbio.01495-21] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 05/24/2021] [Indexed: 01/27/2023] Open
Abstract
The survival of viruses depends on their ability to resist host defenses and, of all animal virus families, the poxviruses have the most antidefense genes. Orthopoxviruses (ORPV), a genus within the subfamily Chordopoxvirinae, infect diverse mammals and include one of the most devastating human pathogens, the now eradicated smallpox virus. ORPV encode ∼200 genes, of which roughly half are directly involved in virus genome replication and expression as well as virion morphogenesis. The remaining ∼100 "accessory" genes are responsible for virus-host interactions, particularly counter-defense of innate immunity. Complete sequences are currently available for several hundred ORPV genomes isolated from a variety of mammalian hosts, providing a rich resource for comparative genomics and reconstruction of ORPV evolution. To identify the provenance and evolutionary trends of the ORPV accessory genes, we constructed clusters including the orthologs of these genes from all chordopoxviruses. Most of the accessory genes were captured in three major waves early in chordopoxvirus evolution, prior to the divergence of ORPV and the sister genus Centapoxvirus from their common ancestor. The capture of these genes from the host was followed by extensive gene duplication, yielding several paralogous gene families. In addition, nine genes were gained during the evolution of ORPV themselves. In contrast, nearly every accessory gene was lost, some on multiple, independent occasions in numerous lineages of ORPV, so that no ORPV retains them all. A variety of functional interactions could be inferred from examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently. IMPORTANCE Orthopoxviruses (ORPV) include smallpox (variola) virus, one of the most devastating human pathogens, and vaccinia virus, comprising the vaccine used for smallpox eradication. Among roughly 200 ORPV genes, about half are essential for genome replication and expression as well as virion morphogenesis, whereas the remaining half consists of accessory genes counteracting the host immune response. We reannotated the accessory genes of ORPV, predicting the functions of uncharacterized genes, and reconstructed the history of their gain and loss during the evolution of ORPV. Most of the accessory genes were acquired in three major waves antedating the origin of ORPV from chordopoxviruses. The evolution of ORPV themselves was dominated by gene loss, with numerous genes lost at the base of each major group of ORPV. Examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently during ORPV evolution allows prediction of different types of functional interactions.
Collapse
|
27
|
Li X, Zhao Y, Tu X, Li C, Zhu Y, Zhong H, Liu ZJ, Wu S, Zhai J. Comparative analysis of plastomes in Oxalidaceae: Phylogenetic relationships and potential molecular markers. PLANT DIVERSITY 2021; 43:281-291. [PMID: 34485770 PMCID: PMC8390927 DOI: 10.1016/j.pld.2021.04.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Revised: 04/14/2021] [Accepted: 04/17/2021] [Indexed: 06/13/2023]
Abstract
The wood sorrel family, Oxalidaceae, is mainly composed of annual or perennial herbs, a few shrubs, and trees distributed from temperate to tropical zones. Members of Oxalidaceae are of high medicinal, ornamental, and economic value. Despite the rich diversity and value of Oxalidaceae, few molecular markers or plastomes are available for phylogenetic analysis of the family. Here, we reported four new whole plastomes of Oxalidaceae and compared them with plastomes of three species in the family, as well as the plastome of Rourea microphylla in the closely related family Connaraceae. The eight plastomes ranged in length from 150,673 bp (Biophytum sensitivum) to 156,609 bp (R. microphylla). Genome annotations revealed a total of 129-131 genes, including 83-84 protein-coding genes, eight rRNA genes, 37 tRNA genes, and two to three pseudogenes. Comparative analyses showed that the plastomes of these species have minor variations at the gene level. The smaller plastomes of herbs B. sensitivum and three Oxalis species are associated with variations in IR region sizes, intergenic region variation, and gene or intron loss. We identified sequences with high variation that may serve as molecular markers in taxonomic studies of Oxalidaceae. The phylogenetic trees of selected superrosid representatives based on 76 protein-coding genes corroborated the Oxalidaceae position in Oxalidales and supported it as a sister to Connaraceae. Our research also supported the monophyly of the COM (Celastrales, Oxalidales, and Malpighiales) clade.
Collapse
|
28
|
Ogita Y, Tamura K, Mawaribuchi S, Takamatsu N, Ito M. Independent pseudogenizations and losses of sox15 during amniote diversification following asymmetric ohnolog evolution. BMC Ecol Evol 2021; 21:134. [PMID: 34193037 PMCID: PMC8244163 DOI: 10.1186/s12862-021-01864-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 06/17/2021] [Indexed: 11/28/2022] Open
Abstract
Background Four ohnologous genes (sox1, sox2, sox3, and sox15) were generated by two rounds of whole-genome duplication in a vertebrate ancestor. In eutherian mammals, Sox1, Sox2, and Sox3 participate in central nervous system (CNS) development. Sox15 has a function in skeletal muscle regeneration and has little functional overlap with the other three ohnologs. In contrast, the frog Xenopus laevis and zebrafish orthologs of sox15 as well as sox1-3 function in CNS development. We previously reported that Sox15 is involved in mouse placental development as neofunctionalization, but is pseudogenized in the marsupial opossum. These findings suggest that sox15 might have evolved with divergent gene fates during vertebrate evolution. However, knowledge concerning sox15 in other vertebrate lineages than therian mammals, anuran amphibians, and teleost fish is scarce. Our purpose in this study was to clarify the fate and molecular evolution of sox15 during vertebrate evolution. Results We searched for sox15 orthologs in all vertebrate classes from agnathans to mammals by significant sequence similarity and synteny analyses using vertebrate genome databases. Interestingly, sox15 was independently pseudogenized at least twice during diversification of the marsupial mammals. Moreover, we observed independent gene loss of sox15 at least twice during reptile evolution in squamates and crocodile-bird diversification. Codon-based phylogenetic tree and selective analyses revealed an increased dN/dS ratio for sox15 compared to the other three ohnologs during jawed vertebrate evolution. Conclusions The findings revealed an asymmetric evolution of sox15 among the four ohnologs during vertebrate evolution, which was supported by the increased dN/dS values in cartilaginous fishes, anuran amphibians, and amniotes. The increased dN/dS value of sox15 may have been caused mainly by relaxed selection. Notably, independent pseudogenizations and losses of sox15 were observed during marsupial and reptile evolution, respectively. Both might have been caused by strong relaxed selection. The drastic gene fates of sox15, including neofunctionalization and pseudogenizations/losses during amniote diversification, might be caused by a release from evolutionary constraints. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-021-01864-z.
Collapse
|
29
|
Fischer H, Tschachler E, Eckhart L. Cytosolic DNA sensing through cGAS and STING is inactivated by gene mutations in pangolins. Apoptosis 2021; 25:474-480. [PMID: 32533513 PMCID: PMC7291609 DOI: 10.1007/s10495-020-01614-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The release of DNA into the cytoplasm upon damage to the nucleus or during viral infection triggers an interferon-mediated defense response, inflammation and cell death. In human cells cytoplasmic DNA is sensed by cyclic GMP-AMP Synthase (cGAS) and Absent In Melanoma 2 (AIM2). Here, we report the identification of a “natural knockout” model of cGAS. Comparative genomics of phylogenetically diverse mammalian species showed that cGAS and its interaction partner Stimulator of Interferon Genes (STING) have been inactivated by mutations in the Malayan pangolin whereas other mammals retained intact copies of these genes. The coding sequences of CGAS and STING1 are also disrupted by premature stop codons and frame-shift mutations in Chinese and tree pangolins, suggesting that expression of these genes was lost in a common ancestor of all pangolins that lived more than 20 million years ago. AIM2 is retained in a functional form in pangolins whereas it is inactivated by mutations in carnivorans, the phylogenetic sister group of pangolins. The deficiency of cGAS and STING points to the existence of alternative mechanisms of controlling cytoplasmic DNA-associated cell damage and viral infections in pangolins.
Collapse
|
30
|
Guo YY, Yang JX, Bai MZ, Zhang GQ, Liu ZJ. The chloroplast genome evolution of Venus slipper (Paphiopedilum): IR expansion, SSC contraction, and highly rearranged SSC regions. BMC PLANT BIOLOGY 2021; 21:248. [PMID: 34058997 DOI: 10.21203/rs.3.rs-257472/v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 05/20/2021] [Indexed: 05/29/2023]
Abstract
BACKGROUND Paphiopedilum is the largest genus of slipper orchids. Previous studies showed that the phylogenetic relationships of this genus are not well resolved, and sparse taxon sampling documented inverted repeat (IR) expansion and small single copy (SSC) contraction of the chloroplast genomes of Paphiopedilum. RESULTS Here, we sequenced, assembled, and annotated 77 plastomes of Paphiopedilum species (size range of 152,130 - 164,092 bp). The phylogeny based on the plastome resolved the relationships of the genus except for the phylogenetic position of two unstable species. We used phylogenetic and comparative genomic approaches to elucidate the plastome evolution of Paphiopedilum. The plastomes of Paphiopedilum have a conserved genome structure and gene content except in the SSC region. The large single copy/inverted repeat (LSC/IR) boundaries are relatively stable, while the boundaries of the inverted repeat and small single copy region (IR/SSC) varied among species. Corresponding to the IR/SSC boundary shifts, the chloroplast genomes of the genus experienced IR expansion and SSC contraction. The IR region incorporated one to six genes of the SSC region. Unexpectedly, great variation in the size, gene order, and gene content of the SSC regions was found, especially in the subg. Parvisepalum. Furthermore, Paphiopedilum provides evidence for the ongoing degradation of the ndh genes in the photoautotrophic plants. The estimated substitution rates of the protein coding genes show accelerated rates of evolution in clpP, psbH, and psbZ. Genes transferred to the IR region due to the boundary shift also have higher substitution rates. CONCLUSIONS We found IR expansion and SSC contraction in the chloroplast genomes of Paphiopedilum with dense sampling, and the genus shows variation in the size, gene order, and gene content of the SSC region. This genus provides an ideal system to investigate the dynamics of plastome evolution.
Collapse
|
31
|
Guo YY, Yang JX, Bai MZ, Zhang GQ, Liu ZJ. The chloroplast genome evolution of Venus slipper (Paphiopedilum): IR expansion, SSC contraction, and highly rearranged SSC regions. BMC PLANT BIOLOGY 2021; 21:248. [PMID: 34058997 PMCID: PMC8165784 DOI: 10.1186/s12870-021-03053-y] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 05/20/2021] [Indexed: 05/24/2023]
Abstract
BACKGROUND Paphiopedilum is the largest genus of slipper orchids. Previous studies showed that the phylogenetic relationships of this genus are not well resolved, and sparse taxon sampling documented inverted repeat (IR) expansion and small single copy (SSC) contraction of the chloroplast genomes of Paphiopedilum. RESULTS Here, we sequenced, assembled, and annotated 77 plastomes of Paphiopedilum species (size range of 152,130 - 164,092 bp). The phylogeny based on the plastome resolved the relationships of the genus except for the phylogenetic position of two unstable species. We used phylogenetic and comparative genomic approaches to elucidate the plastome evolution of Paphiopedilum. The plastomes of Paphiopedilum have a conserved genome structure and gene content except in the SSC region. The large single copy/inverted repeat (LSC/IR) boundaries are relatively stable, while the boundaries of the inverted repeat and small single copy region (IR/SSC) varied among species. Corresponding to the IR/SSC boundary shifts, the chloroplast genomes of the genus experienced IR expansion and SSC contraction. The IR region incorporated one to six genes of the SSC region. Unexpectedly, great variation in the size, gene order, and gene content of the SSC regions was found, especially in the subg. Parvisepalum. Furthermore, Paphiopedilum provides evidence for the ongoing degradation of the ndh genes in the photoautotrophic plants. The estimated substitution rates of the protein coding genes show accelerated rates of evolution in clpP, psbH, and psbZ. Genes transferred to the IR region due to the boundary shift also have higher substitution rates. CONCLUSIONS We found IR expansion and SSC contraction in the chloroplast genomes of Paphiopedilum with dense sampling, and the genus shows variation in the size, gene order, and gene content of the SSC region. This genus provides an ideal system to investigate the dynamics of plastome evolution.
Collapse
|
32
|
Guo X, Zhang G, Fan L, Liu C, Ji Y. Highly degenerate plastomes in two hemiparasitic dwarf mistletoes: Arceuthobium chinense and A. pini (Viscaceae). PLANTA 2021; 253:125. [PMID: 34028602 DOI: 10.1007/s00425-021-03643-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Accepted: 05/19/2021] [Indexed: 06/12/2023]
Abstract
The leafless and endophytic habitat may significantly relax the selection pressure on photosynthesis, and plastid transcription and translation, causing the loss/pseudogenization of several essential plastid-encoding genes in dwarf mistletoes. Dwarf mistletoes (Arceuthobium spp., Viscaceae) are the most destructive plant parasites to numerous conifer species worldwide. In this study, the plastid genomes (plastomes) of Arceuthobium chinense Lecomte and A. pini Hawksworth and Wiens were sequenced and characterized. Although dwarf mistletoes are hemiparasites capable of photosynthesis, their plastomes were highly degenerated, as indicated by the smallest plastome size, the lowest GC content, and relatively very few intact genes among the Santalales hemiparasites. Unexpectedly, several essential housekeeping genes (rpoA, rpoB, rpoC1, and rpoC2) and some core photosynthetic genes (psbZ and petL), as well as the rpl33 gene, that is indispensable for plants under stress conditions, were deleted or pseudogenized in the Arceuthobium plastomes. Our data suggest that the leafless and endophytic habit, which heavily relies on the coniferous hosts for nutrients and carbon requirement, may largely relax the selection pressure on photosynthesis, as well as plastid transcription and translation, thus resulting in the loss/pseudogenization of such essential plastid-encoding genes in dwarf mistletoes. Therefore, the higher level of plastome degradation in Arceuthobium species than other Santalales hemiparasites is likely correlated with the evolution of leafless and endophytic habit. A higher degree of plastome degradation in Arceuthobium. These findings provide new insights into the plastome degeneration associated with parasitism in Santalales and deepen our understanding of the biology of dwarf mistletoes.
Collapse
|
33
|
Li J, Yuan D, Wang P, Wang Q, Sun M, Liu Z, Si H, Xu Z, Ma Y, Zhang B, Pei L, Tu L, Zhu L, Chen LL, Lindsey K, Zhang X, Jin S, Wang M. Cotton pan-genome retrieves the lost sequences and genes during domestication and selection. Genome Biol 2021; 22:119. [PMID: 33892774 PMCID: PMC8063427 DOI: 10.1186/s13059-021-02351-w] [Citation(s) in RCA: 56] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 04/14/2021] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Millennia of directional human selection has reshaped the genomic architecture of cultivated cotton relative to wild counterparts, but we have limited understanding of the selective retention and fractionation of genomic components. RESULTS We construct a comprehensive genomic variome based on 1961 cottons and identify 456 Mb and 357 Mb of sequence with domestication and improvement selection signals and 162 loci, 84 of which are novel, including 47 loci associated with 16 agronomic traits. Using pan-genome analyses, we identify 32,569 and 8851 non-reference genes lost from Gossypium hirsutum and Gossypium barbadense reference genomes respectively, of which 38.2% (39,278) and 14.2% (11,359) of genes exhibit presence/absence variation (PAV). We document the landscape of PAV selection accompanied by asymmetric gene gain and loss and identify 124 PAVs linked to favorable fiber quality and yield loci. CONCLUSIONS This variation repertoire points to genomic divergence during cotton domestication and improvement, which informs the characterization of favorable gene alleles for improved breeding practice using a pan-genome-based approach.
Collapse
|
34
|
Puzakova LV, Puzakov MV, Gostyukhina OL. Newly Discovered AqE Gene is Highly Conserved in Non-tetrapod Vertebrates. J Mol Evol 2021; 89:214-224. [PMID: 33604781 DOI: 10.1007/s00239-021-09997-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 01/21/2021] [Indexed: 10/22/2022]
Abstract
Studying the diversity of energy production pathways is important for understanding the evolutionary relationships between metabolic pathways and their biochemical precursors. The lactate/malate dehydrogenase (LDH/MDH) superfamily has been a model system for structural and functional evolution for a long time. Recently, the type-2 family of LDH/MDH (or LDH2/MDH2 oxidoreductase) has been identified. The LDH2/MDH2 oxidoreductase family is now known to have functionally more diverse enzymes than the LDH/MDH superfamily. In channel catfish, the gene encoding the LDH2/MDH2 oxidoreductase has been found (and was provisionally termed AqE). Homologs of this enzyme are predominantly present in organisms living in an aquatic environment. In this work, we studied the AqE gene distribution among non-tetrapod vertebrates. It was found that the AqE gene is present in the genomes of bony and cartilaginous fish and in the genomes of hagfishes and lampreys. In addition, it has been confirmed that in representatives of Cypriniformes, the AqE gene has been lost. AqE in representatives of Salmoniformes underwent significant deletions, which most likely led to its pseudogenization. In most orders of non-Tetrapoda vertebrates, the AqE gene remains highly conserved, suggesting that the AqE gene in aquatic vertebrates is an essential gene and undergoes rigorous selection. The AqE gene has the highest sequence similarity with the archaeal ComC gene that encodes sulfolactate dehydrogenase (SLDH). Based on the similarity of substrates, the enzyme encoded by the AqE gene is likely involved in the malate-aspartate shuttle mechanism or the biosynthesis of the energy coenzyme M equivalent.
Collapse
|
35
|
Patterns and tempo of PCSK9 pseudogenizations suggest an ancient divergence in mammalian cholesterol homeostasis mechanisms. Genetica 2021; 149:1-19. [PMID: 33515402 PMCID: PMC7929951 DOI: 10.1007/s10709-021-00113-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 01/04/2021] [Indexed: 01/06/2023]
Abstract
Proprotein convertase subtilisin/kexin type 9 (PCSK9) plays a central role in cholesterol homeostasis in humans as a major regulator of LDLR levels. PCSK9 is an intriguing protease in that it does not act by proteolysis but by preventing LDLR recirculation from endosomes to the plasma membrane. This, and the inexistence of any other proteolytic substrate but itself could suggest that PCSK9 is an exquisite example of evolutionary fine-tuning. However, the gene has been lost in several mammalian species, and null alleles are present (albeit at low frequencies) in some human populations without apparently deleterious health effects, raising the possibility that the PCSK9 may have become dispensable in the mammalian lineage. To address this issue, we systematically recovered, assembled, corrected, annotated and analysed publicly available PCSK9 sequences for 420 eutherian species to determine the distribution, frequencies, mechanisms and timing of PCSK9 pseudogenization events, as well as the evolutionary pressures underlying the preservation or loss of the gene. We found a dramatic difference in the patterns of PCSK9 retention and loss between Euarchontoglires—where there is strong pressure for gene preservation—and Laurasiatheria, where multiple independent events have led to PCSK9 loss in most species. These results suggest that there is a fundamental difference in the regulation of cholesterol metabolism between Euarchontoglires and Laurasiatheria, which in turn has important implications for the use of Laurasiatheria species (e.g. pigs) as animal models of human cholesterol-related diseases.
Collapse
|
36
|
Ecological Genomics and Evolution of Trichoderma reesei. Methods Mol Biol 2021; 2234:1-21. [PMID: 33165775 DOI: 10.1007/978-1-0716-1048-0_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
The filamentous fungus Trichoderma reesei (Hypocreales, Ascomycota) is an efficient industrial cell factory for the production of cellulolytic enzymes used for biofuel and other applications. Therefore, researches addressing T. reesei are relatively advanced compared to other Trichoderma spp. because of the significant bulk of available knowledge, multiple genomic data, and gene manipulation techniques. However, the established role of T. reesei in industry has resulted in a frequently biased understanding of the biology of this fungus. Thus, the recent studies unexpectedly show that the superior cellulolytic activity of T. reesei and other Trichoderma species evolved due to multiple lateral gene transfer events, while the innate ability to parasitize other fungi (mycoparasitism) was maintained in the genus, including T. reesei. In this chapter, we will follow the concept of ecological genomics and describe the ecology, distribution, and evolution of T. reesei, as well as critically discuss several common misconceptions that originate from the success of this species in applied sciences and industry.
Collapse
|
37
|
Guo X, Liu C, Wang H, Zhang G, Yan H, Jin L, Su W, Ji Y. The complete plastomes of two flowering epiparasites (Phacellaria glomerata and P. compressa): Gene content, organization, and plastome degradation. Genomics 2020; 113:447-455. [PMID: 33370586 DOI: 10.1016/j.ygeno.2020.12.031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 03/10/2020] [Accepted: 12/22/2020] [Indexed: 11/28/2022]
Abstract
A plant parasite obligately parasitizing another plant parasite is referred to as epiparasite, which is extremely rare in angiosperms, and their complete plastome sequences have not been characterized to date. In this study, the complete plastomes of two flowering epiparasites: Phacellaria compressa and P. glomerata (Amphorogynaceae, Santalales) were sequenced. The plastomes of both species are of similar size, structure, gene content, and arrangement of genes to other hemiparasites in Santalales. Their plastomes were characterized by the functional loss of plastid-encoded NAD(P)H-dehydrogenase and infA genes, which strongly coincides with the general pattern of plastome degradation observed in Santalales hemiparasites. Our study demonstrates that the relatively higher level of nutritional reliance on the host plants and the reduced vegetative bodies of P. compressa and P. glomerata do not appear to cause any unique plastome degradation compared with their closely related hemiparasites.
Collapse
|
38
|
Sharma S, Shinde SS, Teekas L, Vijay N. Evidence for the loss of plasminogen receptor KT gene in chicken. Immunogenetics 2020; 72:507-515. [PMID: 33247773 DOI: 10.1007/s00251-020-01186-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 11/03/2020] [Indexed: 12/31/2022]
Abstract
The loss of conserved genes has the potential to alter phenotypes drastically. Screening of vertebrate genomes for lineage-specific gene loss events has identified numerous natural knockouts associated with specific phenotypes. We provide evidence for the loss of a multi-exonic plasminogen receptor KT (PLGRKT) protein-encoding gene located on the Z chromosome in chicken. Exons 1 and 2 are entirely missing; remnants of exon 3 and a mostly intact exon 4 are identified in an assembly gap-free region in chicken with conserved synteny across species and verified using transcriptome and genome sequencing. PLGRKT gene disrupting changes are present in representative species from all five galliform families. In contrast to this, the presence of an intact transcriptionally active PLGRKT gene in species such as mallard, swan goose, and Anolis lizard suggests that gene loss occurred in the galliform lineage sometime between 68 and 80 Mya. The presence of galliform specific chicken repeat 1 (CR1) insertion at the erstwhile exon 2 of PLGRKT gene suggests repeat insertion-mediated loss. However, at least nine other independent PLGRKT coding frame disrupting changes in other bird species are supported by genome sequencing and indicate a role for relaxed purifying selection before CR1 insertion. The recurrent loss of a conserved gene with a role in the regulation of macrophage migration, efferocytosis, and blood coagulation is intriguing. Hence, we propose potential candidate genes that might be compensating the function of PLGRKT based on the presence of a C-terminal lysine residue, transmembrane domains, and gene expression patterns.
Collapse
|
39
|
Convergent Cortistatin losses parallel modifications in circadian rhythmicity and energy homeostasis in Cetacea and other mammalian lineages. Genomics 2020; 113:1064-1070. [PMID: 33157262 DOI: 10.1016/j.ygeno.2020.11.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 10/26/2020] [Accepted: 11/01/2020] [Indexed: 12/11/2022]
Abstract
The ancestors of Cetacea underwent profound morpho-physiological alterations. By displaying an exclusive aquatic existence, cetaceans evolved unique patterns of locomotor activity, vigilant behaviour, thermoregulation and circadian rhythmicity. Deciphering the molecular landscape governing many of these adaptations is key to understand the evolution of phenotypes. Here, we investigate Cortistatin (CORT), a neuropeptide displaying an important role in mammalian biorhythm regulation. This neuropeptide is a known neuroendocrine factor, stimulating slow-wave sleep, but also involved in the regulation of energy metabolism and hypomotility inducement. We assessed the functional status of CORT in 359 mammalian genomes (25 orders), including 30 species of Cetacea. Our findings indicate that cetaceans and other mammals with atypical biorhythms, thermal constraints and/or energy metabolism, have accumulated deleterious mutations in CORT. In light of the pleiotropic action of this neuropeptide, we suggest that this inactivation contributed to a plethora of phenotypic adjustments to accommodate adaptive solutions to specific ecological niches.
Collapse
|
40
|
Górecki P, Markin A, Eulenstein O. Exact median-tree inference for unrooted reconciliation costs. BMC Evol Biol 2020; 20:136. [PMID: 33115401 PMCID: PMC7593691 DOI: 10.1186/s12862-020-01700-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Solving median tree problems under tree reconciliation costs is a classic and well-studied approach for inferring species trees from collections of discordant gene trees. These problems are NP-hard, and therefore are, in practice, typically addressed by local search heuristics. So far, however, such heuristics lack any provable correctness or precision. Further, even for small phylogenetic studies, it has been demonstrated that local search heuristics may only provide sub-optimal solutions. Obviating such heuristic uncertainties are exact dynamic programming solutions that allow solving tree reconciliation problems for smaller phylogenetic studies. Despite these promises, such exact solutions are only suitable for credibly rooted input gene trees, which constitute only a tiny fraction of the readily available gene trees. Standard gene tree inference approaches provide only unrooted gene trees and accurately rooting such trees is often difficult, if not impossible. Results Here, we describe complex dynamic programming solutions that represent the first nonnaïve exact solutions for solving the tree reconciliation problems for unrooted input gene trees. Further, we show that the asymptotic runtime of the proposed solutions does not increase when compared to the most time-efficient dynamic programming solutions for rooted input trees. Conclusions In an experimental evaluation, we demonstrate that the described solutions for unrooted gene trees are, like the solutions for rooted input gene trees, suitable for smaller phylogenetic studies. Finally, for the first time, we study the accuracy of classic local search heuristics for unrooted tree reconciliation problems.
Collapse
|
41
|
Choi K, Weng ML, Ruhlman TA, Jansen RK. Extensive variation in nucleotide substitution rate and gene/intron loss in mitochondrial genomes of Pelargonium. Mol Phylogenet Evol 2020; 155:106986. [PMID: 33059063 DOI: 10.1016/j.ympev.2020.106986] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 07/16/2020] [Accepted: 10/07/2020] [Indexed: 12/30/2022]
Abstract
Geraniaceae organelle genomes have been shown to exhibit several highly unusual features compared to most other photosynthetic angiosperms. This includes massively rearranged plastomes with considerable size variation, extensive gene and intron loss, accelerated rates of nucleotide substitutions in both mitogenomes and plastomes, and biparental inheritance and cytonuclear incompatibility of the plastome. Most previous studies have focused on plastome evolution with mitogenome comparisons limited to only a few taxa or genes. In this study, mitogenomes and transcriptomes were examined for 27 species of Geraniales, including 13 species of Pelargonium. Extensive gene and intron losses were detected across the Geraniales with Pelargonium representing the most gene depauperate lineage in the family. Plotting these events on the Geraniaceae phylogenetic tree showed that gene losses occurred multiple times, whereas intron losses more closely reflected the relationships among taxa. In addition, P. australe acquired an intron by horizontal transfer. Comparisons of nucleotide substitution rates in Pelargonium showed that synonymous changes in nuclear genes were much lower than in mitochondrial genes. This is in contrast to the previously published studies that indicated that nuclear genes have 16 fold higher rates than mitochondrial genes across angiosperms. Elevated synonymous substitutions occurred for each mitochondrial gene in Pelargonium with the highest values 783 and 324 times higher than outgroups and other Geraniaceae, respectively. Pelargonium is one of four unrelated genera of angiosperms (Ajuga, Plantago and Silene) that have experienced highly accelerated nucleotide substitutions in mitogenomes. It is distinct from most angiosperms in also having elevated substitution rates in plastid genes but the cause of rate accelerations in Pelargonium plastomes and mitogenomes may be different.
Collapse
|
42
|
Fonseca E, Machado AM, Vilas-Arrondo N, Gomes-Dos-Santos A, Veríssimo A, Esteves P, Almeida T, Themudo G, Ruivo R, Pérez M, da Fonseca R, Santos MM, Froufe E, Román-Marcote E, Venkatesh B, Castro LFC. Cartilaginous fishes offer unique insights into the evolution of the nuclear receptor gene repertoire in gnathostomes. Gen Comp Endocrinol 2020; 295:113527. [PMID: 32526329 DOI: 10.1016/j.ygcen.2020.113527] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/15/2020] [Accepted: 06/03/2020] [Indexed: 12/12/2022]
Abstract
Nuclear receptors (NRs) are key transcription factors that originated in the common ancestor of metazoans. The vast majority of NRs are triggered by binding to either endogenous (e.g. retinoic acid) or exogenous (e.g. xenobiotics) ligands, and their evolution and expansion is tightly linked to the function of endocrine systems. Importantly, they represent classic targets of physiological exploitation by endocrine disrupting chemicals. The NR gene repertoire in different lineages has been shaped by gene loss, duplication and mutation, denoting a dynamic evolutionary route. As the earliest diverging class of gnathostomes (jawed vertebrates), cartilaginous fishes offer an exceptional opportunity to address the early diversification of NR gene families and the evolution of the endocrine system in jawed vertebrates. Here we provide an exhaustive analysis into the NR gene composition in five elasmobranch (sharks and rays) and two holocephalan (chimaeras) species. For this purpose, we generated also a low coverage draft genome assembly of the chimaera small-eyed rabbitfish, Hydrolagus affinis. We show that cartilaginous fish retain an archetypal NR gene repertoire, similar to that of mammals and coincident with the two rounds of whole genome duplication that occurred in the gnathostome ancestor. Furthermore, novel gene members of the non-canonical NR0B receptors were found in the genomes of this lineage. Our findings provide an essential view into the early diversification of NRs in gnathostomes, paving the way for functional studies.
Collapse
|
43
|
Petters-Vandresen DAL, Rossi BJ, Groenewald JZ, Crous PW, Machado MA, Stukenbrock EH, Glienke C. Mating-type locus rearrangements and shifts in thallism states in Citrus-associated Phyllosticta species. Fungal Genet Biol 2020; 144:103444. [PMID: 32822858 DOI: 10.1016/j.fgb.2020.103444] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 07/28/2020] [Accepted: 07/31/2020] [Indexed: 11/15/2022]
Abstract
Currently, eight Phyllosticta species are known to be associated with several Citrus hosts, incorporating diverse lifestyles: while some of them are endophytic (P. capitalensis and P. citribraziliensis), others are pathogenic (P. citriasiana, P. citricarpa, P. citrichinaensis and P. paracitricarpa). Sexual reproduction plays a key role in the interaction between these Phyllosticta species and their Citrus hosts, especially for the spread and persistence of the pathogenic species in the environment. Given this, differences in sexual reproduction strategies could be related to the differences in lifestyles. To evaluate this hypothesis, we characterized the mating-type loci of six Citrus-associated Phyllosticta species from whole genome assemblies. Mating-type genes in the Citrus-associated Phyllosticta species are highly variable in their sequence content, but the genomic locations and organization of the mating-type loci are conserved. Phyllosticta citriasiana, P. citribraziliensis, P. citricarpa and P. paracitricarpa are heterothallic, while P. capitalensis and P. citrichinaensis are homothallic. In addition, the P. citrichinaensis MAT1-2 idiomorph occurs in a separate location from the mating-type locus. Ancestral state reconstruction suggests that homothallism is the ancestral thallism state in Phyllosticta, with a shift to heterothallism in Phyllosticta species that are pathogenic to Citrus. Moreover, the homothallic strategies of P. capitalensis and P. citrichinaensis result from independent evolutionary events, as P. capitalensis locus likely represents the ancestral state, and P. citrichinaensis homothallism has risen through a reversion in a heterothallic ancestor and underwent remodelling events. As the pathogenic species P. citriasiana, P. citricarpa and P. paracitricarpa are heterothallic and incapable of selfing, disease management practices focused in preventing the occurrence of sexual reproduction could assist in the control of Citrus Black Spot and Citrus Tan Spot diseases. This study emphasizes the importance of studying Citrus-Phyllosticta interactions under evolutionary and genomic perspectives, as these approaches can provide valuable information about the association between Phyllosticta species and their hosts, and also serve as guidance for the improvement of disease management practices.
Collapse
|
44
|
Improved reconstruction and comparative analysis of chromosome 12 to rectify Mis-assemblies in Gossypium arboreum. BMC Genomics 2020; 21:470. [PMID: 32640982 PMCID: PMC7346634 DOI: 10.1186/s12864-020-06814-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2019] [Accepted: 06/09/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genome sequencing technologies have been improved at an exponential pace but precise chromosome-scale genome assembly still remains a great challenge. The draft genome of cultivated G. arboreum was sequenced and assembled with shotgun sequencing approach, however, it contains several misassemblies. To address this issue, we generated an improved reassembly of G. arboreum chromosome 12 using genetic mapping and reference-assisted approaches and evaluated this reconstruction by comparing with homologous chromosomes of G. raimondii and G. hirsutum. RESULTS In this study, we generated a high quality assembly of the 94.64 Mb length of G. arboreum chromosome 12 (A_A12) which comprised of 144 scaffolds and contained 3361 protein coding genes. Evaluation of results using syntenic and collinear analysis of reconstructed G. arboreum chromosome A_A12 with its homologous chromosomes of G. raimondii (D_D08) and G. hirsutum (AD_A12 and AD_D12) confirmed the significant improved quality of current reassembly as compared to previous one. We found major misassemblies in previously assembled chromosome 12 (A_Ca9) of G. arboreum particularly in anchoring and orienting of scaffolds into a pseudo-chromosome. Further, homologous chromosomes 12 of G. raimondii (D_D08) and G. arboreum (A_A12) contained almost equal number of transcription factor (TF) related genes, and showed good collinear relationship with each other. As well, a higher rate of gene loss was found in corresponding homologous chromosomes of tetraploid (AD_A12 and AD_D12) than diploid (A_A12 and D_D08) cotton, signifying that gene loss is likely a continuing process in chromosomal evolution of tetraploid cotton. CONCLUSION This study offers a more accurate strategy to correct misassemblies in sequenced draft genomes of cotton which will provide further insights towards its genome organization.
Collapse
|
45
|
Jasti N, Sebagh D, Riaz M, Wang X, Koripella B, Palanisamy V, Mohammad N, Chen Q, Friedrich M. Towards reconstructing the dipteran demise of an ancient essential gene: E3 ubiquitin ligase Murine double minute. Dev Genes Evol 2020; 230:279-294. [PMID: 32623522 DOI: 10.1007/s00427-020-00663-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 06/21/2020] [Indexed: 01/09/2023]
Abstract
Genome studies have uncovered many examples of essential gene loss, raising the question of how ancient genes transition from essentiality to dispensability. We explored this process for the deeply conserved E3 ubiquitin ligase Murine double minute (Mdm), which is lacking in Drosophila despite the conservation of its main regulatory target, the cellular stress response gene p53. Conducting gene expression and knockdown experiments in the red flour beetle Tribolium castaneum, we found evidence that Mdm has remained essential in insects where it is present. Using bioinformatics approaches, we confirm the absence of the Mdm gene family in Drosophila, mapping its loss to the stem lineage of schizophoran Diptera and Pipunculidae (big-headed flies), about 95-85 million years ago. Intriguingly, this gene loss event was preceded by the de novo origin of the gene Companion of reaper (Corp), a novel p53 regulatory factor that is characterized by functional similarities to vertebrate Mdm2 despite lacking E3 ubiquitin ligase protein domains. Speaking against a 1:1 compensatory gene gain/loss scenario, however, we found that hoverflies (Syrphidae) and pointed-wing flies (Lonchopteridae) possess both Mdm and Corp. This implies that the two p53 regulators have been coexisting for ~ 150 million years in select dipteran clades and for at least 50 million years in the lineage to Schizophora and Pipunculidae. Given these extensive time spans of Mdm/Corp coexistence, we speculate that the loss of Mdm in the lineage to Drosophila involved further acquisitions of compensatory gene activities besides the emergence of Corp. Combined with the previously noted reduction of an ancestral P53 contact domain in the Mdm homologs of crustaceans and insects, we conclude that the loss of the ancient Mdm gene family in flies was the outcome of incremental functional regression over long macroevolutionary time scales.
Collapse
|
46
|
Li Z, Ma X, Wen Y, Chen S, Jiang Y, Jin X. Plastome of the mycoheterotrophic eudicot Exacum paucisquama (Gentianaceae) exhibits extensive gene loss and a highly expanded inverted repeat region. PeerJ 2020; 8:e9157. [PMID: 32551191 PMCID: PMC7292021 DOI: 10.7717/peerj.9157] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2019] [Accepted: 04/18/2020] [Indexed: 11/20/2022] Open
Abstract
Mycoheterotrophic plants are highly specialized species able to acquire organic carbon from symbiotic fungi, with relaxed dependence on photosynthesis for carbon fixation. The relaxation of the functional constraint of photosynthesis and thereby the relaxed selective pressure on functional photosynthetic genes usually lead to substantial gene loss and a highly degraded plastid genome in heterotrophs. In this study, we sequenced and analyzed the plastome of the eudicot Exacum paucisquama, providing the first plastid genome of a mycoheterotroph in the family Gentianaceae to date. The E. paucisquama plastome was 44,028 bp in length, which is much smaller than the plastomes of autotrophic eudicots. Although the E. paucisquama plastome had a quadripartite structure, a distinct boundary shift was observed in comparison with the plastomes of other eudicots. We detected extensive gene loss and only 21 putative functional genes (15 protein-coding genes, four rRNA genes and two tRNA genes). Our results provide valuable information for comparative evolutionary analyses of plastomes of heterotrophic species belonging to different phylogenetic groups.
Collapse
|
47
|
Graham-Taylor C, Kamphuis LG, Derbyshire MC. A detailed in silico analysis of secondary metabolite biosynthesis clusters in the genome of the broad host range plant pathogenic fungus Sclerotinia sclerotiorum. BMC Genomics 2020; 21:7. [PMID: 31898475 PMCID: PMC6941272 DOI: 10.1186/s12864-019-6424-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Accepted: 12/23/2019] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND The broad host range pathogen Sclerotinia sclerotiorum infects over 400 plant species and causes substantial yield losses in crops worldwide. Secondary metabolites are known to play important roles in the virulence of plant pathogens, but little is known about the secondary metabolite repertoire of S. sclerotiorum. In this study, we predicted secondary metabolite biosynthetic gene clusters in the genome of S. sclerotiorum and analysed their expression during infection of Brassica napus using an existing transcriptome data set. We also investigated their sequence diversity among a panel of 25 previously published S. sclerotiorum isolate genomes. RESULTS We identified 80 putative secondary metabolite clusters. Over half of the clusters contained at least three transcriptionally coregulated genes. Comparative genomics revealed clusters homologous to clusters in the closely related plant pathogen Botrytis cinerea for production of carotenoids, hydroxamate siderophores, DHN melanin and botcinic acid. We also identified putative phytotoxin clusters that can potentially produce the polyketide sclerin and an epipolythiodioxopiperazine. Secondary metabolite clusters were enriched in subtelomeric genomic regions, and those containing paralogues showed a particularly strong association with repeats. The positional bias we identified was borne out by intraspecific comparisons that revealed putative secondary metabolite genes suffered more presence / absence polymorphisms and exhibited a significantly higher sequence diversity than other genes. CONCLUSIONS These data suggest that S. sclerotiorum produces numerous secondary metabolites during plant infection and that their gene clusters undergo enhanced rates of mutation, duplication and recombination in subtelomeric regions. The microevolutionary regimes leading to S. sclerotiorum secondary metabolite diversity have yet to be elucidated. Several potential phytotoxins documented in this study provide the basis for future functional analyses.
Collapse
|
48
|
Liu A, He F, Shen L, Liu R, Wang Z, Zhou J. Convergent degeneration of olfactory receptor gene repertoires in marine mammals. BMC Genomics 2019; 20:977. [PMID: 31842731 PMCID: PMC6916060 DOI: 10.1186/s12864-019-6290-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 11/14/2019] [Indexed: 11/10/2022] Open
Abstract
Background Olfactory receptors (ORs) can bind odor molecules and play a crucial role in odor sensation. Due to the frequent gains and losses of genes during evolution, the number of OR members varies greatly among different species. However, whether the extent of gene gains/losses varies between marine mammals and related terrestrial mammals has not been clarified, and the factors that might underlie these variations are unknown. Results To address these questions, we identified more than 10,000 members of the OR family in 23 mammals and classified them into 830 orthologous gene groups (OGGs) and 281 singletons. Significant differences occurred in the number of OR repertoires and OGGs among different species. We found that all marine mammals had fewer OR genes than their related terrestrial lineages, with the fewest OR genes found in cetaceans, which may be closely related to olfactory degradation. ORs with more gene duplications or loss events tended to be under weaker purifying selection. The average gain and loss rates of OR genes in terrestrial mammals were higher than those of mammalian gene families, while the average gain and loss rates of OR genes in marine mammals were significantly lower and much higher than those of mammalian gene families, respectively. Additionally, we failed to detect any one-to-one orthologous genes in the focal species, suggesting that OR genes are not well conserved among marine mammals. Conclusions Marine mammals have experienced large numbers of OR gene losses compared with their related terrestrial lineages, which may result from the frequent birth-and-death evolution under varied functional constrains. Due to their independent degeneration, OR genes present in each lineage are not well conserved among marine mammals. Our study provides a basis for future research on the olfactory receptor function in mammals from the perspective of evolutionary trajectories.
Collapse
|
49
|
Rubanov LI, Zaraisky AG, Shilovsky GA, Seliverstov AV, Zverkov OA, Lyubetsky VA. Screening for mouse genes lost in mammals with long lifespans. BioData Min 2019; 12:20. [PMID: 31728160 PMCID: PMC6842137 DOI: 10.1186/s13040-019-0208-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 10/25/2019] [Indexed: 12/23/2022] Open
Abstract
Background Gerontogenes include those that modulate life expectancy in various species and may be the actual longevity genes. We believe that a long (relative to body weight) lifespan in individual rodent and primate species can be due, among other things, to the loss of particular genes that are present in short-lived species of the same orders. These genes can also explain the widely different rates of aging among diverse species as well as why similarly sized rodents or primates sometimes have anomalous life expectancies (e.g., naked mole-rats and humans). Here, we consider the gene loss in the context of the prediction of Williams’ theory that concerns the reallocation of physiological resources of an organism between active reproduction (r-strategy) and self-maintenance (K-strategy). We have identified such lost genes using an original computer-aided approach; the software considers the loss of a gene as disruptions in gene orthology, local gene synteny or both. Results A method and software identifying the genes that are absent from a predefined set of species but present in another predefined set of species are suggested. Examples of such pairs of sets include long-lived vs short-lived, homeothermic vs poikilothermic, amniotic vs anamniotic, aquatic vs terrestrial, and neotenic vs nonneotenic species, among others. Species are included in one of two sets according to the property of interest, such as longevity or homeothermy. The program is universal towards these pairs, i.e., towards the underlying property, although the sets should include species with quality genome assemblies. Here, the proposed method was applied to study the longevity of Euarchontoglires species. It largely predicted genes that are highly expressed in the testis, epididymis, uterus, mammary glands, and the vomeronasal and other reproduction-related organs. This agrees with Williams’ theory that hypothesizes a species transition from r-strategy to K-strategy. For instance, the method predicts the mouse gene Smpd5, which has an expression level 20 times greater in the testis than in organs unrelated to reproduction as experimentally demonstrated elsewhere. At the same time, its paralog Smpd3 is not predicted by the program and is widely expressed in many organs not specifically related to reproduction. Conclusions The method and program, which were applied here to screen for gene losses that can accompany increased lifespan, were also applied to study reduced regenerative capacity and development of the telencephalon, neoteny, etc. Some of these results have been carefully tested experimentally. Therefore, we assume that the method is widely applicable.
Collapse
|
50
|
Choi IS, Schwarz EN, Ruhlman TA, Khiyami MA, Sabir JSM, Hajarah NH, Sabir MJ, Rabah SO, Jansen RK. Fluctuations in Fabaceae mitochondrial genome size and content are both ancient and recent. BMC PLANT BIOLOGY 2019; 19:448. [PMID: 31653201 PMCID: PMC6814987 DOI: 10.1186/s12870-019-2064-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 10/02/2019] [Indexed: 05/12/2023]
Abstract
BACKGROUND Organelle genome studies of Fabaceae, an economically and ecologically important plant family, have been biased towards the plastid genome (plastome). Thus far, less than 15 mitochondrial genome (mitogenome) sequences of Fabaceae have been published, all but four of which belong to the subfamily Papilionoideae, limiting the understanding of size variation and content across the family. To address this, four mitogenomes were sequenced and assembled from three different subfamilies (Cercidoideae, Detarioideae and Caesalpinioideae). RESULTS Phylogenetic analysis based on shared mitochondrial protein coding regions produced a fully resolved and well-supported phylogeny that was completely congruent with the plastome tree. Comparative analyses suggest that two kinds of mitogenome expansions have occurred in Fabaceae. Size expansion of four genera (Tamarindus, Libidibia, Haematoxylum, and Leucaena) in two subfamilies (Detarioideae and Caesalpinioideae) occurred in relatively deep nodes, and was mainly caused by intercellular gene transfer and/or interspecific horizontal gene transfer (HGT). The second, more recent expansion occurred in the Papilionoideae as a result of duplication of native mitochondrial sequences. Family-wide gene content analysis revealed 11 gene losses, four (rps2, 7, 11 and 13) of which occurred in the ancestor of Fabaceae. Losses of the remaining seven genes (cox2, rpl2, rpl10, rps1, rps19, sdh3, sdh4) were restricted to specific lineages or occurred independently in different clades. Introns of three genes (cox2, ccmFc and rps10) showed extensive lineage-specific length variation due to large sequence insertions and deletions. Shared DNA analysis among Fabaceae mitogenomes demonstrated a substantial decay of intergenic spacers and provided further insight into HGT between the mimosoid clade of Caesalpinioideae and the holoparasitic Lophophytum (Balanophoraceae). CONCLUSION This study represents the most exhaustive analysis of Fabaceae mitogenomes so far, and extends the understanding the dynamic variation in size and gene/intron content. The four newly sequenced mitogenomes reported here expands the phylogenetic coverage to four subfamilies. The family has experienced multiple mitogenome size fluctuations in both ancient and recent times. The causes of these size variations are distinct in different lineages. Fabaceae mitogenomes experienced extensive size fluctuation by recruitment of exogenous DNA and duplication of native mitochondrial DNA.
Collapse
|