1
|
Naranjo JG, Sither CB, Conant GC. Shared single copy genes are generally reliable for inferring phylogenetic relationships among polyploid taxa. Mol Phylogenet Evol 2024; 196:108087. [PMID: 38677353 DOI: 10.1016/j.ympev.2024.108087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 03/22/2024] [Accepted: 04/24/2024] [Indexed: 04/29/2024]
Abstract
Polyploidy, or whole-genome duplication, is expected to confound the inference of species trees with phylogenetic methods for two reasons. First, the presence of retained duplicated genes requires the reconciliation of the inferred gene trees to a proposed species tree. Second, even if the analyses are restricted to shared single copy genes, the occurrence of reciprocal gene loss, where the surviving genes in different species are paralogs from the polyploidy rather than orthologs, will mean that such genes will not have evolved under the corresponding species tree and may not produce gene trees that allow inference of that species tree. Here we analyze three different ancient polyploidy events, using synteny-based inferences of orthology and paralogy to infer gene trees from nearly 17,000 sets of homologous genes. We find that the simple use of single copy genes from polyploid organisms provides reasonably robust phylogenetic signals, despite the presence of reciprocal gene losses. Such gene trees are also most often in accord with the inferred species relationships inferred from maximum likelihood models of gene loss after polyploidy: a completely distinct phylogenetic signal present in these genomes. As seen in other studies, however, we find that methods for inferring phylogenetic confidence yield high support values even in cases where the underlying data suggest meaningful conflict in the phylogenetic signals.
Collapse
Affiliation(s)
- Jaells G Naranjo
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
| | - Charles B Sither
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC, USA
| | - Gavin C Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA; Genetics and Genomics Academy, North Carolina State University, Raleigh, NC, USA; Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
2
|
Mabry ME, Abrahams RS, Al-Shehbaz IA, Baker WJ, Barak S, Barker MS, Barrett RL, Beric A, Bhattacharya S, Carey SB, Conant GC, Conran JG, Dassanayake M, Edger PP, Hall JC, Hao Y, Hendriks KP, Hibberd JM, King GJ, Kliebenstein DJ, Koch MA, Leitch IJ, Lens F, Lysak MA, McAlvay AC, McKibben MTW, Mercati F, Moore RC, Mummenhoff K, Murphy DJ, Nikolov LA, Pisias M, Roalson EH, Schranz ME, Thomas SK, Yu Q, Yocca A, Pires JC, Harkess AE. Complementing model species with model clades. THE PLANT CELL 2024; 36:1205-1226. [PMID: 37824826 PMCID: PMC11062466 DOI: 10.1093/plcell/koad260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 09/07/2023] [Accepted: 09/22/2023] [Indexed: 10/14/2023]
Abstract
Model species continue to underpin groundbreaking plant science research. At the same time, the phylogenetic resolution of the land plant tree of life continues to improve. The intersection of these 2 research paths creates a unique opportunity to further extend the usefulness of model species across larger taxonomic groups. Here we promote the utility of the Arabidopsis thaliana model species, especially the ability to connect its genetic and functional resources, to species across the entire Brassicales order. We focus on the utility of using genomics and phylogenomics to bridge the evolution and diversification of several traits across the Brassicales to the resources in Arabidopsis, thereby extending scope from a model species by establishing a "model clade." These Brassicales-wide traits are discussed in the context of both the model species Arabidopsis and the family Brassicaceae. We promote the utility of such a "model clade" and make suggestions for building global networks to support future studies in the model order Brassicales.
Collapse
Affiliation(s)
- Makenzie E Mabry
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - R Shawn Abrahams
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
- Department of Biochemistry, Purdue University, West Lafayette, IN 47906, USA
| | | | | | - Simon Barak
- Ben-Gurion University of the Negev, French Associates Institute for Agriculture and Biotechnology of Drylands, Jacob Blaustein Institutes for Desert Research, Midreshet Ben-Gurion, 8499000, Israel
| | - Michael S Barker
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Russell L Barrett
- National Herbarium of New South Wales, Australian Botanic Garden, Locked Bag 6002, Mount Annan, NSW 2567, Australia
| | - Aleksandra Beric
- Department of Psychiatry, Washington University in Saint Louis School of Medicine, St. Louis, MO 63110, USA
- NeuroGenomics and Informatics Center, Washington University in Saint Louis School of Medicine, St. Louis, MO 63108, USA
| | - Samik Bhattacharya
- Department of Biology, Botany, University of Osnabrück, D-49076 Osnabrück, Germany
| | - Sarah B Carey
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Gavin C Conant
- Department of Biological Sciences, Bioinformatics Research Center, Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
| | - John G Conran
- ACEBB and SGC, School of Biological Sciences, The University of Adelaide, Adelaide, SA 5005, Australia
| | - Maheshi Dassanayake
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI 48864, USA
| | - Jocelyn C Hall
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Yue Hao
- Cancer and Cell Biology Division, Translational Genomics Research Institute, Phoenix, AZ 85004, USA
| | - Kasper P Hendriks
- Department of Biology, Botany, University of Osnabrück, D-49076 Osnabrück, Germany
- Functional Traits, Naturalis Biodiversity Center, PO Box 9517, Leiden 2300 RA, the Netherlands
| | - Julian M Hibberd
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 1TN, UK
| | - Graham J King
- Southern Cross Plant Science, Southern Cross University, Lismore, NSW 2480, Australia
| | | | - Marcus A Koch
- Centre for Organismal Studies (COS), Heidelberg University, 69120 Heidelberg, Germany
| | - Ilia J Leitch
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - Frederic Lens
- Functional Traits, Naturalis Biodiversity Center, PO Box 9517, Leiden 2300 RA, the Netherlands
- Institute of Biology Leiden, Plant Sciences, Leiden University, 2333 BE Leiden, the Netherlands
| | - Martin A Lysak
- CEITEC, and NCBR, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
| | - Alex C McAlvay
- Institute of Economic Botany, New York Botanical Garden, The Bronx, NY 10458, USA
| | - Michael T W McKibben
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Francesco Mercati
- National Research Council (CNR), Institute of Biosciences and Bioresource (IBBR), Palermo 90129, Italy
| | | | - Klaus Mummenhoff
- Department of Biology, Botany, University of Osnabrück, D-49076 Osnabrück, Germany
| | - Daniel J Murphy
- Royal Botanic Gardens Victoria, Melbourne, VIC 3004, Australia
| | | | - Michael Pisias
- Division of Plant Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Eric H Roalson
- School of Biological Sciences, Washington State University, Pullman, WA 99164-4236, USA
| | - M Eric Schranz
- Biosystematics Group, Wageningen University, 6708 PB Wageningen, the Netherlands
| | - Shawn K Thomas
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
- Bioinformatics and Analytics Core, University of Missouri, Columbia, MO 65211, USA
| | - Qingyi Yu
- Daniel K. Inouye U.S. Pacific Basin Agricultural Research Center, Agricultural Research Service, United States Department of Agriculture, Hilo, HI 96720, USA
| | - Alan Yocca
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - J Chris Pires
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO 80523-1170, USA
| | - Alex E Harkess
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| |
Collapse
|
3
|
Assis R, Conant G, Holland B, Liberles DA, O'Reilly MM, Wilson AE. Models for the retention of duplicate genes and their biological underpinnings. F1000Res 2024; 12:1400. [PMID: 38173826 PMCID: PMC10762295 DOI: 10.12688/f1000research.141786.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/08/2024] [Indexed: 01/05/2024] Open
Abstract
Gene content in genomes changes through several different processes, with gene duplication being an important contributor to such changes. Gene duplication occurs over a range of scales from individual genes to whole genomes, and the dynamics of this process can be context dependent. Still, there are rules by which genes are retained or lost from genomes after duplication, and probabilistic modeling has enabled characterization of these rules, including their context-dependence. Here, we describe the biology and corresponding mathematical models that are used to understand duplicate gene retention and its contribution to the set of biochemical functions encoded in a genome.
Collapse
Affiliation(s)
- Raquel Assis
- Florida Atlantic University, Boca Raton, Florida, USA
| | - Gavin Conant
- North Carolina State University, Raleigh, North Carolina, USA
| | | | | | | | | |
Collapse
|
4
|
Chahar N, Dangwal M, Das S. Complex origin, evolution, and diversification of non-canonically organized OVATE-OFP and OVATE-Like OFP gene pair across Embryophyta. Gene 2023; 883:147685. [PMID: 37536399 DOI: 10.1016/j.gene.2023.147685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 07/21/2023] [Accepted: 07/31/2023] [Indexed: 08/05/2023]
Abstract
Ovate Family Proteins (OFP) is a plant-specific gene family of negative transcriptional regulators. Till-date, a handful of in-silico studies have provided glimpses into family size, expansion patterns, and genic features across all major plant lineages. A major lacuna exists in understanding origin of organisation complexity of members such as those arranged in a head-to-head manner which may lead to transcriptional co-regulation via a common bi-directional promoter. To address this gap, we investigated the origin, organization and evolution of two head-to-head arranged gene pairs of homologs of AtOFP2-AtOFP17, and, AtOFP4-AtOFP20 across Archaeplastida. The ancestral forms of AtOFP2, AtOFP4, AtOFP17, and AtOFP20 are likely to have evolved in last common ancestors of Embryophyta (land plants) given their complete absence in Rhodophyta and Chlorophyta. The OFP gene family originated and expanded in Bryophyta, including protein variants with complete (OVATE-OFP) or partial (OVATE-Like OFP) OVATE domain; with head-to-head organization present only in Spermatophyta (gymnosperms and angiosperms). Ancestral State Reconstruction revealed the origin of head-to-head organized gene pair in gymnosperms, with both genes being OVATE-OFP (homologs of AtOFP2/4). Phylogenetic reconstruction and copy number analysis suggests the presence of a single copy of the head-to-head arranged pair of OFP2/4 (OVATE)-OFP17/20 (OVATE-Like) in all angiosperms except Brassicaceae, and a duplication event in last common ancestor of core Brassicaceae approximately 32-54 MYA leading to origin of AtOFP2-AtOFP17 and AtOFP4-AtOFP20 as paralogs. Synteny analysis of genomic regions harbouring homologs of AtOFP2-AtOFP17, AtOFP4-AtOFP20 and AtOFP2/4-AtOFP17/20 across angiosperms suggested ancestral nature of AtOFP2-AtOFP17 gene pair. The present study thus establishes the orthology and evolutionary history of two non-canonically organised gene pairs with variation in their OVATE domain. The non-canonical organisation, atleast in Brassicaceae, has the potential of generating complex transcriptional regulation mediated via a common bi-directional promoter. The study thus lays down a framework to understand evolution of gene and protein structure, transcriptional regulation and function across a phylogenetic lineage through comparative analyses.
Collapse
Affiliation(s)
- Nishu Chahar
- Department of Botany, University of Delhi, Delhi 110 007, India.
| | | | - Sandip Das
- Department of Botany, University of Delhi, Delhi 110 007, India.
| |
Collapse
|
5
|
Yang Y, Xu T, Conant G, Kishino H, Thorne JL, Ji X. Interlocus Gene Conversion, Natural Selection, and Paralog Homogenization. Mol Biol Evol 2023; 40:msad198. [PMID: 37675606 PMCID: PMC10503786 DOI: 10.1093/molbev/msad198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 08/07/2023] [Accepted: 09/05/2023] [Indexed: 09/08/2023] Open
Abstract
Following a duplication, the resulting paralogs tend to diverge. While mutation and natural selection can accelerate this process, they can also slow it. Here, we quantify the paralog homogenization that is caused by point mutations and interlocus gene conversion (IGC). Among 164 duplicated teleost genes, the median percentage of postduplication codon substitutions that arise from IGC rather than point mutation is estimated to be between 7% and 8%. By differentiating between the nonsynonymous codon substitutions that homogenize the protein sequences of paralogs and the nonhomogenizing nonsynonymous substitutions, we estimate the homogenizing nonsynonymous rates to be higher for 163 of the 164 teleost data sets as well as for all 14 data sets of duplicated yeast ribosomal protein-coding genes that we consider. For all 14 yeast data sets, the estimated homogenizing nonsynonymous rates exceed the synonymous rates.
Collapse
Affiliation(s)
- Yixuan Yang
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
| | - Tanchumin Xu
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
| | - Gavin Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA
| | - Hirohisa Kishino
- AI/Data Science Social Implementation Laboratory, Chuo University, Tokyo, Japan
| | - Jeffrey L Thorne
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA
| | - Xiang Ji
- Department of Mathematics, Tulane University, New Orleans, LA, USA
| |
Collapse
|
6
|
Siddiqui M, Conant GC. POInT browse: orthology prediction and synteny exploration for paleopolyploid genomes. BMC Bioinformatics 2023; 24:174. [PMID: 37106333 PMCID: PMC10134530 DOI: 10.1186/s12859-023-05298-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 04/19/2023] [Indexed: 04/29/2023] Open
Abstract
We describe POInTbrowse, a web portal that gives access to the orthology inferences made for polyploid genomes with POInT, the Polyploidy Orthology Inference Tool. Ancient, or paleo-, polyploidy events are widely distributed across the eukaryotic phylogeny, and the combination of duplicated and lost duplicated genes that these polyploidies produce can confound the identification of orthologous genes between genomes. POInT uses conserved synteny and phylogenetic models to infer orthologous genes between genomes with a shared polyploidy. It also gives confidence estimates for those orthology inferences. POInTbrowse gives both graphical and query-based access to these inferences from 12 different polyploidy events, allowing users to visualize genomic regions produced by polyploidies and perform batch queries for each polyploidy event, downloading genes trees and coding sequences for orthologous genes meeting user-specified criteria. POInTbrowse and the associated data are online at https://wgd.statgen.ncsu.edu .
Collapse
Affiliation(s)
- Mustafa Siddiqui
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA
| | - Gavin C Conant
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA.
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.
- Program in Genetics, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
7
|
Conant GC. POInT: Modeling Polyploidy in the Era of Ubiquitous Genomics. Methods Mol Biol 2023; 2545:77-90. [PMID: 36720808 DOI: 10.1007/978-1-0716-2561-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Thirteen years ago, we described an evolutionary modeling tool that could resolve the orthology relationships among the homologous genomic regions created by a whole-genome duplication. This tool, which we subsequently named POInT (the Polyploid Orthology Inference Tool), was originally only useful for studying a genome duplication known from bakers' yeast and its relatives. Now, with hundreds of genome sequences that contain the relicts of ancient polyploidy available, POInT can be used to study dozens of different polyploidies, asking both questions about the history of individual events and about the commonalities and differences seen between those events. In this chapter, I give a brief history of the development of POInT as an illustration of the interconnected nature of computational biology research. I then further describe how POInT operates and some of the strengths and drawbacks of its structure. I close with a few examples of discoveries we have made using it.
Collapse
Affiliation(s)
- Gavin C Conant
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA.
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.
- Program in Genetics, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
8
|
McRae L, Beric A, Conant GC. Hybridization order is not the driving factor behind biases in duplicate gene losses among the hexaploid Solanaceae. Proc Biol Sci 2022; 289:20221810. [PMID: 36285500 PMCID: PMC9597411 DOI: 10.1098/rspb.2022.1810] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
We model the post-hexaploidy evolution of four genomes from the Solanaceae, a group of flowering plants comprising tomatoes, potatoes and their relatives. The hexaploidy that these genomes descend from occurred through two sequential allopolyploidy events and was marked by the unequal losses of duplicated genes from the different progenitor subgenomes. In contrast with the hexaploid Brassiceae (broccoli and its relatives), where the subgenome with the most surviving genes arrived last in the hexaploidy, among the Solanaceae the most preserved subgenome descends from one of the original two tetraploid progenitors. In fact, the last-arriving subgenome in these plants actually has the fewest surviving genes in the modern genomes. We explore whether the distribution of repetitive elements (REs) in these genomes can explain the biases in gene losses, but while the signals we find are broadly consistent with a role for high RE density in driving gene losses, the REs turn over so quickly that little signal of the RE condition at the time of paleopolyploidy is extant in the modern genomes.
Collapse
Affiliation(s)
- Logan McRae
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Aleksandra Beric
- Department of Psychiatry, Washington University in Saint Louis School of Medicine, St. Louis, MO 63110, USA
- NeuroGenomics and Informatics Center, Washington University in Saint Louis School of Medicine, St. Louis, MO 63108, USA
| | - Gavin C. Conant
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
- Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
| |
Collapse
|
9
|
Hao Y, Fleming J, Petterson J, Lyons E, Edger PP, Pires JC, Thorne JL, Conant GC. Convergent evolution of polyploid genomes from across the eukaryotic tree of life. G3 (BETHESDA, MD.) 2022; 12:jkac094. [PMID: 35451464 PMCID: PMC9157103 DOI: 10.1093/g3journal/jkac094] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 04/15/2022] [Indexed: 11/14/2022]
Abstract
By modeling the homoeologous gene losses that occurred in 50 genomes deriving from ten distinct polyploidy events, we show that the evolutionary forces acting on polyploids are remarkably similar, regardless of whether they occur in flowering plants, ciliates, fishes, or yeasts. We show that many of the events show a relative rate of duplicate gene loss before the first postpolyploidy speciation that is significantly higher than in later phases of their evolution. The relatively weak selective constraint experienced by the single-copy genes these losses produced leads us to suggest that most of the purely selectively neutral duplicate gene losses occur in the immediate postpolyploid period. Nearly all of the events show strong evidence of biases in the duplicate losses, consistent with them being allopolyploidies, with 2 distinct progenitors contributing to the modern species. We also find ongoing and extensive reciprocal gene losses (alternative losses of duplicated ancestral genes) between these genomes. With the exception of a handful of closely related taxa, all of these polyploid organisms are separated from each other by tens to thousands of reciprocal gene losses. As a result, it is very unlikely that viable diploid hybrid species could form between these taxa, since matings between such hybrids would tend to produce offspring lacking essential genes. It is, therefore, possible that the relatively high frequency of recurrent polyploidies in some lineages may be due to the ability of new polyploidies to bypass reciprocal gene loss barriers.
Collapse
Affiliation(s)
- Yue Hao
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85281, USA
| | - Jonathon Fleming
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
| | - Joanna Petterson
- Department of Biomedical Engineering, North Carolina State University, Raleigh, NC 27695, USA
| | - Eric Lyons
- School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - J Chris Pires
- International Plant Science Center, New York Botanical Garden, Bronx, NY 10458, USA
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Jeffrey L Thorne
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
- Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
- Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
| | - Gavin C Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA
- Program in Genetics, North Carolina State University, Raleigh, NC 27695, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695, USA
| |
Collapse
|
10
|
Genomic and transcriptomic analyses of the subterranean termite Reticulitermes speratus: Gene duplication facilitates social evolution. Proc Natl Acad Sci U S A 2022; 119:2110361119. [PMID: 35042774 PMCID: PMC8785959 DOI: 10.1073/pnas.2110361119] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2021] [Indexed: 12/26/2022] Open
Abstract
Gene duplication is a major source of evolutionary innovation and is associated with the increases in biological complexity and adaptive radiation. Termites are model social organisms characterized by a sophisticated caste system. We analyzed the genome of the Japanese subterranean termite, an ecologically and economically important insect acting as a destructive pest. The analyses revealed the significance of gene duplication in social evolution. Gene duplication associated with caste-biased gene expression was prevalent in the termite genome. Many of the duplicated genes were related to social functions, such as chemical communication, social immunity, and defense, and they were often expressed in caste-specific organs. We propose that gene duplication facilitates social evolution through regulatory diversification leading to caste-biased expression and functional specialization. Termites are model social organisms characterized by a polyphenic caste system. Subterranean termites (Rhinotermitidae) are ecologically and economically important species, including acting as destructive pests. Rhinotermitidae occupies an important evolutionary position within the clade representing a transitional taxon between the higher (Termitidae) and lower (other families) termites. Here, we report the genome, transcriptome, and methylome of the Japanese subterranean termite Reticulitermes speratus. Our analyses highlight the significance of gene duplication in social evolution in this termite. Gene duplication associated with caste-biased gene expression was prevalent in the R. speratus genome. The duplicated genes comprised diverse categories related to social functions, including lipocalins (chemical communication), cellulases (wood digestion and social interaction), lysozymes (social immunity), geranylgeranyl diphosphate synthase (social defense), and a novel class of termite lineage–specific genes with unknown functions. Paralogous genes were often observed in tandem in the genome, but their expression patterns were highly variable, exhibiting caste biases. Some of the assayed duplicated genes were expressed in caste-specific organs, such as the accessory glands of the queen ovary and the frontal glands of soldier heads. We propose that gene duplication facilitates social evolution through regulatory diversification, leading to caste-biased expression and subfunctionalization and/or neofunctionalization conferring caste-specialized functions.
Collapse
|
11
|
Abstract
Ancient polyploidy events are widely distributed across the evolutionary history of eukaryotes. Here, we describe a likelihood-based tool, POInT (the Polyploidy Orthology Inference Tool), for modeling ancient whole genome duplications and triplications, assigning homoeologous genes to subgenomes and inferring gene losses across different parental subgenomes after polyploidy.
Collapse
Affiliation(s)
- Yue Hao
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ, USA
| | - Gavin C Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA.
- Program in Genetics, North Carolina State University, Raleigh, NC, USA.
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
12
|
Chan J, Wang L, Li L, Mu K, Bushek D, Xu Y, Guo X, Zhang G, Zhang L. Transcriptomic Response to Perkinsus marinus in Two Crassostrea Oysters Reveals Evolutionary Dynamics of Host-Parasite Interactions. Front Genet 2021; 12:795706. [PMID: 34925467 PMCID: PMC8678459 DOI: 10.3389/fgene.2021.795706] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 11/17/2021] [Indexed: 12/17/2022] Open
Abstract
Infectious disease outbreaks are causing widespread declines of marine invertebrates including corals, sea stars, shrimps, and molluscs. Dermo is a lethal infectious disease of the eastern oyster Crassostrea virginica caused by the protist Perkinsus marinus. The Pacific oyster Crassostrea gigas is resistant to Dermo due to differences in the host-parasite interaction that is not well understood. We compared transcriptomic responses to P. marinus challenge in the two oysters at early and late infection stages. Dynamic and orchestrated regulation of large sets of innate immune response genes were observed in both species with remarkably similar patterns for most orthologs, although responses in C. virginica were stronger, suggesting strong or over-reacting immune response could be a cause of host mortality. Between the two species, several key immune response gene families differed in their expansion, sequence variation and/or transcriptional response to P. marinus, reflecting evolutionary divergence in host-parasite interaction. Of note, significant upregulation of inhibitors of apoptosis (IAPs) was observed in resistant C. gigas but not in susceptible C. virginica, suggesting upregulation of IAPs is an active defense mechanism, not a passive response orchestrated by P. marinus. Compared with C. gigas, C. virginica exhibited greater expansion of toll-like receptors (TLRs) and positive selection in P. marinus responsive TLRs. The C1q domain containing proteins (C1qDCs) with the galactose-binding lectin domain that is involved in P. marinus recognition, were only present and significantly upregulated in C. virginica. These results point to previously undescribed differences in host defense genes between the two oyster species that may account for the difference in susceptibility, providing an expanded portrait of the evolutionary dynamics of host-parasite interaction in lophotrochozoans that lack adaptive immunity. Our findings suggest that C. virginica and P. marinus have a history of coevolution and the recent outbreaks may be due to increased virulence of the parasite.
Collapse
Affiliation(s)
- Jiulin Chan
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Lu Wang
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- University of Chinese Academy of Sciences, College of Marine Science, Beijing, China
| | - Li Li
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- University of Chinese Academy of Sciences, College of Marine Science, Beijing, China
| | - Kang Mu
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- University of Chinese Academy of Sciences, College of Marine Science, Beijing, China
| | - David Bushek
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, Port Norris, NJ, United States
| | - Yue Xu
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Ximing Guo
- Haskin Shellfish Research Laboratory, Department of Marine and Coastal Sciences, Rutgers University, Port Norris, NJ, United States
| | - Guofan Zhang
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- University of Chinese Academy of Sciences, College of Marine Science, Beijing, China
| | - Linlin Zhang
- CAS and Shandong Province Key Laboratory of Experimental Marine Biology and Center of Deep Sea Research, Center for Ocean Mega-Science, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- University of Chinese Academy of Sciences, College of Marine Science, Beijing, China
| |
Collapse
|
13
|
Hao Y, Mabry ME, Edger PP, Freeling M, Zheng C, Jin L, VanBuren R, Colle M, An H, Abrahams RS, Washburn JD, Qi X, Barry K, Daum C, Shu S, Schmutz J, Sankoff D, Barker MS, Lyons E, Pires JC, Conant GC. The contributions from the progenitor genomes of the mesopolyploid Brassiceae are evolutionarily distinct but functionally compatible. Genome Res 2021; 31:799-810. [PMID: 33863805 PMCID: PMC8092008 DOI: 10.1101/gr.270033.120] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 03/05/2021] [Indexed: 01/08/2023]
Abstract
The members of the tribe Brassiceae share a whole-genome triplication (WGT), and one proposed model for its formation is a two-step pair of hybridizations producing hexaploid descendants. However, evidence for this model is incomplete, and the evolutionary and functional constraints that drove evolution after the hexaploidy are even less understood. Here, we report a new genome sequence of Crambe hispanica, a species sister to most sequenced Brassiceae. Using this new genome and three others that share the hexaploidy, we traced the history of gene loss after the WGT using the Polyploidy Orthology Inference Tool (POInT). We confirm the two-step formation model and infer that there was a significant temporal gap between those two allopolyploidizations, with about a third of the gene losses from the first two subgenomes occurring before the arrival of the third. We also, for the 90,000 individual genes in our study, make parental subgenome assignments, inferring, with measured uncertainty, from which of the progenitor genomes of the allohexaploidy each gene derives. We further show that each subgenome has a statistically distinguishable rate of homoeolog losses. There is little indication of functional distinction between the three subgenomes: the individual subgenomes show no patterns of functional enrichment, no excess of shared protein-protein or metabolic interactions between their members, and no biases in their likelihood of having experienced a recent selective sweep. We propose a "mix and match" model of allopolyploidy, in which subgenome origin drives homoeolog loss propensities but where genes from different subgenomes function together without difficulty.
Collapse
Affiliation(s)
- Yue Hao
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - Makenzie E Mabry
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
- Genetics and Genome Sciences, Michigan State University, East Lansing, Michigan 48824, USA
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA
| | - Chunfang Zheng
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Lingling Jin
- Department of Computer Science, University of Saskatchewan, Saskatoon, Saskatchewan S7N 5C9, Canada
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
- Plant Resilience Institute, Michigan State University, East Lansing, Michigan 48824, USA
| | - Marivi Colle
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | - Hong An
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - R Shawn Abrahams
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Jacob D Washburn
- Plant Genetics Research Unit, USDA-ARS, Columbia, Missouri 65211, USA
| | - Xinshuai Qi
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
| | - Kerrie Barry
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Christopher Daum
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Shengqiang Shu
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Jeremy Schmutz
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - David Sankoff
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Michael S Barker
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
| | - Eric Lyons
- School of Plant Sciences, University of Arizona, Tucson, Arizona 85721, USA
- BIO5 Institute, University of Arizona, Tucson, Arizona 85721, USA
| | - J Chris Pires
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri 65211, USA
- Informatics Institute, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| | - Gavin C Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina 27695, USA
- Program in Genetics, North Carolina State University, Raleigh, North Carolina 27695, USA
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
- Division of Animal Sciences, University of Missouri-Columbia, Columbia, Missouri 65211, USA
| |
Collapse
|
14
|
Dyson CJ, Goodisman MAD. Gene Duplication in the Honeybee: Patterns of DNA Methylation, Gene Expression, and Genomic Environment. Mol Biol Evol 2021; 37:2322-2331. [PMID: 32243528 DOI: 10.1093/molbev/msaa088] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2019] [Accepted: 03/27/2020] [Indexed: 12/12/2022] Open
Abstract
Gene duplication serves a critical role in evolutionary adaptation by providing genetic raw material to the genome. The evolution of duplicated genes may be influenced by epigenetic processes such as DNA methylation, which affects gene function in some taxa. However, the manner in which DNA methylation affects duplicated genes is not well understood. We studied duplicated genes in the honeybee Apis mellifera, an insect with a highly sophisticated social structure, to investigate whether DNA methylation was associated with gene duplication and genic evolution. We found that levels of gene body methylation were significantly lower in duplicate genes than in single-copy genes, implicating a possible role of DNA methylation in postduplication gene maintenance. Additionally, we discovered associations of gene body methylation with the location, length, and time since divergence of paralogous genes. We also found that divergence in DNA methylation was associated with divergence in gene expression in paralogs, although the relationship was not completely consistent with a direct link between DNA methylation and gene expression. Overall, our results provide further insight into genic methylation and how its association with duplicate genes might facilitate evolutionary processes and adaptation.
Collapse
Affiliation(s)
- Carl J Dyson
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA
| | | |
Collapse
|
15
|
Conant GC. The lasting after-effects of an ancient polyploidy on the genomes of teleosts. PLoS One 2020; 15:e0231356. [PMID: 32298330 PMCID: PMC7161988 DOI: 10.1371/journal.pone.0231356] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 03/20/2020] [Indexed: 12/20/2022] Open
Abstract
The ancestor of most teleost fishes underwent a whole-genome duplication event three hundred million years ago. Despite its antiquity, the effects of this event are evident both in the structure of teleost genomes and in how the surviving duplicated genes still operate to drive form and function. I inferred a set of shared syntenic regions that survive from the teleost genome duplication (TGD) using eight teleost genomes and the outgroup gar genome (which lacks the TGD). I then phylogenetically modeled the TGD's resolution via shared and independent gene losses and applied a new simulation-based statistical test for the presence of bias toward the preservation of genes from one parental subgenome. On the basis of that test, I argue that the TGD was likely an allopolyploidy. I find that duplicate genes surviving from this duplication in zebrafish are less likely to function in early embryo development than are genes that have returned to single copy at some point in this species' history. The tissues these ohnologs are expressed in, as well as their biological functions, lend support to recent suggestions that the TGD was the source of a morphological innovation in the structure of the teleost retina. Surviving duplicates also appear less likely to be essential than singletons, despite the fact that their single-copy orthologs in mouse are no less essential than other genes.
Collapse
Affiliation(s)
- Gavin C. Conant
- Department of Biological Sciences, North Carolina State University, Raleigh, NC, United States of America
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, United States of America
- Program in Genetics, North Carolina State University, Raleigh, NC, United States of America
- Division of Animal Sciences, University of Missouri, Columbia, MO, United States of America
| |
Collapse
|
16
|
Abstract
The root-knot nematodes of the genus Meloidogyne are important and damaging parasites capable of infecting most flowering plants. Within this genus, several species of the Meloidogyne incognita group show evidence of paleopolyploidy in their genomes. We used our software tool POInT, the Polyploidy Orthology Inference Tool, to phylogenetically model the gene losses that followed that polyploidy. These models, and simulations based on them, show that three of these species (M. incognita, M. arenaria and M. javanica) descend from a single common hybridization event that yielded triplicated genomes with three distinguishable subgenomes. While one of the three subgenomes shows elevated gene loss rates relative to the other two, this subgenome does not show elevated sequence divergence. In all three species, ancestral loci where two of the three gene copies have been lost are less likely to have orthologs in Caenorhabditis elegans that are lethal when knocked down than are ancestral loci with surviving duplicate copies.
Collapse
|
17
|
Abrahams RS, Pires JC, Schranz ME. Genomic Origin and Diversification of the Glucosinolate MAM Locus. FRONTIERS IN PLANT SCIENCE 2020; 11:711. [PMID: 32582245 PMCID: PMC7289053 DOI: 10.3389/fpls.2020.00711] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 05/05/2020] [Indexed: 05/06/2023]
Abstract
Glucosinolates are a diverse group of plant metabolites that characterize the order Brassicales. The MAM locus is one of the most significant QTLs for glucosinolate diversity. However, most of what we understand about evolution at the locus is focused on only a few species and not within a phylogenetic context. In this study, we utilize a micro-synteny network and phylogenetic inference to investigate the origin and diversification of the MAM/IPMS gene family. We uncover unique MAM-like genes found at the orthologous locus in the Cleomaceae that shed light on the transition from IPMS to MAM. In the Brassicaceae, we identify six distinct MAM clades across Lineages I, II, and III. We characterize the evolutionary impact and consequences of local duplications, transpositions, whole genome duplications, and gene fusion events, generating several new hypothesizes on the function and diversity of the MAM locus.
Collapse
Affiliation(s)
- R. Shawn Abrahams
- Division of Biological Sciences, University of Missouri, Columbia, MO, United States
- Biosystematics Group, Wageningen University, Wageningen, Netherlands
| | - J. Chris Pires
- Division of Biological Sciences, University of Missouri, Columbia, MO, United States
| | - M. Eric Schranz
- Biosystematics Group, Wageningen University, Wageningen, Netherlands
- *Correspondence: M. Eric Schranz,
| |
Collapse
|
18
|
Ma J, Deng S, Chen L, Jia Z, Sang Z, Zhu Z, Ma L, Chen F. Gene duplication led to divergence of expression patterns, protein-protein interaction patterns and floral development functions of AGL6-like genes in the basal angiosperm Magnolia wufengensis (Magnoliaceae). TREE PHYSIOLOGY 2019; 39:861-876. [PMID: 31034013 DOI: 10.1093/treephys/tpz010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 01/07/2019] [Accepted: 01/31/2019] [Indexed: 06/09/2023]
Abstract
The MADS-box family genes play critical roles in the regulation of growth and development of flowering plants. AGAMOUS-LIKE 6 (AGL6)-like genes are one of the most enigmatic subfamilies of the MADS-box family because of highly variable expression patterns and ambiguous functions, which have long puzzled researchers. A lot of AGL6 homologs have been identified from gymnosperms and angiosperms. However, only a few have been characterized, especially for basal angiosperm taxa. Magnolia wufengensis is a woody basal angiosperm from the family Magnoliaceae. In the current study, the phylogenesis, expression and protein-protein interaction (PPI) patterns, and functions of two AGL6 homologs from M. wufengensis, MawuAGL6-1 and MawuAGL6-2, were analyzed. Phylogenetic analysis indicated that the two AGL6 duplicates may have arisen by gene duplication before the divergence of Magnoliaceae and Lauraceae, with the diversification of their expression and PPI patterns after gene duplication. Functional analysis revealed that, in addition to common functions in accelerating flowering, MawuAGL6-1 might be responsible for flower meristem determinacy, while MawuAGL6-2 is preferentially recruited to regulate tepal morphogenesis. These findings further advance our understanding of the evolution of phylogenesis, expression, interaction and functions of AGL6 lineage genes from basal angiosperms, as well as the entire AGL6 lineage genes, and the significance of AGL6 lineage genes in the evolution and biological diversity.
Collapse
Affiliation(s)
- Jiang Ma
- Ministry of Education Key Laboratory of Silviculture and Conservation, Forestry College, Beijing Forestry University, Beijing, PR China
- Key Laboratory of Three Gorges Regional Plant Genetics & Germplasm Enhancement (CTGU)/Biotechnology Research Center, China Three Gorges University, Yichang, PR China
| | - Shixin Deng
- Ministry of Education Key Laboratory of Silviculture and Conservation, Forestry College, Beijing Forestry University, Beijing, PR China
| | - Liyuan Chen
- Ministry of Education Key Laboratory of Silviculture and Conservation, Forestry College, Beijing Forestry University, Beijing, PR China
| | - Zhongkui Jia
- Ministry of Education Key Laboratory of Silviculture and Conservation, Forestry College, Beijing Forestry University, Beijing, PR China
| | - Ziyang Sang
- Forestry Bureau of Wufeng County, Wufeng, Hubei Province, PR China
| | - Zhonglong Zhu
- Wufeng Bo Ling Magnolia Wufengensis Technology Development Co., Ltd, Wufeng, Hubei Province, PR China
| | - Lvyi Ma
- Ministry of Education Key Laboratory of Silviculture and Conservation, Forestry College, Beijing Forestry University, Beijing, PR China
| | - Faju Chen
- Key Laboratory of Three Gorges Regional Plant Genetics & Germplasm Enhancement (CTGU)/Biotechnology Research Center, China Three Gorges University, Yichang, PR China
| |
Collapse
|
19
|
Costa RA, Martins RST, Capilla E, Anjos L, Power DM. Vertebrate SLRP family evolution and the subfunctionalization of osteoglycin gene duplicates in teleost fish. BMC Evol Biol 2018; 18:191. [PMID: 30545285 PMCID: PMC6293640 DOI: 10.1186/s12862-018-1310-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 11/27/2018] [Indexed: 02/07/2023] Open
Abstract
Background Osteoglycin (OGN, a.k.a. mimecan) belongs to cluster III of the small leucine-rich proteoglycans (SLRP) of the extracellular matrix (ECM). In vertebrates OGN is a characteristic ECM protein of bone. In the present study we explore the evolution of SLRP III and OGN in teleosts that have a skeleton adapted to an aquatic environment. Results The SLRP gene family has been conserved since the separation of chondrichthyes and osteichthyes. Few gene duplicates of the SLRP III family exist even in the teleosts that experienced a specific whole genome duplication. One exception is ogn for which duplicate copies were identified in fish genomes. The ogn promoter sequence and in vitro mesenchymal stem cell (MSC) cultures suggest the duplicate ogn genes acquired divergent functions. In gilthead sea bream (Sparus aurata) ogn1 was up-regulated during osteoblast and myocyte differentiation in vitro, while ogn2 was severely down-regulated during bone-derived MSCs differentiation into adipocytes in vitro. Conclusions Overall, the phylogenetic analysis indicates that the SLRP III family in vertebrates has been under conservative evolutionary pressure. The retention of the ogn gene duplicates in teleosts was linked with the acquisition of different functions. The acquisition by OGN of functions other than that of a bone ECM protein occurred early in the vertebrate lineage. Electronic supplementary material The online version of this article (10.1186/s12862-018-1310-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- R A Costa
- Comparative Endocrinology and Integrative Biology Group, Centre of Marine Sciences, University of Algarve, Campus of Gambelas, 8005-139, Faro, Portugal
| | - R S T Martins
- Comparative Endocrinology and Integrative Biology Group, Centre of Marine Sciences, University of Algarve, Campus of Gambelas, 8005-139, Faro, Portugal.
| | - E Capilla
- Department of Cell Biology, Physiology and Immunology, Faculty of Biology, University of Barcelona, 08028, Barcelona, Spain
| | - L Anjos
- Comparative Endocrinology and Integrative Biology Group, Centre of Marine Sciences, University of Algarve, Campus of Gambelas, 8005-139, Faro, Portugal
| | - D M Power
- Comparative Endocrinology and Integrative Biology Group, Centre of Marine Sciences, University of Algarve, Campus of Gambelas, 8005-139, Faro, Portugal.
| |
Collapse
|
20
|
Blischak PD, Mabry ME, Conant GC, Pires JC. Integrating Networks, Phylogenomics, and Population Genomics for the Study of Polyploidy. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2018. [DOI: 10.1146/annurev-ecolsys-121415-032302] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Duplication events are regarded as sources of evolutionary novelty, but our understanding of general trends for the long-term trajectory of additional genomic material is still lacking. Organisms with a history of whole genome duplication (WGD) offer a unique opportunity to study potential trends in the context of gene retention and/or loss, gene and network dosage, and changes in gene expression. In this review, we discuss the prevalence of polyploidy across the tree of life, followed by an overview of studies investigating genome evolution and gene expression. We then provide an overview of methods in network biology, phylogenomics, and population genomics that are critical for advancing our understanding of evolution post-WGD, highlighting the need for models that can accommodate polyploids. Finally, we close with a brief note on the importance of random processes in the evolution of polyploids with respect to neutral versus selective forces, ancestral polymorphisms, and the formation of autopolyploids versus allopolyploids.
Collapse
Affiliation(s)
- Paul D. Blischak
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, Ohio 43210, USA
| | - Makenzie E. Mabry
- Division of Biological Sciences and Bond Life Sciences Center, University of Missouri, Columbia, Missouri 65211, USA
| | - Gavin C. Conant
- Division of Animal Sciences, University of Missouri, Columbia, Missouri 65211, USA
- Current affiliation: Bioinformatics Research Center, Program in Genetics and Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina 27695, USA
| | - J. Chris Pires
- Division of Biological Sciences and Bond Life Sciences Center, University of Missouri, Columbia, Missouri 65211-7310, USA
| |
Collapse
|
21
|
Emery M, Willis MMS, Hao Y, Barry K, Oakgrove K, Peng Y, Schmutz J, Lyons E, Pires JC, Edger PP, Conant GC. Preferential retention of genes from one parental genome after polyploidy illustrates the nature and scope of the genomic conflicts induced by hybridization. PLoS Genet 2018; 14:e1007267. [PMID: 29590103 PMCID: PMC5891031 DOI: 10.1371/journal.pgen.1007267] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Revised: 04/09/2018] [Accepted: 02/21/2018] [Indexed: 11/18/2022] Open
Abstract
Polyploidy is increasingly seen as a driver of both evolutionary innovation and ecological success. One source of polyploid organisms' successes may be their origins in the merging and mixing of genomes from two different species (e.g., allopolyploidy). Using POInT (the Polyploid Orthology Inference Tool), we model the resolution of three allopolyploidy events, one from the bakers' yeast (Saccharomyces cerevisiae), one from the thale cress (Arabidopsis thaliana) and one from grasses including Sorghum bicolor. Analyzing a total of 21 genomes, we assign to every gene a probability for having come from each parental subgenome (i.e., derived from the diploid progenitor species), yielding orthologous segments across all genomes. Our model detects statistically robust evidence for the existence of biased fractionation in all three lineages, whereby genes from one of the two subgenomes were more likely to be lost than those from the other subgenome. We further find that a driver of this pattern of biased losses is the co-retention of genes from the same parental genome that share functional interactions. The pattern of biased fractionation after the Arabidopsis and grass allopolyploid events was surprisingly constant in time, with the same parental genome favored throughout the lineages' history. In strong contrast, the yeast allopolyploid event shows evidence of biased fractionation only immediately after the event, with balanced gene losses more recently. The rapid loss of functionally associated genes from a single subgenome is difficult to reconcile with the action of genetic drift and suggests that selection may favor the removal of specific duplicates. Coupled to the evidence for continuing, functionally-associated biased fractionation after the A. thaliana At-α event, we suggest that, after allopolyploidy, there are functional conflicts between interacting genes encoded in different subgenomes that are ultimately resolved through preferential duplicate loss.
Collapse
Affiliation(s)
- Marianne Emery
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri, United States of America
| | - M. Madeline S. Willis
- Department of Biochemistry, University of Missouri-Columbia, Columbia, Missouri, United States of America
| | - Yue Hao
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Kerrie Barry
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
| | - Khouanchy Oakgrove
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
| | - Yi Peng
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
| | - Jeremy Schmutz
- Department of Energy Joint Genome Institute, Walnut Creek, California, United States of America
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama, United States of America
| | - Eric Lyons
- School of Plant Sciences, University of Arizona, Tucson, Arizona, United States of America
| | - J. Chris Pires
- Division of Biological Sciences, University of Missouri-Columbia, Columbia, Missouri, United States of America
- Informatics Institute, University of Missouri-Columbia, Columbia, Missouri, United States of America
- Bond Life Sciences Center, University of Missouri-Columbia, Columbia, Missouri, United States of America
| | - Patrick P. Edger
- Department of Horticulture, Michigan State University, East Lansing, Michigan, United States of America
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, Michigan, United States of America
| | - Gavin C. Conant
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, United States of America
- Division of Animal Sciences, University of Missouri-Columbia, Columbia, Missouri, United States of America
- Program in Genetics, North Carolina State University, Raleigh, North Carolina, United States of America
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
22
|
Fortunato A, Boddy A, Mallo D, Aktipis A, Maley CC, Pepper JW. Natural Selection in Cancer Biology: From Molecular Snowflakes to Trait Hallmarks. Cold Spring Harb Perspect Med 2017; 7:cshperspect.a029652. [PMID: 28148564 DOI: 10.1101/cshperspect.a029652] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Evolution by natural selection is the conceptual foundation for nearly every branch of biology and increasingly also for biomedicine and medical research. In cancer biology, evolution explains how populations of cells in tumors change over time. It is a fundamental question whether this evolutionary process is driven primarily by natural selection and adaptation or by other evolutionary processes such as founder effects and drift. In cancer biology, as in organismal evolutionary biology, there is controversy about this question and also about the use of adaptation through natural selection as a guiding framework for research. In this review, we discuss the differences and similarities between evolution among somatic cells versus evolution among organisms. We review what is known about the parameters and rate of evolution in neoplasms, as well as evidence for adaptation. We conclude that adaptation is a useful framework that accurately explains the defining characteristics of cancer. Further, convergent evolution through natural selection provides the only satisfying explanation both for how a group of diverse pathologies have enough in common to usefully share the descriptive label of "cancer" and for why this convergent condition becomes life-threatening.
Collapse
Affiliation(s)
- Angelo Fortunato
- Biodesign Center for Personalized Diagnostics, and School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| | - Amy Boddy
- Department of Psychology, Arizona State University, Tempe, Arizona 85287
| | - Diego Mallo
- Biodesign Center for Personalized Diagnostics, and School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| | - Athena Aktipis
- Department of Psychology, Arizona State University, Tempe, Arizona 85287.,Biodesign Center for Evolution and Medicine, Arizona State University, Tempe, Arizona 85287
| | - Carlo C Maley
- Biodesign Center for Personalized Diagnostics, and School of Life Sciences, Arizona State University, Tempe, Arizona 85287.,Centre for Evolution and Cancer, Institute of Cancer Research, London SM2 5NG, United Kingdom
| | - John W Pepper
- Biometry Research Group, Division of Cancer Prevention, National Cancer Institute, Rockville, Maryland 20850.,Santa Fe Institute, Santa Fe, New Mexico 87501
| |
Collapse
|
23
|
Tiley GP, Ané C, Burleigh JG. Evaluating and Characterizing Ancient Whole-Genome Duplications in Plants with Gene Count Data. Genome Biol Evol 2016; 8:1023-37. [PMID: 26988251 PMCID: PMC4860690 DOI: 10.1093/gbe/evw058] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Whole-genome duplications (WGDs) have helped shape the genomes of land plants, and recent evidence suggests that the genomes of all angiosperms have experienced at least two ancient WGDs. In plants, WGDs often are followed by rapid fractionation, in which many homeologous gene copies are lost. Thus, it can be extremely difficult to identify, let alone characterize, ancient WGDs. In this study, we use a new maximum likelihood estimator to test for evidence of ancient WGDs in land plants and estimate the fraction of new genes copies that are retained following a WGD using gene count data, the number of gene copies in gene families. We identified evidence of many putative ancient WGDs in land plants and found that the genome fractionation rates vary tremendously among ancient WGDs. Analyses of WGDs within Brassicales also indicate that background gene duplication and loss rates vary across land plants, and different gene families have different probabilities of being retained following a WGD. Although our analyses are largely robust to errors in duplication and loss rates and the choice of priors, simulations indicate that this method can have trouble detecting multiple WGDs that occur on the same branch, especially when the gene retention rates for ancient WGDs are very low. They also suggest that we should carefully evaluate evidence for some ancient plant WGD hypotheses.
Collapse
Affiliation(s)
| | - Cécile Ané
- Department of Statistics, University of Wisconsin-Madison Department of Botany, University of Wisconsin-Madison
| | | |
Collapse
|
24
|
Scienski K, Fay JC, Conant GC. Patterns of Gene Conversion in Duplicated Yeast Histones Suggest Strong Selection on a Coadapted Macromolecular Complex. Genome Biol Evol 2015; 7:3249-58. [PMID: 26560339 PMCID: PMC4700949 DOI: 10.1093/gbe/evv216] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
We find evidence for interlocus gene conversion in five duplicated histone genes from six yeast species. The sequences of these duplicated genes, surviving from the ancient genome duplication, show phylogenetic patterns inconsistent with the well-resolved orthology relationships inferred from a likelihood model of gene loss after the genome duplication. Instead, these paralogous genes are more closely related to each other than any is to its nearest ortholog. In addition to simulations supporting gene conversion, we also present evidence for elevated rates of radical amino acid substitutions along the branches implicated in the conversion events. As these patterns are similar to those seen in ribosomal proteins that have undergone gene conversion, we speculate that in cases where duplicated genes code for proteins that are a part of tightly interacting complexes, selection may favor the fixation of gene conversion events in order to maintain high protein identities between duplicated copies.
Collapse
Affiliation(s)
- Kathy Scienski
- Division of Animal Sciences, University of Missouri, Columbia Present address: Genetics Graduate Program, Texas A&M University, College Station, TX
| | - Justin C Fay
- Department of Genetics, Washington University Center for Genome Sciences and Systems Biology, Washington University
| | - Gavin C Conant
- Division of Animal Sciences, University of Missouri, Columbia Informatics Institute, University of Missouri, Columbia
| |
Collapse
|
25
|
Conant GC. Comparative genomics as a time machine: how relative gene dosage and metabolic requirements shaped the time-dependent resolution of yeast polyploidy. Mol Biol Evol 2014; 31:3184-93. [PMID: 25158798 DOI: 10.1093/molbev/msu250] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Using a phylogenetic model of evolution after genome duplication (i.e., polyploidy) and 12 yeast genomes with a shared genome duplication, I show that the loss of duplicate genes after that duplication occurred in three phases. First, losses that occurred immediately after the event were biased toward genes functioning in DNA repair and organellar functions. Then, the main group of duplicate losses appear to have been shaped by a requirement to maintain balance in protein levels: There is a strong statistical association between the number of protein interactions a gene's product is involved in and its propensity to have remained in duplicate. Moreover, when duplicated genes with interactions were lost, it was more common than expected for both members of an interaction pair to have been lost on the same branch of the phylogeny. Finally, in the third phase of the resolution process, overretention of duplicated enzymes carrying high flux and of duplicated genes involved in transcriptional regulation became dominant. I speculate that initial retention of such genes by a requirement to maintain gene dosage set the stage for the later functional changes that then maintained these duplicates for long periods.
Collapse
Affiliation(s)
- Gavin C Conant
- Informatics Institute, University of Missouri, Columbia Division of Animal Sciences, University of Missouri, Columbia
| |
Collapse
|
26
|
McGrath CL, Gout JF, Johri P, Doak TG, Lynch M. Differential retention and divergent resolution of duplicate genes following whole-genome duplication. Genome Res 2014; 24:1665-75. [PMID: 25085612 PMCID: PMC4199370 DOI: 10.1101/gr.173740.114] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The Paramecium aurelia complex is a group of 15 species that share at least three past whole-genome duplications (WGDs). The macronuclear genome sequences of P. biaurelia and P. sexaurelia are presented and compared to the published sequence of P. tetraurelia. Levels of duplicate-gene retention from the recent WGD differ by >10% across species, with P. sexaurelia losing significantly more genes than P. biaurelia or P. tetraurelia. In addition, historically high rates of gene conversion have homogenized WGD paralogs, probably extending the paralogs’ lifetimes. The probability of duplicate retention is positively correlated with GC content and expression level; ribosomal proteins, transcription factors, and intracellular signaling proteins are overrepresented among maintained duplicates. Finally, multiple sources of evidence indicate that P. sexaurelia diverged from the two other lineages immediately following, or perhaps concurrent with, the recent WGD, with approximately half of gene losses between P. tetraurelia and P. sexaurelia representing divergent gene resolutions (i.e., silencing of alternative paralogs), as expected for random duplicate loss between these species. Additionally, though P. biaurelia and P. tetraurelia diverged from each other much later, there are still more than 100 cases of divergent resolution between these two species. Taken together, these results indicate that divergent resolution of duplicate genes between lineages acts to reinforce reproductive isolation between species in the Paramecium aurelia complex.
Collapse
Affiliation(s)
- Casey L McGrath
- Department of Biology, Indiana University, Bloomington, Indiana 47408, USA
| | - Jean-Francois Gout
- Department of Biology, Indiana University, Bloomington, Indiana 47408, USA
| | - Parul Johri
- Department of Biology, Indiana University, Bloomington, Indiana 47408, USA
| | - Thomas G Doak
- Department of Biology, Indiana University, Bloomington, Indiana 47408, USA; National Center for Genome Analysis Support at Indiana University, Bloomington, Indiana 47408, USA
| | - Michael Lynch
- Department of Biology, Indiana University, Bloomington, Indiana 47408, USA;
| |
Collapse
|
27
|
Dhroso A, Korkin D, Conant GC. The yeast protein interaction network has a capacity for self-organization. FEBS J 2014; 281:3420-32. [PMID: 24924781 DOI: 10.1111/febs.12870] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 05/02/2014] [Accepted: 06/06/2014] [Indexed: 12/20/2022]
Abstract
The organization of the cellular interior gives rise to properties including metabolic channeling and micro-compartmentalization of signaling. Here, we use a lattice model of molecular crowding, together with literature-derived protein interactions and abundances, to describe the molecular organization and stoichiometry of local cellular regions, showing that physical protein-protein interactions induce emergent structures not seen when random interaction networks are modeled. Specifically, we find that the lattices give rise to micro-groups of enzymes on the surfaces of protein clusters. These arrangements of proteins are also robust to protein overexpression, while still showing evidence for expression tuning. Our results indicate that some of the complex organization of the cell may derive from simple rules of molecular aggregation and interaction.
Collapse
Affiliation(s)
- Andi Dhroso
- Department of Computer Science, University of Missouri, Columbia, MO, USA; Informatics Institute, University of Missouri, Columbia, MO, USA
| | | | | |
Collapse
|
28
|
Conant GC, Birchler JA, Pires JC. Dosage, duplication, and diploidization: clarifying the interplay of multiple models for duplicate gene evolution over time. CURRENT OPINION IN PLANT BIOLOGY 2014; 19:91-8. [PMID: 24907529 DOI: 10.1016/j.pbi.2014.05.008] [Citation(s) in RCA: 172] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Revised: 05/06/2014] [Accepted: 05/08/2014] [Indexed: 05/04/2023]
Abstract
Requirements to maintain dosage balance shape many genome-scale patterns in organisms, including the resolution of whole genome duplications (WGD), as well as the varied effects of aneuploidy, segmental duplications, tandem duplications, gene copy number variations (CNV), and epigenetic marks. Like neofunctionalization and subfunctionalization, the impact of absolute and relative dosage varies over time. These variations are of particular importance in understanding the role of dosage in the evolution of polyploid organisms. Numerous investigations have found the consequences of polyploidy remain distinct from small-scale duplications (SSD). This observation is significant as all flowering plants have experienced at least two ancient polyploid events, and many angiosperm lineages have undergone additional rounds of polyploidy. Intriguingly, recent studies indicate a link between how epigenetic marks in recent allopolyploids may induce immediate changes in gene expression and the longer-term patterns of biased fractionation and chromosomal evolution. We argue that dosage effects represent one aspect of an emerging pluralistic framework, a framework that will use biophysics, genomic technologies, and systems-level models of cells to broaden our view of how genomes evolve.
Collapse
Affiliation(s)
- Gavin C Conant
- Division of Animal Sciences, University of Missouri, Columbia, MO 65211, United States; Informatics Institute, University of Missouri, Columbia, MO 65211, United States.
| | - James A Birchler
- Division of Biological Sciences, University of Missouri, Columbia, MO 65211, United States.
| | - J Chris Pires
- Informatics Institute, University of Missouri, Columbia, MO 65211, United States; Division of Biological Sciences, University of Missouri, Columbia, MO 65211, United States.
| |
Collapse
|
29
|
Abstract
The evolutionary mechanisms underlying duplicate gene maintenance and divergence remain highly debated. Epigenetic modifications, such as DNA methylation, may contribute to duplicate gene evolution by facilitating tissue-specific regulation. However, the role of epigenetic divergence on duplicate gene evolution remains little understood. Here we show, using comprehensive data across 10 diverse human tissues, that DNA methylation plays critical roles in several aspects of duplicate gene evolution. We first demonstrate that duplicate genes are initially heavily methylated, before gradually losing DNA methylation as they age. Within each pair, DNA methylation divergence between duplicate partners increases with evolutionary age. Importantly, tissue-specific DNA methylation of duplicates correlates with tissue-specific expression, implicating DNA methylation as a causative factor for functional divergence of duplicate genes. These patterns are apparent in promoters but not in gene bodies, in accord with the complex relationship between gene-body DNA methylation and transcription. Remarkably, many duplicate gene pairs exhibit consistent division of DNA methylation across multiple, divergent tissues: For the majority (73%) of duplicate gene pairs, one partner is always hypermethylated compared with the other. This is indicative of a common underlying determinant of DNA methylation. The division of DNA methylation is also consistent with their chromatin accessibility profiles. Moreover, at least two sequence motifs known to interact with the Sp1 transcription factor mark promoters of more hypomethylated duplicate partners. These results demonstrate critical roles of DNA methylation, as well as complex interaction between genome and epigenome, on duplicate gene evolution.
Collapse
|
30
|
Watching the grin fade: tracing the effects of polyploidy on different evolutionary time scales. Semin Cell Dev Biol 2013; 24:320-31. [PMID: 23466286 DOI: 10.1016/j.semcdb.2013.02.002] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Revised: 02/05/2013] [Accepted: 02/07/2013] [Indexed: 12/13/2022]
Abstract
Polyploidy, or whole-genome duplication (WGD), is a recurrent mutation both in cell lineages and over evolutionary time. By globally changing the relationship between gene copy number and other cellular entities, it can induce dramatic changes at the cellular and phenotypic level. Perhaps surprisingly, then, the insights that these events can bring to understanding other cellular features are not as well appreciated as they could be. In this review, we draw on examples of polyploidy from animals, plants and yeast to explore how investigations of polyploid cells have improved our understanding of the cell cycle, biological network complexity, metabolic phenotypes and tumor biology. We argue that the study of polyploidy across organisms, cell types, and time scales serves not only as a window into basic cell biology, but also as a basis for a predictive biology with applications ranging from crop improvement to treating cancer.
Collapse
|
31
|
Polyploidy and the evolution of complex traits. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:292068. [PMID: 22900230 PMCID: PMC3413983 DOI: 10.1155/2012/292068] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Revised: 05/29/2012] [Accepted: 06/05/2012] [Indexed: 12/14/2022]
Abstract
We explore how whole-genome duplications (WGDs) may have given rise to complex innovations in cellular networks, innovations that could not have evolved through sequential single-gene duplications. We focus on two classical WGD events, one in bakers' yeast and the other at the base of vertebrates (i.e., two rounds of whole-genome duplication: 2R-WGD). Two complex adaptations are discussed in detail: aerobic ethanol fermentation in yeast and the rewiring of the vertebrate developmental regulatory network through the 2R-WGD. These two examples, derived from diverged branches on the eukaryotic tree, boldly underline the evolutionary potential of WGD in facilitating major evolutionary transitions. We close by arguing that the evolutionary importance of WGD may require updating certain aspects of modern evolutionary theory, perhaps helping to synthesize a new evolutionary systems biology.
Collapse
|
32
|
Casola C, Conant GC, Hahn MW. Very low rate of gene conversion in the yeast genome. Mol Biol Evol 2012; 29:3817-26. [PMID: 22844073 DOI: 10.1093/molbev/mss192] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Gene duplication is a major driver of organismal adaptation and evolution and plays an important role in multiple human diseases. Whole-genome analyses have shown similar and high rates of gene duplication across a variety of eukaryotic species. Most of these studies, however, did not address the possible impact of interlocus gene conversion (IGC) on the evolution of gene duplicates. Because IGC homogenizes pairs of duplicates, widespread conversion would cause gene duplication events that happened long ago to appear more recent, resulting in artificially high estimates of duplication rates. Although the majority of genome-wide studies (including in the budding yeast Saccharomyces cerevisiae [Scer]) point to levels of IGC between paralogs ranging from 2% to 18%, Gao and Innan (Gao LZ, Innan H. 2004. Very low gene duplication rate in the yeast genome. Science 306:1367-1370.) found that gene conversion in yeast affected >80% of paralog pairs. If conversion rates really are this high, it would imply that the rate of gene duplication in eukaryotes is much lower than previously reported. In this work, we apply four different methodologies-including one approach that closely mirrors Gao and Innan's method-to estimate the level of IGC in Scer. Our analyses point to a maximum conversion level of 13% between paralogs in this species, in close agreement with most estimates of IGC in eukaryotes. We also show that the exceedingly high levels of conversion found previously derive from application of an accurate method to an inappropriate data set. In conclusion, our work provides the most striking evidence to date supporting the reduced incidence of IGC among Scer paralogs and sets up a framework for future analyses in other eukaryotes.
Collapse
|
33
|
Woodhouse MR, Tang H, Freeling M. Different gene families in Arabidopsis thaliana transposed in different epochs and at different frequencies throughout the rosids. THE PLANT CELL 2011; 23:4241-53. [PMID: 22180627 PMCID: PMC3269863 DOI: 10.1105/tpc.111.093567] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Certain types of gene families, such as those encoding most families of transcription factors, maintain their chromosomal syntenic positions throughout angiosperm evolutionary time. Other nonsyntenic gene families are prone to deletion, tandem duplication, and transposition. Here, we describe the chromosomal positional history of all genes in Arabidopsis thaliana throughout the rosid superorder. We introduce a public database where researchers can look up the positional history of their favorite A. thaliana gene or gene family. Finally, we show that specific gene families transposed at specific points in evolutionary time, particularly after whole-genome duplication events in the Brassicales, and suggest that genes in mobile gene families are under different selection pressure than syntenic genes.
Collapse
Affiliation(s)
- Margaret R Woodhouse
- Department of Plant and Microbial Biology, University of California, Berkeley, California 94720, USA.
| | | | | |
Collapse
|
34
|
Sugihara C, Hisatomi T, Kodama T, Tsuboi M. The GAL10 gene is located 40 kbp away from the GAL7-GAL1 region in the yeast Kazachstania naganishii. Curr Microbiol 2011; 63:366-71. [PMID: 21822965 DOI: 10.1007/s00284-011-9988-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 07/17/2011] [Indexed: 10/17/2022]
Abstract
Of the genes involved in galactose metabolism, GAL7, GAL10, and GAL1 are tightly linked in this order on chromosome II in Saccharomyces cerevisiae. While several species of the order Saccharomycetales have similar gene organization, Kazachstania naganishii is unique, in which GAL7 and GAL1 are close to each other whereas GAL10 is substantially apart from them on chromosome XI. In this study, we inserted the recognition sequence of I-SceI homing-endonuclease into GAL10 and also into the intervening segment of GAL7-GAL1. By cleaving chromosome DNA of the gene-manipulated strain with I-SceI, we obtained evidence that chromosome XI (610 kbp) was replaced with three fragments (305, 265, and 40 kbp). Using appropriate probes, we further found that GAL10 was about 40 kbp apart from the GAL7-GAL1 cluster and that orientation of GAL10 was reversed comparing to the S. cerevisiae counter part. We, therefore, contend that comparison of the organization of the GAL cluster among Saccharomycetales is of importance to elucidate evolution of chromosomes and that the experimental scheme developed in this study is useful for this line of investigation.
Collapse
Affiliation(s)
- Chisa Sugihara
- Department of Biotechnology, Faculty of Life Sciences and Biotechnology, Fukuyama University, Gakuen-cho, Fukuyama, Hiroshima, Japan
| | | | | | | |
Collapse
|
35
|
Evangelisti AM, Conant GC. Nonrandom survival of gene conversions among yeast ribosomal proteins duplicated through genome doubling. Genome Biol Evol 2010; 2:826-34. [PMID: 20966100 PMCID: PMC2988524 DOI: 10.1093/gbe/evq067] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
By comparing the patterns of evolution in the coding and upstream noncoding regions of yeast ribosomal protein (RP) genes duplicated in a genome duplication, we find that although nonsynonymous sites in the coding sequences show strong evidence for the fixation of recent gene conversion events, similar patterns are less evident among the synonymous positions and noncoding regulatory elements. This result suggests a potential explanation for the somewhat puzzling fact that duplicated RP genes are not functionally redundant despite their very high protein sequence identity. An analysis of the patterns of regulatory network evolution after genome duplication also indicates that the duplicated proteins have diverged considerably in expression despite their similar protein sequences.
Collapse
|
36
|
Souciet JL, Dujon B, Gaillardin C, Johnston M, Baret PV, Cliften P, Sherman DJ, Weissenbach J, Westhof E, Wincker P, Jubin C, Poulain J, Barbe V, Ségurens B, Artiguenave F, Anthouard V, Vacherie B, Val ME, Fulton RS, Minx P, Wilson R, Durrens P, Jean G, Marck C, Martin T, Nikolski M, Rolland T, Seret ML, Casarégola S, Despons L, Fairhead C, Fischer G, Lafontaine I, Leh V, Lemaire M, de Montigny J, Neuvéglise C, Thierry A, Blanc-Lenfle I, Bleykasten C, Diffels J, Fritsch E, Frangeul L, Goëffon A, Jauniaux N, Kachouri-Lafond R, Payen C, Potier S, Pribylova L, Ozanne C, Richard GF, Sacerdot C, Straub ML, Talla E. Comparative genomics of protoploid Saccharomycetaceae. Genome Res 2009; 19:1696-709. [PMID: 19525356 DOI: 10.1101/gr.091546.109] [Citation(s) in RCA: 180] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Our knowledge of yeast genomes remains largely dominated by the extensive studies on Saccharomyces cerevisiae and the consequences of its ancestral duplication, leaving the evolution of the entire class of hemiascomycetes only partly explored. We concentrate here on five species of Saccharomycetaceae, a large subdivision of hemiascomycetes, that we call "protoploid" because they diverged from the S. cerevisiae lineage prior to its genome duplication. We determined the complete genome sequences of three of these species: Kluyveromyces (Lachancea) thermotolerans and Saccharomyces (Lachancea) kluyveri (two members of the newly described Lachancea clade), and Zygosaccharomyces rouxii. We included in our comparisons the previously available sequences of Kluyveromyces lactis and Ashbya (Eremothecium) gossypii. Despite their broad evolutionary range and significant individual variations in each lineage, the five protoploid Saccharomycetaceae share a core repertoire of approximately 3300 protein families and a high degree of conserved synteny. Synteny blocks were used to define gene orthology and to infer ancestors. Far from representing minimal genomes without redundancy, the five protoploid yeasts contain numerous copies of paralogous genes, either dispersed or in tandem arrays, that, altogether, constitute a third of each genome. Ancient, conserved paralogs as well as novel, lineage-specific paralogs were identified.
Collapse
Affiliation(s)
-
- Université de Strasbourg, CNRS UMR, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Conant GC, Stadler PF. Solvent exposure imparts similar selective pressures across a range of yeast proteins. Mol Biol Evol 2009; 26:1155-61. [PMID: 19233963 DOI: 10.1093/molbev/msp031] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We study how an amino acid residue's solvent exposure influences its propensity for substitution by analyzing multiple alignments of 61 yeast genes for which the crystal structure is known. We find that the selective constraint on the interior residues is on average 10 times that of residues on the surface. Surprisingly, there is no correlation between the overall selective constraint observed for a protein alignment and the ratio of constraints on interior and surface residues. By modeling the selective constraint on several amino acid properties, we show that although residue volume and hydropathy are strongly conserved across most alignments, there is little variation in interior versus surface conservation for these two properties. By contrast, residue charge (isoelectric point) is less generally conserved when considering the protein as a whole but shows a strong constraint against the introduction of charged residues into the protein interior.
Collapse
Affiliation(s)
- Gavin C Conant
- Division of Animal Sciences and Informatics Institute, University of Missouri-Columbia, USA.
| | | |
Collapse
|
38
|
Current awareness on yeast. Yeast 2006. [DOI: 10.1002/yea.1617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|