1
|
VanKuren NW, Chen J, Long M. Sexual conflict drive in the rapid evolution of new gametogenesis genes. Semin Cell Dev Biol 2024; 159-160:27-37. [PMID: 38309142 DOI: 10.1016/j.semcdb.2024.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 01/19/2024] [Accepted: 01/19/2024] [Indexed: 02/05/2024]
Abstract
The evolutionary forces underlying the rapid evolution in sequences and functions of new genes remain a mystery. Adaptation by natural selection explains the evolution of some new genes. However, many new genes perform sex-biased functions that have rapidly evolved over short evolutionary time scales, suggesting that new gene evolution may often be driven by conflicting selective pressures on males and females. It is well established that such sexual conflict (SC) plays a central role in maintaining phenotypic and genetic variation within populations, but the role of SC in driving new gene evolution remains essentially unknown. This review explores the connections between SC and new gene evolution through discussions of the concept of SC, the phenotypic and genetic signatures of SC in evolving populations, and the molecular mechanisms by which SC could drive the evolution of new genes. We synthesize recent work in this area with a discussion of the case of Apollo and Artemis, two extremely young genes (<200,000 years) in Drosophila melanogaster, which offered the first empirical insights into the evolutionary process by which SC could drive the evolution of new genes. These new duplicate genes exhibit the hallmarks of sexually antagonistic selection: rapid DNA and protein sequence evolution, essential sex-specific functions in gametogenesis, and complementary sex-biased expression patterns. Importantly, Apollo is essential for male fitness but detrimental to female fitness, while Artemis is essential for female fitness but detrimental to male fitness. These sexually antagonistic fitness effects and complementary changes to expression, sequence, and function suggest that these duplicates were selected for mitigating SC, but that SC has not been fully resolved. Finally, we propose Sexual Conflict Drive as a self-driven model to interpret the rapid evolution of new genes, explain the potential for SC and sexually antagonistic selection to contribute to long-term evolution, and suggest its utility for understanding the rapid evolution of new genes in gametogenesis.
Collapse
Affiliation(s)
- Nicholas W VanKuren
- Department of Ecology and Evolution, The University of Chicago, United States.
| | - Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, United States
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, United States.
| |
Collapse
|
2
|
Coronado-Zamora M, González J. Transposons contribute to the functional diversification of the head, gut, and ovary transcriptomes across Drosophila natural strains. Genome Res 2023; 33:1541-1553. [PMID: 37793782 PMCID: PMC10620055 DOI: 10.1101/gr.277565.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 08/08/2023] [Indexed: 10/06/2023]
Abstract
Transcriptomes are dynamic, with cells, tissues, and body parts expressing particular sets of transcripts. Transposable elements (TEs) are a known source of transcriptome diversity; however, studies often focus on a particular type of chimeric transcript, analyze single body parts or cell types, or are based on incomplete TE annotations from a single reference genome. In this work, we have implemented a method based on de novo transcriptome assembly that minimizes the potential sources of errors while identifying a comprehensive set of gene-TE chimeras. We applied this method to the head, gut, and ovary dissected from five Drosophila melanogaster natural strains, with individual reference genomes available. We found that ∼19% of body part-specific transcripts are gene-TE chimeras. Overall, chimeric transcripts contribute a mean of 43% to the total gene expression, and they provide protein domains for DNA binding, catalytic activity, and DNA polymerase activity. Our comprehensive data set is a rich resource for follow-up analysis. Moreover, because TEs are present in virtually all species sequenced to date, their role in spatially restricted transcript expression is likely not exclusive to the species analyzed in this work.
Collapse
Affiliation(s)
| | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, Barcelona 08003, Spain
| |
Collapse
|
3
|
Koch TL, Torres JP, Baskin RP, Salcedo PF, Chase K, Olivera BM, Safavi-Hemami H. A toxin-based approach to neuropeptide and peptide hormone discovery. Front Mol Neurosci 2023; 16:1176662. [PMID: 37720554 PMCID: PMC10501145 DOI: 10.3389/fnmol.2023.1176662] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 08/15/2023] [Indexed: 09/19/2023] Open
Abstract
Peptide hormones and neuropeptides form a diverse class of bioactive secreted molecules that control essential processes in animals. Despite breakthroughs in peptide discovery, many signaling peptides remain undiscovered. Recently, we demonstrated the use of somatostatin-mimicking toxins from cone snails to identify the invertebrate ortholog of somatostatin. Here, we show that this toxin-based approach can be systematically applied to discover other unknown secretory peptides that are likely to have signaling function. Using large sequencing datasets, we searched for homologies between cone snail toxins and secreted proteins from the snails' prey. We identified and confirmed expression of five toxin families that share strong similarities with unknown secretory peptides from mollusks and annelids and in one case also from ecdysozoans. Based on several lines of evidence we propose that these peptides likely act as signaling peptides that serve important physiological functions. Indeed, we confirmed that one of the identified peptides belongs to the family of crustacean hyperglycemic hormone, a peptide not previously observed in Spiralia. We propose that this discovery pipeline can be broadly applied to other systems in which one organism has evolved molecules to manipulate the physiology of another.
Collapse
Affiliation(s)
- Thomas Lund Koch
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Biochemistry, University of Utah, Salt Lake City, UT, United States
| | - Joshua P. Torres
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Robert P. Baskin
- School of Biological Sciences, University of Utah, Salt Lake City, UT, United States
- The Ohio State University College of Medicine, Columbus, OH, United States
| | - Paula Flórez Salcedo
- Department of Neurobiology, University of Utah, Salt Lake City, UT, United States
| | - Kevin Chase
- School of Biological Sciences, University of Utah, Salt Lake City, UT, United States
| | - Baldomero M. Olivera
- School of Biological Sciences, University of Utah, Salt Lake City, UT, United States
| | - Helena Safavi-Hemami
- Department of Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark
- Department of Biochemistry, University of Utah, Salt Lake City, UT, United States
- School of Biological Sciences, University of Utah, Salt Lake City, UT, United States
| |
Collapse
|
4
|
Titus-McQuillan JE, Nanni AV, McIntyre LM, Rogers RL. Estimating transcriptome complexities across eukaryotes. BMC Genomics 2023; 24:254. [PMID: 37170194 PMCID: PMC10173493 DOI: 10.1186/s12864-023-09326-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 04/20/2023] [Indexed: 05/13/2023] Open
Abstract
BACKGROUND Genomic complexity is a growing field of evolution, with case studies for comparative evolutionary analyses in model and emerging non-model systems. Understanding complexity and the functional components of the genome is an untapped wealth of knowledge ripe for exploration. With the "remarkable lack of correspondence" between genome size and complexity, there needs to be a way to quantify complexity across organisms. In this study, we use a set of complexity metrics that allow for evaluating changes in complexity using TranD. RESULTS We ascertain if complexity is increasing or decreasing across transcriptomes and at what structural level, as complexity varies. In this study, we define three metrics - TpG, EpT, and EpG- to quantify the transcriptome's complexity that encapsulates the dynamics of alternative splicing. Here we compare complexity metrics across 1) whole genome annotations, 2) a filtered subset of orthologs, and 3) novel genes to elucidate the impacts of orthologs and novel genes in transcript model analysis. Effective Exon Number (EEN) issued to compare the distribution of exon sizes within transcripts against random expectations of uniform exon placement. EEN accounts for differences in exon size, which is important because novel gene differences in complexity for orthologs and whole-transcriptome analyses are biased towards low-complexity genes with few exons and few alternative transcripts. CONCLUSIONS With our metric analyses, we are able to quantify changes in complexity across diverse lineages with greater precision and accuracy than previous cross-species comparisons under ortholog conditioning. These analyses represent a step toward whole-transcriptome analysis in the emerging field of non-model evolutionary genomics, with key insights for evolutionary inference of complexity changes on deep timescales across the tree of life. We suggest a means to quantify biases generated in ortholog calling and correct complexity analysis for lineage-specific effects. With these metrics, we directly assay the quantitative properties of newly formed lineage-specific genes as they lower complexity.
Collapse
Affiliation(s)
- James E Titus-McQuillan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA.
| | - Adalena V Nanni
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Lauren M McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, 32611, USA
- University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Rebekah L Rogers
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
| |
Collapse
|
5
|
Yang Y, Shi L, Xu X, Wen J, Xie T, Li H, Li X, Chen M, Dou X, Yuan C, Song H, Xie B, Tao Y. Spermidine Synthase and Saccharopine Reductase Have Co-Expression Patterns Both in Basidiomycetes with Fusion Form and Ascomycetes with Separate Form. J Fungi (Basel) 2023; 9:jof9030352. [PMID: 36983520 PMCID: PMC10051792 DOI: 10.3390/jof9030352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 03/07/2023] [Accepted: 03/08/2023] [Indexed: 03/15/2023] Open
Abstract
Gene fusion is a process through which two or more distinct genes are fused into a single chimeric gene. Unlike most harmful fusion genes in cancer cells, in this study, we first found that spermidine synthetase- (SPDS, catalyst of spermidine biosynthesis) and saccharopine reductase- (SR, catalyst of the penultimate step of lysine biosynthesis) encoding genes form a natural chimeric gene, FfSpdsSr, in Flammulina filiformis. Through the cloning of full-length ORFs in different strains and the analysis of alternative splicing in developmental stages, FfSpdsSr has only one copy and unique transcript encoding chimeric SPDS-SR in F. filiformis. By an orthologous gene search of SpdsSr in more than 80 fungi, we found that the chimeric SpdsSr exists in basidiomycetes, while the two separate Spds and Sr independently exist in ascomycetes, chytridiomycetes, and oomycetes. Further, the transcript level of FfSpdsSr was investigated in different developmental stages and under some common environmental factors and stresses by RT-qPCR. The results showed that FfSpdsSr mainly up-regulated in the elongation stage and pileus development of F. filiformis, as well as under blue light, high temperature, H2O2, and MeJA treatments. Moreover, a total of 15 sets of RNA-Seq data, including 218 samples of Neurospora crassa, were downloaded from the GEO database and used to analyze the expression correlation of NcSpds and NcSr. The results showed that the separate NcSpds and NcSr shared highly similar co-expression patterns in the samples with different strains and different nutritional and environmental condition treatments. The chimeric SpdsSr in basidiomycetes and the co-expression pattern of the Spds and Sr in N. crassa indicate the special link of spermidine and lysine in fungi, which may play an important role in the growth and development of fruiting body and in response to the multiple environmental factors and abiotic stresses.
Collapse
Affiliation(s)
- Yayong Yang
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Lei Shi
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Xinyu Xu
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Jin Wen
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Tianyue Xie
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Hui Li
- Institute of Cash Crops, Hebei Academy of Agriculture and Forestry Sciences, Shijiazhuang 050051, China
| | - Xiaoyu Li
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Mengyu Chen
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Xinyi Dou
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Chengjin Yuan
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Hanbing Song
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Baogui Xie
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
| | - Yongxin Tao
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Mycological Research Center, Fujian Agriculture and Forestry University, Fuzhou 350002, China
- Correspondence: ; Tel.: +86-0591-83789281
| |
Collapse
|
6
|
Rogers RL, Grizzard SL, Garner JT. Strong, Recent Selective Sweeps Reshape Genetic Diversity in Freshwater Bivalve Megalonaias nervosa. Mol Biol Evol 2023; 40:7026026. [PMID: 36738170 PMCID: PMC9976758 DOI: 10.1093/molbev/msad024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 01/16/2023] [Accepted: 01/24/2023] [Indexed: 02/05/2023] Open
Abstract
Freshwater Unionid bivalves have recently faced ecological upheaval through pollution, barriers to dispersal, harvesting, and changes in fish-host prevalence. Currently, over 70% of species in North America are threatened, endangered or extinct. To characterize the genetic response to recent selective pressures, we collected population genetic data for one successful bivalve species, Megalonaias nervosa. We identify megabase-sized regions that are nearly monomorphic across the population, signals of strong, recent selection reshaping diversity across 73 Mb total. These signatures of selection are greater than is commonly seen in population genetic models. We observe 102 duplicate genes with high dN/dS on terminal branches among regions with sweeps, suggesting that gene duplication is a causative mechanism of recent adaptation in M. nervosa. Genes in sweeps reflect functional classes important for Unionid survival, including anticoagulation genes important for fish host parasitization, detox genes, mitochondria management, and shell formation. We identify sweeps in regions with no known functional impacts, suggesting mechanisms of adaptation that deserve greater attention in future work on species survival. In contrast, polymorphic transposable elements (TEs) appear to be detrimental and underrepresented among regions with sweeps. TE site frequency spectra are skewed toward singleton variants, and TEs among regions with sweeps are present at low frequency. Our work suggests that duplicate genes are an essential source of genetic novelty that has helped this species succeed in environments where others have struggled. These results suggest that gene duplications deserve greater attention in non-model population genomics, especially in species that have recently faced sudden environmental challenges.
Collapse
Affiliation(s)
- Rebekah L Rogers
- Department of Bioinformatics and Genomics, University of North Carolina, Charlotte, NC 28223, USA
| | | | - Jeffrey T Garner
- Division of Wildlife and Freshwater Fisheries, Alabama Department of Conservation and Natural Resources, Florence, AL, USA
| |
Collapse
|
7
|
Li J, Shen J, Wang R, Chen Y, Zhang T, Wang H, Guo C, Qi J. The nearly complete assembly of the Cercis chinensis genome and Fabaceae phylogenomic studies provide insights into new gene evolution. PLANT COMMUNICATIONS 2023; 4:100422. [PMID: 35957520 PMCID: PMC9860166 DOI: 10.1016/j.xplc.2022.100422] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 08/02/2022] [Accepted: 08/05/2022] [Indexed: 05/27/2023]
Abstract
Fabaceae is a large family of angiosperms with high biodiversity that contains a variety of economically important crops and model plants for the study of biological nitrogen fixation. Polyploidization events have been extensively studied in some Fabaceae plants, but the occurrence of new genes is still concealed, owing to a lack of genomic information on certain species of the basal clade of Fabaceae. Cercis chinensis (Cercidoideae) is one such species; it diverged earliest from Fabaceae and is essential for phylogenomic studies and new gene predictions in Fabaceae. To facilitate genomic studies on Fabaceae, we performed genome sequencing of C. chinensis and obtained a 352.84 Mb genome, which was further assembled into seven pseudochromosomes with 30 612 predicted protein-coding genes. Compared with other legume genomes, that of C. chinensis exhibits no lineage-specific polyploidization event. Further phylogenomic analyses of 22 legumes and 11 other angiosperms revealed that many gene families are lineage specific before and after the diversification of Fabaceae. Among them, dozens of genes are candidates for new genes that have evolved from intergenic regions and are thus regarded as de novo-originated genes. They differ significantly from established genes in coding sequence length, exon number, guanine-cytosine content, and expression patterns among tissues. Functional analysis revealed that many new genes are related to asparagine metabolism. This study represents an important advance in understanding the evolutionary pattern of new genes in legumes and provides a valuable resource for plant phylogenomic studies.
Collapse
Affiliation(s)
- Jinglong Li
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Jingting Shen
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Rui Wang
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Yamao Chen
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Taikui Zhang
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Haifeng Wang
- College of Agriculture, Guangxi University, Nanning 530004, China
| | - Chunce Guo
- Jiangxi Provincial Key Laboratory for Bamboo Germplasm Resources and Utilization, Forestry College, Jiangxi Agricultural University, Nanchang 330045, China
| | - Ji Qi
- State Key Laboratory of Genetic Engineering, Institute of Plant Biology, School of Life Sciences, Fudan University, Shanghai 200433, China.
| |
Collapse
|
8
|
Zhou Y, Zhang C, Zhang L, Ye Q, Liu N, Wang M, Long G, Fan W, Long M, Wing RA. Gene fusion as an important mechanism to generate new genes in the genus Oryza. Genome Biol 2022; 23:130. [PMID: 35706016 PMCID: PMC9199173 DOI: 10.1186/s13059-022-02696-w] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 05/30/2022] [Indexed: 11/16/2022] Open
Abstract
Background Events of gene fusion have been reported in several organisms. However, the general role of gene fusion as part of new gene origination remains unknown. Results We conduct genome-wide interrogations of four Oryza genomes by designing and implementing novel pipelines to detect fusion genes. Based on the phylogeny of ten plant species, we detect 310 fusion genes across four Oryza species. The estimated rate of origination of fusion genes in the Oryza genus is as high as 63 fusion genes per species per million years, which is fixed at 16 fusion genes per species per million years and much higher than that in flies. By RNA sequencing analysis, we find more than 44% of the fusion genes are expressed and 90% of gene pairs show strong signals of purifying selection. Further analysis of CRISPR/Cas9 knockout lines indicates that newly formed fusion genes regulate phenotype traits including seed germination, shoot length and root length, suggesting the functional significance of these genes. Conclusions We detect new fusion genes that may drive phenotype evolution in Oryza. This study provides novel insights into the genome evolution of Oryza. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02696-w.
Collapse
Affiliation(s)
- Yanli Zhou
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Chengjun Zhang
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China. .,Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.
| | - Li Zhang
- Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.,Chinese Institute for Brain Research, (CIBR), Beijing, 102206, China
| | - Qiannan Ye
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Ningyawen Liu
- Germplasm Bank of Wild species, Kunming Institute of Botany, Chinese Academy of Science, Kunming, Yunnan, 650201, China
| | - Muhua Wang
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,State Key Laboratory for Biocontrol, School of Marine Sciences, Sun Yat-sen University, Zhuhai, 519000, China
| | - Guangqiang Long
- Key Laboratory of Medicinal Plant Biology of Yunnan Province, Yunnan Agricultural University, Kunming, Yunnan, 650201, China
| | - Wei Fan
- Key Laboratory of Medicinal Plant Biology of Yunnan Province, Yunnan Agricultural University, Kunming, Yunnan, 650201, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, 1101 E. 57th Street, Chicago, IL, 60637, USA.
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA. .,Center for Desert Agriculture, King Abdullah University of Science & Technology, Thuwal, 23955-6900, Kingdom of Saudi Arabia.
| |
Collapse
|
9
|
Dosage sensitivity and exon shuffling shape the landscape of polymorphic duplicates in Drosophila and humans. Nat Ecol Evol 2021; 6:273-287. [PMID: 34969986 DOI: 10.1038/s41559-021-01614-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 11/10/2021] [Indexed: 11/08/2022]
Abstract
Despite polymorphic duplicate genes' importance for the early stages of duplicate gene evolution, they are less studied than old gene duplicates. Two essential questions thus remain poorly addressed: how does dosage sensitivity, imposed by stoichiometry in protein complexes or by X chromosome dosage compensation, affect the emergence of complete duplicate genes? Do introns facilitate intergenic and intragenic chimaerism as predicted by the theory of exon shuffling? Here, we analysed new data for Drosophila and public data for humans, to characterize polymorphic duplicate genes with respect to dosage, exon-intron structures and allele frequencies. We found that complete duplicate genes are under dosage constraint induced by protein stoichiometry but potentially tolerated by X chromosome dosage compensation. We also found that in the intron-rich human genome, gene fusions and intragenic duplications extensively use intronic breakpoints generating in-frame proteins, in accordance with the theory of exon shuffling. Finally, we found that only a small proportion of complete or partial duplicates are at high frequencies, indicating the deleterious nature of dosage or gene structural changes. Altogether, we demonstrate how mechanistic factors including dosage sensitivity and exon-intron structure shape the short-term functional consequences of gene duplication.
Collapse
|
10
|
Genomic analyses of new genes and their phenotypic effects reveal rapid evolution of essential functions in Drosophila development. PLoS Genet 2021; 17:e1009654. [PMID: 34242211 PMCID: PMC8270118 DOI: 10.1371/journal.pgen.1009654] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 06/09/2021] [Indexed: 12/27/2022] Open
Abstract
It is a conventionally held dogma that the genetic basis underlying development is conserved in a long evolutionary time scale. Ample experiments based on mutational, biochemical, functional, and complementary knockdown/knockout approaches have revealed the unexpectedly important role of recently evolved new genes in the development of Drosophila. The recent progress in the genome-wide experimental testing of gene effects and improvements in the computational identification of new genes (< 40 million years ago, Mya) open the door to investigate the evolution of gene essentiality with a phylogenetically high resolution. These advancements also raised interesting issues in techniques and concepts related to phenotypic effect analyses of genes, particularly of those that recently originated. Here we reported our analyses of these issues, including reproducibility and efficiency of knockdown experiment and difference between RNAi libraries in the knockdown efficiency and testing of phenotypic effects. We further analyzed a large data from knockdowns of 11,354 genes (~75% of the Drosophila melanogaster total genes), including 702 new genes (~66% of the species total new genes that aged < 40 Mya), revealing a similarly high proportion (~32.2%) of essential genes that originated in various Sophophora subgenus lineages and distant ancestors beyond the Drosophila genus. The transcriptional compensation effect from CRISPR knockout were detected for highly similar duplicate copies. Knockout of a few young genes detected analogous essentiality in various functions in development. Taken together, our experimental and computational analyses provide valuable data for detection of phenotypic effects of genes in general and further strong evidence for the concept that new genes in Drosophila quickly evolved essential functions in viability during development.
Collapse
|
11
|
Le VTB, Tsimbalyuk S, Lim EQ, Solis A, Gawat D, Boeck P, Lim EQ, Renolo R, Forwood JK, Kuhn ML. The Vibrio cholerae SpeG Spermidine/Spermine N-Acetyltransferase Allosteric Loop and β6-β7 Structural Elements Are Critical for Kinetic Activity. Front Mol Biosci 2021; 8:645768. [PMID: 33928120 PMCID: PMC8076852 DOI: 10.3389/fmolb.2021.645768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 02/16/2021] [Indexed: 11/27/2022] Open
Abstract
Polyamines regulate many important biological processes including gene expression, intracellular signaling, and biofilm formation. Their intracellular concentrations are tightly regulated by polyamine transport systems and biosynthetic and catabolic pathways. Spermidine/spermine N-acetyltransferases (SSATs) are catabolic enzymes that acetylate polyamines and are critical for maintaining intracellular polyamine homeostasis. These enzymes belong to the Gcn5-related N-acetyltransferase (GNAT) superfamily and adopt a highly conserved fold found across all kingdoms of life. SpeG is an SSAT protein found in a variety of bacteria, including the human pathogen Vibrio cholerae. This protein adopts a dodecameric structure and contains an allosteric site, making it unique compared to other SSATs. Currently, we have a limited understanding of the critical structural components of this protein that are required for its allosteric behavior. Therefore, we explored the importance of two key regions of the SpeG protein on its kinetic activity. To achieve this, we created various constructs of the V. cholerae SpeG protein, including point mutations, a deletion, and chimeras with residues from the structurally distinct and non-allosteric human SSAT protein. We measured enzyme kinetic activity toward spermine for ten constructs and crystallized six of them. Ultimately, we identified specific portions of the allosteric loop and the β6-β7 structural elements that were critical for enzyme kinetic activity. These results provide a framework for further study of the structure/function relationship of SpeG enzymes from other organisms and clues toward the structural evolution of members of the GNAT family across domains of life.
Collapse
Affiliation(s)
- Van Thi Bich Le
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Sofiya Tsimbalyuk
- School of Biomedical Science, Charles Sturt University, Wagga Wagga, NSW, Australia
| | - Ee Qi Lim
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Allan Solis
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Darwin Gawat
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Paloma Boeck
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Ee Qing Lim
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Rosselini Renolo
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| | - Jade K. Forwood
- School of Biomedical Science, Charles Sturt University, Wagga Wagga, NSW, Australia
| | - Misty L. Kuhn
- Department of Chemistry & Biochemistry, San Francisco State University, San Francisco, CA, United States
| |
Collapse
|
12
|
Bartošová-Sojková P, Kyslík J, Alama-Bermejo G, Hartigan A, Atkinson SD, Bartholomew JL, Picard-Sánchez A, Palenzuela O, Faber MN, Holland JW, Holzer AS. Evolutionary Analysis of Cystatins of Early-Emerging Metazoans Reveals a Novel Subtype in Parasitic Cnidarians. BIOLOGY 2021; 10:110. [PMID: 33546310 PMCID: PMC7913475 DOI: 10.3390/biology10020110] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 01/26/2021] [Accepted: 01/31/2021] [Indexed: 01/04/2023]
Abstract
The evolutionary aspects of cystatins are greatly underexplored in early-emerging metazoans. Thus, we surveyed the gene organization, protein architecture, and phylogeny of cystatin homologues mined from 110 genomes and the transcriptomes of 58 basal metazoan species, encompassing free-living and parasite taxa of Porifera, Placozoa, Cnidaria (including Myxozoa), and Ctenophora. We found that the cystatin gene repertoire significantly differs among phyla, with stefins present in most of the investigated lineages but with type 2 cystatins missing in several basal metazoan groups. Similar to liver and intestinal flukes, myxozoan parasites possess atypical stefins with chimeric structure that combine motifs of classical stefins and type 2 cystatins. Other early metazoan taxa regardless of lifestyle have only the classical representation of cystatins and lack multi-domain ones. Our comprehensive phylogenetic analyses revealed that stefins and type 2 cystatins clustered into taxonomically defined clades with multiple independent paralogous groups, which probably arose due to gene duplications. The stefin clade split between the subclades of classical stefins and the atypical stefins of myxozoans and flukes. Atypical stefins represent key evolutionary innovations of the two parasite groups for which their origin might have been linked with ancestral gene chimerization, obligate parasitism, life cycle complexity, genome reduction, and host immunity.
Collapse
Affiliation(s)
- Pavla Bartošová-Sojková
- Biology Centre, Institute of Parasitology, Czech Academy of Sciences, 37005 České Budějovice, Czech Republic; (J.K.); (G.A.-B.); (A.P.-S.); (A.S.H.)
| | - Jiří Kyslík
- Biology Centre, Institute of Parasitology, Czech Academy of Sciences, 37005 České Budějovice, Czech Republic; (J.K.); (G.A.-B.); (A.P.-S.); (A.S.H.)
- Faculty of Science, University of South Bohemia, 37005 České Budějovice, Czech Republic
| | - Gema Alama-Bermejo
- Biology Centre, Institute of Parasitology, Czech Academy of Sciences, 37005 České Budějovice, Czech Republic; (J.K.); (G.A.-B.); (A.P.-S.); (A.S.H.)
| | - Ashlie Hartigan
- Department of Life Sciences, Natural History Museum, London SW7 5BD, UK;
| | - Stephen D. Atkinson
- Department of Microbiology, Oregon State University, Corvallis, OR 97331, USA; (S.D.A.); (J.L.B.)
| | - Jerri L. Bartholomew
- Department of Microbiology, Oregon State University, Corvallis, OR 97331, USA; (S.D.A.); (J.L.B.)
| | - Amparo Picard-Sánchez
- Biology Centre, Institute of Parasitology, Czech Academy of Sciences, 37005 České Budějovice, Czech Republic; (J.K.); (G.A.-B.); (A.P.-S.); (A.S.H.)
- Fish Pathology Group, Instituto de Acuicultura Torre de la Sal (IATS-CSIC), 12595 Castellón, Spain;
| | - Oswaldo Palenzuela
- Fish Pathology Group, Instituto de Acuicultura Torre de la Sal (IATS-CSIC), 12595 Castellón, Spain;
| | - Marc Nicolas Faber
- Scottish Fish Immunology Research Centre, Institute of Biological and Environmental Sciences, University of Aberdeen, Aberdeen AB24 3UU, UK; (M.N.F.); (J.W.H.)
| | - Jason W. Holland
- Scottish Fish Immunology Research Centre, Institute of Biological and Environmental Sciences, University of Aberdeen, Aberdeen AB24 3UU, UK; (M.N.F.); (J.W.H.)
| | - Astrid S. Holzer
- Biology Centre, Institute of Parasitology, Czech Academy of Sciences, 37005 České Budějovice, Czech Republic; (J.K.); (G.A.-B.); (A.P.-S.); (A.S.H.)
| |
Collapse
|
13
|
The new chimeric chiron genes evolved essential roles in zebrafish embryonic development by regulating NAD + levels. SCIENCE CHINA-LIFE SCIENCES 2021; 64:1929-1948. [PMID: 33521859 DOI: 10.1007/s11427-020-1851-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 11/16/2020] [Indexed: 10/22/2022]
Abstract
The origination of new genes is important for generating genetic novelties for adaptive evolution and biological diversity. However, their potential roles in embryonic development, evolutionary processes into ancient networks, and contributions to adaptive evolution remain poorly investigated. Here, we identified a novel chimeric gene family, the chiron family, and explored its genetic basis and functional evolution underlying the adaptive evolution of Danioninae fishes. The ancestral chiron gene originated through retroposition of nampt in Danioninae 48-54 million years ago (Mya) and expanded into five duplicates (chiron1-5) in zebrafish 1-4 Mya. The chiron genes (chirons) likely originated in embryonic development and gradually extended their expression in the testis. Functional experiments showed that chirons were essential for zebrafish embryo development. By integrating into the NAD+ synthesis pathway, chirons could directly catalyze the NAD+ rate-limiting reaction and probably impact two energy metabolism genes (nmnat1 and naprt) to be under positive selection in Danioninae fishes. Together, these results mainly demonstrated that the origin of new chimeric chiron genes may be involved in adaptive evolution by integrating and impacting the NAD+ biosynthetic pathway. This coevolution may contribute to the physiological adaptation of Danioninae fishes to widespread and varied biomes in Southeast Asian.
Collapse
|
14
|
Ricci F, Luporini P, Alimenti C, Vallesi A. Functional chimeric genes in ciliates: An instructive case from Euplotes raikovi. Gene 2020; 767:145186. [PMID: 32998045 DOI: 10.1016/j.gene.2020.145186] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 09/09/2020] [Accepted: 09/23/2020] [Indexed: 11/29/2022]
Abstract
In ciliates, with every sexual event the transcriptionally active genes of the sub-chromosomic somatic genome that resides in the cell macronucleus are lost. They are de novo assembled starting from 'Macronuclear Destined Sequences' that arise from the fragmentation of transcriptionally silent DNA sequences of the germline chromosomic genome enclosed in the cell micronucleus. The RNA-mediated epigenetic mechanism that drives the assembly of these sequences is subject to errors which result in the formation of chimeric genes. Studying a gene family that in Euplotes raikovi controls the synthesis of protein signal pheromones responsible for a self/not-self recognition mechanism, we identified the chimeric structure of an 851-bp macronuclear gene previously known to specify soluble and membrane-bound pheromone molecules through an intron-splicing mechanism. This chimeric gene, designated mac-er-1*, conserved the native pheromone-gene structure throughout its coding and 3' regions. Instead, its 5' region is completely unrelated to the pheromone gene structure at the level of a 360-bp sequence, which derives from the assembly with a MDS destined to compound a 2417-bp gene encoding a 696-amino acid protein with unknown function. This mac-er-1* gene characterization provides further evidence that ciliates rely on functional chimeric genes that originate in non-programmed phenomena of somatic MDS recombination to increase the species genetic variability independently of gene reshuffling phenomena of the germline genome.
Collapse
Affiliation(s)
- Francesca Ricci
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino 62032, Italy
| | - Pierangelo Luporini
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino 62032, Italy
| | - Claudio Alimenti
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino 62032, Italy
| | - Adriana Vallesi
- Laboratory of Eukaryotic Microbiology and Animal Biology, School of Biosciences and Veterinary Medicine, University of Camerino, Camerino 62032, Italy.
| |
Collapse
|
15
|
Kapun M, Barrón MG, Staubach F, Obbard DJ, Wiberg RAW, Vieira J, Goubert C, Rota-Stabelli O, Kankare M, Bogaerts-Márquez M, Haudry A, Waidele L, Kozeretska I, Pasyukova EG, Loeschcke V, Pascual M, Vieira CP, Serga S, Montchamp-Moreau C, Abbott J, Gibert P, Porcelli D, Posnien N, Sánchez-Gracia A, Grath S, Sucena É, Bergland AO, Guerreiro MPG, Onder BS, Argyridou E, Guio L, Schou MF, Deplancke B, Vieira C, Ritchie MG, Zwaan BJ, Tauber E, Orengo DJ, Puerma E, Aguadé M, Schmidt P, Parsch J, Betancourt AJ, Flatt T, González J. Genomic Analysis of European Drosophila melanogaster Populations Reveals Longitudinal Structure, Continent-Wide Selection, and Previously Unknown DNA Viruses. Mol Biol Evol 2020; 37:2661-2678. [PMID: 32413142 PMCID: PMC7475034 DOI: 10.1093/molbev/msaa120] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Genetic variation is the fuel of evolution, with standing genetic variation especially important for short-term evolution and local adaptation. To date, studies of spatiotemporal patterns of genetic variation in natural populations have been challenging, as comprehensive sampling is logistically difficult, and sequencing of entire populations costly. Here, we address these issues using a collaborative approach, sequencing 48 pooled population samples from 32 locations, and perform the first continent-wide genomic analysis of genetic variation in European Drosophila melanogaster. Our analyses uncover longitudinal population structure, provide evidence for continent-wide selective sweeps, identify candidate genes for local climate adaptation, and document clines in chromosomal inversion and transposable element frequencies. We also characterize variation among populations in the composition of the fly microbiome, and identify five new DNA viruses in our samples.
Collapse
Affiliation(s)
- Martin Kapun
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Department of Biology, University of Fribourg, Fribourg, Switzerland
- Department of Evolutionary Biology and Environmental Sciences, University of Zürich, Zürich, Switzerland
- Division of Cell and Developmental Biology, Medical University of Vienna, Vienna, Austria
| | - Maite G Barrón
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Fabian Staubach
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Evolutionary Biology and Ecology, University of Freiburg, Freiburg, Germany
| | - Darren J Obbard
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom
| | - R Axel W Wiberg
- The European Drosophila Population Genomics Consortium (DrosEU)
- Centre for Biological Diversity, School of Biology, University of St. Andrews, St Andrews, Scotland
- Department of Environmental Sciences, Zoological Institute, University of Basel, Basel, Switzerland
| | - Jorge Vieira
- The European Drosophila Population Genomics Consortium (DrosEU)
- Instituto de Biologia Molecular e Celular (IBMC), University of Porto, Porto, Portugal
- Instituto de Investigação e Inovação em Saúde (I3S), University of Porto, Porto, Portugal
| | - Clément Goubert
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, CNRS, Université Lyon 1, Université de Lyon, Villeurbanne, France
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY
| | - Omar Rota-Stabelli
- The European Drosophila Population Genomics Consortium (DrosEU)
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all’ Adige, Italy
| | - Maaria Kankare
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland
| | - María Bogaerts-Márquez
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Annabelle Haudry
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, CNRS, Université Lyon 1, Université de Lyon, Villeurbanne, France
| | - Lena Waidele
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Evolutionary Biology and Ecology, University of Freiburg, Freiburg, Germany
| | - Iryna Kozeretska
- The European Drosophila Population Genomics Consortium (DrosEU)
- General and Medical Genetics Department, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
- State Institution National Antarctic Scientific Center of Ministry of Education and Science of Ukraine, Kyiv, Ukraine
| | - Elena G Pasyukova
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratory of Genome Variation, Institute of Molecular Genetics of RAS, Moscow, Russia
| | - Volker Loeschcke
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Bioscience—Genetics, Ecology and Evolution, Aarhus University, Aarhus C, Denmark
| | - Marta Pascual
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Cristina P Vieira
- The European Drosophila Population Genomics Consortium (DrosEU)
- Instituto de Biologia Molecular e Celular (IBMC), University of Porto, Porto, Portugal
- Instituto de Investigação e Inovação em Saúde (I3S), University of Porto, Porto, Portugal
| | - Svitlana Serga
- The European Drosophila Population Genomics Consortium (DrosEU)
- General and Medical Genetics Department, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
| | - Catherine Montchamp-Moreau
- The European Drosophila Population Genomics Consortium (DrosEU)
- Université Paris-Saclay, CNRS, IRD, UMR Évolution, Génomes, Comportement et Écologie, 91198, Gif-sur-Yvette, France
| | - Jessica Abbott
- The European Drosophila Population Genomics Consortium (DrosEU)
- Section for Evolutionary Ecology, Department of Biology, Lund University, Lund, Sweden
| | - Patricia Gibert
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, CNRS, Université Lyon 1, Université de Lyon, Villeurbanne, France
| | - Damiano Porcelli
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Animal and Plant Sciences, Sheffield, United Kingdom
| | - Nico Posnien
- The European Drosophila Population Genomics Consortium (DrosEU)
- Johann-Friedrich-Blumenbach-Institut für Zoologie und Anthropologie, Universität Göttingen, Göttingen, Germany
| | - Alejandro Sánchez-Gracia
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Sonja Grath
- The European Drosophila Population Genomics Consortium (DrosEU)
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
| | - Élio Sucena
- The European Drosophila Population Genomics Consortium (DrosEU)
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- Departamento de Biologia Animal, Faculdade de Ciências da Universidade de Lisboa, Lisboa, Portugal
| | - Alan O Bergland
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Biology, University of Virginia, Charlottesville, VA
| | - Maria Pilar Garcia Guerreiro
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica i Microbiologia, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Banu Sebnem Onder
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Biology, Faculty of Science, Hacettepe University, Ankara, Turkey
| | - Eliza Argyridou
- The European Drosophila Population Genomics Consortium (DrosEU)
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
| | - Lain Guio
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Mads Fristrup Schou
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Bioscience—Genetics, Ecology and Evolution, Aarhus University, Aarhus C, Denmark
- Section for Evolutionary Ecology, Department of Biology, Lund University, Lund, Sweden
| | - Bart Deplancke
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Bio-engineering, School of Life Sciences, EPFL, Lausanne, Switzerland
| | - Cristina Vieira
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratoire de Biométrie et Biologie Evolutive UMR 5558, CNRS, Université Lyon 1, Université de Lyon, Villeurbanne, France
| | - Michael G Ritchie
- The European Drosophila Population Genomics Consortium (DrosEU)
- Centre for Biological Diversity, School of Biology, University of St. Andrews, St Andrews, Scotland
| | - Bas J Zwaan
- The European Drosophila Population Genomics Consortium (DrosEU)
- Laboratory of Genetics, Department of Plant Sciences, Wageningen University, Wageningen, Netherlands
| | - Eran Tauber
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Evolutionary and Environmental Biology, University of Haifa, Haifa, Israel
- Institute of Evolution, University of Haifa, Haifa, Israel
| | - Dorcas J Orengo
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Eva Puerma
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Montserrat Aguadé
- The European Drosophila Population Genomics Consortium (DrosEU)
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Paul Schmidt
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Biology, University of Pennsylvania, Philadelphia, PA
| | - John Parsch
- The European Drosophila Population Genomics Consortium (DrosEU)
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
| | - Andrea J Betancourt
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Evolution, Ecology, and Behaviour, University of Liverpool, Liverpool, United Kingdom
| | - Thomas Flatt
- The European Drosophila Population Genomics Consortium (DrosEU)
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
- Department of Biology, University of Fribourg, Fribourg, Switzerland
| | - Josefa González
- The European Drosophila Population Genomics Consortium (DrosEU)
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
16
|
Stewart NB, Rogers RL. Chromosomal rearrangements as a source of new gene formation in Drosophila yakuba. PLoS Genet 2019; 15:e1008314. [PMID: 31545792 PMCID: PMC6776367 DOI: 10.1371/journal.pgen.1008314] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 10/03/2019] [Accepted: 07/17/2019] [Indexed: 11/19/2022] Open
Abstract
The origins of new genes are among the most fundamental questions in evolutionary biology. Our understanding of the ways that new genetic material appears and how that genetic material shapes population variation remains incomplete. De novo genes and duplicate genes are a key source of new genetic material on which selection acts. To better understand the origins of these new gene sequences, we explored the ways that structural variation might alter expression patterns and form novel transcripts. We provide evidence that chromosomal rearrangements are a source of novel genetic variation that facilitates the formation of de novo exons in Drosophila. We identify 51 cases of de novo exon formation created by chromosomal rearrangements in 14 strains of D. yakuba. These new genes inherit transcription start signals and open reading frames when the 5' end of existing genes are combined with previously untranscribed regions. Such new genes would appear with novel peptide sequences, without the necessity for secondary transitions from non-coding RNA to protein. This mechanism of new peptide formations contrasts with canonical theory of de novo gene progression requiring non-coding intermediaries that must acquire new mutations prior to loss via pseudogenization. Hence, these mutations offer a means to de novo gene creation and protein sequence formation in a single mutational step, answering a long standing open question concerning new gene formation. We further identify gene expression changes to 134 existing genes, indicating that these mutations can alter gene regulation. Population variability for chromosomal rearrangements is considerable, with 2368 rearrangements observed across 14 inbred lines. More rearrangements were identified on the X chromosome than any of the autosomes, suggesting the X is more susceptible to chromosome alterations. Together, these results suggest that chromosomal rearrangements are a source of variation in populations that is likely to be important to explain genetic and therefore phenotypic diversity.
Collapse
Affiliation(s)
- Nicholas B. Stewart
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina, United States of America
- Department of Biological Sciences, Ft Hays State University, Ft Hays, Kansas, United States of America
| | - Rebekah L. Rogers
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
17
|
Baker EP, Hittinger CT. Evolution of a novel chimeric maltotriose transporter in Saccharomyces eubayanus from parent proteins unable to perform this function. PLoS Genet 2019; 15:e1007786. [PMID: 30946740 PMCID: PMC6448821 DOI: 10.1371/journal.pgen.1007786] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 10/25/2018] [Indexed: 11/23/2022] Open
Abstract
At the molecular level, the evolution of new traits can be broadly divided between changes in gene expression and changes in protein-coding sequence. For proteins, the evolution of novel functions is generally thought to proceed through sequential point mutations or recombination of whole functional units. In Saccharomyces, the uptake of the sugar maltotriose into the cell is the primary limiting factor in its utilization, but maltotriose transporters are relatively rare, except in brewing strains. No known wild strains of Saccharomyces eubayanus, the cold-tolerant parent of hybrid lager-brewing yeasts (Saccharomyces cerevisiae x S. eubayanus), are able to consume maltotriose, which limits their ability to fully ferment malt extract. In one strain of S. eubayanus, we found a gene closely related to a known maltotriose transporter and were able to confer maltotriose consumption by overexpressing this gene or by passaging the strain on maltose. Even so, most wild strains of S. eubayanus lack native maltotriose transporters. To determine how this rare trait could evolve in naive genetic backgrounds, we performed an adaptive evolution experiment for maltotriose consumption, which yielded a single strain of S. eubayanus able to grow on maltotriose. We mapped the causative locus to a gene encoding a novel chimeric transporter that was formed by an ectopic recombination event between two genes encoding transporters that are unable to import maltotriose. In contrast to classic models of the evolution of novel protein functions, the recombination breakpoints occurred within a single functional domain. Thus, the ability of the new protein to carry maltotriose was likely acquired through epistatic interactions between independently evolved substitutions. By acquiring multiple mutations at once, the transporter rapidly gained a novel function, while bypassing potentially deleterious intermediate steps. This study provides an illuminating example of how recombination between paralogs can establish novel interactions among substitutions to create adaptive functions. Hybrids of the yeasts Saccharomyces cerevisiae and Saccharomyces eubayanus (lager-brewing yeasts) dominate the modern brewing industry. S. cerevisiae, also known as baker’s yeast, is well-known for its role in industry and scientific research. Less well recognized is S. eubayanus, which was only discovered as a pure species in 2011. While most lager-brewing yeasts rapidly and completely utilize the important brewing sugar maltotriose, no strain of S. eubayanus isolated to date is known to do so. Despite being unable to consume maltotriose, we identified one strain of S. eubayanus carrying a gene for a functional maltotriose transporter, although most strains lack this gene. During an adaptive evolution experiment, a strain of S. eubayanus without native maltotriose transporters evolved the ability to grow on maltotriose. Maltotriose consumption in the evolved strain resulted from a chimeric transporter that arose by shuffling genes encoding parent proteins that were unable to transport maltotriose. Traditionally, functional chimeric proteins are thought to evolve by shuffling discrete functional domains or modules, but the breakpoints in the chimera studied here occurred within the single functional module of the protein. These results support the less well-recognized role of shuffling duplicate gene sequences to generate novel proteins with adaptive functions.
Collapse
Affiliation(s)
- EmilyClare P. Baker
- Laboratory of Genetics, Microbiology Doctoral Training Program, Genome Center of Wisconsin, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Chris Todd Hittinger
- Laboratory of Genetics, Microbiology Doctoral Training Program, Genome Center of Wisconsin, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- * E-mail:
| |
Collapse
|
18
|
Laumer CE. Inferring Ancient Relationships with Genomic Data: A Commentary on Current Practices. Integr Comp Biol 2019; 58:623-639. [PMID: 29982611 DOI: 10.1093/icb/icy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Contemporary phylogeneticists enjoy an embarrassment of riches, not only in the volumes of data now available, but also in the diversity of bioinformatic tools for handling these data. Here, I discuss a subset of these tools I consider well-suited to the task of inferring ancient relationships with coding sequence data in particular, encompassing data generation, orthology assignment, alignment and gene tree inference, supermatrix construction, and analysis under the best-fitting models applicable to large-scale datasets. Throughout, I compare and critique methods, considering both their theoretical principles and the details of their implementation, and offering practical tips on usage where appropriate. I also entertain different motivations for analyzing what are almost always originally DNA sequence data as codons, amino acids, and higher-order recodings. Although presented in a linear order, I see value in using the diversity of tools available to us to assess the sensitivity of clades of biological interest to different gene and taxon sets and analytical modes, which can be an indication of the presence of systematic error, of which a few forms remain poorly controlled by even the best available inference methods.
Collapse
Affiliation(s)
- Christopher E Laumer
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, EBML-EBI South Building, Hinxton CB10 1SD, UK
| |
Collapse
|
19
|
Abstract
Drosophila melanogaster is a human commensal and dietary generalist. A new study in its ancestral range in Africa finds that wild Drosophila melanogaster are specialists on marula fruit - fruits cached in caves by Pleistocene humans.
Collapse
Affiliation(s)
- Marianthi Karageorgi
- Department of Integrative Biology, University of California, Berkeley, 3040 Valley Life Sciences Building, Berkeley, CA 94720, USA
| | - Teruyuki Matsunaga
- Department of Integrative Biology, University of California, Berkeley, 3040 Valley Life Sciences Building, Berkeley, CA 94720, USA
| | - Noah K Whiteman
- Department of Integrative Biology, University of California, Berkeley, 3040 Valley Life Sciences Building, Berkeley, CA 94720, USA.
| |
Collapse
|
20
|
Abstract
The classic Darwinian theory and the Synthetic evolutionary theory and their linear models, while invaluable to study the origins and evolution of species, are not primarily designed to model the evolution of organisations, typically that of ecosystems, nor that of processes. How could evolutionary theory better explain the evolution of biological complexity and diversity? Inclusive network-based analyses of dynamic systems could retrace interactions between (related or unrelated) components. This theoretical shift from a Tree of Life to a Dynamic Interaction Network of Life, which is supported by diverse molecular, cellular, microbiological, organismal, ecological and evolutionary studies, would further unify evolutionary biology.
Collapse
Affiliation(s)
- Eric Bapteste
- Sorbonne Universités, UPMC Université Paris 06, Institut de Biologie Paris-Seine (IBPS), F-75005 Paris, France
- CNRS, UMR7138, Institut de Biologie Paris-Seine, F-75005 Paris, France
| | - Philippe Huneman
- Institut d’Histoire et de Philosophie des Sciences et des Techniques (CNRS / Paris I Sorbonne), F-75006 Paris, France
| |
Collapse
|
21
|
VanKuren NW, Long M. Gene duplicates resolving sexual conflict rapidly evolved essential gametogenesis functions. Nat Ecol Evol 2018; 2:705-712. [PMID: 29459709 PMCID: PMC5866764 DOI: 10.1038/s41559-018-0471-0] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Accepted: 01/05/2018] [Indexed: 02/04/2023]
Abstract
Males and females have different fitness optima but share the vast majority of their genomes, causing an inherent genetic conflict between the two sexes that must be resolved to achieve maximal population fitness. We show that two tandem duplicate genes found specifically in Drosophila melanogaster are sexually antagonistic, but rapidly evolved sex-specific functions and expression patterns that mitigate their antagonistic effects. We use copy-specific knockouts and rescue experiments to show that Apollo (Apl) is essential for male fertility but detrimental to female fertility, in addition to its important role in development, while Artemis (Arts) is essential for female fertility but detrimental to male fertility. Further analyses show that Apl and Arts have essential roles in spermatogenesis and oogenesis. These duplicates formed ~200,000 years ago, underwent a strong selective sweep and lost most expression in the antagonized sex. These data provide direct evidence that gene duplication allowed rapid mitigation of sexual conflict by allowing Apl and Arts to evolve essential sex-specific reproductive functions and complementary expression in male and female gonads.
Collapse
Affiliation(s)
- Nicholas W VanKuren
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL, USA.
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, IL, USA.
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL, USA.
- Committee on Genetics, Genomics and Systems Biology, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
22
|
Shapiro JA. Living Organisms Author Their Read-Write Genomes in Evolution. BIOLOGY 2017; 6:E42. [PMID: 29211049 PMCID: PMC5745447 DOI: 10.3390/biology6040042] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 11/17/2017] [Accepted: 11/28/2017] [Indexed: 12/18/2022]
Abstract
Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
23
|
Adaptive evolution by spontaneous domain fusion and protein relocalization. Nat Ecol Evol 2017; 1:1562-1568. [PMID: 29185504 DOI: 10.1038/s41559-017-0283-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 07/18/2017] [Indexed: 11/08/2022]
Abstract
Knowledge of adaptive processes encompasses understanding the emergence of new genes. Computational analyses of genomes suggest that new genes can arise by domain swapping; however, empirical evidence has been lacking. Here we describe a set of nine independent deletion mutations that arose during selection experiments with the bacterium Pseudomonas fluorescens in which the membrane-spanning domain of a fatty acid desaturase became translationally fused to a cytosolic di-guanylate cyclase, generating an adaptive 'wrinkly spreader' phenotype. Detailed genetic analysis of one gene fusion shows that the mutant phenotype is caused by relocalization of the di-guanylate cyclase domain to the cell membrane. The relative ease by which this new gene arose, along with its functional and regulatory effects, provides a glimpse of mutational events and their consequences that are likely to have a role in the evolution of new genes.
Collapse
|
24
|
Abstract
The classic model for the evolution of novel gene function is through gene duplication followed by evolution of a new function by one of the copies (neofunctionalization) [1, 2]. However, other modes have also been found, such as novel genes arising from non-coding DNA, chimeric fusions, and lateral gene transfers from other organisms [3-7]. Here we use the rapid turnover of venom genes in parasitoid wasps to study how new gene functions evolve. In contrast to the classic gene duplication model, we find that a common mode of acquisition of new venom genes in parasitoid wasps is co-option of single-copy genes from non-venom progenitors. Transcriptome and proteome sequencing reveal that recruitment and loss of venom genes occur primarily by rapid cis-regulatory expression evolution in the venom gland. Loss of venom genes is primarily due to downregulation of expression in the gland rather than gene death through coding sequence degradation. While the majority of venom genes have specialized expression in the venom gland, recent losses of venom function occur primarily among genes that show broader expression in development, suggesting that they can more readily switch functional roles. We propose that co-option of single-copy genes may be a common but relatively understudied mechanism of evolution for new gene functions, particularly under conditions of rapid evolutionary change.
Collapse
Affiliation(s)
- Ellen O Martinson
- Biology Department, University of Rochester, Rochester, NY 14627, USA
| | | | - Ching-Ho Chang
- Biology Department, University of Rochester, Rochester, NY 14627, USA
| | - John H Werren
- Biology Department, University of Rochester, Rochester, NY 14627, USA.
| |
Collapse
|
25
|
Tandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba. PLoS Genet 2017; 13:e1006795. [PMID: 28531189 PMCID: PMC5460883 DOI: 10.1371/journal.pgen.1006795] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Revised: 06/06/2017] [Accepted: 05/03/2017] [Indexed: 01/06/2023] Open
Abstract
One common hypothesis to explain the impacts of tandem duplications is that whole gene duplications commonly produce additive changes in gene expression due to copy number changes. Here, we use genome wide RNA-seq data from a population sample of Drosophila yakuba to test this ‘gene dosage’ hypothesis. We observe little evidence of expression changes in response to whole transcript duplication capturing 5′ and 3′ UTRs. Among whole gene duplications, we observe evidence that dosage sharing across copies is likely to be common. The lack of expression changes after whole gene duplication suggests that the majority of genes are subject to tight regulatory control and therefore not sensitive to changes in gene copy number. Rather, we observe changes in expression level due to both shuffling of regulatory elements and the creation of chimeric structures via tandem duplication. Additionally, we observe 30 de novo gene structures arising from tandem duplications, 23 of which form with expression in the testes. Thus, the value of tandem duplications is likely to be more intricate than simple changes in gene dosage. The common regulatory effects from chimeric gene formation after tandem duplication may explain their contribution to genome evolution. The enclosed work shows that whole gene duplications rarely affect gene expression, in contrast to widely held views that the adaptive value of duplicate genes is related to additive changes in gene expression due to gene copy number. We further explain how tandem duplications that create shuffled gene structures can force upregulation of gene sequences, de novo gene creation, and multifold changes in transcript levels. These results show that tandem duplications can produce new genes that are a source of immediate novelty associated with more extreme expression changes than previously suggested by theory. Further, these gene expression changes are a potential source of both beneficial and pathogenic mutations, immediately relevant to clinical and medical genetics in humans and other metazoans.
Collapse
|
26
|
Structural and functional innovations in the real-time evolution of new (βα) 8 barrel enzymes. Proc Natl Acad Sci U S A 2017; 114:4727-4732. [PMID: 28416687 DOI: 10.1073/pnas.1618552114] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
New genes can arise by duplication and divergence, but there is a fundamental gap in our understanding of the relationship between these genes, the evolving proteins they encode, and the fitness of the organism. Here we used crystallography, NMR dynamics, kinetics, and mass spectrometry to explain the molecular innovations that arose during a previous real-time evolution experiment. In that experiment, the (βα)8 barrel enzyme HisA was under selection for two functions (HisA and TrpF), resulting in duplication and divergence of the hisA gene to encode TrpF specialists, HisA specialists, and bifunctional generalists. We found that selection affects enzyme structure and dynamics, and thus substrate preference, simultaneously and sequentially. Bifunctionality is associated with two distinct sets of loop conformations, each essential for one function. We observed two mechanisms for functional specialization: structural stabilization of each loop conformation and substrate-specific adaptation of the active site. Intracellular enzyme performance, calculated as the product of catalytic efficiency and relative expression level, was not linearly related to fitness. Instead, we observed thresholds for each activity above which further improvements in catalytic efficiency had little if any effect on growth rate. Overall, we have shown how beneficial substitutions selected during real-time evolution can lead to manifold changes in enzyme function and bacterial fitness. This work emphasizes the speed at which adaptive evolution can yield enzymes with sufficiently high activities such that they no longer limit the growth of their host organism, and confirms the (βα)8 barrel as an inherently evolvable protein scaffold.
Collapse
|
27
|
The Role of Microsatellites in Streptophyta Gene Evolution. J Mol Evol 2017; 84:144-148. [PMID: 28116472 DOI: 10.1007/s00239-016-9778-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 12/19/2016] [Indexed: 10/20/2022]
Abstract
Microsatellites form hotspot regions for recombination. In this research, we investigated whether genic microsatellites can be responsible for generating new genes by enhancing crossover between gene containing microsatellites and other genomic regions. We tested our hypothesis on 33,531 UniGene entries containing microsatellites. Each sequence was divided into microsatellites upstream and downstream fragments, and each pair of sequences was compared to study the microsatellites effect. The candidate pairs of genes are supposed to share a high similar fragment in one side of the microsatellites, while the other fragments should be completely different. This in silico approach detected 448 valid pairs of sequences in which both of them showed semi-resemblance nature. The synteny analysis for the detected sequences against 55 plant genomes indicated low representation of them across plant kingdom. Our results will add a body of knowledge toward understanding the role of microsatellites in gene evolution.
Collapse
|
28
|
Using natural sequences and modularity to design common and novel protein topologies. Curr Opin Struct Biol 2016; 38:26-36. [PMID: 27270240 DOI: 10.1016/j.sbi.2016.05.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Revised: 05/13/2016] [Accepted: 05/18/2016] [Indexed: 02/07/2023]
Abstract
Protein design is still a challenging undertaking, often requiring multiple attempts or iterations for success. Typically, the source of failure is unclear, and scoring metrics appear similar between successful and failed cases. Nevertheless, the use of sequence statistics, modularity and symmetry from natural proteins, combined with computational design both at the coarse-grained and atomistic levels is propelling a new wave of design efforts to success. Here we highlight recent examples of design, showing how the wealth of natural protein sequence and topology data may be leveraged to reduce the search space and increase the likelihood of achieving desired outcomes.
Collapse
|
29
|
Shu C, Zhou J, Crickmore N, Li X, Song F, Liang G, He K, Huang D, Zhang J. In vitro template-change PCR to create single crossover libraries: a case study with B. thuringiensis Cry2A toxins. Sci Rep 2016; 6:23536. [PMID: 27097519 PMCID: PMC4838838 DOI: 10.1038/srep23536] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 03/09/2016] [Indexed: 11/09/2022] Open
Abstract
During evolution the creation of single crossover chimeras between duplicated paralogous genes is a known process for increasing diversity. Comparing the properties of homologously recombined chimeras with one or two crossovers is also an efficient strategy for analyzing relationships between sequence variation and function. However, no well-developed in vitro method has been established to create single-crossover libraries. Here we present an in vitro template-change polymerase change reaction that has been developed to enable the production of such libraries. We applied the method to two closely related toxin genes from B. thuringiensis and created chimeras with differing properties that can help us understand how these toxins are able to differentiate between insect species.
Collapse
Affiliation(s)
- Changlong Shu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Jianqiao Zhou
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Neil Crickmore
- School of Life Sciences, University of Sussex, Falmer, Brighton, UK
| | - Xianchun Li
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Fuping Song
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Gemei Liang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Kanglai He
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| | - Dafang Huang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing, 100081, P. R. China
| | - Jie Zhang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, P. R. China
| |
Collapse
|
30
|
Rogers RL, Cridland JM, Shao L, Hu TT, Andolfatto P, Thornton KR. Tandem Duplications and the Limits of Natural Selection in Drosophila yakuba and Drosophila simulans. PLoS One 2015; 10:e0132184. [PMID: 26176952 PMCID: PMC4503668 DOI: 10.1371/journal.pone.0132184] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2015] [Accepted: 06/10/2015] [Indexed: 11/30/2022] Open
Abstract
Tandem duplications are an essential source of genetic novelty, and their variation in natural populations is expected to influence adaptive walks. Here, we describe evolutionary impacts of recently-derived, segregating tandem duplications in Drosophila yakuba and Drosophila simulans. We observe an excess of duplicated genes involved in defense against pathogens, insecticide resistance, chorion development, cuticular peptides, and lipases or endopeptidases associated with the accessory glands across both species. The observed agreement is greater than expectations on chance alone, suggesting large amounts of convergence across functional categories. We document evidence of widespread selection on the D. simulans X, suggesting adaptation through duplication is common on the X. Despite the evidence for positive selection, duplicates display an excess of low frequency variants consistent with largely detrimental impacts, limiting the variation that can effectively facilitate adaptation. Standing variation for tandem duplications spans less than 25% of the genome in D. yakuba and D. simulans, indicating that evolution will be strictly limited by mutation, even in organisms with large population sizes. Effective whole gene duplication rates are low at 1.17 × 10-9 per gene per generation in D. yakuba and 6.03 × 10-10 per gene per generation in D. simulans, suggesting long wait times for new mutations on the order of thousands of years for the establishment of sweeps. Hence, in cases where adaptation depends on individual tandem duplications, evolution will be severely limited by mutation. We observe low levels of parallel recruitment of the same duplicated gene in different species, suggesting that the span of standing variation will define evolutionary outcomes in spite of convergence across gene ontologies consistent with rapidly evolving phenotypes.
Collapse
Affiliation(s)
- Rebekah L. Rogers
- Ecology and Evolutionary Biology, University of California, Berkeley, California, United States of America
| | - Julie M. Cridland
- Ecology and Evolutionary Biology, University of California, Davis, Davis, California, United States of America
| | - Ling Shao
- Ecology and Evolutionary Biology, University of California, Irvine, Irvine, California, United States of America
| | - Tina T. Hu
- Ecology and Evolutionary Biology and the Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Peter Andolfatto
- Ecology and Evolutionary Biology and the Lewis Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Kevin R. Thornton
- Ecology and Evolutionary Biology, University of California, Irvine, Irvine, California, United States of America
| |
Collapse
|
31
|
Tanaka K, Diekmann Y, Hazbun A, Hijazi A, Vreede B, Roch F, Sucena É. Multispecies Analysis of Expression Pattern Diversification in the Recently Expanded Insect Ly6 Gene Family. Mol Biol Evol 2015; 32:1730-47. [PMID: 25743545 PMCID: PMC4476152 DOI: 10.1093/molbev/msv052] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Gene families often consist of members with diverse expression domains reflecting their functions in a wide variety of tissues. However, how the expression of individual members, and thus their tissue-specific functions, diversified during the course of gene family expansion is not well understood. In this study, we approached this question through the analysis of the duplication history and transcriptional evolution of a rapidly expanding subfamily of insect Ly6 genes. We analyzed different insect genomes and identified seven Ly6 genes that have originated from a single ancestor through sequential duplication within the higher Diptera. We then determined how the original embryonic expression pattern of the founding gene diversified by characterizing its tissue-specific expression in the beetle Tribolium castaneum, the butterfly Bicyclus anynana, and the mosquito Anopheles stephensi and those of its duplicates in three higher dipteran species, representing various stages of the duplication history (Megaselia abdita, Ceratitis capitata, and Drosophila melanogaster). Our results revealed that frequent neofunctionalization episodes contributed to the increased expression breadth of this subfamily and that these events occurred after duplication and speciation events at comparable frequencies. In addition, at each duplication node, we consistently found asymmetric expression divergence. One paralog inherited most of the tissue-specificities of the founder gene, whereas the other paralog evolved drastically reduced expression domains. Our approach attests to the power of combining a well-established duplication history with a comprehensive coverage of representative species in acquiring unequivocal information about the dynamics of gene expression evolution in gene families.
Collapse
Affiliation(s)
| | | | | | - Assia Hijazi
- Centre de Biologie du Développement, CNRS UMR 5547, Université de Toulouse UPS, Toulouse, France
| | | | - Fernando Roch
- Centre de Biologie du Développement, CNRS UMR 5547, Université de Toulouse UPS, Toulouse, France
| | - Élio Sucena
- Instituto Gulbenkian de Ciência, Oeiras, Portugal Departamento de Biologia Animal, Faculdade de Ciências, Edifício C2, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
32
|
Rogers RL, Cridland JM, Shao L, Hu TT, Andolfatto P, Thornton KR. Landscape of standing variation for tandem duplications in Drosophila yakuba and Drosophila simulans. Mol Biol Evol 2014; 31:1750-66. [PMID: 24710518 PMCID: PMC4069613 DOI: 10.1093/molbev/msu124] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
We have used whole genome paired-end Illumina sequence data to identify tandem duplications in 20 isofemale lines of Drosophila yakuba and 20 isofemale lines of D. simulans and performed genome wide validation with PacBio long molecule sequencing. We identify 1,415 tandem duplications that are segregating in D. yakuba as well as 975 duplications in D. simulans, indicating greater variation in D. yakuba. Additionally, we observe high rates of secondary deletions at duplicated sites, with 8% of duplicated sites in D. simulans and 17% of sites in D. yakuba modified with deletions. These secondary deletions are consistent with the action of the large loop mismatch repair system acting to remove polymorphic tandem duplication, resulting in rapid dynamics of gain and loss in duplicated alleles and a richer substrate of genetic novelty than has been previously reported. Most duplications are present in only single strains, suggesting that deleterious impacts are common. Drosophila simulans shows larger numbers of whole gene duplications in comparison to larger proportions of gene fragments in D. yakuba. Drosophila simulans displays an excess of high-frequency variants on the X chromosome, consistent with adaptive evolution through duplications on the D. simulans X or demographic forces driving duplicates to high frequency. We identify 78 chimeric genes in D. yakuba and 38 chimeric genes in D. simulans, as well as 143 cases of recruited noncoding sequence in D. yakuba and 96 in D. simulans, in agreement with rates of chimeric gene origination in D. melanogaster. Together, these results suggest that tandem duplications often result in complex variation beyond whole gene duplications that offers a rich substrate of standing variation that is likely to contribute both to detrimental phenotypes and disease, as well as to adaptive evolutionary change.
Collapse
Affiliation(s)
- Rebekah L Rogers
- Department of Ecology and Evolutionary Biology, University of California, Irvine
| | - Julie M Cridland
- Department of Ecology and Evolutionary Biology, University of California, IrvineDepartment of Ecology and Evolutionary Biology, University of California, Davis
| | - Ling Shao
- Department of Ecology and Evolutionary Biology, University of California, Irvine
| | - Tina T Hu
- Department of Ecology and Evolutionary Biology and the Lewis Sigler Institute for Integrative Genomics, Princeton University
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis Sigler Institute for Integrative Genomics, Princeton University
| | - Kevin R Thornton
- Department of Ecology and Evolutionary Biology, University of California, Irvine
| |
Collapse
|
33
|
Rippey C, Walsh T, Gulsuner S, Brodsky M, Nord AS, Gasperini M, Pierce S, Spurrell C, Coe BP, Krumm N, Lee MK, Sebat J, McClellan JM, King MC. Formation of chimeric genes by copy-number variation as a mutational mechanism in schizophrenia. Am J Hum Genet 2013; 93:697-710. [PMID: 24094746 PMCID: PMC3791253 DOI: 10.1016/j.ajhg.2013.09.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2013] [Revised: 08/15/2013] [Accepted: 09/10/2013] [Indexed: 12/28/2022] Open
Abstract
Chimeric genes can be caused by structural genomic rearrangements that fuse together portions of two different genes to create a novel gene. We hypothesize that brain-expressed chimeras may contribute to schizophrenia. Individuals with schizophrenia and control individuals were screened genome wide for copy-number variants (CNVs) that disrupted two genes on the same DNA strand. Candidate events were filtered for predicted brain expression and for frequency < 0.001 in an independent series of 20,000 controls. Four of 124 affected individuals and zero of 290 control individuals harbored such events (p = 0.002); a 47 kb duplication disrupted MATK and ZFR2, a 58 kb duplication disrupted PLEKHD1 and SLC39A9, a 121 kb duplication disrupted DNAJA2 and NETO2, and a 150 kb deletion disrupted MAP3K3 and DDX42. Each fusion produced a stable protein when exogenously expressed in cultured cells. We examined whether these chimeras differed from their parent genes in localization, regulation, or function. Subcellular localizations of DNAJA2-NETO2 and MAP3K3-DDX42 differed from their parent genes. On the basis of the expression profile of the MATK promoter, MATK-ZFR2 is likely to be far more highly expressed in the brain during development than the ZFR2 parent gene. MATK-ZFR2 includes a ZFR2-derived isoform that we demonstrate localizes preferentially to neuronal dendritic branch sites. These results suggest that the formation of chimeric genes is a mechanism by which CNVs contribute to schizophrenia and that, by interfering with parent gene function, chimeras may disrupt critical brain processes, including neurogenesis, neuronal differentiation, and dendritic arborization.
Collapse
Affiliation(s)
- Caitlin Rippey
- Departments of Medicine and of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
Genes are perpetually added to and deleted from genomes during evolution. Thus, it is important to understand how new genes are formed and how they evolve to be critical components of the genetic systems that determine the biological diversity of life. Two decades of effort have shed light on the process of new gene origination and have contributed to an emerging comprehensive picture of how new genes are added to genomes, ranging from the mechanisms that generate new gene structures to the presence of new genes in different organisms to the rates and patterns of new gene origination and the roles of new genes in phenotypic evolution. We review each of these aspects of new gene evolution, summarizing the main evidence for the origination and importance of new genes in evolution. We highlight findings showing that new genes rapidly change existing genetic systems that govern various molecular, cellular, and phenotypic functions.
Collapse
Affiliation(s)
- Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637;
| | | | | | | |
Collapse
|
35
|
Abstract
Monocot chimeric jacalins are a small group of lectins (currently with nine members), each typically consisting of a dirigent domain and a jacalin-related lectin domain. This unique module structure, along with their limited taxonomic distribution and short time window in molecular evolution, makes them a novel family of lectins. Recent studies have shown that these proteins play important roles in plant stress responses and development. Our knowledge of these proteins in functional domain and evolution has also made significant progress.
Collapse
Affiliation(s)
- Qing-Hu Ma
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences , Beijing , China
| |
Collapse
|
36
|
Bornberg-Bauer E, Albà MM. Dynamics and adaptive benefits of modular protein evolution. Curr Opin Struct Biol 2013; 23:459-66. [PMID: 23562500 DOI: 10.1016/j.sbi.2013.02.012] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2013] [Revised: 02/15/2013] [Accepted: 02/15/2013] [Indexed: 11/29/2022]
Abstract
During protein evolution, novel domain arrangements are continuously formed. Rearrangements are important for the creation of molecular biodiversity and for functional molecular changes which underlie developmental shifts in the bauplan of organisms. Here we review the mechanisms by which new arrangements arise and the potential benefits of rearrangements. We concentrate on how new domains emerge and why they rapidly spread across genomes, gaining higher copy numbers than older, more established domains. This spread is most likely a consequence of their high adaptive potential but is unlikely to make up on its own for the drastic loss of domains, which is observed across different taxa. We show that a significant portion of the recently emerged domains, especially those in multidomain families, are highly disordered and speculate about the significance of these findings for the evolvability of novel genetic material.
Collapse
Affiliation(s)
- Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Hüfferstrasse 1, D48149 Münster, Germany.
| | | |
Collapse
|
37
|
Moore AD, Grath S, Schüler A, Huylmans AK, Bornberg-Bauer E. Quantification and functional analysis of modular protein evolution in a dense phylogenetic tree. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2013; 1834:898-907. [PMID: 23376183 DOI: 10.1016/j.bbapap.2013.01.007] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Revised: 01/06/2013] [Accepted: 01/09/2013] [Indexed: 12/24/2022]
Abstract
Modularity is a hallmark of molecular evolution. Whether considering gene regulation, the components of metabolic pathways or signaling cascades, the ability to reuse autonomous modules in different molecular contexts can expedite evolutionary innovation. Similarly, protein domains are the modules of proteins, and modular domain rearrangements can create diversity with seemingly few operations in turn allowing for swift changes to an organism's functional repertoire. Here, we assess the patterns and functional effects of modular rearrangements at high resolution. Using a well resolved and diverse group of pancrustaceans, we illustrate arrangement diversity within closely related organisms, estimate arrangement turnover frequency and establish, for the first time, branch-specific rate estimates for fusion, fission, domain addition and terminal loss. Our results show that roughly 16 new arrangements arise per million years and that between 64% and 81% of these can be explained by simple, single-step modular rearrangement events. We find evidence that the frequencies of fission and terminal deletion events increase over time, and that modular rearrangements impact all levels of the cellular signaling apparatus and thus may have strong adaptive potential. Novel arrangements that cannot be explained by simple modular rearrangements contain a significant amount of repeat domains that occur in complex patterns which we term "supra-repeats". Furthermore, these arrangements are significantly longer than those with a single-step rearrangement solution, suggesting that such arrangements may result from multi-step events. In summary, our analysis provides an integrated view and initial quantification of the patterns and functional impact of modular protein evolution in a well resolved phylogenetic tree. This article is part of a Special Issue entitled: The emerging dynamic view of proteins: Protein plasticity in allostery, evolution and self-assembly.
Collapse
Affiliation(s)
- Andrew D Moore
- Institute for Evolution and Biodiversity, Münster, Germany
| | | | | | | | | |
Collapse
|
38
|
Jachiet PA, Pogorelcnik R, Berry A, Lopez P, Bapteste E. MosaicFinder: identification of fused gene families in sequence similarity networks. ACTA ACUST UNITED AC 2013; 29:837-44. [PMID: 23365410 DOI: 10.1093/bioinformatics/btt049] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
MOTIVATION Gene fusion is an important evolutionary process. It can yield valuable information to infer the interactions and functions of proteins. Fused genes have been identified as non-transitive patterns of similarity in triplets of genes. To be computationally tractable, this approach usually imposes an a priori distinction between a dataset in which fused genes are searched for, and a dataset that may have provided genetic material for fusion. This reduces the 'genetic space' in which fusion can be discovered, as only a subset of triplets of genes is investigated. Moreover, this approach may have a high-false-positive rate, and it does not identify gene families descending from a common fusion event. RESULTS We represent similarities between sequences as a network. This leads to an efficient formulation of previous methods of fused gene identification, which we implemented in the Python program FusedTriplets. Furthermore, we propose a new characterization of families of fused genes, as clique minimal separators of the sequence similarity network. This well-studied graph topology provides a robust and fast method of detection, well suited for automatic analyses of big datasets. We implemented this method in the C++ program MosaicFinder, which additionally uses local alignments to discard false-positive candidates and indicates potential fusion points. The grouping into families will help distinguish sequencing or prediction errors from real biological fusions, and it will yield additional insight into the function and history of fused genes. AVAILABILITY FusedTriplets and MosaicFinder are published under the GPL license and are freely available with their source code at this address: http://sourceforge.net/projects/mosaicfinder. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pierre-Alain Jachiet
- UMR CNRS 7138 Systématique, Adaptation, Evolution, Université Pierre et Marie Curie, 75005 Paris, France
| | | | | | | | | |
Collapse
|
39
|
Schrider DR, Navarro FCP, Galante PAF, Parmigiani RB, Camargo AA, Hahn MW, de Souza SJ. Gene copy-number polymorphism caused by retrotransposition in humans. PLoS Genet 2013; 9:e1003242. [PMID: 23359205 PMCID: PMC3554589 DOI: 10.1371/journal.pgen.1003242] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2012] [Accepted: 11/28/2012] [Indexed: 01/05/2023] Open
Abstract
The era of whole-genome sequencing has revealed that gene copy-number changes caused by duplication and deletion events have important evolutionary, functional, and phenotypic consequences. Recent studies have therefore focused on revealing the extent of variation in copy-number within natural populations of humans and other species. These studies have found a large number of copy-number variants (CNVs) in humans, many of which have been shown to have clinical or evolutionary importance. For the most part, these studies have failed to detect an important class of gene copy-number polymorphism: gene duplications caused by retrotransposition, which result in a new intron-less copy of the parental gene being inserted into a random location in the genome. Here we describe a computational approach leveraging next-generation sequence data to detect gene copy-number variants caused by retrotransposition (retroCNVs), and we report the first genome-wide analysis of these variants in humans. We find that retroCNVs account for a substantial fraction of gene copy-number differences between any two individuals. Moreover, we show that these variants may often result in expressed chimeric transcripts, underscoring their potential for the evolution of novel gene functions. By locating the insertion sites of these duplicates, we are able to show that retroCNVs have had an important role in recent human adaptation, and we also uncover evidence that positive selection may currently be driving multiple retroCNVs toward fixation. Together these findings imply that retroCNVs are an especially important class of polymorphism, and that future studies of copy-number variation should search for these variants in order to illuminate their potential evolutionary and functional relevance. Recent studies of human genetic variation have revealed that, in addition to differing at single nucleotide polymorphisms, individuals differ in copy-number at many regions of the genome. These copy-number variants (CNVs) are caused by duplication or deletion events and often affect functional sequences such as genes. Efforts to reveal the functional impact of CNVs have identified many variants increasing the risk of various disorders, and some that are adaptive. However, these studies mostly fail to detect gene duplications caused by retrotransposition, in which an mRNA transcript is reverse-transcribed and reinserted into the genome, yielding a new intron-less gene copy. Here we describe a method leveraging next-generation sequence data to accurately detect gene copy-number variants caused by retrotransposition, or retroCNVs, and apply this method to hundreds of whole-genome sequences from three different human subpopulations. We find that these variants account for a substantial number of gene copy-number differences between individuals, and that gene retrotransposition may often result in both deleterious and beneficial mutations. Indeed, we present evidence that two of these new gene duplications may be adaptive. These results imply that retroCNVs are an especially important class of CNV and should be included in future studies of human copy-number variation.
Collapse
Affiliation(s)
- Daniel R. Schrider
- Department of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana, United States of America
- * E-mail: (DRS); (FCPN)
| | - Fabio C. P. Navarro
- São Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, Brazil
- Departamento de Bioquímica, Universidade de São Paulo, São Paulo, Brazil
- Centro de Oncologia Molecular–Hospital Sírio-Libanês, São Paulo, Brazil
- * E-mail: (DRS); (FCPN)
| | - Pedro A. F. Galante
- São Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, Brazil
- Centro de Oncologia Molecular–Hospital Sírio-Libanês, São Paulo, Brazil
| | - Raphael B. Parmigiani
- São Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, Brazil
- Centro de Oncologia Molecular–Hospital Sírio-Libanês, São Paulo, Brazil
| | - Anamaria A. Camargo
- São Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, Brazil
- Centro de Oncologia Molecular–Hospital Sírio-Libanês, São Paulo, Brazil
| | - Matthew W. Hahn
- Department of Biology and School of Informatics and Computing, Indiana University, Bloomington, Indiana, United States of America
| | - Sandro J. de Souza
- São Paulo Branch, Ludwig Institute for Cancer Research, São Paulo, Brazil
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
| |
Collapse
|
40
|
Abrusán G, Szilágyi A, Zhang Y, Papp B. Turning gold into 'junk': transposable elements utilize central proteins of cellular networks. Nucleic Acids Res 2013; 41:3190-200. [PMID: 23341038 PMCID: PMC3597677 DOI: 10.1093/nar/gkt011] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
The numerous discovered cases of domesticated transposable element (TE) proteins led to the recognition that TEs are a significant source of evolutionary innovation. However, much less is known about the reverse process, whether and to what degree the evolution of TEs is influenced by the genome of their hosts. We addressed this issue by searching for cases of incorporation of host genes into the sequence of TEs and examined the systems-level properties of these genes using the Saccharomyces cerevisiae and Drosophila melanogaster genomes. We identified 51 cases where the evolutionary scenario was the incorporation of a host gene fragment into a TE consensus sequence, and we show that both the yeast and fly homologues of the incorporated protein sequences have central positions in the cellular networks. An analysis of selective pressure (Ka/Ks ratio) detected significant selection in 37% of the cases. Recent research on retrovirus-host interactions shows that virus proteins preferentially target hubs of the host interaction networks enabling them to take over the host cell using only a few proteins. We propose that TEs face a similar evolutionary pressure to evolve proteins with high interacting capacities and take some of the necessary protein domains directly from their hosts.
Collapse
Affiliation(s)
- György Abrusán
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Center of the Hungarian Academy of Sciences, Temesváry krt. 62. Szeged H-6701, Hungary.
| | | | | | | |
Collapse
|
41
|
Ma QH, Zhen WB, Liu YC. Jacalin domain in wheat jasmonate-regulated protein Ta-JA1 confers agglutinating activity and pathogen resistance. Biochimie 2012; 95:359-65. [PMID: 23116711 DOI: 10.1016/j.biochi.2012.10.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2012] [Accepted: 10/08/2012] [Indexed: 01/14/2023]
Abstract
Ta-JA1 is a jacalin-like lectin from wheat (Triticum aestivum) plants. To date, its homologs are only observed in the Gramineae family. Our previous experiments have demonstrated that Ta-JA1 contains a modular structure consisting of an N-terminal dirigent domain and a C-terminal jacalin-related lectin domain (JRL) and this protein exhibits a mannose-specific lectin activity. The over-expression of Ta-JA1 gene provides transgenic plants a broad-spectrum resistance to diseases. Here, we report the differential activities of the dirigent and JRL domains of Ta-JA1. In vitro assay showed that the recombinant JRL domain could effectively agglutinate rabbit erythrocytes and pathogen bacteria Pseudomonas syringe pv tabaci. These hemagglutination activities could be inhibited by mannose but not by galactose. In contrast, the recombinant dirigent domain did not show agglutination activity. Corresponding to these differentiations of activities, similar to full-length of Ta-JA1, the over-expression of JRL domain in transgenic plants also increased resistance to the infection of P. syringe. Unlike JRL, the over-expression of dirigent domain in transgenic plants led to alteration of the seedling sensitivity to salts. In addition, a d(N)/d(S) ratio analysis of Ta-JA1 and its related proteins showed that this protein family functionally limited to a few crop plants, such as maize, rice and wheat.
Collapse
Affiliation(s)
- Qing-Hu Ma
- Key Laboratory of Plant Resources, Institute of Botany, Chinese Academy of Sciences, 20 Nanxincun, Xiangshan, Beijing 100093, China.
| | | | | |
Collapse
|
42
|
Prasad KVSK, Song BH, Olson-Manning C, Anderson JT, Lee CR, Schranz ME, Windsor AJ, Clauss MJ, Manzaneda AJ, Naqvi I, Reichelt M, Gershenzon J, Rupasinghe SG, Schuler MA, Mitchell-Olds T. A gain-of-function polymorphism controlling complex traits and fitness in nature. Science 2012; 337:1081-4. [PMID: 22936775 DOI: 10.1126/science.1221636] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Identification of the causal genes that control complex trait variation remains challenging, limiting our appreciation of the evolutionary processes that influence polymorphisms in nature. We cloned a quantitative trait locus that controls plant defensive chemistry, damage by insect herbivores, survival, and reproduction in the natural environments where this polymorphism evolved. These ecological effects are driven by duplications in the BCMA (branched-chain methionine allocation) loci controlling this variation and by two selectively favored amino acid changes in the glucosinolate-biosynthetic cytochrome P450 proteins that they encode. These changes cause a gain of novel enzyme function, modulated by allelic differences in catalytic rate and gene copy number. Ecological interactions in diverse environments likely contribute to the widespread polymorphism of this biochemical function.
Collapse
Affiliation(s)
- Kasavajhala V S K Prasad
- Department of Biology, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Novel genes from formation to function. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:821645. [PMID: 22811949 PMCID: PMC3395120 DOI: 10.1155/2012/821645] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 04/26/2012] [Indexed: 11/29/2022]
Abstract
The study of the evolution of novel genes generally focuses on the formation of new coding sequences. However, equally important in the evolution of novel functional genes are the formation of regulatory regions that allow the expression of the genes and the effects of the new genes in the organism as well. Herein, we discuss the current knowledge on the evolution of novel functional genes, and we examine in more detail the youngest genes discovered. We examine the existing data on a very recent and rapidly evolving cluster of duplicated genes, the Sdic gene cluster. This cluster of genes is an excellent model for the evolution of novel genes, as it is very recent and may still be in the process of evolving.
Collapse
|
44
|
Ranz JM, Parsch J. Newly evolved genes: moving from comparative genomics to functional studies in model systems. How important is genetic novelty for species adaptation and diversification? Bioessays 2012; 34:477-83. [PMID: 22461005 DOI: 10.1002/bies.201100177] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Genes are gained and lost over the course of evolution. A recent study found that over 1,800 new genes have appeared during primate evolution and that an unexpectedly high proportion of these genes are expressed in the human brain. But what are the molecular functions of newly evolved genes and what is their impact on an organism's fitness? The acquisition of new genes may provide a rich source of genetic diversity that fuels evolutionary innovation. Although gene manipulation experiments are not feasible in humans, studies in model organisms, such as Drosophila melanogaster, have shown that new genes can quickly become integrated into genetic networks and become essential for survival or fertility. Future studies of new genes, especially chimeric genes, and their functions will help determine the role of genetic novelty in the adaptation and diversification of species.
Collapse
Affiliation(s)
- José M Ranz
- Department of Ecology and Evolutionary Biology, University of California-Irvine, CA, USA.
| | | |
Collapse
|
45
|
Functional evidence that a recently evolved Drosophila sperm-specific gene boosts sperm competition. Proc Natl Acad Sci U S A 2012; 109:2043-8. [PMID: 22308475 DOI: 10.1073/pnas.1121327109] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In many species, both morphological and molecular traits related to sex and reproduction evolve faster in males than in females. Ultimately, rapid male evolution relies on the acquisition of genetic variation associated with differential reproductive success. Many newly evolved genes are associated with novel functions that might enhance male fitness. However, functional evidence of the adaptive role of recently originated genes in males is still lacking. The Sperm dynein intermediate chain multigene family, which encodes a Sperm dynein intermediate chain presumably involved in sperm motility, originated from complex genetic rearrangements in the lineage that leads to Drosophila melanogaster within the last 5.4 million years since its split from Drosophila simulans. We deleted all the members of this multigene family resident on the X chromosome of D. melanogaster by chromosome engineering and found that, although the deletion does not result in a reduction of progeny number, it impairs the competence of the sperm in the presence of sperm from wild-type males. Therefore, the Sperm dynein intermediate chain multigene family contributes to the differential reproductive success among males and illustrates precisely how quickly a new gene function can be incorporated into the genetic network of a species.
Collapse
|
46
|
Kersting AR, Bornberg-Bauer E, Moore AD, Grath S. Dynamics and adaptive benefits of protein domain emergence and arrangements during plant genome evolution. Genome Biol Evol 2012; 4:316-29. [PMID: 22250127 PMCID: PMC3318442 DOI: 10.1093/gbe/evs004] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Plant genomes are generally very large, mostly paleopolyploid, and have numerous gene duplicates and complex genomic features such as repeats and transposable elements. Many of these features have been hypothesized to enable plants, which cannot easily escape environmental challenges, to rapidly adapt. Another mechanism, which has recently been well described as a major facilitator of rapid adaptation in bacteria, animals, and fungi but not yet for plants, is modular rearrangement of protein-coding genes. Due to the high precision of profile-based methods, rearrangements can be well captured at the protein level by characterizing the emergence, loss, and rearrangements of protein domains, their structural, functional, and evolutionary building blocks. Here, we study the dynamics of domain rearrangements and explore their adaptive benefit in 27 plant and 3 algal genomes. We use a phylogenomic approach by which we can explain the formation of 88% of all arrangements by single-step events, such as fusion, fission, and terminal loss of domains. We find many domains are lost along every lineage, but at least 500 domains are novel, that is, they are unique to green plants and emerged more or less recently. These novel domains duplicate and rearrange more readily within their genomes than ancient domains and are overproportionally involved in stress response and developmental innovations. Novel domains more often affect regulatory proteins and show a higher degree of structural disorder than ancient domains. Whereas a relatively large and well-conserved core set of single-domain proteins exists, long multi-domain arrangements tend to be species-specific. We find that duplicated genes are more often involved in rearrangements. Although fission events typically impact metabolic proteins, fusion events often create new signaling proteins essential for environmental sensing. Taken together, the high volatility of single domains and complex arrangements in plant genomes demonstrate the importance of modularity for environmental adaptability of plants.
Collapse
Affiliation(s)
- Anna R Kersting
- Evolutionary Bioinformatics Group, Institute for Evolution and Biodiversity, University of Muenster (WWU), Germany
| | | | | | | |
Collapse
|