101
|
Yin H, Li M, Xia L, He C, Zhang Z. Computational determination of gene age and characterization of evolutionary dynamics in human. Brief Bioinform 2019; 20:2141-2149. [PMID: 30184145 DOI: 10.1093/bib/bby074] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2018] [Revised: 08/01/2018] [Accepted: 08/02/2018] [Indexed: 12/23/2022] Open
Abstract
Genes originate at different evolutionary time scales and possess different ages, accordingly presenting diverse functional characteristics and reflecting distinct adaptive evolutionary innovations. In the past decades, progresses have been made in gene age identification by a variety of methods that are principally based on comparative genomics. Here we summarize methods for computational determination of gene age and evaluate the effectiveness of different computational methods for age identification. Our results show that improved age determination can be achieved by combining homolog clustering with phylogeny inference, which enables more accurate age identification in human genes. Accordingly, we characterize evolutionary dynamics of human genes based on an extremely long evolutionary time scale spanning ~4,000 million years from archaea/bacteria to human, revealing that young genes are clustered on certain chromosomes and that Mendelian disease genes (including monogenic disease and polygenic disease genes) and cancer genes exhibit divergent evolutionary origins. Taken together, deciphering genes' ages as well as their evolutionary dynamics is of fundamental significance in unveiling the underlying mechanisms during evolution and better understanding how young or new genes become indispensable integrants coupled with novel phenotypes and biological diversity.
Collapse
Affiliation(s)
- Hongyan Yin
- Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresources, Institute of Tropical Agriculture and Forestry, Hainan University, China
| | - Mengwei Li
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Lin Xia
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Chaozu He
- Hainan Key Laboratory for Sustainable Utilization of Tropical Bioresources, Institute of Tropical Agriculture and Forestry, Hainan University, China
| | - Zhang Zhang
- BIG Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
102
|
Zhou Y, Zhang C. Evolutionary patterns of chimeric retrogenes in Oryza species. Sci Rep 2019; 9:17733. [PMID: 31776387 PMCID: PMC6881317 DOI: 10.1038/s41598-019-54085-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Accepted: 10/30/2019] [Indexed: 11/23/2022] Open
Abstract
Chimeric retroposition is a process by which RNA is reverse transcribed and the resulting cDNA is integrated into the genome along with flanking sequences. This process plays essential roles and drives genome evolution. Although the origination rates of chimeric retrogenes are high in plant genomes, the evolutionary patterns of the retrogenes and their parental genes are relatively uncharacterised in the rice genome. In this study, we evaluated the substitution ratio of 24 retrogenes and their parental genes to clarify their evolutionary patterns. The results indicated that seven gene pairs were under positive selection. Additionally, soon after new chimeric retrogenes were formed, they rapidly evolved. However, an unexpected pattern was also revealed. Specifically, after an undefined period following the formation of new chimeric retrogenes, the parental genes, rather than the new chimeric retrogenes, rapidly evolved under positive selection. We also observed that one retro chimeric gene (RCG3) was highly expressed in infected calli, whereas its parental gene was not. Finally, a comparison of our Ka/Ks analysis with that of other species indicated that the proportion of genes under positive selection is greater for chimeric retrogenes than for non-chimeric retrogenes in the rice genome.
Collapse
Affiliation(s)
- Yanli Zhou
- The Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, No. 132 Lanhei Road, Kunming, 650201, Yunnan, China
| | - Chengjun Zhang
- The Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, No. 132 Lanhei Road, Kunming, 650201, Yunnan, China. .,Haiyan Engineering & Technology Center, Kunming Institute of Botany, Chinese Academy of Science, Jiaxing, 314300, Zhejiang, China.
| |
Collapse
|
103
|
Sessegolo C, Cruaud C, Da Silva C, Cologne A, Dubarry M, Derrien T, Lacroix V, Aury JM. Transcriptome profiling of mouse samples using nanopore sequencing of cDNA and RNA molecules. Sci Rep 2019; 9:14908. [PMID: 31624302 PMCID: PMC6797730 DOI: 10.1038/s41598-019-51470-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Accepted: 09/28/2019] [Indexed: 01/27/2023] Open
Abstract
Our vision of DNA transcription and splicing has changed dramatically with the introduction of short-read sequencing. These high-throughput sequencing technologies promised to unravel the complexity of any transcriptome. Generally gene expression levels are well-captured using these technologies, but there are still remaining caveats due to the limited read length and the fact that RNA molecules had to be reverse transcribed before sequencing. Oxford Nanopore Technologies has recently launched a portable sequencer which offers the possibility of sequencing long reads and most importantly RNA molecules. Here we generated a full mouse transcriptome from brain and liver using the Oxford Nanopore device. As a comparison, we sequenced RNA (RNA-Seq) and cDNA (cDNA-Seq) molecules using both long and short reads technologies and tested the TeloPrime preparation kit, dedicated to the enrichment of full-length transcripts. Using spike-in data, we confirmed that expression levels are efficiently captured by cDNA-Seq using short reads. More importantly, Oxford Nanopore RNA-Seq tends to be more efficient, while cDNA-Seq appears to be more biased. We further show that the cDNA library preparation of the Nanopore protocol induces read truncation for transcripts containing internal runs of T's. This bias is marked for runs of at least 15 T's, but is already detectable for runs of at least 9 T's and therefore concerns more than 20% of expressed transcripts in mouse brain and liver. Finally, we outline that bioinformatics challenges remain ahead for quantifying at the transcript level, especially when reads are not full-length. Accurate quantification of repeat-associated genes such as processed pseudogenes also remains difficult, and we show that current mapping protocols which map reads to the genome largely over-estimate their expression, at the expense of their parent gene.
Collapse
Affiliation(s)
- Camille Sessegolo
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622, Villeurbanne, France
- EPI ERABLE - Inria Grenoble, Rhône-Alpes, France
| | - Corinne Cruaud
- Genoscope, Institut de biologie François-Jacob, Commissariat a l'Energie Atomique (CEA), Université Paris-Saclay, F-91057, Evry, France
| | - Corinne Da Silva
- Genoscope, Institut de biologie François-Jacob, Commissariat a l'Energie Atomique (CEA), Université Paris-Saclay, F-91057, Evry, France
| | - Audric Cologne
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622, Villeurbanne, France
- EPI ERABLE - Inria Grenoble, Rhône-Alpes, France
| | - Marion Dubarry
- Genoscope, Institut de biologie François-Jacob, Commissariat a l'Energie Atomique (CEA), Université Paris-Saclay, F-91057, Evry, France
| | - Thomas Derrien
- Univ Rennes, CNRS, IGDR (Institut de génétique et développement de Rennes) - UMR 6290, F-35000, Rennes, France
| | - Vincent Lacroix
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Évolutive UMR5558, F-69622, Villeurbanne, France
- EPI ERABLE - Inria Grenoble, Rhône-Alpes, France
| | - Jean-Marc Aury
- Genoscope, Institut de biologie François-Jacob, Commissariat a l'Energie Atomique (CEA), Université Paris-Saclay, F-91057, Evry, France.
| |
Collapse
|
104
|
Gerdol M, Greco S, Pallavicini A. Extensive Tandem Duplication Events Drive the Expansion of the C1q-Domain-Containing Gene Family in Bivalves. Mar Drugs 2019; 17:md17100583. [PMID: 31615007 PMCID: PMC6835236 DOI: 10.3390/md17100583] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 10/11/2019] [Accepted: 10/12/2019] [Indexed: 01/04/2023] Open
Abstract
C1q-domain-containing (C1qDC) proteins are rapidly emerging as key players in the innate immune response of bivalve mollusks. Growing experimental evidence suggests that these highly abundant secretory proteins are involved in the recognition of microbe-associated molecular patterns, serving as lectin-like molecules in the bivalve proto-complement system. While a large amount of functional data concerning the binding specificity of the globular head C1q domain and on the regulation of these molecules in response to infection are quickly accumulating, the genetic mechanisms that have led to the extraordinary lineage-specific expansion of the C1qDC gene family in bivalves are still largely unknown. The analysis of the chromosome-scale genome assembly of the Eastern oyster Crassostrea virginica revealed that the 476 oyster C1qDC genes, far from being uniformly distributed along the genome, are located in large clusters of tandemly duplicated paralogs, mostly found on chromosomes 7 and 8. Our observations point out that the evolutionary process behind the development of a large arsenal of C1qDC lectin-like molecules in marine bivalves is still ongoing and likely based on an unequal crossing over.
Collapse
Affiliation(s)
- Marco Gerdol
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy.
| | - Samuele Greco
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy.
| | - Alberto Pallavicini
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy.
- National Institute of Oceanography and Applied Geophysics, 34151 Trieste, Italy.
| |
Collapse
|
105
|
Whole-genome sequencing reveals recent and frequent genetic recombination between clonal lineages of Cryphonectria parasitica in western Europe. Fungal Genet Biol 2019; 130:122-133. [DOI: 10.1016/j.fgb.2019.06.002] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Revised: 04/30/2019] [Accepted: 06/04/2019] [Indexed: 02/07/2023]
|
106
|
Plebanek A, Larnerd C, Popović M, Wei C, Pohorille A, Ditzler MA. Big on Change, Small on Innovation: Evolutionary Consequences of RNA Sequence Duplication. J Mol Evol 2019; 87:240-253. [PMID: 31435687 PMCID: PMC6711949 DOI: 10.1007/s00239-019-09906-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 08/06/2019] [Indexed: 01/11/2023]
Abstract
The potential for biopolymers to evolve new structures has important consequences for their ability to optimize function and our attempts to reconstruct their evolutionary histories. Prior work with in vitro systems suggests that structural remodeling of RNA is difficult to achieve through the accumulation of point mutations or through recombination events. Sequence duplication may represent an alternative mechanism that can more readily lead to the evolution of new structures. Structural and sequence elements in many RNAs and proteins appear to be the products of duplication events, indicating that this mechanism plays a major role in molecular evolution. Despite the potential significance of this mechanism, little experimental data is available concerning the structural and evolutionary consequences of duplicating biopolymer sequences. To assess the structural consequences of sequence duplication on the evolution of RNA, we mutagenized an RNA sequence containing two copies of an ATP aptamer and subjected the resulting population to multiple in vitro evolution experiments. We identified multiple routes by which duplication, followed by the accumulation of functional point mutations, allowed our populations to sample two entirely different secondary structures. The two structures have no base pairs in common, but both structures contain two copies of the same ATP-binding motif. We do not observe the emergence of any other functional secondary structures beyond these two. Although this result suggests a limited capacity for duplication to support short-term functional innovation, major changes in secondary structure, like the one observed here, should be given careful consideration as they are likely to frustrate attempts to infer deep evolutionary histories of functional RNAs.
Collapse
Affiliation(s)
- Andrew Plebanek
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA
| | - Caleb Larnerd
- NASA Internship Program, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Milena Popović
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA.,Blue Marble Space Institute of Science, Seattle, WA, 98145, USA
| | - Chenyu Wei
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Andrew Pohorille
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA.,Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, 94143, USA.,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA
| | - Mark A Ditzler
- Exobiology Branch, Space Science and Astrobiology Division, NASA Ames Research Center, Bldg N239 Mail Stop 239-4, Moffett Field, CA, 94035, USA. .,Center for the Emergence of Life, NASA Ames Research Center, Moffett Field, CA, 94035, USA.
| |
Collapse
|
107
|
Chatron N, Cassinari K, Quenez O, Baert-Desurmont S, Bardel C, Buisine MP, Calpena E, Capri Y, Corominas Galbany J, Diguet F, Edery P, Isidor B, Labalme A, Le Caignec C, Lévy J, Lecoquierre F, Lindenbaum P, Pichon O, Rollat-Farnier PA, Simonet T, Saugier-Veber P, Tabet AC, Toutain A, Wilkie AOM, Lesca G, Sanlaville D, Nicolas G, Schluth-Bolard C. Identification of mobile retrocopies during genetic testing: Consequences for routine diagnosis. Hum Mutat 2019; 40:1993-2000. [PMID: 31230393 DOI: 10.1002/humu.23845] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 05/29/2019] [Accepted: 06/17/2019] [Indexed: 12/24/2022]
Abstract
Human retrocopies, that is messenger RNA transcripts benefitting from the long interspersed element 1 machinery for retrotransposition, may have specific consequences for genomic testing. Next genetration sequencing (NGS) techniques allow the detection of such mobile elements but they may be misinterpreted as genomic duplications or be totally overlooked. We report eight observations of retrocopies detected during diagnostic NGS analyses of targeted gene panels, exome, or genome sequencing. For seven cases, while an exons-only copy number gain was called, read alignment inspection revealed a depth of coverage shift at every exon-intron junction where indels were also systematically called. Moreover, aberrant chimeric read pairs spanned entire introns or were paired with another locus for terminal exons. The 8th retrocopy was present in the reference genome and thus showed a normal NGS profile. We emphasize the existence of retrocopies and strategies to accurately detect them at a glance during genetic testing and discuss pitfalls for genetic testing.
Collapse
Affiliation(s)
- Nicolas Chatron
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Kevin Cassinari
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Olivier Quenez
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Stéphanie Baert-Desurmont
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Claire Bardel
- Bioinformatics group of the Lyon University Hospital NGS facility, Groupement Hospitalier Est, Lyon, France.,Biostatistics and Bioinformatics Department, HCL, Lyon, France
| | - Marie-Pierre Buisine
- Department of Biochemistry and Molecular Biology, JPA Research Center, Inserm UMR-S 1172, Lille University, Lille University Hospital, Lille, France
| | - Eduardo Calpena
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Yline Capri
- Genetics Department, Clinical Genetics Unit, Hôpital Universitaire Robert Debré, Paris, France
| | | | - Flavie Diguet
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Patrick Edery
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | | | - Audrey Labalme
- Genetics Department, Hospices Civils de Lyon, Lyon, France
| | - Cedric Le Caignec
- Genetics Department, CHU Nantes, Nantes, France.,INSERM UMR_S915, Institut du thorax, Nantes University, Nantes, France
| | - Jonathan Lévy
- Genetics Department, Cytogenetics Unit, Hôpital Universitaire Robert Debré, Paris, France
| | - François Lecoquierre
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Pierre Lindenbaum
- INSERM, UMR_S1087, Institut du thorax, Nantes, France.,CNRS, UMR 6291, Nantes, France
| | | | - Pierre-Antoine Rollat-Farnier
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,Bioinformatics group of the Lyon University Hospital NGS facility, Groupement Hospitalier Est, Lyon, France
| | - Thomas Simonet
- Cellular Biotechnology Center, Hospices Civils de Lyon, Lyon, France.,Nerve-Muscle Interactions Team, Institut NeuroMyoGène CNRS UMR 5310-INSERM U1217-Université Claude Bernard Lyon 1, Lyon, France
| | - Pascale Saugier-Veber
- Department of Genetics, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Anne-Claude Tabet
- Genetics Department, Cytogenetics Unit, Hôpital Universitaire Robert Debré, Paris, France.,Neuroscience Department, Human Genetics and Cognitive Function Unit, Institut Pasteur, Paris, France
| | - Annick Toutain
- Genetics Department, Hôpital Bretonneau, CHU, Tours, France.,UMR 1253, iBrain, Tours University, Inserm, Tours, France
| | - Andrew O M Wilkie
- Clinical Genetics Group, MRC Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Gaetan Lesca
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Damien Sanlaville
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| | - Gaël Nicolas
- Department of Genetics and CNR-MAJ, Normandie Univ, UNIROUEN, Inserm U1245 and Rouen University Hospital, F 76000, Normandy Center for Genomic and Personalized Medicine, Rouen, France
| | - Caroline Schluth-Bolard
- Genetics Department, Hospices Civils de Lyon, Lyon, France.,GENDEV Team, CRNL, INSERM U1028, CNRS UMR5292, UCBL1, Lyon, France
| |
Collapse
|
108
|
Zhang JY, Zhou Q. On the Regulatory Evolution of New Genes Throughout Their Life History. Mol Biol Evol 2019; 36:15-27. [PMID: 30395322 DOI: 10.1093/molbev/msy206] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Every gene has a birthplace and an age, that is, a cis-regulatory environment and an evolution lifespan since its origination, yet how the two shape the evolution trajectories of genes remains unclear. Here, we address this basic question by comparing phylogenetically dated new genes in the context of both their ages and origination mechanisms. In both Drosophila and vertebrates, we confirm a clear "out of the testis" transition from the specifically expressed young genes to the broadly expressed old housekeeping genes, observed only in testis but not in other tissues. Many new genes have gained important functions during embryogenesis, manifested as either specific activation at maternal-zygotic transition, or different spatiotemporal expressions from their parental genes. These expression patterns are largely driven by an age-dependent evolution of cis-regulatory environment. We discover that retrogenes are more frequently born in a pre-existing repressive regulatory domain, and are more diverged in their enhancer repertoire than the DNA-based gene duplications. During evolution, new gene duplications gradually gain active histone modifications and undergo more enhancer turnovers when becoming older, but exhibit complex trends of gaining or losing repressive histone modifications in Drosophila or vertebrates, respectively. Interestingly, vertebrate new genes exhibit an "into the testis" epigenetic transition that older genes become more likely to be co-occupied by both active and repressive ("bivalent") histone modifications specifically in testis. Our results uncover the regulatory mechanisms underpinning the stepwise acquisition of novel and complex functions by new genes, and illuminate the general evolution trajectory of genes throughout their life history.
Collapse
Affiliation(s)
- Jia-Yu Zhang
- MOE Key Laboratory of Biosystems Homeostasis & Protection, Life Sciences Institute, Zhejiang University, Hangzhou, China
| | - Qi Zhou
- MOE Key Laboratory of Biosystems Homeostasis & Protection, Life Sciences Institute, Zhejiang University, Hangzhou, China.,Department of Molecular Evolution and Development, University of Vienna, Vienna, Austria
| |
Collapse
|
109
|
Phenotypic Effects of FGF4 Retrogenes on Intervertebral Disc Disease in Dogs. Genes (Basel) 2019; 10:genes10060435. [PMID: 31181696 PMCID: PMC6627552 DOI: 10.3390/genes10060435] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 06/04/2019] [Accepted: 06/05/2019] [Indexed: 01/31/2023] Open
Abstract
Two FGF4 retrogenes on chromosomes 12 (12-FGF4RG) and 18 (18-FGF4RG) contribute to short-limbed phenotypes in dogs. 12-FGF4RG has also been associated with intervertebral disc disease (IVDD). Both of these retrogenes were found to be widespread among dog breeds with allele frequencies ranging from 0.02 to 1; however, their additive contribution to disease is unknown. Surgical cases of IVDD (n = 569) were evaluated for age of onset, disc calcification, and genotypes for the FGF4 retrogenes. Multivariable linear regression analysis identified the presence of one or two copies of 12-FGF4RG associated with significantly younger age at first surgery in a dominant manner. 18-FGF4RG had only a minor effect in dogs with one copy. Multivariable logistic regression showed that 12-FGF4RG had an additive effect on radiographic disc calcification, while 18-FGF4RG had no effect. Multivariable logistic regression using mixed breed cases and controls identified only 12-FGF4RG as highly associated with disc herniation in a dominant manner (Odds Ratio, OR, 18.42, 95% Confidence Interval (CI) 7.44 to 50.26; p < 0.001). The relative risk for disc surgery associated with 12-FGF4RG varied from 5.5 to 15.1 within segregating breeds and mixed breeds. The FGF4 retrogene on CFA12 acts in a dominant manner to decrease the age of onset and increase the overall risk of disc disease in dogs. Other modifiers of risk may be present within certain breeds, including the FGF4 retrogene on CFA18.
Collapse
|
110
|
Jiang X, Assis R. Rapid functional divergence after small-scale gene duplication in grasses. BMC Evol Biol 2019; 19:97. [PMID: 31046675 PMCID: PMC6498639 DOI: 10.1186/s12862-019-1415-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 03/31/2019] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Gene duplication has played an important role in the evolution and domestication of flowering plants. Yet little is known about how plant duplicate genes evolve and are retained over long timescales, particularly those arising from small-scale duplication (SSD) rather than whole-genome duplication (WGD) events. RESULTS We address this question in the Poaceae (grass) family by analyzing gene expression data from nine tissues of Brachypodium distachyon, Oryza sativa japonica (rice), and Sorghum bicolor (sorghum). Consistent with theoretical predictions, expression profiles of most grass genes are conserved after SSD, suggesting that functional conservation is the primary outcome of SSD in grasses. However, we also uncover support for widespread functional divergence, much of which occurs asymmetrically via the process of neofunctionalization. Moreover, neofunctionalization preferentially targets younger (child) duplicate gene copies, is associated with RNA-mediated duplication, and occurs quickly after duplication. Further analysis reveals that functional divergence of SSD-derived genes is positively correlated with both sequence divergence and tissue specificity in all three grass species, and particularly with anther expression in B. distachyon. CONCLUSIONS Our results suggest that SSD-derived grass genes often undergo rapid functional divergence that may be driven by natural selection on male-specific phenotypes. These observations are consistent with those in several animal species, suggesting that duplicate genes take similar evolutionary trajectories in plants and animals.
Collapse
Affiliation(s)
- Xueyuan Jiang
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Raquel Assis
- Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, USA.
- Department of Biology, Pennsylvania State University, University Park, PA, USA.
| |
Collapse
|
111
|
Nagasawa T, Kawaguchi M, Yano T, Isoyama S, Yasumasu S, Okabe M. Translocation of promoter-conserved hatching enzyme genes with intron-loss provides a new insight in the role of retrocopy during teleostean evolution. Sci Rep 2019; 9:2448. [PMID: 30792427 PMCID: PMC6385490 DOI: 10.1038/s41598-019-38693-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Accepted: 12/17/2018] [Indexed: 11/12/2022] Open
Abstract
The hatcing enzyme gene (HE) encodes a protease that is indispensable for the hatching process and is conserved during vertebrate evolution. During teleostean evolution, it is known that HE experienced a drastic transfiguration of gene structure, namely, losing all of its introns. However, these facts are contradiction with each other, since intron-less genes typically lose their original promoter because of duplication via mature mRNA, called retrocopy. Here, using a comparative genomic assay, we showed that HEs have changed their genomic location several times, with the evolutionary timings of these translocations being identical to those of intron-loss. We further showed that HEs maintain the promoter sequence upstream of them after translocation. Therefore, teleostean HEs are unique genes which have changed intra- (exon-intron) and extra-genomic structure (genomic loci) several times, although their indispensability for the reproductive process of hatching implies that HE genes are translocated by retrocopy with their promoter sequence.
Collapse
Affiliation(s)
- Tatsuki Nagasawa
- Department of Anatomy, The Jikei University School of Medicine, 3-25-8 Nishishimbashi, Minato-ku, Tokyo, 105-8461, Japan.,Research Fellow of the Japan Society for the Promotion of Science (JSPS), Tokyo, 102-0083, Japan.,Department of Materials and Life Sciences, Faculty of Science and Technology, Sophia University, 7-1 Kioi-cho, Chiyoda-ku, Tokyo, 102-8554, Japan
| | - Mari Kawaguchi
- Department of Materials and Life Sciences, Faculty of Science and Technology, Sophia University, 7-1 Kioi-cho, Chiyoda-ku, Tokyo, 102-8554, Japan
| | - Tohru Yano
- Department of Anatomy, The Jikei University School of Medicine, 3-25-8 Nishishimbashi, Minato-ku, Tokyo, 105-8461, Japan
| | - Sho Isoyama
- Department of Materials and Life Sciences, Faculty of Science and Technology, Sophia University, 7-1 Kioi-cho, Chiyoda-ku, Tokyo, 102-8554, Japan
| | - Shigeki Yasumasu
- Department of Materials and Life Sciences, Faculty of Science and Technology, Sophia University, 7-1 Kioi-cho, Chiyoda-ku, Tokyo, 102-8554, Japan.
| | - Masataka Okabe
- Department of Anatomy, The Jikei University School of Medicine, 3-25-8 Nishishimbashi, Minato-ku, Tokyo, 105-8461, Japan
| |
Collapse
|
112
|
Hou Y, Ji N, Zhang H, Shi X, Han H, Lin S. Genome size-dependent pcna gene copy number in dinoflagellates and molecular evidence of retroposition as a major evolutionary mechanism. JOURNAL OF PHYCOLOGY 2019; 55:37-46. [PMID: 30468510 DOI: 10.1111/jpy.12815] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 08/22/2018] [Indexed: 06/09/2023]
Abstract
Proliferating cell nuclear antigen (PCNA) plays critical roles in eukaryotic DNA replication and replication-associated processes. It is typically encoded by one or two gene copies (pcna) in eukaryotic genomes. Recently reported higher copy numbers of pcna in some dinoflagellates raised a question of how this gene has uniquely evolved in this phylum. Through real-time PCR quantification, we found a wide range of pcna copy number (2-287 copies) in 11 dinoflagellate species (n = 38), and a strong positive correlation between pcna copy number and genome size (log10 -log10 transformed). Intraspecific pcna diverged up to 21% and are dominated by nonsynonymous substitutions, indicating strong purifying selection pressure on and hence functional necessity of this gene. By surveying pcna copy numbers in eukaryotes, we observed a genome size threshold at 4 pg DNA, above which more than two pcna copies are found. To examine whether retrotransposition is a mechanism of pcna duplication, we measured the copy number of retroposed pcna, taking advantage of the 22-nt dinoflagellate-specific spliced leader (DinoSL) capping the 5' end of dinoflagellate nuclear-encoded mRNAs, which would exist in the upstream region of a retroposed gene copy. We found that retroposed pcna copy number increased with total pcna copy number and genome size. These results indicate co-evolution of dinoflagellate pcna copy number with genome size, and retroposition as a major mechanism of pcna duplication in dinoflagellates. Furthermore, we posit that the demand of faithful replication and maintenance of the large dinoflagellate genomes might have favored the preservation of the retroposed pcna as functional genes.
Collapse
Affiliation(s)
- Yubo Hou
- Department of Marine Sciences, University of Connecticut, Groton, Connecticut, 06340, USA
- Department of Biology, Skidmore College, Saratoga Springs, New York, 12866, USA
| | - Nanjing Ji
- State Key Laboratory of Marine Environmental Science and Marine Biodiversity and Global Change Research Center, Xiamen University, Xiamen, Fujian, 361102, China
| | - Huan Zhang
- Department of Marine Sciences, University of Connecticut, Groton, Connecticut, 06340, USA
| | - Xinguo Shi
- State Key Laboratory of Marine Environmental Science and Marine Biodiversity and Global Change Research Center, Xiamen University, Xiamen, Fujian, 361102, China
| | - Hansen Han
- Department of Biology, Skidmore College, Saratoga Springs, New York, 12866, USA
| | - Senjie Lin
- Department of Marine Sciences, University of Connecticut, Groton, Connecticut, 06340, USA
| |
Collapse
|
113
|
Kasahara M, Flajnik MF. Origin and evolution of the specialized forms of proteasomes involved in antigen presentation. Immunogenetics 2019; 71:251-261. [PMID: 30675634 DOI: 10.1007/s00251-019-01105-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 01/09/2019] [Indexed: 01/10/2023]
Abstract
Proteasomes are a multi-subunit protease complex that produces peptides bound by major histocompatibility complex (MHC) class I molecules. Phylogenetic studies indicate that two specialized forms of proteasomes, immunoproteasomes and thymoproteasomes, and the proteasome activator PA28αβ emerged in a common ancestor of jawed vertebrates which acquired adaptive immunity based on the MHC, T cell receptors, and B cell receptors ~ 500 million years ago. Comparative genomics studies now provide strong evidence that the genes coding for the immunoproteasome subunits emerged by genome-wide duplication. On the other hand, the gene encoding the thymoproteasome subunit β5t emerged by tandem duplication from the gene coding for the β5 subunit. Strikingly, birds lack immunoproteasomes, thymoproteasomes, and the proteasome activator PA28αβ, raising an interesting question of whether they have evolved any compensatory mechanisms.
Collapse
Affiliation(s)
- Masanori Kasahara
- Department of Pathology, Faculty of Medicine and Graduate School of Medicine, Hokkaido University, Sapporo, 060-8638, Japan.
| | - Martin F Flajnik
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, 21201, USA.
| |
Collapse
|
114
|
Wang GD, Shao XJ, Bai B, Wang J, Wang X, Cao X, Liu YH, Wang X, Yin TT, Zhang SJ, Lu Y, Wang Z, Wang L, Zhao W, Zhang B, Ruan J, Zhang YP. Structural variation during dog domestication: insights from gray wolf and dhole genomes. Natl Sci Rev 2019; 6:110-122. [PMID: 34694297 PMCID: PMC8291444 DOI: 10.1093/nsr/nwy076] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Revised: 06/27/2018] [Accepted: 07/17/2018] [Indexed: 12/11/2022] Open
Abstract
Several processes like phenotypic evolution, disease susceptibility and environmental adaptations, which fashion the domestication of animals, are largely attributable to structural variations (SVs) in the genome. Here, we present high-quality draft genomes of the gray wolf (Canis lupus) and dhole (Cuon alpinus) with scaffold N50 of 6.04 Mb and 3.96 Mb, respectively. Sequence alignment comprising genomes of three canid species reveals SVs specific to the dog, particularly 16 315 insertions, 2565 deletions, 443 repeats, 16 inversions and 15 translocations. Functional annotation of the dog SVs associated with genes indicates their enrichments in energy metabolisms, neurological processes and immune systems. Interestingly, we identify and verify at population level an insertion fully covering a copy of the AKR1B1 (Aldo-Keto Reductase Family 1 Member B) transcript. Transcriptome analysis reveals a high level of expression of the new AKR1B1 copy in the small intestine and liver, implying an increase in de novo fatty acid synthesis and antioxidant ability in dog compared to gray wolf, likely in response to dietary shifts during the agricultural revolution. For the first time, we report a comprehensive analysis of the evolutionary dynamics of SVs during the domestication step of dogs. Our findings demonstrate that retroposition can birth new genes to facilitate domestication, and affirm the importance of large-scale genomic variants in domestication studies.
Collapse
Affiliation(s)
- Guo-Dong Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| | - Xiu-Juan Shao
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Bing Bai
- Medical Faculty, Kunming University of Science and Technology, Kunming 650504, China
- Department of Pediatrics, the First People's Hospital of Yunnan Province, Kunming 650032, China
| | - Junlong Wang
- College of Pharmacology, Soochow University, Suzhou 215123, China
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences and Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Xiaobo Wang
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Xue Cao
- Department of Laboratory Animal Science, Kunming Medical University, Kunming 650500, China
| | - Yan-Hu Liu
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
| | - Xuan Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Ting-Ting Yin
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming 650204, China
| | - Shao-Jie Zhang
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
| | - Yan Lu
- Beijing Zoo, Beijing 100044, China
| | | | - Lu Wang
- Laboratory for Conservation and Utilization of Bio-Resources and Key Laboratory for Microbial Resources of the Ministry of Education, Yunnan University, Kunming 650091, China
| | - Wenming Zhao
- Core Genomic Facility, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Bing Zhang
- Core Genomic Facility, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Jue Ruan
- Agricultural Genomics Institute, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
115
|
Chen J, Mortola E, Du X, Zhao S, Liu X. Excess of retrogene traffic in pig X chromosome. Genetica 2018; 147:23-32. [PMID: 30535819 DOI: 10.1007/s10709-018-0048-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 11/21/2018] [Indexed: 11/26/2022]
Abstract
Despite the insights into retrogene evolution in multiple species provided by trans-traffic retrogenes (trans-retrogenes), cis-traffic retrogenes (cis-retrogenes) are hitherto understudied and thus limit our understanding of genetic novelty. Here we used the pig genome as a model to compare cis- and trans-retrogenes. We found that cis- and trans-retrogenes have similar patterns in terms of excessive X-chromosome representation, testis expression preference, and metabolic functional enrichment. Despite these similarities, lower levels of selective pressure were found in cis- than in trans-retrogenes. The evolutionary dating evidence revealed that more cis-retrogenes have originated after pig-cattle divergence than trans-retrogenes, suggesting a more recent origination of cis-retrogenes. The gene family clustering confirmed that excesses of X-derived cis-retrogenes may be driven by their rapid expansion in the recent times. Thus, the different selective constraints between cis- and trans-retrogenes may be due to different evolutionary time-scales rather than any differential functional importance. Thus, this study highlights the ongoing functional contributions from cis- and trans-traffic retrogenes during the reshaping of pig genome.
Collapse
Affiliation(s)
- Jianhai Chen
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan, 430070, People's Republic of China
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Emily Mortola
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Xiaoyong Du
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
- College of Informatics, Huazhong Agricultural University, Wuhan, 430070, People's Republic of China
| | - Shuhong Zhao
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan, 430070, People's Republic of China.
| | - Xiangdong Liu
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction, Ministry of Education, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan, 430070, People's Republic of China.
| |
Collapse
|
116
|
Kovalenko TF, Patrushev LI. Pseudogenes as Functionally Significant Elements of the Genome. BIOCHEMISTRY (MOSCOW) 2018; 83:1332-1349. [PMID: 30482145 DOI: 10.1134/s0006297918110044] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Pseudogene is a gene copy that has lost its original function. For a long time, pseudogenes have been considered as "junk DNA" that inevitably arises as a result of ongoing evolutionary process. However, experimental data obtained during recent years indicate this understanding of the nature of pseudogenes is not entirely correct, and many pseudogenes perform important genetic functions. In the review, we have addressed classification of pseudogenes, methods of their detection in the genome, and the problem of their evolutionary conservatism and prevalence among species belonging to different taxonomic groups in the light of modern data. The mechanisms of gene expression regulation by pseudogenes and the role of pseudogenes in pathogenesis of various human diseases are discussed.
Collapse
Affiliation(s)
- T F Kovalenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia.
| | - L I Patrushev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, 117997, Russia
| |
Collapse
|
117
|
Armisén D, Rajakumar R, Friedrich M, Benoit JB, Robertson HM, Panfilio KA, Ahn SJ, Poelchau MF, Chao H, Dinh H, Doddapaneni HV, Dugan S, Gibbs RA, Hughes DST, Han Y, Lee SL, Murali SC, Muzny DM, Qu J, Worley KC, Munoz-Torres M, Abouheif E, Bonneton F, Chen T, Chiang LM, Childers CP, Cridge AG, Crumière AJJ, Decaras A, Didion EM, Duncan EJ, Elpidina EN, Favé MJ, Finet C, Jacobs CGC, Cheatle Jarvela AM, Jennings EC, Jones JW, Lesoway MP, Lovegrove MR, Martynov A, Oppert B, Lillico-Ouachour A, Rajakumar A, Refki PN, Rosendale AJ, Santos ME, Toubiana W, van der Zee M, Vargas Jentzsch IM, Lowman AV, Viala S, Richards S, Khila A. The genome of the water strider Gerris buenoi reveals expansions of gene repertoires associated with adaptations to life on the water. BMC Genomics 2018; 19:832. [PMID: 30463532 PMCID: PMC6249893 DOI: 10.1186/s12864-018-5163-2] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2018] [Accepted: 10/14/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Having conquered water surfaces worldwide, the semi-aquatic bugs occupy ponds, streams, lakes, mangroves, and even open oceans. The diversity of this group has inspired a range of scientific studies from ecology and evolution to developmental genetics and hydrodynamics of fluid locomotion. However, the lack of a representative water strider genome hinders our ability to more thoroughly investigate the molecular mechanisms underlying the processes of adaptation and diversification within this group. RESULTS Here we report the sequencing and manual annotation of the Gerris buenoi (G. buenoi) genome; the first water strider genome to be sequenced thus far. The size of the G. buenoi genome is approximately 1,000 Mb, and this sequencing effort has recovered 20,949 predicted protein-coding genes. Manual annotation uncovered a number of local (tandem and proximal) gene duplications and expansions of gene families known for their importance in a variety of processes associated with morphological and physiological adaptations to a water surface lifestyle. These expansions may affect key processes associated with growth, vision, desiccation resistance, detoxification, olfaction and epigenetic regulation. Strikingly, the G. buenoi genome contains three insulin receptors, suggesting key changes in the rewiring and function of the insulin pathway. Other genomic changes affecting with opsin genes may be associated with wavelength sensitivity shifts in opsins, which is likely to be key in facilitating specific adaptations in vision for diverse water habitats. CONCLUSIONS Our findings suggest that local gene duplications might have played an important role during the evolution of water striders. Along with these findings, the sequencing of the G. buenoi genome now provides us the opportunity to pursue exciting research opportunities to further understand the genomic underpinnings of traits associated with the extreme body plan and life history of water striders.
Collapse
Affiliation(s)
- David Armisén
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Rajendhran Rajakumar
- Department of Molecular Genetics & Microbiology and UF Genetics Institute, University of Florida, 2033 Mowry Road, Gainesville, FL 32610-3610 USA
| | - Markus Friedrich
- Department of Biological Sciences, Wayne State University, Detroit, MI 48202 USA
| | - Joshua B. Benoit
- Department of Biological Sciences, McMicken College of Arts and Sciences, University of Cincinnati, 318 College Drive, Cincinnati, OH 45221-0006 USA
| | - Hugh M. Robertson
- Department of Entomology, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA
| | - Kristen A. Panfilio
- Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674 Cologne, Germany
- School of Life Sciences, University of Warwick, Gibbet Hill Campus, Coventry, CV4 7AL UK
| | - Seung-Joon Ahn
- USDA-ARS Horticultural Crops Research Unit, 3420 NW Orchard Avenue, Corvallis, OR 97330 USA
- Department of Crop and Soil Science, Oregon State University, 3050 SW Campus Way, Corvallis, OR 97331 USA
| | - Monica F. Poelchau
- USDA Agricultural Research Service, National Agricultural Library, Beltsville, MD 20705 USA
| | - Hsu Chao
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Huyen Dinh
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Harsha Vardhan Doddapaneni
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Shannon Dugan
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Richard A. Gibbs
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Daniel S. T. Hughes
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Yi Han
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Sandra L. Lee
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Shwetha C. Murali
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195 USA
| | - Donna M. Muzny
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Jiaxin Qu
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Kim C. Worley
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | | | - Ehab Abouheif
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
| | - François Bonneton
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Travis Chen
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
| | - Li-Mei Chiang
- USDA Agricultural Research Service, National Agricultural Library, Beltsville, MD 20705 USA
| | | | - Andrew G. Cridge
- Laboratory for Evolution and Development, Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, New Zealand
| | - Antonin J. J. Crumière
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Amelie Decaras
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Elise M. Didion
- Department of Biological Sciences, McMicken College of Arts and Sciences, University of Cincinnati, 318 College Drive, Cincinnati, OH 45221-0006 USA
| | - Elizabeth J. Duncan
- Laboratory for Evolution and Development, Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, New Zealand
- School of Biology, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT UK
| | - Elena N. Elpidina
- A.N. Belozersky Institute of Physico-Chemical Biology, Moscow State University, Moscow, 119991 Russia
| | - Marie-Julie Favé
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
| | - Cédric Finet
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Chris G. C. Jacobs
- Institute of Biology, Leiden University, Sylviusweg 72, 2333 BE Leiden, Netherlands
- Max Planck Institute for Chemical Ecology, Hans-Knöll Strasse 8, 07745 Jena, Germany
| | | | - Emily C. Jennings
- Department of Biological Sciences, McMicken College of Arts and Sciences, University of Cincinnati, 318 College Drive, Cincinnati, OH 45221-0006 USA
| | - Jeffery W. Jones
- Department of Biological Sciences, Wayne State University, Detroit, MI 48202 USA
| | - Maryna P. Lesoway
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
- Smithsonian Tropical Research Institute, Apartado Postal 0843-03092, Balboa Ancon, Panama City, Panama
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025 Russia
| | - Mackenzie R. Lovegrove
- Laboratory for Evolution and Development, Department of Biochemistry, University of Otago, P.O. Box 56, Dunedin, New Zealand
| | - Alexander Martynov
- Center of Life Sciences, Skolkovo Institute of Science and Technology, Skolkovo, 143025 Russia
| | - Brenda Oppert
- USDA ARS Center for Grain and Animal Health Research, 1515 College Ave., Manhattan, KS-66502 USA
| | - Angelica Lillico-Ouachour
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
| | - Arjuna Rajakumar
- Department of Biology, McGill University, 1205 Avenue Docteur Penfield Avenue, Montréal, Québec H3A 1B1 Canada
| | - Peter Nagui Refki
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
- Department of Evolutionary Genetics, Max-Planck-Institut für Evolutionsbiologie, August-Thienemann-Straße 2, 24306 Plön, Germany
| | - Andrew J. Rosendale
- Department of Biological Sciences, McMicken College of Arts and Sciences, University of Cincinnati, 318 College Drive, Cincinnati, OH 45221-0006 USA
| | - Maria Emilia Santos
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - William Toubiana
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Maurijn van der Zee
- Institute of Biology, Leiden University, Sylviusweg 72, 2333 BE Leiden, Netherlands
| | - Iris M. Vargas Jentzsch
- Institute for Zoology: Developmental Biology, University of Cologne, Zülpicher Str. 47b, 50674 Cologne, Germany
| | - Aidamalia Vargas Lowman
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Severine Viala
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| | - Stephen Richards
- Human Genome Sequencing Center, Department of Human and Molecular Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030 USA
| | - Abderrahman Khila
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5242, Ecole Normale Supérieure de Lyon 46, allée d’Italie, 69364 Lyon Cedex 07, France
| |
Collapse
|
118
|
Werner MS, Sieriebriennikov B, Prabh N, Loschko T, Lanz C, Sommer RJ. Young genes have distinct gene structure, epigenetic profiles, and transcriptional regulation. Genome Res 2018; 28:1675-1687. [PMID: 30232198 PMCID: PMC6211652 DOI: 10.1101/gr.234872.118] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 09/05/2018] [Indexed: 12/22/2022]
Abstract
Species-specific, new, or "orphan" genes account for 10%-30% of eukaryotic genomes. Although initially considered to have limited function, an increasing number of orphan genes have been shown to provide important phenotypic innovation. How new genes acquire regulatory sequences for proper temporal and spatial expression is unknown. Orphan gene regulation may rely in part on origination in open chromatin adjacent to preexisting promoters, although this has not yet been assessed by genome-wide analysis of chromatin states. Here, we combine taxon-rich nematode phylogenies with Iso-Seq, RNA-seq, ChIP-seq, and ATAC-seq to identify the gene structure and epigenetic signature of orphan genes in the satellite model nematode Pristionchus pacificus Consistent with previous findings, we find young genes are shorter, contain fewer exons, and are on average less strongly expressed than older genes. However, the subset of orphan genes that are expressed exhibit distinct chromatin states from similarly expressed conserved genes. Orphan gene transcription is determined by a lack of repressive histone modifications, confirming long-held hypotheses that open chromatin is important for new gene formation. Yet orphan gene start sites more closely resemble enhancers defined by H3K4me1, H3K27ac, and ATAC-seq peaks, in contrast to conserved genes that exhibit traditional promoters defined by H3K4me3 and H3K27ac. Although the majority of orphan genes are located on chromosome arms that contain high recombination rates and repressive histone marks, strongly expressed orphan genes are more randomly distributed. Our results support a model of new gene origination by rare integration into open chromatin near enhancers.
Collapse
Affiliation(s)
- Michael S Werner
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Bogdan Sieriebriennikov
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Neel Prabh
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Tobias Loschko
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Christa Lanz
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Ralf J Sommer
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| |
Collapse
|
119
|
Liu J, Robinson-Rechavi M. Developmental Constraints on Genome Evolution in Four Bilaterian Model Species. Genome Biol Evol 2018; 10:2266-2277. [PMID: 30137380 PMCID: PMC6130771 DOI: 10.1093/gbe/evy177] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/17/2018] [Indexed: 12/12/2022] Open
Abstract
Developmental constraints on genome evolution have been suggested to follow either an early conservation model or an "hourglass" model. Both models agree that late development strongly diverges between species, but debate on which developmental period is the most conserved. Here, based on a modified "Transcriptome Age Index" approach, that is, weighting trait measures by expression level, we analyzed the constraints acting on three evolutionary traits of protein coding genes (strength of purifying selection on protein sequences, phyletic age, and duplicability) in four species: Nematode worm Caenorhabditis elegans, fly Drosophila melanogaster, zebrafish Danio rerio, and mouse Mus musculus. In general, we found that both models can be supported by different genomic properties. Sequence evolution follows an hourglass model, but the evolution of phyletic age and of duplicability follow an early conservation model. Further analyses indicate that stronger purifying selection on sequences in the middle development are driven by temporal pleiotropy of these genes. In addition, we report evidence that expression in late development is enriched with retrogenes, which usually lack efficient regulatory elements. This implies that expression in late development could facilitate transcription of new genes, and provide opportunities for acquisition of function. Finally, in C. elegans, we suggest that dosage imbalance could be one of the main factors that cause depleted expression of high duplicability genes in early development.
Collapse
Affiliation(s)
- Jialin Liu
- Department of Ecology and Evolution, University of Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
120
|
Coutinho-Abreu IV, Valenzuela JG. Comparative Evolution of Sand Fly Salivary Protein Families and Implications for Biomarkers of Vector Exposure and Salivary Vaccine Candidates. Front Cell Infect Microbiol 2018; 8:290. [PMID: 30211125 PMCID: PMC6123390 DOI: 10.3389/fcimb.2018.00290] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2018] [Accepted: 07/30/2018] [Indexed: 11/25/2022] Open
Abstract
Sand fly salivary proteins that produce a specific antibody response in humans and animal reservoirs have been shown to be promising biomarkers of sand fly exposure. Furthermore, immunity to sand fly salivary proteins were shown to protect rodents and non-human primates against Leishmania infection. We are missing critical information regarding the divergence amongst sand fly salivary proteins from different sand fly vectors, a knowledge that will support the search of broad or specific salivary biomarkers of vector exposure and those for vaccines components against leishmaniasis. Here, we compare the molecular evolution of the salivary protein families in New World and Old World sand flies from 14 different sand fly vectors. We found that the protein families unique to OW sand flies are more conserved than those unique to NW sand flies regarding both sequence polymorphisms and copy number variation. In addition, the protein families unique to OW sand flies do not display as many conserved cysteine residues as the one unique to the NW group (28.5% in OW vs. 62.5% in NW). Moreover, the expression of specific protein families is restricted to the salivary glands of unique sand fly taxon. For instance, the ParSP15 family is unique to the Larroussius subgenus whereas phospholipase A2 is only expressed in member of Larroussius and Adlerius subgenera. The SP2.5-like family is only expressed in members of the Phlebotomus and Paraphlebotomus subgenera. The sequences shared between OW and NW sand flies have diverged at similar rates (38.7 and 45.3% amino acid divergence, respectively), yet differences in gene copy number were evident across protein families and sand fly species. Overall, this comparative analysis sheds light on the different modes of sand fly salivary protein family divergence. Also, it informs which protein families are unique and conserved within taxon for the choice of taxon-specific biomarkers of vector exposure, as well as those families more conserved across taxa to be used as pan-specific vaccines for leishmaniasis.
Collapse
Affiliation(s)
- Iliano V Coutinho-Abreu
- Vector Molecular Biology Section, Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, United States
| | - Jesus G Valenzuela
- Vector Molecular Biology Section, Laboratory of Malaria and Vector Research, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Rockville, MD, United States
| |
Collapse
|
121
|
Schrader L, Schmitz J. The impact of transposable elements in adaptive evolution. Mol Ecol 2018; 28:1537-1549. [PMID: 30003608 DOI: 10.1111/mec.14794] [Citation(s) in RCA: 134] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 07/06/2018] [Indexed: 12/16/2022]
Abstract
The growing knowledge about the influence of transposable elements (TEs) on (a) long-term genome and transcriptome evolution; (b) genomic, transcriptomic and epigenetic variation within populations; and (c) patterns of somatic genetic differences in individuals continues to spur the interest of evolutionary biologists in the role of TEs in adaptive evolution. As TEs can trigger a broad range of molecular variation in a population with potentially severe fitness and phenotypic consequences for individuals, different mechanisms evolved to keep TE activity in check, allowing for a dynamic interplay between the host, its TEs and the environment in evolution. Here, we review evidence for adaptive phenotypic changes associated with TEs and the basic molecular mechanisms by which the underlying genetic changes arise: (a) domestication, (b) exaptation, (c) host gene regulation, (d) TE-mediated formation of intronless gene copies-so-called retrogenes and (e) overall increased genome plasticity. Furthermore, we review and discuss how the stress-dependent incapacitation of defence mechanisms against the activity of TEs might facilitate adaptive responses to environmental challenges and how such mechanisms might be particularly relevant in species frequently facing novel environments, such as invasive, pathogenic or parasitic species.
Collapse
Affiliation(s)
- Lukas Schrader
- Institute for Evolution and Biodiversity (IEB), University of Münster, Münster, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, University of Münster, Münster, Germany
| |
Collapse
|
122
|
Song B, Chen S, Chen W. Dinoflagellates, a Unique Lineage for Retrogene Research. Front Microbiol 2018; 9:1556. [PMID: 30050525 PMCID: PMC6050394 DOI: 10.3389/fmicb.2018.01556] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 06/22/2018] [Indexed: 12/14/2022] Open
Abstract
The birth and evolution of retrogenes have played crucial roles in genome evolution. Dinoflagellates represent a unique lineage for retrogene research because the retrogenes can be reliably identified by the presence of a 22 nucleotide splice leader called DinoSL, which is post-transcriptionally added to the 5' terminus of all mRNAs. Compared to studies of retrogenes conducted in other model genomes, dinoflagellate retrogenes can potentially be more comprehensively characterized because intron-containing retrogenes have already been detected. Unfortunately, dinoflagellate retrogene research has long been neglected. Here, we review the work on dinoflagellate retrogenes and show their distinct character. Like the dinoflagellate genome itself, dinoflagellate retrogenes are also characterized by many unusual features, including a high survival rate and large numbers in the genome. These data are critical complements to what we know about retrogenes, and will further frame our understanding of retroposition and its roles in genome evolution, as well as providing new insights into retrogene studies in other genomes.
Collapse
Affiliation(s)
- Bo Song
- Guangdong Provincial Key Laboratory for Plant Epigenetics, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, China.,Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Optoelectronic Engineering, Shenzhen University, Shenzhen, China
| | - Sijie Chen
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Wenbin Chen
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| |
Collapse
|
123
|
Hua Z, Doroodian P, Vu W. Contrasting duplication patterns reflect functional diversities of ubiquitin and ubiquitin-like protein modifiers in plants. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 95:296-311. [PMID: 29738099 DOI: 10.1111/tpj.13951] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 03/29/2018] [Accepted: 04/30/2018] [Indexed: 06/08/2023]
Abstract
Ubiquitin (Ub) and Ub-like proteins, collectively forming the ubiquiton family, regulate nearly all aspects of cellular processes via post-translational modifications. Studies devoted to specific members suggested a large expansion of this family in plants; however, a lack of systematic analysis hinders the comparison of individual members at both evolutionary history and functional divergence levels, which may provide new insight into biological functions. In this work, we first retrieved a total of 5856 members of 17 known ubiquiton subfamilies in 50 plant genomes by searching both prior annotations and missing loci in each genome. We then applied this list to analyze the duplication history of major ubiquiton subfamilies in plants. We show that autophagy-related protein 8 (ATG8), membrane-anchored Ub-fold (MUB), small Ub-like modifier (SUMO) and Ub loci encode 88% of the plant ubiquiton family. Although whole genome duplications (WGDs) significantly expanded the family, we discovered contrasting duplication patterns both in species and in subfamilies. Within the family, the ATG8 and MUB members were primarily duplicated through WGDs, whereas a significant number of Ub and SUMO loci were generated through retroposition and tandem duplications, respectively. Although Ub coding regions are highly conserved in plants, promoter activity analysis demonstrated lineage-specific expression patterns of polyUb genes in Oryza sativa (rice) and Arabidopsis, confirming their retroposition origin. Based on the theory of dosage balance constraints, our study suggests that ubiquiton members duplicated through WGDs play crucial roles in plants, and that the regulatory pathways involving ATG8 and MUB are more conserved than those controlled by Ub and SUMO.
Collapse
Affiliation(s)
- Zhihua Hua
- Department of Environmental and Plant Biology, Ohio University, Athens, OH, 45701, USA
- Interdisciplinary Program in Molecular and Cellular Biology, Ohio University, Athens, OH, 45701, USA
| | - Paymon Doroodian
- Department of Environmental and Plant Biology, Ohio University, Athens, OH, 45701, USA
| | - William Vu
- Department of Environmental and Plant Biology, Ohio University, Athens, OH, 45701, USA
- Department of Chemistry and Biochemistry, Ohio University, Athens, OH, 45701, USA
| |
Collapse
|
124
|
Genes Relocated Between Drosophila Chromosome Arms Evolve Under Relaxed Selective Constraints Relative to Non-Relocated Genes. J Mol Evol 2018; 86:340-352. [PMID: 29926120 DOI: 10.1007/s00239-018-9849-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 06/11/2018] [Indexed: 10/28/2022]
Abstract
Gene duplication creates a second copy of a gene either in tandem to the ancestral locus or dispersed to another chromosomal location. When the ancestral copy of a dispersed duplicate is lost from the genome, it creates the appearance that the gene was "relocated" from the ancestral locus to the derived location. Gene relocations may be as common as canonical dispersed duplications in which both the ancestral and derived copies are retained. Relocated genes appear to be under more selective constraints than the derived copies of canonical duplications, and they are possibly as conserved as single-copy non-relocated genes. To test this hypothesis, we combined comparative genomics, population genetics, gene expression, and functional analyses to assess the selection pressures acting on relocated, duplicated, and non-relocated single-copy genes in Drosophila genomes. We find that relocated genes evolve faster than single-copy non-relocated genes, and there is no evidence that this faster evolution is driven by positive selection. In addition, relocated genes are less essential for viability and male fertility than single-copy non-relocated genes, suggesting that relocated genes evolve fast because of relaxed selective constraints. However, relocated genes evolve slower than the derived copies of canonical dispersed duplicated genes. We therefore conclude that relocated genes are under more selective constraints than canonical duplicates, but are not as conserved as single-copy non-relocated genes.
Collapse
|
125
|
Luo J, Wang Y, Yuan J, Zhao Z, Lu J. MicroRNA duplication accelerates the recruitment of new targets during vertebrate evolution. RNA (NEW YORK, N.Y.) 2018; 24:787-802. [PMID: 29511046 PMCID: PMC5959248 DOI: 10.1261/rna.062752.117] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 03/02/2018] [Indexed: 05/27/2023]
Abstract
The repertoire of miRNAs has considerably expanded during metazoan evolution, and duplication is an important mechanism for generating new functional miRNAs. However, relatively little is known about the functional divergence between paralogous miRNAs and the possible coevolution between duplicated miRNAs and the genomic contexts. By systematically examining small RNA expression profiles across various human tissues and interrogating the publicly available miRNA:mRNA pairing chimeras, we found that changes in expression patterns and targeting preferences are widespread for duplicated miRNAs in vertebrates. Both the empirical interactions and target predictions suggest that evolutionarily conserved homo-seed duplicated miRNAs pair with significantly higher numbers of target sites compared to the single-copy miRNAs. Our birth-and-death evolutionary analysis revealed that the new target sites of miRNAs experienced frequent gains and losses during function development. Our results suggest that a newly emerged target site has a higher probability to be functional and maintained by natural selection if it is paired to a seed shared by multiple paralogous miRNAs rather than being paired to a single-copy miRNA. We experimentally verified the divergence in target repression between two paralogous miRNAs by transfecting let-7a and let-7b mimics into kidney-derived cell lines of four mammalian species and measuring the resulting transcriptome alterations by extensive high-throughput sequencing. Our results also suggest that the gains and losses of let-7 target sites might be associated with the evolution of repressiveness of let-7 across mammalian species.
Collapse
Affiliation(s)
- Junjie Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Yirong Wang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Jian Yuan
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Zhilei Zhao
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
126
|
Song B, Morse D, Song Y, Fu Y, Lin X, Wang W, Cheng S, Chen W, Liu X, Lin S. Comparative Genomics Reveals Two Major Bouts of Gene Retroposition Coinciding with Crucial Periods of Symbiodinium Evolution. Genome Biol Evol 2018; 9:2037-2047. [PMID: 28903461 PMCID: PMC5585692 DOI: 10.1093/gbe/evx144] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/04/2017] [Indexed: 12/21/2022] Open
Abstract
Gene retroposition is an important mechanism of genome evolution but the role it plays in dinoflagellates, a critical player in marine ecosystems, is not known. Until recently, when the genomes of two coral-symbiotic dinoflagellate genomes, Symbiodinium kawagutii and S. minutum, were released, it has not been possible to systematically study these retrogenes. Here we examine the abundant retrogenes (∼23% of the total genes) in these species. The hallmark of retrogenes in the genome is the presence of DCCGTAGCCATTTTGGCTCAAG, a spliced leader (DinoSL) constitutively trans-spliced to the 5′-end of all nucleus-encoded mRNAs. Although the retrogenes have often lost part of the 22-nt DinoSL, the putative promoter motif from the DinoSL, TTT(T/G), is consistently retained in the upstream region of these genes, providing an explanation for the high survival rate of retrogenes in dinoflagellates. Our analysis of DinoSL sequence divergence revealed two major bursts of retroposition in the evolutionary history of Symbiodinium, occurring at ∼60 and ∼6 Ma. Reconstruction of the evolutionary trajectory of the Symbiodinium genomes mapped these 2 times to the origin and rapid radiation of this dinoflagellate lineage, respectively. GO analysis revealed differential functional enrichment of the retrogenes between the two episodes, with a broad impact on transport in the first bout and more localized influence on symbiosis-related processes such as cell adhesion in the second bout. This study provides the first evidence of large-scale retroposition as a major mechanism of genome evolution for any organism and sheds light on evolution of coral symbiosis.
Collapse
Affiliation(s)
- Bo Song
- Guangdong Provincial Key Laboratory for Plant Epigenetics, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, China.,Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Optoelectronic Engineering, Shenzhen University, Shenzhen, China.,BGI-Shenzhen, Shenzhen, China
| | - David Morse
- Département de Sciences Biologiques, Institut de Recherche en Biologie Végétale, Université de Montréal, Canada
| | - Yue Song
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Yuan Fu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Xin Lin
- State Key Laboratory of Marine Environmental Science and Marine Biodiversity and Global Change Research Center, Xiamen University, Xiamen, China
| | - Wenliang Wang
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Shifeng Cheng
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Wenbin Chen
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, China.,China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Senjie Lin
- State Key Laboratory of Marine Environmental Science and Marine Biodiversity and Global Change Research Center, Xiamen University, Xiamen, China.,Department of Marine Sciences, University of Connecticut, Groton
| |
Collapse
|
127
|
Sahebi M, Hanafi MM, van Wijnen AJ, Rice D, Rafii MY, Azizi P, Osman M, Taheri S, Bakar MFA, Isa MNM, Noor YM. Contribution of transposable elements in the plant's genome. Gene 2018; 665:155-166. [PMID: 29684486 DOI: 10.1016/j.gene.2018.04.050] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2017] [Revised: 04/04/2018] [Accepted: 04/18/2018] [Indexed: 12/26/2022]
Abstract
Plants maintain extensive growth flexibility under different environmental conditions, allowing them to continuously and rapidly adapt to alterations in their environment. A large portion of many plant genomes consists of transposable elements (TEs) that create new genetic variations within plant species. Different types of mutations may be created by TEs in plants. Many TEs can avoid the host's defense mechanisms and survive alterations in transposition activity, internal sequence and target site. Thus, plant genomes are expected to utilize a variety of mechanisms to tolerate TEs that are near or within genes. TEs affect the expression of not only nearby genes but also unlinked inserted genes. TEs can create new promoters, leading to novel expression patterns or alternative coding regions to generate alternate transcripts in plant species. TEs can also provide novel cis-acting regulatory elements that act as enhancers or inserts within original enhancers that are required for transcription. Thus, the regulation of plant gene expression is strongly managed by the insertion of TEs into nearby genes. TEs can also lead to chromatin modifications and thereby affect gene expression in plants. TEs are able to generate new genes and modify existing gene structures by duplicating, mobilizing and recombining gene fragments. They can also facilitate cellular functions by sharing their transposase-coding regions. Hence, TE insertions can not only act as simple mutagens but can also alter the elementary functions of the plant genome. Here, we review recent discoveries concerning the contribution of TEs to gene expression in plant genomes and discuss the different mechanisms by which TEs can affect plant gene expression and reduce host defense mechanisms.
Collapse
Affiliation(s)
- Mahbod Sahebi
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia.
| | - Mohamed M Hanafi
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia; Laboratory of Plantation Science and Technology, Institute of Plantation Studies, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia; Department of Land Management, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia.
| | | | - David Rice
- Department of Molecular Biology & Biotecnology, University of Sheffield, United Kingdom
| | - M Y Rafii
- Laboratory of Climate-Smart Food Crop Production, Institute of Tropical Agriculture and Food Security, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Parisa Azizi
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Mohamad Osman
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | - Sima Taheri
- Department of Crop Science, Faculty of Agriculture, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia
| | | | | | | |
Collapse
|
128
|
Lokits AD, Indrischek H, Meiler J, Hamm HE, Stadler PF. Tracing the evolution of the heterotrimeric G protein α subunit in Metazoa. BMC Evol Biol 2018; 18:51. [PMID: 29642851 PMCID: PMC5896119 DOI: 10.1186/s12862-018-1147-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 03/06/2018] [Indexed: 01/07/2023] Open
Abstract
BACKGROUND Heterotrimeric G proteins are fundamental signaling proteins composed of three subunits, Gα and a Gβγ dimer. The role of Gα as a molecular switch is critical for transmitting and amplifying intracellular signaling cascades initiated by an activated G protein Coupled Receptor (GPCR). Despite their biochemical and therapeutic importance, the study of G protein evolution has been limited to the scope of a few model organisms. Furthermore, of the five primary Gα subfamilies, the underlying gene structure of only two families has been thoroughly investigated outside of Mammalia evolution. Therefore our understanding of Gα emergence and evolution across phylogeny remains incomplete. RESULTS We have computationally identified the presence and absence of every Gα gene (GNA-) across all major branches of Deuterostomia and evaluated the conservation of the underlying exon-intron structures across these phylogenetic groups. We provide evidence of mutually exclusive exon inclusion through alternative splicing in specific lineages. Variations of splice site conservation and isoforms were found for several paralogs which coincide with conserved, putative motifs of DNA-/RNA-binding proteins. In addition to our curated gene annotations, within Primates, we identified 15 retrotranspositions, many of which have undergone pseudogenization. Most importantly, we find numerous deviations from previous findings regarding the presence and absence of individual GNA- genes, nuanced differences in phyla-specific gene copy numbers, novel paralog duplications and subsequent intron gain and loss events. CONCLUSIONS Our curated annotations allow us to draw more accurate inferences regarding the emergence of all Gα family members across Metazoa and to present a new, updated theory of Gα evolution. Leveraging this, our results are critical for gaining new insights into the co-evolution of the Gα subunit and its many protein binding partners, especially therapeutically relevant G protein - GPCR signaling pathways which radiated in Vertebrata evolution.
Collapse
Affiliation(s)
- A. D. Lokits
- 0000 0001 2264 7217grid.152326.1Neuroscience Program, Vanderbilt University, Nashville, TN USA ,0000 0001 2264 7217grid.152326.1Center for Structural Biology, Vanderbilt University, Nashville, TN USA
| | - H. Indrischek
- 0000 0001 2230 9752grid.9647.cBioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany ,0000 0001 2230 9752grid.9647.cComputational EvoDevo Group, Bioinformatics Department, Leipzig University, Leipzig, Germany
| | - J. Meiler
- 0000 0001 2264 7217grid.152326.1Center for Structural Biology, Vanderbilt University, Nashville, TN USA ,0000 0001 2264 7217grid.152326.1Chemistry Department, Vanderbilt University, Nashville, TN USA
| | - H. E. Hamm
- 0000 0004 1936 9916grid.412807.8Pharmacology Department, Vanderbilt University Medical Center, Nashville, TN USA
| | - P. F. Stadler
- 0000 0001 2230 9752grid.9647.cBioinformatics Group, Department of Computer Science, Leipzig University, Leipzig, Germany ,0000 0001 0674 042Xgrid.5254.6Center for non-coding RNA in Technology and Health, University of Copenhagen, Frederiksberg C, Denmark ,0000 0001 2286 1424grid.10420.37Institute for Theoretical Chemistry, University of Vienna, Wien, Austria ,0000 0001 2230 9752grid.9647.cIZBI-Interdisciplinary Center for Bioinformatics and LIFE-Leipzig Research Center for Civilization Diseases and Competence Center for Scalable Data Services and Solutions, University Leipzig, Leipzig, Germany ,grid.419532.8Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany ,0000 0001 1941 1940grid.209665.eSanta Fe Institute, Santa Fe, NM USA
| |
Collapse
|
129
|
Bianconi ME, Dunning LT, Moreno-Villena JJ, Osborne CP, Christin PA. Gene duplication and dosage effects during the early emergence of C4 photosynthesis in the grass genus Alloteropsis. JOURNAL OF EXPERIMENTAL BOTANY 2018; 69:1967-1980. [PMID: 29394370 PMCID: PMC6018922 DOI: 10.1093/jxb/ery029] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2017] [Accepted: 01/17/2018] [Indexed: 05/04/2023]
Abstract
The importance of gene duplication for evolutionary diversification has been mainly discussed in terms of genetic redundancy allowing neofunctionalization. In the case of C4 photosynthesis, which evolved via the co-option of multiple enzymes to boost carbon fixation in tropical conditions, the importance of genetic redundancy has not been consistently supported by genomic studies. Here, we test for a different role for gene duplication in the early evolution of C4 photosynthesis, via dosage effects creating rapid step changes in expression levels. Using genome-wide data for accessions of the grass genus Alloteropsis that recently diversified into different photosynthetic types, we estimate gene copy numbers and demonstrate that recurrent duplications in two important families of C4 genes coincided with increases in transcript abundance along the phylogeny, in some cases via a pure dosage effect. While increased gene copy number during the initial emergence of C4 photosynthesis probably offered a rapid route to enhanced expression, we also find losses of duplicates following the acquisition of genes encoding better-suited isoforms. The dosage effect of gene duplication might therefore act as a transient process during the evolution of a C4 biochemistry, rendered obsolete by the fixation of regulatory mutations increasing expression levels.
Collapse
Affiliation(s)
- Matheus E Bianconi
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | - Luke T Dunning
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | | | - Colin P Osborne
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | | |
Collapse
|
130
|
Jiang F, Liu Q, Wang Y, Zhang J, Wang H, Song T, Yang M, Wang X, Kang L. Comparative genomic analysis of SET domain family reveals the origin, expansion, and putative function of the arthropod-specific SmydA genes as histone modifiers in insects. Gigascience 2018; 6:1-16. [PMID: 28444351 PMCID: PMC5459927 DOI: 10.1093/gigascience/gix031] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2016] [Accepted: 04/19/2017] [Indexed: 02/07/2023] Open
Abstract
The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain–containing genes (SET genes) in all of the 147 arthropod genomes sequenced at the time of carrying out this experiment to understand the evolutionary history by which SET domains have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, that is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution.
Collapse
Affiliation(s)
- Feng Jiang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Qing Liu
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Yanli Wang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.,Institute of Applied Biology, Shanxi University, Taiyuan, Shanxi, China
| | - Jie Zhang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Huimin Wang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Tianqi Song
- Institute of Applied Biology, Shanxi University, Taiyuan, Shanxi, China
| | - Meiling Yang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Xianhui Wang
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Le Kang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
131
|
Prade VM, Gundlach H, Twardziok S, Chapman B, Tan C, Langridge P, Schulman AH, Stein N, Waugh R, Zhang G, Platzer M, Li C, Spannagl M, Mayer KFX. The pseudogenes of barley. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 93:502-514. [PMID: 29205595 DOI: 10.1111/tpj.13794] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 11/16/2017] [Accepted: 11/24/2017] [Indexed: 06/07/2023]
Abstract
Pseudogenes have a reputation of being 'evolutionary relics' or 'junk DNA'. While they are well characterized in mammals, studies in more complex plant genomes have so far been hampered by the absence of reference genome sequences. Barley is one of the economically most important cereals and has a genome size of 5.1 Gb. With the first high-quality genome reference assembly available for a Triticeae crop, we conducted a whole-genome assessment of pseudogenes on the barley genome. We identified, characterized and classified 89 440 gene fragments and pseudogenes scattered along the chromosomes, with occasional hotspots and higher densities at the chromosome ends. Full-length pseudogenes (11 015) have preferentially retained their exon-intron structure. Retrotransposition of processed mRNAs only plays a marginal role in their creation. However, the distribution of retroposed pseudogenes reflects the Rabl configuration of barley chromosomes and thus hints at founding mechanisms. While parent genes related to the defense-response were found to be under-represented in cultivated barley, we detected several defense-related pseudogenes in wild barley accessions. The percentage of transcriptionally active pseudogenes is 7.2%, and these may potentially adopt new regulatory roles.The barley genome is rich in pseudogenes and small gene fragments mainly located towards chromosome tips or as tandemly repeated units. Our results indicate non-random duplication and pseudogenization preferences and improve our understanding of the dynamics of gene birth and death in large plant genomes and the mechanisms that lead to evolutionary innovations.
Collapse
Affiliation(s)
- Verena M Prade
- Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Heidrun Gundlach
- Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Sven Twardziok
- Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Brett Chapman
- Centre for Comparative Genomics, Murdoch University, 90 South Street, WA6150, Murdoch, Australia
| | - Cong Tan
- School of Veterinary and Life Sciences, Murdoch University, 90 South Street, WA6150, Murdoch, Australia
| | - Peter Langridge
- School of Agriculture, University of Adelaide, Waite Campus, SA5064, Urrbrae, Australia
| | - Alan H Schulman
- Green Technology, Natural Resources Institute (Luke), Viikki Plant Science Centre, Institute of Biotechnology, University of Helsinki, 00014, Helsinki, Finland
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Seeland, Germany
- School of Plant Biology, University of Western Australia, Crawley, WA6009, Australia
| | - Robbie Waugh
- The James Hutton Institute, Dundee, DD2 5DA, UK
- School of Life Sciences, University of Dundee, Dundee, DD2 5DA, UK
| | - Guoping Zhang
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, 310058, China
| | - Matthias Platzer
- Leibniz Institute on Aging - Fritz Lipmann Institute (FLI), 07745, Jena, Germany
| | - Chengdao Li
- School of Veterinary and Life Sciences, Murdoch University, 90 South Street, WA6150, Murdoch, Australia
- Department of Agriculture and Food, Government of Western Australia, South Perth, WA, 6151, Australia
| | - Manuel Spannagl
- Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
| | - Klaus F X Mayer
- Plant Genome and Systems Biology, Helmholtz Center Munich - German Research Center for Environmental Health, Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Alte Akademie 8, 85354, Freising, Germany
| |
Collapse
|
132
|
Wang TT, Si FL, He ZB, Chen B. Genome-wide identification, characterization and classification of ionotropic glutamate receptor genes (iGluRs) in the malaria vector Anopheles sinensis (Diptera: Culicidae). Parasit Vectors 2018; 11:34. [PMID: 29334982 PMCID: PMC5769321 DOI: 10.1186/s13071-017-2610-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 12/28/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Ionotropic glutamate receptors (iGluRs) are conserved ligand-gated ion channel receptors, and ionotropic receptors (IRs) were revealed as a new family of iGluRs. Their subdivision was unsettled, and their characteristics are little known. Anopheles sinensis is a major malaria vector in eastern Asia, and its genome was recently well sequenced and annotated. METHODS We identified iGluR genes in the An. sinensis genome, analyzed their characteristics including gene structure, genome distribution, domains and specific sites by bioinformatic methods, and deduced phylogenetic relationships of all iGluRs in An. sinensis, Anopheles gambiae and Drosophila melanogaster. Based on the characteristics and phylogenetics, we generated the classification of iGluRs, and comparatively analyzed the intron number and selective pressure of three iGluRs subdivisions, iGluR group, Antenna IR and Divergent IR subfamily. RESULTS A total of 56 iGluR genes were identified and named in the whole-genome of An. sinensis. These genes were located on 18 scaffolds, and 31 of them (29 being IRs) are distributed into 10 clusters that are suggested to form mainly from recent gene duplication. These iGluRs can be divided into four groups: NMDA, non-NMDA, Antenna IR and Divergent IR based on feature comparison and phylogenetic analysis. IR8a and IR25a were suggested to be monophyletic, named as Putative in the study, and moved from the Antenna subfamily in the IR family to the non-NMDA group as a sister of traditional non-NMDA. The generated iGluRs of genes (including NMDA and regenerated non-NMDA) are relatively conserved, and have a more complicated gene structure, smaller ω values and some specific functional sites. The iGluR genes in An. sinensis, An. gambiae and D. melanogaster have amino-terminal domain (ATD), ligand binding domain (LBD) and Lig_Chan domains, except for IR8a that only has the LBD and Lig_Chan domains. However, the new concept IR family of genes (including regenerated Antenna IR, and Divergent IR), especially for Divergent IR are more variable, have a simpler gene structure (intron loss phenomenon) and larger ω values, and lack specific functional sites. These IR genes have no other domains except for Antenna IRs that only have the Lig_Chan domain. CONCLUSIONS This study provides a comprehensive information framework for iGluR genes in An. sinensis, and generated the classification of iGluRs by feature and bioinformatics analyses. The work lays the foundation for further functional study of these genes.
Collapse
Affiliation(s)
- Ting-Ting Wang
- Chongqing Key Laboratory of Vector Insects; Institute of Entomology and Molecular Biology, Chongqing Normal University, Chongqing, People’s Republic of China
| | - Feng-Ling Si
- Chongqing Key Laboratory of Vector Insects; Institute of Entomology and Molecular Biology, Chongqing Normal University, Chongqing, People’s Republic of China
| | - Zheng-Bo He
- Chongqing Key Laboratory of Vector Insects; Institute of Entomology and Molecular Biology, Chongqing Normal University, Chongqing, People’s Republic of China
| | - Bin Chen
- Chongqing Key Laboratory of Vector Insects; Institute of Entomology and Molecular Biology, Chongqing Normal University, Chongqing, People’s Republic of China
| |
Collapse
|
133
|
Kim S, Park J, Yeom SI, Kim YM, Seo E, Kim KT, Kim MS, Lee JM, Cheong K, Shin HS, Kim SB, Han K, Lee J, Park M, Lee HA, Lee HY, Lee Y, Oh S, Lee JH, Choi E, Choi E, Lee SE, Jeon J, Kim H, Choi G, Song H, Lee J, Lee SC, Kwon JK, Lee HY, Koo N, Hong Y, Kim RW, Kang WH, Huh JH, Kang BC, Yang TJ, Lee YH, Bennetzen JL, Choi D. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol 2017; 18:210. [PMID: 29089032 PMCID: PMC5664825 DOI: 10.1186/s13059-017-1341-9] [Citation(s) in RCA: 163] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Accepted: 10/06/2017] [Indexed: 01/08/2023] Open
Abstract
Background Transposable elements are major evolutionary forces which can cause new genome structure and species diversification. The role of transposable elements in the expansion of nucleotide-binding and leucine-rich-repeat proteins (NLRs), the major disease-resistance gene families, has been unexplored in plants. Results We report two high-quality de novo genomes (Capsicum baccatum and C. chinense) and an improved reference genome (C. annuum) for peppers. Dynamic genome rearrangements involving translocations among chromosomes 3, 5, and 9 were detected in comparison between C. baccatum and the two other peppers. The amplification of athila LTR-retrotransposons, members of the gypsy superfamily, led to genome expansion in C. baccatum. In-depth genome-wide comparison of genes and repeats unveiled that the copy numbers of NLRs were greatly increased by LTR-retrotransposon-mediated retroduplication. Moreover, retroduplicated NLRs are abundant across the angiosperms and, in most cases, are lineage-specific. Conclusions Our study reveals that retroduplication has played key roles for the massive emergence of NLR genes including functional disease-resistance genes in pepper plants. Electronic supplementary material The online version of this article (doi:10.1186/s13059-017-1341-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Seungill Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jieun Park
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Seon-In Yeom
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Yong-Min Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Eunyoung Seo
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Ki-Tae Kim
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Myung-Shin Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Je Min Lee
- Department of Horticultural Science, Kyungpook National University, Daegu, 41566, South Korea
| | - Kyeongchae Cheong
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea.,Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Ho-Sub Shin
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Saet-Byul Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Koeun Han
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Jundae Lee
- Department of Horticulture, Chonbuk National University, Jeonju, 54896, South Korea
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, 30602-7223, USA
| | - Hyun-Ah Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Hye-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Youngsill Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Soohyun Oh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Joo Hyun Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunhye Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunbi Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - So Eui Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jongbum Jeon
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyunbin Kim
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Gobong Choi
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyeunjeong Song
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - JunKi Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Sang-Choon Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jin-Kyung Kwon
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Hea-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Namjin Koo
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Yunji Hong
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Ryan W Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Won-Hee Kang
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Jin Hoe Huh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Byoung-Cheorl Kang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.,Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Yong-Hwan Lee
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea.,Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | | | - Doil Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.
| |
Collapse
|
134
|
Kim S, Park J, Yeom SI, Kim YM, Seo E, Kim KT, Kim MS, Lee JM, Cheong K, Shin HS, Kim SB, Han K, Lee J, Park M, Lee HA, Lee HY, Lee Y, Oh S, Lee JH, Choi E, Choi E, Lee SE, Jeon J, Kim H, Choi G, Song H, Lee J, Lee SC, Kwon JK, Lee HY, Koo N, Hong Y, Kim RW, Kang WH, Huh JH, Kang BC, Yang TJ, Lee YH, Bennetzen JL, Choi D. New reference genome sequences of hot pepper reveal the massive evolution of plant disease-resistance genes by retroduplication. Genome Biol 2017; 18:210. [PMID: 29089032 DOI: 10.1007/s13580-019-00157-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 06/19/2019] [Accepted: 10/06/2017] [Indexed: 05/24/2023] Open
Abstract
BACKGROUND Transposable elements are major evolutionary forces which can cause new genome structure and species diversification. The role of transposable elements in the expansion of nucleotide-binding and leucine-rich-repeat proteins (NLRs), the major disease-resistance gene families, has been unexplored in plants. RESULTS We report two high-quality de novo genomes (Capsicum baccatum and C. chinense) and an improved reference genome (C. annuum) for peppers. Dynamic genome rearrangements involving translocations among chromosomes 3, 5, and 9 were detected in comparison between C. baccatum and the two other peppers. The amplification of athila LTR-retrotransposons, members of the gypsy superfamily, led to genome expansion in C. baccatum. In-depth genome-wide comparison of genes and repeats unveiled that the copy numbers of NLRs were greatly increased by LTR-retrotransposon-mediated retroduplication. Moreover, retroduplicated NLRs are abundant across the angiosperms and, in most cases, are lineage-specific. CONCLUSIONS Our study reveals that retroduplication has played key roles for the massive emergence of NLR genes including functional disease-resistance genes in pepper plants.
Collapse
Affiliation(s)
- Seungill Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jieun Park
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Seon-In Yeom
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Yong-Min Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Eunyoung Seo
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Ki-Tae Kim
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Myung-Shin Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Je Min Lee
- Department of Horticultural Science, Kyungpook National University, Daegu, 41566, South Korea
| | - Kyeongchae Cheong
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | - Ho-Sub Shin
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Saet-Byul Kim
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Koeun Han
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Jundae Lee
- Department of Horticulture, Chonbuk National University, Jeonju, 54896, South Korea
| | - Minkyu Park
- Department of Genetics, University of Georgia, Athens, GA, 30602-7223, USA
| | - Hyun-Ah Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Hye-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Youngsill Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Soohyun Oh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Joo Hyun Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunhye Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Eunbi Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - So Eui Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jongbum Jeon
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyunbin Kim
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Gobong Choi
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - Hyeunjeong Song
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
| | - JunKi Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Sang-Choon Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Jin-Kyung Kwon
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Hea-Young Lee
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Namjin Koo
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Yunji Hong
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Ryan W Kim
- Korean Bioinformation Center, Korea Research Institute of Bioscience and Biotechnology, Daejon, 34141, South Korea
| | - Won-Hee Kang
- Department of Agricultural Plant Science, Institute of Agriculture & Life Science, Gyeongsang National University, Jinju, 52828, South Korea
| | - Jin Hoe Huh
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Byoung-Cheorl Kang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
- Vegetable Breeding Research Center, Seoul National University, Seoul, 08826, South Korea
| | - Tae-Jin Yang
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea
| | - Yong-Hwan Lee
- Interdisciplinary Program in Agricultural Genomics, Seoul National University, Seoul, 08826, South Korea
- Department of Agricultural Biotechnology, Seoul National University, Seoul, 08826, South Korea
| | | | - Doil Choi
- Department of Plant Science, Plant Genomics and Breeding Institute, Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 08826, South Korea.
| |
Collapse
|
135
|
Tsai ZTY, Lloyd JP, Shiu SH. Defining Functional Genic Regions in the Human Genome through Integration of Biochemical, Evolutionary, and Genetic Evidence. Mol Biol Evol 2017; 34:1788-1798. [PMID: 28398576 DOI: 10.1093/molbev/msx101] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The human genome is dominated by large tracts of DNA with extensive biochemical activity but no known function. In particular, it is well established that transcriptional activities are not restricted to known genes. However, whether this intergenic transcription represents activity with functional significance or noise is under debate, highlighting the need for an effective method of defining functional genomic regions. Moreover, these discoveries raise the question whether genomic regions can be defined as functional based solely on the presence of biochemical activities, without considering evolutionary (conservation) and genetic (effects of mutations) evidence. Here, computational models integrating genetic, evolutionary, and biochemical evidence are established that provide reliable predictions of human protein-coding and RNA genes. Importantly, in addition to sequence conservation, biochemical features allow accurate predictions of genic sequences with phenotypic evidence under strong purifying selection, suggesting that they can be used as an alternative measure of selection. Moreover, 18.5% of annotated noncoding RNAs exhibit higher degrees of similarity to phenotype genes and, thus, are likely functional. However, 64.5% of noncoding RNAs appear to belong to a sequence class of their own, and the remaining 17% are more similar to pseudogenes and random intergenic sequences that may represent noisy transcription.
Collapse
Affiliation(s)
- Zing Tsung-Yeh Tsai
- Department of Plant Biology, Michigan State University, East Lansing, MI.,Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - John P Lloyd
- Department of Plant Biology, Michigan State University, East Lansing, MI
| | - Shin-Han Shiu
- Department of Plant Biology, Michigan State University, East Lansing, MI
| |
Collapse
|
136
|
Demars J, Cano M, Drouilhet L, Plisson-Petit F, Bardou P, Fabre S, Servin B, Sarry J, Woloszyn F, Mulsant P, Foulquier D, Carrière F, Aletru M, Rodde N, Cauet S, Bouchez O, Pirson M, Tosser-Klopp G, Allain D. Genome-Wide Identification of the Mutation Underlying Fleece Variation and Discriminating Ancestral Hairy Species from Modern Woolly Sheep. Mol Biol Evol 2017; 34:1722-1729. [PMID: 28379502 PMCID: PMC5455980 DOI: 10.1093/molbev/msx114] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The composition and structure of fleece variation observed in mammals is a consequence of a strong selective pressure for fiber production after domestication. In sheep, fleece variation discriminates ancestral species carrying a long and hairy fleece from modern domestic sheep (Ovis aries) owning a short and woolly fleece. Here, we report that the “woolly” allele results from the insertion of an antisense EIF2S2 retrogene (called asEIF2S2) into the 3′ UTR of the IRF2BP2 gene leading to an abnormal IRF2BP2 transcript. We provide evidence that this chimeric IRF2BP2/asEIF2S2 messenger 1) targets the genuine sense EIF2S2 RNA and 2) creates a long endogenous double-stranded RNA which alters the expression of both EIF2S2 and IRF2BP2 mRNA. This represents a unique example of a phenotype arising via a RNA-RNA hybrid, itself generated through a retroposition mechanism. Our results bring new insights on the sheep population history thanks to the identification of the molecular origin of an evolutionary phenotypic variation.
Collapse
Affiliation(s)
- Julie Demars
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | | | - Laurence Drouilhet
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | | | - Philippe Bardou
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France.,SIGENAE, INRA, Castanet-Tolosan, France
| | - Stéphane Fabre
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | - Bertrand Servin
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | - Julien Sarry
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | - Florent Woloszyn
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | - Philippe Mulsant
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| | - Didier Foulquier
- Domaine Expérimental de La Fage INRA, Roquefort sur Soulzon, France
| | - Fabien Carrière
- Domaine Expérimental de La Fage INRA, Roquefort sur Soulzon, France
| | - Mathias Aletru
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France.,Domaine Expérimental de Langlade INRA, Pompertuzat, France
| | | | | | - Olivier Bouchez
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France.,Get-Plage INRA, Castanet-Tolosan, France
| | - Maarten Pirson
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France.,Get-Plage INRA, Castanet-Tolosan, France
| | | | - Daniel Allain
- GenPhySE, Université de Toulouse, INRA, INPT, ENVT, Castanet-Tolosan, France
| |
Collapse
|
137
|
Silencing of Transposable Elements by piRNAs in Drosophila: An Evolutionary Perspective. GENOMICS PROTEOMICS & BIOINFORMATICS 2017; 15:164-176. [PMID: 28602845 PMCID: PMC5487533 DOI: 10.1016/j.gpb.2017.01.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2016] [Revised: 01/02/2017] [Accepted: 01/12/2017] [Indexed: 11/28/2022]
Abstract
Transposable elements (TEs) are DNA sequences that can move within the genome. TEs have greatly shaped the genomes, transcriptomes, and proteomes of the host organisms through a variety of mechanisms. However, TEs generally disrupt genes and destabilize the host genomes, which substantially reduce fitness of the host organisms. Understanding the genomic distribution and evolutionary dynamics of TEs will greatly deepen our understanding of the TE-mediated biological processes. Most TE insertions are highly polymorphic in Drosophila melanogaster, providing us a good system to investigate the evolution of TEs at the population level. Decades of theoretical and experimental studies have well established “transposition-selection” population genetics model, which assumes that the equilibrium between TE replication and purifying selection determines the copy number of TEs in the genome. In the last decade, P-element-induced wimpy testis (PIWI)-interacting RNAs (piRNAs) were demonstrated to be master repressors of TE activities in Drosophila. The discovery of piRNAs revolutionized our understanding of TE repression, because it reveals that the host organisms have evolved an adaptive mechanism to defend against TE invasion. Tremendous progress has been made to understand the molecular mechanisms by which piRNAs repress active TEs, although many details in this process remain to be further explored. The interaction between piRNAs and TEs well explains the molecular mechanisms underlying hybrid dysgenesis for the I-R and P-M systems in Drosophila, which have puzzled evolutionary biologists for decades. The piRNA repression pathway provides us an unparalleled system to study the co-evolutionary process between parasites and host organisms.
Collapse
|
138
|
Casola C, Betrán E. The Genomic Impact of Gene Retrocopies: What Have We Learned from Comparative Genomics, Population Genomics, and Transcriptomic Analyses? Genome Biol Evol 2017; 9:1351-1373. [PMID: 28605529 PMCID: PMC5470649 DOI: 10.1093/gbe/evx081] [Citation(s) in RCA: 56] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2017] [Indexed: 02/07/2023] Open
Abstract
Gene duplication is a major driver of organismal evolution. Gene retroposition is a mechanism of gene duplication whereby a gene's transcript is used as a template to generate retroposed gene copies, or retrocopies. Intriguingly, the formation of retrocopies depends upon the enzymatic machinery encoded by retrotransposable elements, genomic parasites occurring in the majority of eukaryotes. Most retrocopies are depleted of the regulatory regions found upstream of their parental genes; therefore, they were initially considered transcriptionally incompetent gene copies, or retropseudogenes. However, examples of functional retrocopies, or retrogenes, have accumulated since the 1980s. Here, we review what we have learned about retrocopies in animals, plants and other eukaryotic organisms, with a particular emphasis on comparative and population genomic analyses complemented with transcriptomic datasets. In addition, these data have provided information about the dynamics of the different "life cycle" stages of retrocopies (i.e., polymorphic retrocopy number variants, fixed retropseudogenes and retrogenes) and have provided key insights into the retroduplication mechanisms, the patterns and evolutionary forces at work during the fixation process and the biological function of retrogenes. Functional genomic and transcriptomic data have also revealed that many retropseudogenes are transcriptionally active and a biological role has been experimentally determined for many. Finally, we have learned that not only non-long terminal repeat retroelements but also long terminal repeat retroelements play a role in the emergence of retrocopies across eukaryotes. This body of work has shown that mRNA-mediated duplication represents a widespread phenomenon that produces an array of new genes that contribute to organismal diversity and adaptation.
Collapse
Affiliation(s)
- Claudio Casola
- Department of Ecosystem Science and Management, Texas A&M University, TX
| | - Esther Betrán
- Department of Biology, University of Texas at Arlington, Arlington, TX
| |
Collapse
|
139
|
Steenwyk J, Rokas A. Extensive Copy Number Variation in Fermentation-Related Genes Among Saccharomyces cerevisiae Wine Strains. G3 (BETHESDA, MD.) 2017; 7:1475-1485. [PMID: 28292787 PMCID: PMC5427499 DOI: 10.1534/g3.117.040105] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/02/2017] [Accepted: 03/08/2017] [Indexed: 01/30/2023]
Abstract
Due to the importance of Saccharomyces cerevisiae in wine-making, the genomic variation of wine yeast strains has been extensively studied. One of the major insights stemming from these studies is that wine yeast strains harbor low levels of genetic diversity in the form of single nucleotide polymorphisms (SNPs). Genomic structural variants, such as copy number (CN) variants, are another major type of variation segregating in natural populations. To test whether genetic diversity in CN variation is also low across wine yeast strains, we examined genome-wide levels of CN variation in 132 whole-genome sequences of S. cerevisiae wine strains. We found an average of 97.8 CN variable regions (CNVRs) affecting ∼4% of the genome per strain. Using two different measures of CN diversity, we found that gene families involved in fermentation-related processes such as copper resistance (CUP), flocculation (FLO), and glucose metabolism (HXT), as well as the SNO gene family whose members are expressed before or during the diauxic shift, showed substantial CN diversity across the 132 strains examined. Importantly, these same gene families have been shown, through comparative transcriptomic and functional assays, to be associated with adaptation to the wine fermentation environment. Our results suggest that CN variation is a substantial contributor to the genomic diversity of wine yeast strains, and identify several candidate loci whose levels of CN variation may affect the adaptation and performance of wine yeast strains during fermentation.
Collapse
Affiliation(s)
- Jacob Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee 37235
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee 37235
| |
Collapse
|
140
|
Protein-Coding Genes' Retrocopies and Their Functions. Viruses 2017; 9:v9040080. [PMID: 28406439 PMCID: PMC5408686 DOI: 10.3390/v9040080] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2017] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 12/11/2022] Open
Abstract
Transposable elements, often considered to be not important for survival, significantly contribute to the evolution of transcriptomes, promoters, and proteomes. Reverse transcriptase, encoded by some transposable elements, can be used in trans to produce a DNA copy of any RNA molecule in the cell. The retrotransposition of protein-coding genes requires the presence of reverse transcriptase, which could be delivered by either non-long terminal repeat (non-LTR) or LTR transposons. The majority of these copies are in a state of “relaxed” selection and remain “dormant” because they are lacking regulatory regions; however, many become functional. In the course of evolution, they may undergo subfunctionalization, neofunctionalization, or replace their progenitors. Functional retrocopies (retrogenes) can encode proteins, novel or similar to those encoded by their progenitors, can be used as alternative exons or create chimeric transcripts, and can also be involved in transcriptional interference and participate in the epigenetic regulation of parental gene expression. They can also act in trans as natural antisense transcripts, microRNA (miRNA) sponges, or a source of various small RNAs. Moreover, many retrocopies of protein-coding genes are linked to human diseases, especially various types of cancer.
Collapse
|
141
|
Simplification, Innateness, and the Absorption of Meaning from Context: How Novelty Arises from Gradual Network Evolution. Evol Biol 2017; 44:145-189. [PMID: 28572690 PMCID: PMC5429377 DOI: 10.1007/s11692-017-9407-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 01/06/2017] [Indexed: 02/07/2023]
Abstract
How does new genetic information arise? Traditional thinking holds that mutation happens by accident and then spreads in the population by either natural selection or random genetic drift. There have been at least two fundamental conceptual problems with imagining an alternative. First, it seemed that the only alternative is a mutation that responds "smartly" to the immediate environment; but in complex multicellulars, it is hard to imagine how this could be implemented. Second, if there were mechanisms of mutation that "knew" what genetic changes would be favored in a given environment, this would have only begged the question of how they acquired that particular knowledge to begin with. This paper offers an alternative that avoids these problems. It holds that mutational mechanisms act on information that is in the genome, based on considerations of simplicity, parsimony, elegance, etc. (which are different than fitness considerations). This simplification process, under the performance pressure exerted by selection, not only leads to the improvement of adaptations but also creates elements that have the capacity to serve in new contexts they were not originally selected for. Novelty, then, arises at the system level from emergent interactions between such elements. Thus, mechanistically driven mutation neither requires Lamarckian transmission nor closes the door on novelty, because the changes it implements interact with one another globally in surprising and beneficial ways. Finally, I argue, for example, that genes used together are fused together; that simplification leads to complexity; and that evolution and learning are conceptually linked.
Collapse
|
142
|
Venomics: integrative venom proteomics and beyond*. Biochem J 2017; 474:611-634. [DOI: 10.1042/bcj20160577] [Citation(s) in RCA: 111] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Revised: 12/31/2016] [Accepted: 01/03/2017] [Indexed: 01/15/2023]
Abstract
Venoms are integrated phenotypes that evolved independently in, and are used for predatory and defensive purposes by, a wide phylogenetic range of organisms. The same principles that contribute to the evolutionary success of venoms, contribute to making the study of venoms of great interest in such diverse fields as evolutionary ecology and biotechnology. Evolution is profoundly contingent, and nature also reinvents itself continuosly. Changes in a complex phenotypic trait, such as venom, reflect the influences of prior evolutionary history, chance events, and selection. Reconstructing the natural history of venoms, particularly those of snakes, which will be dealt with in more detail in this review, requires the integration of different levels of knowledge into a meaningful and comprehensive evolutionary framework for separating stochastic changes from adaptive evolution. The application of omics technologies and other disciplines have contributed to a qualitative and quantitative advance in the road map towards this goal. In this review we will make a foray into the world of animal venoms, discuss synergies and complementarities of the different approaches used in their study, and identify current bottlenecks that prevent inferring the evolutionary mechanisms and ecological constraints that molded snake venoms to their present-day variability landscape.
Collapse
|
143
|
Sangrithi MN, Royo H, Mahadevaiah SK, Ojarikre O, Bhaw L, Sesay A, Peters AHFM, Stadler M, Turner JMA. Non-Canonical and Sexually Dimorphic X Dosage Compensation States in the Mouse and Human Germline. Dev Cell 2017; 40:289-301.e3. [PMID: 28132849 PMCID: PMC5300051 DOI: 10.1016/j.devcel.2016.12.023] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Revised: 11/01/2016] [Accepted: 12/27/2016] [Indexed: 12/02/2022]
Abstract
Somatic X dosage compensation requires two mechanisms: X inactivation balances X gene output between males (XY) and females (XX), while X upregulation, hypothesized by Ohno and documented in vivo, balances X gene with autosomal gene output. Whether X dosage compensation occurs in germ cells is unclear. We show that mouse and human germ cells exhibit non-canonical X dosage states that differ from the soma and between the sexes. Prior to genome-wide reprogramming, X upregulation is present, consistent with Ohno's hypothesis. Subsequently, however, it is erased. In females, erasure follows loss of X inactivation, causing X dosage excess. Conversely, in males, erasure leads to permanent X dosage decompensation. Sex chromosomally abnormal models exhibit a “sex-reversed” X dosage state: XX males, like XX females, develop X dosage excess, while XO females, like XY males, develop X dosage decompensation. Thus, germline X dosage compensation states are determined by X chromosome number, not phenotypic sex. These unexpected differences in X dosage compensation states between germline and soma offer unique perspectives on sex chromosome infertility. X dosage compensation in germ cells is reset during GWR PGCs exhibit X upregulation before GWR, in keeping with Ohno's hypothesis X upregulation is lost during GWR Mouse and human germ cells exhibit X dosage states that are sexually dimorphic
Collapse
Affiliation(s)
- Mahesh N Sangrithi
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK; UCL EGA Institute for Women's Health UCL, Medical School Building, 74 Huntley Street, London WC1E 6AU, UK
| | - Helene Royo
- Friedrich Miescher Institute for Biomedical Research (FMI), 4058 Basel, Switzerland; Swiss Institute of Bioinformatics, 4058 Basel, Switzerland
| | - Shantha K Mahadevaiah
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Obah Ojarikre
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Leena Bhaw
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Abdul Sesay
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Antoine H F M Peters
- Friedrich Miescher Institute for Biomedical Research (FMI), 4058 Basel, Switzerland
| | - Michael Stadler
- Friedrich Miescher Institute for Biomedical Research (FMI), 4058 Basel, Switzerland; Swiss Institute of Bioinformatics, 4058 Basel, Switzerland
| | - James M A Turner
- Mill Hill Laboratory, The Francis Crick Institute, The Ridgeway, Mill Hill, London NW7 1AA, UK.
| |
Collapse
|
144
|
Abdelkarim BTM, Maranda V, Drouin G. The fate of retrotransposed processed genes in Arabidopsis thaliana. Gene 2017; 609:1-8. [PMID: 28131821 DOI: 10.1016/j.gene.2017.01.029] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 01/20/2017] [Accepted: 01/24/2017] [Indexed: 11/15/2022]
Abstract
Processed genes are functional genes that have arisen as a result of the retrotransposition of mRNA molecules. We found 6 genes that generated processed genes in the common ancestor of five Brassicaceae species (Arabidopsis thaliana, Arabidopsis lyrata, Capsella rubella, Brassica rapa and Thellungiella parvula). These processed genes have therefore been kept for at least 30millionyears. Analyses of the Ka/Ks ratio of these genes, and of those having given rise to them, show that they evolve relatively slowly and suggest that the processed genes maintained the same function as that of their parental gene. There is a significant negative correlation between the number of ESTs and transcripts produced and the Ka/Ks ratios of the parental genes but not of the processed genes. This suggests that selection has not yet adapted the selective pressure the processed genes experience to their expression level. However, the A. thaliana processed genes tend to be expressed in the same tissues as that of their parental genes. Furthermore, most have a CAATT-box, a TATA-box and are located about 1kb from another protein-coding gene. Altogether, our results suggest that the processed genes found in the A. thaliana genome have been kept to produce more of the same product, and in the same tissues, as that encoded by their parental gene.
Collapse
Affiliation(s)
- Basma T M Abdelkarim
- Département de biologie et Centre de recherche avancée en génomique environnementale, Université d'Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Vincent Maranda
- Département de biologie et Centre de recherche avancée en génomique environnementale, Université d'Ottawa, Ottawa, Ontario K1N 6N5, Canada
| | - Guy Drouin
- Département de biologie et Centre de recherche avancée en génomique environnementale, Université d'Ottawa, Ottawa, Ontario K1N 6N5, Canada.
| |
Collapse
|
145
|
Wang Y. PlantRGDB: A Database of Plant Retrocopied Genes. PLANT & CELL PHYSIOLOGY 2017; 58:e2. [PMID: 28111365 DOI: 10.1093/pcp/pcw210] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 11/17/2016] [Indexed: 05/26/2023]
Abstract
RNA-based gene duplication, known as retrocopy, plays important roles in gene origination and genome evolution. The genomes of many plants have been sequenced, offering an opportunity to annotate and mine the retrocopies in plant genomes. However, comprehensive and unified annotation of retrocopies in these plants is still lacking. In this study I constructed the PlantRGDB (Plant Retrocopied Gene DataBase), the first database of plant retrocopies, to provide a putatively complete centralized list of retrocopies in plant genomes. The database is freely accessible at http://probes.pw.usda.gov/plantrgdb or http://aegilops.wheat.ucdavis.edu/plantrgdb. It currently integrates 49 plant species and 38,997 retrocopies along with characterization information. PlantRGDB provides a user-friendly web interface for searching, browsing and downloading the retrocopies in the database. PlantRGDB also offers graphical viewer-integrated sequence information for displaying the structure of each retrocopy. The attributes of the retrocopies of each species are reported using a browse function. In addition, useful tools, such as an advanced search and BLAST, are available to search the database more conveniently. In conclusion, the database will provide a web platform for obtaining valuable insight into the generation of retrocopies and will supplement research on gene duplication and genome evolution in plants.
Collapse
Affiliation(s)
- Yi Wang
- USDA-ARS, Western Regional Research Center, Crop Improvement and Genetics Research Unit, Albany, CA, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
- USDA-ARS, Plant Gene Expression Center, Albany, CA, USA
| |
Collapse
|
146
|
Farré D, Engel P, Angulo A. Novel Role of 3'UTR-Embedded Alu Elements as Facilitators of Processed Pseudogene Genesis and Host Gene Capture by Viral Genomes. PLoS One 2016; 11:e0169196. [PMID: 28033411 PMCID: PMC5199112 DOI: 10.1371/journal.pone.0169196] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 12/13/2016] [Indexed: 11/19/2022] Open
Abstract
Since the discovery of the high abundance of Alu elements in the human genome, the interest for the functional significance of these retrotransposons has been increasing. Primate Alu and rodent Alu-like elements are retrotransposed by a mechanism driven by the LINE1 (L1) encoded proteins, the same machinery that generates the L1 repeats, the processed pseudogenes (PPs), and other retroelements. Apart from free Alu RNAs, Alus are also transcribed and retrotranscribed as part of cellular gene transcripts, generally embedded inside 3' untranslated regions (UTRs). Despite different proposed hypotheses, the functional implication of the presence of Alus inside 3'UTRs remains elusive. In this study we hypothesized that Alu elements in 3'UTRs could be involved in the genesis of PPs. By analyzing human genome data we discovered that the existence of 3'UTR-embedded Alu elements is overrepresented in genes source of PPs. In contrast, the presence of other retrotransposable elements in 3'UTRs does not show this PP linked overrepresentation. This research was extended to mouse and rat genomes and the results accordingly reveal overrepresentation of 3'UTR-embedded B1 (Alu-like) elements in PP parent genes. Interestingly, we also demonstrated that the overrepresentation of 3'UTR-embedded Alus is particularly significant in PP parent genes with low germline gene expression level. Finally, we provide data that support the hypothesis that the L1 machinery is also the system that herpesviruses, and possibly other large DNA viruses, use to capture host genes expressed in germline or somatic cells. Altogether our results suggest a novel role for Alu or Alu-like elements inside 3'UTRs as facilitators of the genesis of PPs, particularly in lowly expressed genes. Moreover, we propose that this L1-driven mechanism, aided by the presence of 3'UTR-embedded Alus, may also be exploited by DNA viruses to incorporate host genes to their viral genomes.
Collapse
Affiliation(s)
- Domènec Farré
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
- * E-mail:
| | - Pablo Engel
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| | - Ana Angulo
- Immunology Unit, Department of Biomedical Sciences, Medical School, University of Barcelona, Barcelona, Spain
- Institut d’Investigacions Biomèdiques August Pi i Sunyer, Barcelona, Spain
| |
Collapse
|
147
|
Tan S, Cardoso-Moreira M, Shi W, Zhang D, Huang J, Mao Y, Jia H, Zhang Y, Chen C, Shao Y, Leng L, Liu Z, Huang X, Long M, Zhang YE. LTR-mediated retroposition as a mechanism of RNA-based duplication in metazoans. Genome Res 2016; 26:1663-1675. [PMID: 27934698 PMCID: PMC5131818 DOI: 10.1101/gr.204925.116] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 10/18/2016] [Indexed: 01/09/2023]
Abstract
In a broad range of taxa, genes can duplicate through an RNA intermediate in a process mediated by retrotransposons (retroposition). In mammals, L1 retrotransposons drive retroposition, but the elements responsible for retroposition in other animals have yet to be identified. Here, we examined young retrocopies from various animals that still retain the sequence features indicative of the underlying retroposition mechanism. In Drosophila melanogaster, we identified and de novo assembled 15 polymorphic retrocopies and found that all retroposed loci are chimeras of internal retrocopies flanked by discontinuous LTR retrotransposons. At the fusion points between the mRNAs and the LTR retrotransposons, we identified shared short similar sequences that suggest the involvement of microsimilarity-dependent template switches. By expanding our approach to mosquito, zebrafish, chicken, and mammals, we identified in all these species recently originated retrocopies with a similar chimeric structure and shared microsimilarities at the fusion points. We also identified several retrocopies that combine the sequences of two or more parental genes, demonstrating LTR-retroposition as a novel mechanism of exon shuffling. Finally, we found that LTR-mediated retrocopies are immediately cotranscribed with their flanking LTR retrotransposons. Transcriptional profiling coupled with sequence analyses revealed that the sense-strand transcription of the retrocopies often lead to the origination of in-frame proteins relative to the parental genes. Overall, our data show that LTR-mediated retroposition is highly conserved across a wide range of animal taxa; combined with previous work from plants and yeast, it represents an ancient and ongoing mechanism continuously shaping gene content evolution in eukaryotes.
Collapse
Affiliation(s)
- Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | | | - Wenwen Shi
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Dan Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiawei Huang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanan Mao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hangxing Jia
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Chunyan Chen
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Shao
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liang Leng
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhonghua Liu
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Xun Huang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois 60637, USA
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution and State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
148
|
Xu P, Feuda R, Lu B, Xiao H, Graham RI, Wu K. Functional opsin retrogene in nocturnal moth. Mob DNA 2016; 7:18. [PMID: 27777631 PMCID: PMC5070202 DOI: 10.1186/s13100-016-0074-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Accepted: 09/21/2016] [Indexed: 11/17/2022] Open
Abstract
Background Retrotransposed genes are different to other types of genes as they originate from a processed mRNA and are then inserted back into the genome. For a long time, the contribution of this mechanism to the origin of new genes, and hence to the evolutionary process, has been questioned as retrogenes usually lose their regulatory sequences upon insertion and generally decay into pseudogenes. In recent years, there is growing evidence, notably in mammals, that retrotransposition is an important process driving the origin of new genes, but the evidence in insects remains largely restricted to a few model species. Findings By sequencing the messenger RNA of three developmental stages (first and fifth instar larvae and adults) of the pest Helicoverpa armigera, we identified a second, intronless, long-wavelength sensitive opsin (that we called LWS2). We then amplified the partial CDS of LWS2 retrogenes from another six noctuid moths, and investigate the phylogenetic distribution of LWS2 in 15 complete Lepidoptera and 1 Trichoptera genomes. Our results suggests that LWS2 evolved within the noctuid. Furthermore, we found that all the LWS2 opsins have an intact ORF, and have an ω-value (ω = 0.08202) relatively higher compared to their paralog LWS1 (ω = 0.02536), suggesting that LWS2 opsins were under relaxed purifying selection. Finally, the LWS2 shows temporal compartmentalization of expression. LWS2 in H. armigera in adult is expressed at a significantly lower level compared to all other opsins in adults; while in the in 1st instar stage larvae, it is expressed at a significantly higher level compared to other opsins. Conclusions Together the results of our evolutionary sequence analyses and gene expression data suggest that LWS2 is a functional gene, however, the relatively low level of expression in adults suggests that LWS2 is most likely not involved in mediating the visual process. Electronic supplementary material The online version of this article (doi:10.1186/s13100-016-0074-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Pengjun Xu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, No. 2 West Yuan Ming Yuan Road, Beijing, 100193 People's Republic of China.,Tobacco Research Institute, Chinese Academy of Agricultural Sciences, No. 11 Ke Yuan Jing Si Road, Qingdao, 266101 People's Republic of China
| | - Roberto Feuda
- Division of Biology and Biological Engineering, Californian Institute of Technology, Pasadena, CA 91125 USA
| | - Bin Lu
- Department of Herpetology, Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, 610041 People's Republic of China
| | - Haijun Xiao
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, No. 2 West Yuan Ming Yuan Road, Beijing, 100193 People's Republic of China
| | - Robert I Graham
- Crop and Environment Sciences, Harper Adams University, Edgmond, Shropshire TF10 8NB UK
| | - Kongming Wu
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, No. 2 West Yuan Ming Yuan Road, Beijing, 100193 People's Republic of China
| |
Collapse
|
149
|
An Intronless β-amyrin Synthase Gene is More Efficient in Oleanolic Acid Accumulation than its Paralog in Gentiana straminea. Sci Rep 2016; 6:33364. [PMID: 27624821 PMCID: PMC5022052 DOI: 10.1038/srep33364] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 08/24/2016] [Indexed: 11/25/2022] Open
Abstract
Paralogous members of the oxidosqualene cyclase (OSC) family encode a diversity of enzymes that are important in triterpenoid biosynthesis. This report describes the isolation of the Gentiana straminea gene GsAS2 that encodes a β-amyrin synthase (βAS) enzyme. Unlike its previously isolated paralog GsAS1, GsAS2 lacks introns. Its predicted protein product was is a 759 residue polypeptide that shares high homology with other known β-amyrin synthases (βASs). Heterologously expressed GsAS2 generates more β-amyrin in yeast than does GsAS1. Constitutive over-expression of GsAS2 resulted in a 5.7 fold increase in oleanolic acid accumulation, while over-expression of GsAS1 led to a 3 fold increase. Additionally, RNAi-directed suppression of GsAS2 and GsAS1 in G. straminea decreased oleonolic acid levels by 65.9% and 21% respectively, indicating that GsAS2 plays a more important role than GsAS1 in oleanolic acid biosynthesis in G. straminea. We uses a docking model to explore the catalytic mechanism of GsAS1/2 and predicted that GsAS2, with its Y560, have higher efficiency than GsAS1 and mutated versions of GsAS2 in β-amyrin produce. When the key residue in GsAS2 was mutagenized, it produced about 41.29% and 71.15% less β-amyrin than native, while the key residue in GsAS1 was mutagenized to that in GsAS2, the mutant produced 38.02% more β-amyrin than native GsAS1.
Collapse
|
150
|
Emergence and evolution of inter-specific segregating retrocopies in cynomolgus monkey (Macaca fascicularis) and rhesus macaque (Macaca mulatta). Sci Rep 2016; 6:32598. [PMID: 27600022 PMCID: PMC5013489 DOI: 10.1038/srep32598] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Accepted: 08/10/2016] [Indexed: 11/13/2022] Open
Abstract
Retroposition is an RNA-mediated mechanism to generate gene duplication, and is believed to play an important role in genome evolution and phenotypic adaptation in various species including primates. Previous studies suggested an elevated rate of recent retroposition in the rhesus macaque genome. To better understand the impact of retroposition on macaque species which have undergone an adaptive radiation approximately 3–6 million years ago, we developed a bioinformatics pipeline to identify recently derived retrocopies in cynomolgus monkeys. As a result, we identified seven experimentally validated young retrocopies, all of which are polymorphic in cynomolgus monkeys. Unexpectedly, five of them are also present in rhesus monkeys and are still segregating. Molecular evolutionary analysis indicates that the observed inter-specific polymorphism is attribute to ancestral polymorphism. Further population genetics analysis provided strong evidence of balancing selection on at least one case (Crab-eating monkey retrocopy 6, or CER6) in both species. CER6 is in adjacent with an immunoglobulin related gene and may be involved in host-pathogen interaction, a well-known target of balancing selection. Altogether, our data support that retroposition is an important force to shape genome evolution and species adaptation.
Collapse
|