1
|
Sun W, Li M, Wang J. Characteristics of duplicated gene expression and DNA methylation regulation in different tissues of allopolyploid Brassica napus. BMC PLANT BIOLOGY 2024; 24:518. [PMID: 38851683 PMCID: PMC11162574 DOI: 10.1186/s12870-024-05245-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 06/04/2024] [Indexed: 06/10/2024]
Abstract
Plant polyploidization increases the complexity of epigenomes and transcriptional regulation, resulting in genome evolution and enhanced adaptability. However, few studies have been conducted on the relationship between gene expression and epigenetic modification in different plant tissues after allopolyploidization. In this study, we studied gene expression and DNA methylation modification patterns in four tissues (stems, leaves, flowers and siliques) of Brassica napusand its diploid progenitors. On this basis, the alternative splicing patterns and cis-trans regulation patterns of four tissues in B. napus and its diploid progenitors were also analyzed. It can be seen that the number of alternative splicing occurs in the B. napus is higher than that in the diploid progenitors, and the IR type increases the most during allopolyploidy. In addition, we studied the fate changes of duplicated genes after allopolyploidization in B. napus. We found that the fate of most duplicated genes is conserved, but the number of neofunctionalization and specialization is also large. The genetic fate of B. napus was classified according to five replication types (WGD, PD, DSD, TD, TRD). This study also analyzed generational transmission analysis of expression and DNA methylation patterns. Our study provides a reference for the fate differentiation of duplicated genes during allopolyploidization.
Collapse
Affiliation(s)
- Weiqi Sun
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Mengdi Li
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
- Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, College of Life Sciences, Northwest University, Xi'an, 710069, China
| | - Jianbo Wang
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China.
| |
Collapse
|
2
|
Assis R, Conant G, Holland B, Liberles DA, O'Reilly MM, Wilson AE. Models for the retention of duplicate genes and their biological underpinnings. F1000Res 2024; 12:1400. [PMID: 38173826 PMCID: PMC10762295 DOI: 10.12688/f1000research.141786.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/08/2024] [Indexed: 01/05/2024] Open
Abstract
Gene content in genomes changes through several different processes, with gene duplication being an important contributor to such changes. Gene duplication occurs over a range of scales from individual genes to whole genomes, and the dynamics of this process can be context dependent. Still, there are rules by which genes are retained or lost from genomes after duplication, and probabilistic modeling has enabled characterization of these rules, including their context-dependence. Here, we describe the biology and corresponding mathematical models that are used to understand duplicate gene retention and its contribution to the set of biochemical functions encoded in a genome.
Collapse
Affiliation(s)
- Raquel Assis
- Florida Atlantic University, Boca Raton, Florida, USA
| | - Gavin Conant
- North Carolina State University, Raleigh, North Carolina, USA
| | | | | | | | | |
Collapse
|
3
|
Campelo dos Santos AL, DeGiorgio M, Assis R. Predicting evolutionary targets and parameters of gene deletion from expression data. BIOINFORMATICS ADVANCES 2024; 4:vbae002. [PMID: 38282974 PMCID: PMC10812876 DOI: 10.1093/bioadv/vbae002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 12/08/2023] [Accepted: 01/04/2024] [Indexed: 01/30/2024]
Abstract
Motivation Gene deletion is traditionally thought of as a nonadaptive process that removes functional redundancy from genomes, such that it generally receives less attention than duplication in evolutionary turnover studies. Yet, mounting evidence suggests that deletion may promote adaptation via the "less-is-more" evolutionary hypothesis, as it often targets genes harboring unique sequences, expression profiles, and molecular functions. Hence, predicting the relative prevalence of redundant and unique functions among genes targeted by deletion, as well as the parameters underlying their evolution, can shed light on the role of gene deletion in adaptation. Results Here, we present CLOUDe, a suite of machine learning methods for predicting evolutionary targets of gene deletion events from expression data. Specifically, CLOUDe models expression evolution as an Ornstein-Uhlenbeck process, and uses multi-layer neural network, extreme gradient boosting, random forest, and support vector machine architectures to predict whether deleted genes are "redundant" or "unique", as well as several parameters underlying their evolution. We show that CLOUDe boasts high power and accuracy in differentiating between classes, and high accuracy and precision in estimating evolutionary parameters, with optimal performance achieved by its neural network architecture. Application of CLOUDe to empirical data from Drosophila suggests that deletion primarily targets genes with unique functions, with further analysis showing these functions to be enriched for protein deubiquitination. Thus, CLOUDe represents a key advance in learning about the role of gene deletion in functional evolution and adaptation. Availability and implementation CLOUDe is freely available on GitHub (https://github.com/anddssan/CLOUDe).
Collapse
Affiliation(s)
- Andre Luiz Campelo dos Santos
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, United States
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, United States
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, United States
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL 33431, United States
| |
Collapse
|
4
|
Piya AA, DeGiorgio M, Assis R. Predicting gene expression divergence between single-copy orthologs in two species. Genome Biol Evol 2023; 15:evad078. [PMID: 37170892 PMCID: PMC10220509 DOI: 10.1093/gbe/evad078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 04/21/2023] [Accepted: 05/02/2023] [Indexed: 05/13/2023] Open
Abstract
Predicting gene expression divergence is integral to understanding the emergence of new biological functions and associated traits. Whereas several sophisticated methods have been developed for this task, their applications are either limited to duplicate genes or require expression data from more than two species. Thus, here we present PiXi, the first machine learning framework for predicting gene expression divergence between single-copy orthologs in two species. PiXi models gene expression evolution as an Ornstein-Uhlenbeck process, and overlays this model with multi-layer neural network, random forest, and support vector machine architectures for making predictions. It outputs the predicted class "conserved" or "diverged" for each pair of orthologs, as well as their predicted expression optima in the two species. We show that PiXi has high power and accuracy in predicting gene expression divergence between single-copy orthologs, as well as high accuracy and precision in estimating their expression optima in the two species, across a wide range of evolutionary scenarios, with the globally best performance achieved by a multi-layer neural network. Moreover, application of our best performing PiXi predictor to empirical gene expression data from single-copy orthologs residing at different loci in two species of Drosophila reveals that approximately 23% underwent expression divergence after positional relocation. Further analysis shows that several of these "diverged" genes are involved in the electron transport chain of the mitochondrial membrane, suggesting that new chromatin environments may impact energy production in Drosophila. Thus, by providing a toolkit for predicting gene expression divergence between single-copy orthologs in two species, PiXi can shed light on the origins of novel phenotypes across diverse biological processes and study systems.
Collapse
Affiliation(s)
- Antara Anika Piya
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FloridaUSA
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FloridaUSA
| |
Collapse
|
5
|
Kenchanmane Raju SK, Ledford M, Niederhuth CE. DNA methylation signatures of duplicate gene evolution in angiosperms. PLANT PHYSIOLOGY 2023:kiad220. [PMID: 37061825 PMCID: PMC10400039 DOI: 10.1093/plphys/kiad220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 03/03/2023] [Accepted: 04/12/2023] [Indexed: 06/19/2023]
Abstract
Gene duplication is a source of evolutionary novelty. DNA methylation may play a role in the evolution of duplicate genes (paralogs) through its association with gene expression. While this relationship has been examined to varying extents in a few individual species, the generalizability of these results at either a broad phylogenetic scale with species of differing duplication histories or across a population remains unknown. We applied a comparative epigenomics approach to 43 angiosperm species across the phylogeny and a population of 928 Arabidopsis (Arabidopsis thaliana) accessions, examining the association of DNA methylation with paralog evolution. Genic DNA methylation was differentially associated with duplication type, the age of duplication, sequence evolution, and gene expression. Whole genome duplicates were typically enriched for CG-only gene-body methylated or unmethylated genes, while single-gene duplications were typically enriched for non-CG methylated or unmethylated genes. Non-CG methylation, in particular, was characteristic of more recent single-gene duplicates. Core angiosperm gene families differentiated into those which preferentially retain paralogs and 'duplication-resistant' families, which convergently reverted to singletons following duplication. Duplication-resistant families that still have paralogous copies were, uncharacteristically for core angiosperm genes, enriched for non-CG methylation. Non-CG methylated paralogs had higher rates of sequence evolution, higher frequency of presence-absence variation, and more limited expression. This suggests that silencing by non-CG methylation may be important to maintaining dosage following duplication and be a precursor to fractionation. Our results indicate that genic methylation marks differing evolutionary trajectories and fates between paralogous genes and have a role in maintaining dosage following duplication.
Collapse
Affiliation(s)
| | | | - Chad E Niederhuth
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
- AgBioResearch, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
6
|
Jia Y, Xu M, Hu H, Chapman B, Watt C, Buerte B, Han N, Zhu M, Bian H, Li C, Zeng Z. Comparative gene retention analysis in barley, wild emmer, and bread wheat pangenome lines reveals factors affecting gene retention following gene duplication. BMC Biol 2023; 21:25. [PMID: 36747211 PMCID: PMC9903521 DOI: 10.1186/s12915-022-01503-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 12/16/2022] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Gene duplication is a prevalent phenomenon and a major driving force underlying genome evolution. The process leading to the fixation of gene duplicates following duplication is critical to understand how genome evolves but remains fragmentally understood. Most previous studies on gene retention are based on gene duplicate analyses in single reference genome. No population-based comparative gene retention analysis has been performed to date. RESULTS Taking advantage of recently published genomic data in Triticeae, we dissected a divergent homogentisate phytyltransferase (HPT2) lineage caught in the middle stage of gene fixation following duplication. The presence/absence of HPT2 in barley (diploid), wild emmer (tetraploid), and bread wheat (hexaploid) pangenome lines appears to be associated with gene dosage constraint and environmental adaption. Based on these observations, we adopted a phylogeny-based orthology inference approach and performed comparative gene retention analyses across barley, wild emmer, and bread wheat. This led to the identification of 326 HPT2-pattern-like genes at whole genome scale, representing a pool of gene duplicates in the middle stage of gene fixation. Majority of these HPT2-pattern-like genes were identified as small-scale duplicates, such as dispersed, tandem, and proximal duplications. Natural selection analyses showed that HPT2-pattern-like genes have experienced relaxed selection pressure, which is generally accompanied with partial positive selection and transcriptional divergence. Functional enrichment analyses showed that HPT2-pattern-like genes are over-represented with molecular-binding and defense response functions, supporting the potential role of environmental adaption during gene retention. We also observed that gene duplicates from larger gene family are more likely to be lost, implying a gene dosage constraint effect. Further comparative gene retention analysis in barley and bread wheat pangenome lines revealed combined effects of species-specific selection and gene dosage constraint. CONCLUSIONS Comparative gene retention analyses at the population level support gene dosage constraint, environmental adaption, and species-specific selection as three factors that may affect gene retention following gene duplication. Our findings shed light on the evolutionary process leading to the retention of newly formed gene duplicates and will greatly improve our understanding on genome evolution via duplication.
Collapse
Affiliation(s)
- Yong Jia
- grid.1025.60000 0004 0436 6763Western Crop Genetic Alliance, College of Science, Health, Engineering and Education, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia ,grid.1025.60000 0004 0436 6763Western Australian State Agricultural Biotechnology Centre, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia
| | - Mingrui Xu
- grid.410595.c0000 0001 2230 9154College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, 311121 China
| | - Haifei Hu
- grid.1025.60000 0004 0436 6763Western Crop Genetic Alliance, College of Science, Health, Engineering and Education, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia ,grid.1025.60000 0004 0436 6763Western Australian State Agricultural Biotechnology Centre, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia
| | - Brett Chapman
- grid.1025.60000 0004 0436 6763Western Crop Genetic Alliance, College of Science, Health, Engineering and Education, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia ,grid.1025.60000 0004 0436 6763Western Australian State Agricultural Biotechnology Centre, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia
| | - Calum Watt
- grid.1025.60000 0004 0436 6763Western Australian State Agricultural Biotechnology Centre, Murdoch University, 90 South Street, Murdoch, WA 6150 Australia ,grid.516230.30000 0005 0233 6218Intergrain Pty Ltd, Bibra Lake, WA 6163 Australia
| | - B. Buerte
- grid.13402.340000 0004 1759 700XInstitute of Genetic and Regenerative Biology, Key Laboratory for Cell and Gene Engineering of Zhejiang Province, College of Life Sciences, Zhejiang University, Hangzhou, 310058 China
| | - Ning Han
- grid.13402.340000 0004 1759 700XInstitute of Genetic and Regenerative Biology, Key Laboratory for Cell and Gene Engineering of Zhejiang Province, College of Life Sciences, Zhejiang University, Hangzhou, 310058 China
| | - Muyuan Zhu
- grid.13402.340000 0004 1759 700XInstitute of Genetic and Regenerative Biology, Key Laboratory for Cell and Gene Engineering of Zhejiang Province, College of Life Sciences, Zhejiang University, Hangzhou, 310058 China
| | - Hongwu Bian
- Institute of Genetic and Regenerative Biology, Key Laboratory for Cell and Gene Engineering of Zhejiang Province, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China.
| | - Chengdao Li
- Western Crop Genetic Alliance, College of Science, Health, Engineering and Education, Murdoch University, 90 South Street, Murdoch, WA, 6150, Australia. .,Western Australian State Agricultural Biotechnology Centre, Murdoch University, 90 South Street, Murdoch, WA, 6150, Australia. .,Department of Primary Industries and Regional Development, 3-Baron-Hay Court, South Perth, WA, 6151, Australia.
| | - Zhanghui Zeng
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, 311121, China. .,Institute of Genetic and Regenerative Biology, Key Laboratory for Cell and Gene Engineering of Zhejiang Province, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China. .,Zhejiang Provincial Key Laboratory for Genetic Improvement and Quality Control of Medicinal Plants, Hangzhou, 311121, China.
| |
Collapse
|
7
|
Vershinin AV, Elisafenko EA, Evtushenko EV. Genetic Redundancy in Rye Shows in a Variety of Ways. PLANTS (BASEL, SWITZERLAND) 2023; 12:282. [PMID: 36678994 PMCID: PMC9862056 DOI: 10.3390/plants12020282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 12/28/2022] [Accepted: 01/04/2023] [Indexed: 06/17/2023]
Abstract
Fifty years ago Susumu Ohno formulated the famous C-value paradox, which states that there is no correlation between the physical sizes of the genome, i.e., the amount of DNA, and the complexity of the organism, and highlighted the problem of genome redundancy. DNA that does not have a positive effect on the fitness of organisms has been characterized as "junk or selfish DNA". The controversial concept of junk DNA remains viable. Rye is a convenient subject for yet another test of the correctness and scientific significance of this concept. The genome of cultivated rye, Secale cereale L., is considered one of the largest among species of the tribe Triticeae and thus it tops the average angiosperm genome and the genomes of its closest evolutionary neighbors, such as species of barley, Hordeum (by approximately 30-35%), and diploid wheat species, Triticum (approximately 25%). The review provides an analysis of the structural organization of various regions of rye chromosomes with a description of the molecular mechanisms contributing to their size increase during evolution and the classes of DNA sequences involved in these processes. The history of the development of the concept of eukaryotic genome redundancy is traced and the current state of this problem is discussed.
Collapse
Affiliation(s)
- Alexander V. Vershinin
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia
| | - Evgeny A. Elisafenko
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia
- Institute of Cytology and Genetics, SB RAS, Acad. Lavrentiev Ave. 10, 630090 Novosibirsk, Russia
| | - Elena V. Evtushenko
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia
| |
Collapse
|
8
|
Baez LA, Tichá T, Hamann T. Cell wall integrity regulation across plant species. PLANT MOLECULAR BIOLOGY 2022; 109:483-504. [PMID: 35674976 PMCID: PMC9213367 DOI: 10.1007/s11103-022-01284-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 05/05/2022] [Indexed: 05/05/2023]
Abstract
Plant cell walls are highly dynamic and chemically complex structures surrounding all plant cells. They provide structural support, protection from both abiotic and biotic stress as well as ensure containment of turgor. Recently evidence has accumulated that a dedicated mechanism exists in plants, which is monitoring the functional integrity of cell walls and initiates adaptive responses to maintain integrity in case it is impaired during growth, development or exposure to biotic and abiotic stress. The available evidence indicates that detection of impairment involves mechano-perception, while reactive oxygen species and phytohormone-based signaling processes play key roles in translating signals generated and regulating adaptive responses. More recently it has also become obvious that the mechanisms mediating cell wall integrity maintenance and pattern triggered immunity are interacting with each other to modulate the adaptive responses to biotic stress and cell wall integrity impairment. Here we will review initially our current knowledge regarding the mode of action of the maintenance mechanism, discuss mechanisms mediating responses to biotic stresses and highlight how both mechanisms may modulate adaptive responses. This first part will be focused on Arabidopsis thaliana since most of the relevant knowledge derives from this model organism. We will then proceed to provide perspective to what extent the relevant molecular mechanisms are conserved in other plant species and close by discussing current knowledge of the transcriptional machinery responsible for controlling the adaptive responses using selected examples.
Collapse
Affiliation(s)
- Luis Alonso Baez
- Institute for Biology, Faculty of Natural Sciences, Norwegian University of Science and Technology, 5 Høgskoleringen, 7491, Trondheim, Norway
| | - Tereza Tichá
- Institute for Biology, Faculty of Natural Sciences, Norwegian University of Science and Technology, 5 Høgskoleringen, 7491, Trondheim, Norway
| | - Thorsten Hamann
- Institute for Biology, Faculty of Natural Sciences, Norwegian University of Science and Technology, 5 Høgskoleringen, 7491, Trondheim, Norway.
| |
Collapse
|
9
|
Dimos B, Emery M, Beavers K, MacKnight N, Brandt M, Demuth J, Mydlarz L. Adaptive Variation in Homolog Number Within Transcript Families Promotes Expression Divergence in Reef-Building Coral. Mol Ecol 2022; 31:2594-2610. [PMID: 35229964 DOI: 10.1111/mec.16414] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 02/10/2022] [Accepted: 02/22/2022] [Indexed: 11/30/2022]
Abstract
Gene expression, especially in multi-species experiments, is used to gain insight into the genetic basis of how organisms adapt and respond to changing environments. However, evolutionary processes which can influence gene expression patterns between species such as the presence of paralogs which arise from gene duplication events are rarely accounted for. Paralogous transcripts can alter the transcriptional output of a gene and thus exclusion of these transcripts can obscure important biological differences between species. To address this issue, we investigated how differences in transcript family size is associated with divergent gene expression patterns in five species of Caribbean reef-building corals. We demonstrate that transcript families that are rapidly evolving in terms of size have increased levels of expression divergence. Additionally, these rapidly evolving transcript families are enriched for multiple biological processes, with genes involved in the coral innate immune system demonstrating pronounced variation in homolog number between species. Overall, this investigation demonstrates the importance of incorporating paralogous transcripts when comparing gene expression across species by influencing both transcriptional output and the number of transcripts within biological processes. As this investigation was based on transcriptome assemblies, additional insights into the relationship between gene duplications and expression patterns will likely emergence once more genome assemblies are available for study.
Collapse
Affiliation(s)
- Bradford Dimos
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Madison Emery
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Kelsey Beavers
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Nicholas MacKnight
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Marilyn Brandt
- Center for Marine and Environmental Studies, University of the Virgin Islands, St. Thomas, US Virgin Islands, 00802, USA
| | - Jeffery Demuth
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Laura Mydlarz
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| |
Collapse
|
10
|
Elisafenko EA, Evtushenko EV, Vershinin AV. The origin and evolution of a two-component system of paralogous genes encoding the centromeric histone CENH3 in cereals. BMC PLANT BIOLOGY 2021; 21:541. [PMID: 34794377 PMCID: PMC8603533 DOI: 10.1186/s12870-021-03264-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 10/12/2021] [Indexed: 06/07/2023]
Abstract
BACKGROUND The cereal family Poaceae is one of the largest and most diverse angiosperm families. The central component of centromere specification and function is the centromere-specific histone H3 (CENH3). Some cereal species (maize, rice) have one copy of the gene encoding this protein, while some (wheat, barley, rye) have two. We applied a homology-based approach to sequenced cereal genomes, in order to finally trace the mutual evolution of the structure of the CENH3 genes and the nearby regions in various tribes. RESULTS We have established that the syntenic group or the CENH3 locus with the CENH3 gene and the boundaries defined by the CDPK2 and bZIP genes first appeared around 50 Mya in a common ancestor of the subfamilies Bambusoideae, Oryzoideae and Pooideae. This locus came to Pooideae with one copy of CENH3 in the most ancient tribes Nardeae and Meliceae. The βCENH3 gene as a part of the locus appeared in the tribes Stipeae and Brachypodieae around 35-40 Mya. The duplication was accompanied by changes in the exon-intron structure. Purifying selection acts mostly on αCENH3s, while βCENH3s form more heterogeneous structures, in which clade-specific amino acid motifs are present. In barley species, the βCENH3 gene assumed an inverted orientation relative to αCENH3 and the CDPK2 gene was substituted with LHCB-l. As the evolution and domestication of plant species went on, the locus was growing in size due to an increasing distance between αCENH3 and βCENH3 because of a massive insertion of the main LTR-containing retrotransposon superfamilies, gypsy and copia, without any evolutionary preference on either of them. A comparison of the molecular structure of the locus in the A, B and D subgenomes of the hexaploid wheat T. aestivum showed that invasion by mobile elements and concomitant rearrangements took place in an independent way even in evolutionarily close species. CONCLUSIONS The CENH3 duplication in cereals was accompanied by changes in the exon-intron structure of the βCENH3 paralog. The observed general tendency towards the expansion of the CENH3 locus reveals an amazing diversity of ways in which different species implement the scenario described in this paper.
Collapse
Affiliation(s)
- Evgeny A Elisafenko
- Institute of Cytology and Genetics, SB RAS, Novosibirsk, 630090, Russia
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, 630090, Russia
| | - Elena V Evtushenko
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, 630090, Russia
| | - Alexander V Vershinin
- Institute of Molecular and Cellular Biology, SB RAS, Novosibirsk, 630090, Russia.
- Novosibirsk State University, Novosibirsk, 630090, Russia.
| |
Collapse
|
11
|
Lineage-Specific Genes and Family Expansions in Dictyostelid Genomes Display Expression Bias and Evolutionary Diversification during Development. Genes (Basel) 2021; 12:genes12101628. [PMID: 34681022 PMCID: PMC8535579 DOI: 10.3390/genes12101628] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 10/12/2021] [Accepted: 10/13/2021] [Indexed: 12/23/2022] Open
Abstract
Gene duplications generate new genes that can contribute to expression changes and the evolution of new functions. Genomes often consist of gene families that undergo expansions, some of which occur in specific lineages that reflect recent adaptive diversification. In this study, lineage-specific genes and gene family expansions were studied across five dictyostelid species to determine when and how they are expressed during multicellular development. Lineage-specific genes were found to be enriched among genes with biased expression (predominant expression in one developmental stage) in each species and at most developmental time points, suggesting independent functional innovations of new genes throughout the phylogeny. Biased duplicate genes had greater expression divergence than their orthologs and paralogs, consistent with subfunctionalization or neofunctionalization. Lineage-specific expansions in particular had biased genes with both molecular signals of positive selection and high expression, suggesting adaptive genetic and transcriptional diversification following duplication. Our results present insights into the potential contributions of lineage-specific genes and families in generating species-specific phenotypes during multicellular development in dictyostelids.
Collapse
|
12
|
Evtushenko EV, Elisafenko EA, Gatzkaya SS, Schubert V, Houben A, Vershinin AV. Expression of Two Rye CENH3 Variants and Their Loading into Centromeres. PLANTS (BASEL, SWITZERLAND) 2021; 10:2043. [PMID: 34685852 PMCID: PMC8538535 DOI: 10.3390/plants10102043] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 09/23/2021] [Accepted: 09/24/2021] [Indexed: 11/22/2022]
Abstract
Gene duplication and the preservation of both copies during evolution is an intriguing evolutionary phenomenon. Their preservation is related to the function they perform. The central component of centromere specification and function is the centromere-specific histone H3 (CENH3). Some cereal species (maize, rice) have one copy of the gene encoding this protein, while some (wheat, barley, rye) have two. Therefore, they represent a good model for a comparative study of the functional activity of the duplicated CENH3 genes and their protein products. We determined the organization of the CENH3 locus in rye (Secale cereale L.) and identified the functional motifs in the vicinity of the CENH3 genes. We compared the expression of these genes at different stages of plant development and the loading of their products, the CENH3 proteins, into nucleosomes during mitosis and meiosis. Using extended chromatin fibers, we revealed patterns of loading CENH3 proteinsinto polynucleosomal domains in centromeric chromatin. Our results indicate no sign of neofunctionalization, subfunctionalization or specialization in the gene copies. The influence of negative selection on the coding part of the genes led them to preserve their conserved function. The advantage of having two functional genes appears as the gene-dosage effect.
Collapse
Affiliation(s)
- Elena V. Evtushenko
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia; (E.V.E.); (E.A.E.); (S.S.G.)
| | - Evgeny A. Elisafenko
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia; (E.V.E.); (E.A.E.); (S.S.G.)
- Institute of Cytology and Genetics, SB RAS, Acad. Lavrentiev Ave. 10, 630090 Novosibirsk, Russia
| | - Sima S. Gatzkaya
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia; (E.V.E.); (E.A.E.); (S.S.G.)
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany; (V.S.); (A.H.)
| | - Andreas Houben
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstr. 3, 06466 Seeland, Germany; (V.S.); (A.H.)
| | - Alexander V. Vershinin
- Institute of Molecular and Cellular Biology, SB RAS, Acad. Lavrentiev Ave. 8/2, 630090 Novosibirsk, Russia; (E.V.E.); (E.A.E.); (S.S.G.)
| |
Collapse
|
13
|
Assis R. No Expression Divergence despite Transcriptional Interference between Nested Protein-Coding Genes in Mammals. Genes (Basel) 2021; 12:genes12091381. [PMID: 34573363 PMCID: PMC8467205 DOI: 10.3390/genes12091381] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/23/2021] [Accepted: 08/24/2021] [Indexed: 01/05/2023] Open
Abstract
Nested protein-coding genes accumulated throughout metazoan evolution, with early analyses of human and Drosophila microarray data indicating that this phenomenon was simply due to the presence of large introns. However, a recent study employing RNA-seq data uncovered evidence of transcriptional interference driving rapid expression divergence between Drosophila nested genes, illustrating that accurate expression estimation of overlapping genes can enhance detection of their relationships. Hence, here I apply an analogous approach to strand-specific RNA-seq data from human and mouse to revisit the role of transcriptional interference in the evolution of mammalian nested genes. A genomic survey reveals that whereas mammalian nested genes indeed accrued over evolutionary time, they are retained at lower frequencies than in Drosophila. Though several properties of mammalian nested genes align with observations in Drosophila and with expectations under transcriptional interference, contrary to both, their expression divergence is not statistically different from that between unnested genes, and also does not increase after nesting. Together, these results support the hypothesis that lower selection efficiencies limit rates of gene expression evolution in mammals, leading to their reliance on immediate eradication of deleterious nested genes to avoid transcriptional interference.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Electrical Engineering and Computer Science, Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL 33431, USA
| |
Collapse
|
14
|
Chain FJJ, Assis R. BLAST from the Past: Impacts of Evolving Approaches on Studies of Evolution by Gene Duplication. Genome Biol Evol 2021; 13:evab149. [PMID: 34164667 PMCID: PMC8325566 DOI: 10.1093/gbe/evab149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2021] [Indexed: 11/14/2022] Open
Abstract
In 1970, Susumu Ohno hypothesized that gene duplication was a major reservoir of adaptive innovation. However, it was not until over two decades later that DNA sequencing studies uncovered the ubiquity of gene duplication across all domains of life, highlighting its global importance in the evolution of phenotypic complexity and species diversification. Today, it seems that there are no limits to the study of evolution by gene duplication, as it has rapidly coevolved with numerous experimental and computational advances in genomics. In this perspective, we examine word stem usage in PubMed abstracts to infer how evolving discoveries and technologies have shaped the landscape of studying evolution by gene duplication, leading to a more refined understanding of its role in the emergence of novel phenotypes.
Collapse
Affiliation(s)
- Frédéric J J Chain
- Department of Biological Sciences, University of Massachusetts Lowell, Massachusetts, USA
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, Florida, USA
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, Florida, USA
| |
Collapse
|
15
|
DeGiorgio M, Assis R. Learning Retention Mechanisms and Evolutionary Parameters of Duplicate Genes from Their Expression Data. Mol Biol Evol 2021; 38:1209-1224. [PMID: 33045078 PMCID: PMC7947822 DOI: 10.1093/molbev/msaa267] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Learning about the roles that duplicate genes play in the origins of novel phenotypes requires an understanding of how their functions evolve. A previous method for achieving this goal, CDROM, employs gene expression distances as proxies for functional divergence and then classifies the evolutionary mechanisms retaining duplicate genes from comparisons of these distances in a decision tree framework. However, CDROM does not account for stochastic shifts in gene expression or leverage advances in contemporary statistical learning for performing classification, nor is it capable of predicting the parameters driving duplicate gene evolution. Thus, here we develop CLOUD, a multi-layer neural network built on a model of gene expression evolution that can both classify duplicate gene retention mechanisms and predict their underlying evolutionary parameters. We show that not only is the CLOUD classifier substantially more powerful and accurate than CDROM, but that it also yields accurate parameter predictions, enabling a better understanding of the specific forces driving the evolution and long-term retention of duplicate genes. Further, application of the CLOUD classifier and predictor to empirical data from Drosophila recapitulates many previous findings about gene duplication in this lineage, showing that new functions often emerge rapidly and asymmetrically in younger duplicate gene copies, and that functional divergence is driven by strong natural selection. Hence, CLOUD represents a major advancement in classifying retention mechanisms and predicting evolutionary parameters of duplicate genes, thereby highlighting the utility of incorporating sophisticated statistical learning techniques to address long-standing questions about evolution after gene duplication.
Collapse
Affiliation(s)
- Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431.,Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL 33431
| | - Raquel Assis
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431.,Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL 33431
| |
Collapse
|
16
|
A co-opted steroid synthesis gene, maintained in sorghum but not maize, is associated with a divergence in leaf wax chemistry. Proc Natl Acad Sci U S A 2021; 118:2022982118. [PMID: 33723068 PMCID: PMC8000359 DOI: 10.1073/pnas.2022982118] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Virtually all above-ground plant surfaces, such as leaf and stem exteriors, are covered in a cuticle: a wax-infused polyester. This waxy biocomposite is the largest interface between Earth’s biosphere and atmosphere. Its chemical composition is not only highly tuned to mediate nonstomatal water loss, but it also self-assembles to produce superhydrophobic surfaces, protects against UV radiation, and contains bioactive compounds that help resist microbial attack. Developing fundamental knowledge of waxy biocomposites, particularly those on crop species, is a prerequisite for an understanding of their structure–function relationships. Here, we uncover a likely genetic basis for the presence and absence, respectively, of triterpenoids in the leaf waxes of sorghum and maize—compounds previously associated with creating heat-tolerant cuticular water barriers. Virtually all land plants are coated in a cuticle, a waxy polyester that prevents nonstomatal water loss and is important for heat and drought tolerance. Here, we describe a likely genetic basis for a divergence in cuticular wax chemistry between Sorghum bicolor, a drought tolerant crop widely cultivated in hot climates, and its close relative Zea mays (maize). Combining chemical analyses, heterologous expression, and comparative genomics, we reveal that: 1) sorghum and maize leaf waxes are similar at the juvenile stage but, after the juvenile-to-adult transition, sorghum leaf waxes are rich in triterpenoids that are absent from maize; 2) biosynthesis of the majority of sorghum leaf triterpenoids is mediated by a gene that maize and sorghum both inherited from a common ancestor but that is only functionally maintained in sorghum; and 3) sorghum leaf triterpenoids accumulate in a spatial pattern that was previously shown to strengthen the cuticle and decrease water loss at high temperatures. These findings uncover the possibility for resurrection of a cuticular triterpenoid-synthesizing gene in maize that could create a more heat-tolerant water barrier on the plant’s leaf surfaces. They also provide a fundamental understanding of sorghum leaf waxes that will inform efforts to divert surface carbon to intracellular storage for bioenergy and bioproduct innovations.
Collapse
|
17
|
Huang KM, Chain FJJ. Copy number variations and young duplicate genes have high methylation levels in sticklebacks. Evolution 2021; 75:706-718. [PMID: 33527399 DOI: 10.1111/evo.14184] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 01/19/2021] [Accepted: 01/25/2021] [Indexed: 12/11/2022]
Abstract
Gene duplication is an important driver of genomic diversity that can promote adaptive evolution. However, like most mutations, a newly duplicated gene is often deleterious and removed from the genome by drift or natural selection. The early molecular changes that occur soon after duplication therefore may influence the long-term survival of gene duplicates, but relatively little empirical data exist on the events near the onset of duplication before mutations have time to accumulate. In this study, we contrast gene expression and DNA methylation levels of duplicate genes in the threespine stickleback, Gasterosteus aculeatus, including recently emerged duplications that segregate as copy number variations (CNVs). We find that younger duplicate genes have higher levels of promoter methylation than older genes, and that gene CNVs have higher promoter methylation than non-CNVs. These results suggest preferential duplication of highly methylated genes or rapid methylation changes soon after duplication. We also find a negative association between methylation and expression, providing a putative role for methylation in suppressing transcription that compensates for increases in gene copy numbers and promoting paralog retention. We propose that methylation contributes to the longevity of young duplicate genes, extending the window of opportunity for functional divergence via mutation.
Collapse
Affiliation(s)
- Katherine M Huang
- Department of Biological Sciences, University of Massachusetts Lowell, Lowell, Massachusetts, 01854.,Comparative Media Studies/Writing, Massachusetts Institute of Technology, Cambridge, Massachusetts, 02139
| | - Frédéric J J Chain
- Department of Biological Sciences, University of Massachusetts Lowell, Lowell, Massachusetts, 01854
| |
Collapse
|
18
|
Chen L, Wu F, Zhang J. NAC and MYB Families and Lignin Biosynthesis-Related Members Identification and Expression Analysis in Melilotus albus. PLANTS 2021; 10:plants10020303. [PMID: 33562564 PMCID: PMC7914948 DOI: 10.3390/plants10020303] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 11/28/2020] [Accepted: 12/11/2020] [Indexed: 11/26/2022]
Abstract
Melilotus albus is an annual or biennial legume species that adapts to extreme environments via its high stress tolerance. NAC and MYB transcription factors (TFs) are involved in the regulation of lignin biosynthesis, which has not been studied in M. albus. A total of 101 MaNAC and 299 MaMYB members were identified based on M. albus genome. Chromosome distribution and synteny analysis indicated that some genes underwent tandem duplication. Ka/Ks analysis suggested that MaNACs and MaMYBs underwent strong purifying selection. Stress-, hormone- and development-related cis-elements and MYB-binding sites were identified in the promoter regions of MaNACs and MaMYBs. Five MaNACs, two MaMYBs and ten lignin biosynthesis genes were identified as presenting coexpression relationships according to weighted gene coexpression network analysis (WGCNA). Eleven and thirteen candidate MaNAC and MaMYB genes related to lignin biosynthesis were identified, respectively, and a network comprising these genes was constructed which further confirmed the MaNAC and MaMYB relationship. These candidate genes had conserved gene structures and motifs and were highly expressed in the stems and roots, and qRT-PCR further verified the expression patterns. Overall, our results provide a reference for determining the precise role of NAC and MYB genes in M. albus and may facilitate efforts to breed low-lignin-content forage cultivars in the future.
Collapse
|
19
|
Assis R. Out of the testis, into the ovary: biased outcomes of gene duplication and deletion in Drosophila. Evolution 2020; 73:1850-1862. [PMID: 31418820 DOI: 10.1111/evo.13820] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 06/28/2019] [Accepted: 07/03/2019] [Indexed: 12/30/2022]
Abstract
Gene turnover is a key source of adaptive variation. Yet most evolutionary studies have focused on gene duplication, dismissing gene deletion as a mechanism that simply eradicates redundancy. Here, I use genome-scale sequence and multi-tissue expression data from Drosophila melanogaster and Drosophila pseudoobscura to simultaneously assess the evolutionary outcomes of gene duplication and deletion in Drosophila. I find that gene duplication is more frequent than gene deletion in both species, indicating that it may play a more important role in Drosophila evolution. However, examination of several genic properties reveals that genes likely possess distinct functions after duplication that diverge further before deletion, suggesting that loss of redundancy cannot explain a majority of gene deletion events in Drosophila. Moreover, in addition to providing support for the well-known "out of the testis" origin of young duplicate genes, analyses of gene expression profiles uncover a preferential bias against deletion of old ovary-expressed genes. Therefore, I propose a novel "into the ovary" hypothesis for gene deletion in Drosophila, in which gene deletion may promote adaptation by salvaging genes that contribute to the evolution of female reproductive phenotypes. Under this combined "out of the testis, into the ovary" evolutionary model, gene duplication and deletion work in concert to generate and maintain a balanced repertoire of genes that promote sex-specific adaptation in Drosophila.
Collapse
Affiliation(s)
- Raquel Assis
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, 16801
| |
Collapse
|