1
|
Song H, Guo Z, Zhang X, Sui J. De novo genes in Arachis hypogaea cv. Tifrunner: systematic identification, molecular evolution, and potential contributions to cultivated peanut. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 111:1081-1095. [PMID: 35748398 DOI: 10.1111/tpj.15875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 06/15/2022] [Accepted: 06/21/2022] [Indexed: 06/15/2023]
Abstract
De novo genes are derived from non-coding sequences, and they can play essential roles in organisms. Cultivated peanut (Arachis hypogaea) is a major oil and protein crop derived from a cross between Arachis duranensis and Arachis ipaensis. However, few de novo genes have been documented in Arachis. Here, we identified 381 de novo genes in A. hypogaea cv. Tifrunner based on comparison with five closely related Arachis species. There are distinct differences in gene expression patterns and gene structures between conserved and de novo genes. The identified de novo genes originated from ancestral sequence regions associated with metabolic and biosynthetic processes, and they were subsequently integrated into existing regulatory networks. De novo paralogs and homoeologs were identified in A. hypogaea cv. Tifrunner. De novo paralogs and homoeologs with conserved expression have mismatching cis-acting elements under normal growth conditions. De novo genes potentially have pluripotent functions in responses to biotic stresses as well as in growth and development based on quantitative trait locus data. This work provides a foundation for future research examining gene birth processes and gene function in Arachis and related taxa.
Collapse
Affiliation(s)
- Hui Song
- Grassland Agri-husbandry Research Center, College of Grassland Science, Qingdao Agricultural University, Qingdao, China
| | - Zhonglong Guo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, School of Life Sciences and School of Advanced Agricultural Sciences, Peking University, Beijing, China
| | - Xiaojun Zhang
- College of Agronomy, Qingdao Agricultural University, Qingdao, China
| | - Jiongming Sui
- College of Agronomy, Qingdao Agricultural University, Qingdao, China
| |
Collapse
|
2
|
Jiang L, Fan T, Li X, Xu J. Functional Heterogeneity of the Young and Old Duplicate Genes in Tung Tree ( Vernicia fordii). FRONTIERS IN PLANT SCIENCE 2022; 13:902649. [PMID: 35800614 PMCID: PMC9253867 DOI: 10.3389/fpls.2022.902649] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Accepted: 05/12/2022] [Indexed: 06/15/2023]
Abstract
Genes are subject to birth and death during the long evolutionary period. Here, young and old duplicate genes were identified in Vernicia fordii. We performed integrative analyses, including expression pattern, gene complexity, evolution, and functional divergence between young and old duplicate genes. Compared with young genes, old genes have higher values of Ka and Ks, lower Ka/Ks values, and lower average intrinsic structural disorder (ISD) values. Gene ontology and RNA-seq suggested that most young and old duplicate genes contained asymmetric functions. Only old duplicate genes are likely to participate in response to Fusarium wilt infection and exhibit divergent expression patterns. Our data suggest that young genes differ from older genes not only by evolutionary properties but also by their function and structure. These results highlighted the characteristics and diversification of the young and old genes in V. fordii and provided a systematic analysis of these genes in the V. fordii genome.
Collapse
Affiliation(s)
- Lan Jiang
- Key Laboratory of Non-coding RNA Transformation Research of Anhui Higher Education Institution, Yijishan Hospital of Wannan Medical College, Wuhu, China
- Central Laboratory, Yijishan Hospital of Wannan Medical College, Wuhu, China
- Clinical Research Center for Critical Respiratory Medicine of Anhui Province, Wuhu, China
| | - Tingting Fan
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, Changsha, China
| | - Xiaoxu Li
- Technology Center, China Tobacco Hunan Industrial Co., Ltd., Changsha, China
| | - Jun Xu
- Hunan Institute of Microbiology, Changsha, China
| |
Collapse
|
3
|
Kawachi T, Masuda A, Yamashita Y, Takeda JI, Ohkawara B, Ito M, Ohno K. Regulated splicing of large exons is linked to phase-separation of vertebrate transcription factors. EMBO J 2021; 40:e107485. [PMID: 34605568 DOI: 10.15252/embj.2020107485] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 09/06/2021] [Accepted: 09/14/2021] [Indexed: 12/30/2022] Open
Abstract
Although large exons cannot be readily recognized by the spliceosome, many are evolutionarily conserved and constitutively spliced for inclusion in the processed transcript. Furthermore, whether large exons may be enriched in a certain subset of proteins, or mediate specific functions, has remained unclear. Here, we identify a set of nearly 3,000 SRSF3-dependent large constitutive exons (S3-LCEs) in human and mouse cells. These exons are enriched for cytidine-rich sequence motifs, which bind and recruit the splicing factors hnRNP K and SRSF3. We find that hnRNP K suppresses S3-LCE splicing, an effect that is mitigated by SRSF3 to thus achieve constitutive splicing of S3-LCEs. S3-LCEs are enriched in genes for components of transcription machineries, including mediator and BAF complexes, and frequently contain intrinsically disordered regions (IDRs). In a subset of analyzed S3-LCE-containing transcription factors, SRSF3 depletion leads to deletion of the IDRs due to S3-LCE exon skipping, thereby disrupting phase-separated assemblies of these factors. Cytidine enrichment in large exons introduces proline/serine codon bias in intrinsically disordered regions and appears to have been evolutionarily acquired in vertebrates. We propose that layered splicing regulation by hnRNP K and SRSF3 ensures proper phase-separation of these S3-LCE-containing transcription factors in vertebrates.
Collapse
Affiliation(s)
- Toshihiko Kawachi
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Akio Masuda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Yoshihiro Yamashita
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Jun-Ichi Takeda
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Bisei Ohkawara
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Mikako Ito
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Kinji Ohno
- Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan
| |
Collapse
|
4
|
Song H, Guo Z, Hu X, Qian L, Miao F, Zhang X, Chen J. Evolutionary balance between LRR domain loss and young NBS-LRR genes production governs disease resistance in Arachis hypogaea cv. Tifrunner. BMC Genomics 2019; 20:844. [PMID: 31722670 PMCID: PMC6852974 DOI: 10.1186/s12864-019-6212-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Accepted: 10/22/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Cultivated peanut (Arachis hypogaea L.) is an important oil and protein crop, but it has low disease resistance; therefore, it is important to reveal the number, sequence features, function, and evolution of genes that confer resistance. Nucleotide-binding site-leucine-rich repeats (NBS-LRRs) are resistance genes that are involved in response to various pathogens. RESULTS We identified 713 full-length NBS-LRRs in A. hypogaea cv. Tifrunner. Genetic exchange events occurred on NBS-LRRs in A. hypogaea cv. Tifrunner, which were detected in the same subgenomes and also found in different subgenomes. Relaxed selection acted on NBS-LRR proteins and LRR domains in A. hypogaea cv. Tifrunner. Using quantitative trait loci (QTL), we found that NBS-LRRs were involved in response to late leaf spot, tomato spotted wilt virus, and bacterial wilt in A. duranensis (2 NBS-LRRs), A. ipaensis (39 NBS-LRRs), and A. hypogaea cv. Tifrunner (113 NBS-LRRs). In A. hypogaea cv. Tifrunner, 113 NBS-LRRs were classified as 75 young and 38 old NBS-LRRs, indicating that young NBS-LRRs were involved in response to disease after tetraploidization. However, compared to A. duranensis and A. ipaensis, fewer LRR domains were found in A. hypogaea cv. Tifrunner NBS-LRR proteins, partly explaining the lower disease resistance of the cultivated peanut. CONCLUSIONS Although relaxed selection acted on NBS-LRR proteins and LRR domains, LRR domains were preferentially lost in A. hypogaea cv. Tifrunner compared to A. duranensis and A. ipaensis. The QTL results suggested that young NBS-LRRs were important for resistance against diseases in A. hypogaea cv. Tifrunner. Our results provid insight into the greater susceptibility of A. hypogaea cv. Tifrunner to disease compared to A. duranensis and A. ipaensis.
Collapse
Affiliation(s)
- Hui Song
- Grassland Agri-husbandry Research Center, College of Grassland Science, Qingdao Agricultural University, Qingdao, China.
| | - Zhonglong Guo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, School of Life Sciences and School of Advanced Agricultural Sciences, Peking University, Beijing, China
| | - Xiaohui Hu
- Shandong Peanut Research Institute, Qingdao, China
| | - Lang Qian
- Dalian Academy of Agricultural Sciences, Dalian, China
| | - Fuhong Miao
- Grassland Agri-husbandry Research Center, College of Grassland Science, Qingdao Agricultural University, Qingdao, China
| | - Xiaojun Zhang
- College of Agronomy, Qingdao Agricultural University, Qingdao, China
| | - Jing Chen
- Shandong Peanut Research Institute, Qingdao, China.
| |
Collapse
|
5
|
Song H, Sun J, Yang G. The characteristic of Arachis duranensis-specific genes and their potential function. Gene 2019; 705:60-66. [PMID: 31009681 DOI: 10.1016/j.gene.2019.04.052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 03/12/2019] [Accepted: 04/18/2019] [Indexed: 11/17/2022]
Abstract
Arachis species produce flowers aerially, and then grow into the ground, where they develop into fruits; a feature that is unique to Arachis species. We hypothesized that Arachis species evolved genes specifically involved in the control of aerial flowers and the formation of underground fruits. Arachis duranensis is more resistant to biotic and abiotic stressors. Here, we compared different legume species and identified Arachis duranensis-specific genes. We analyzed gene expression patterns, base substitution patterns and sequence features between genes that are conserved across legume plants and A. duranensis-specific genes. Furthermore, we tested the role of A. duranensis-specific genes during seed development, response to nematode Meloidogyne arenaria infection and drought stress. We found that A. duranensis-specific genes had characteristics of young genes. The gene expression level and breadth were lower in the A. duranensis-specific genes compared to conserved genes. The A. duranensis-specific genes had higher codon usage bias than conserved genes, and the polypeptide length and GC content at the three codon sites were lower compared to conserved genes. Of the A. duranensis-specific genes, single-copy and duplicated genes had different features. The RNA-seq result showed A. duranensis-specific genes were involved in seed development, as well as response to nematode infection and drought stress. In addition, we detected asymmetric functions in A. duranensis-specific duplicated genes in response to nematode infection and drought stress.
Collapse
Affiliation(s)
- Hui Song
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China.
| | - Juan Sun
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China
| | - Guofeng Yang
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China.
| |
Collapse
|
6
|
Song H, Sun J, Yang G. Old and young duplicate genes reveal different responses to environmental changes in Arachis duranensis. Mol Genet Genomics 2019; 294:1199-1209. [PMID: 31076861 DOI: 10.1007/s00438-019-01574-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 05/03/2019] [Indexed: 11/24/2022]
Abstract
Old and young duplicate genes have been reported in some organisms. However, little is known about the properties of old and young duplicate genes in Arachis. Here, we have identified old and young duplicate genes in Arachis duranensis, and analyzed the evolution, gene complexity, gene expression pattern, and functional divergence between old and young duplicate genes. Our results showed different evolutionary, gene complexity and gene expression patterns, as well as differing correlations between old and young duplicate genes. Gene ontology results showed that old duplicate genes play a crucial role in lipid and amino acid biosynthesis and the oxidation-reduction process and that young duplicate genes are preferentially involved in photosynthesis and response to biotic stimulus. Transcriptome data sets revealed that most old and young duplicate genes had asymmetric function, and only a few duplicate genes exhibited symmetric function under drought and nematode stress. We found that old duplicate genes are preferentially involved in lipid and amino acid metabolism and response to abiotic stress, while young duplicate genes are likely to participate in photosynthesis and response to biotic stress. This work provides a better understanding of the evolution and functional divergence of old and young duplicate genes in A. duranensis.
Collapse
Affiliation(s)
- Hui Song
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China.
| | - Juan Sun
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China
| | - Guofeng Yang
- Grassland Agri-husbandry Research Center, Qingdao Agricultural University, Qingdao, China.
| |
Collapse
|
7
|
Foy SG, Wilson BA, Bertram J, Cordes MHJ, Masel J. A Shift in Aggregation Avoidance Strategy Marks a Long-Term Direction to Protein Evolution. Genetics 2019; 211:1345-1355. [PMID: 30692195 PMCID: PMC6456324 DOI: 10.1534/genetics.118.301719] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 01/25/2019] [Indexed: 01/06/2023] Open
Abstract
To detect a direction to evolution, without the pitfalls of reconstructing ancestral states, we need to compare "more evolved" to "less evolved" entities. But because all extant species have the same common ancestor, none are chronologically more evolved than any other. However, different gene families were born at different times, allowing us to compare young protein-coding genes to those that are older and hence have been evolving for longer. To be retained during evolution, a protein must not only have a function, but must also avoid toxic dysfunction such as protein aggregation. There is conflict between the two requirements: hydrophobic amino acids form the cores of protein folds, but also promote aggregation. Young genes avoid strongly hydrophobic amino acids, which is presumably the simplest solution to the aggregation problem. Here we show that young genes' few hydrophobic residues are clustered near one another along the primary sequence, presumably to assist folding. The higher aggregation risk created by the higher hydrophobicity of older genes is counteracted by more subtle effects in the ordering of the amino acids, including a reduction in the clustering of hydrophobic residues until they eventually become more interspersed than if distributed randomly. This interspersion has previously been reported to be a general property of proteins, but here we find that it is restricted to old genes. Quantitatively, the index of dispersion delineates a gradual trend, i.e., a decrease in the clustering of hydrophobic amino acids over billions of years.
Collapse
Affiliation(s)
- Scott G Foy
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Benjamin A Wilson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Jason Bertram
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona 85721
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721
| |
Collapse
|