1
|
Wang Z, Xia A, Wang Q, Cui Z, Lu M, Ye Y, Wang Y, He Y. Natural polymorphisms in ZMET2 encoding a DNA methyltransferase modulate the number of husk layers in maize. PLANT PHYSIOLOGY 2024; 195:2129-2142. [PMID: 38431291 PMCID: PMC11213254 DOI: 10.1093/plphys/kiae113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 01/30/2024] [Accepted: 02/08/2024] [Indexed: 03/05/2024]
Abstract
DNA methylation affects agronomic traits and the environmental adaptability of crops, but the natural polymorphisms in DNA methylation-related genes and their contributions to phenotypic variation in maize (Zea mays) remain elusive. Here, we show that a polymorphic 10-bp insertion/deletion variant in the 3'UTR of Zea methyltransferase2 (ZMET2) alters its transcript level and accounts for variation in the number of maize husk layers. ZMET2 encodes a chromomethylase and is required for maintaining genome-wide DNA methylation in the CHG sequence context. Disruption of ZMET2 increased the number of husk layers and resulted in thousands of differentially methylated regions, a proportion of which were also distinguishable in natural ZMET2 alleles. Population genetic analyses indicated that ZMET2 was a target of selection and might play a role in the spread of maize from tropical to temperate regions. Our results provide important insights into the natural variation of ZMET2 that confers both global and locus-specific effects on DNA methylation, which contribute to phenotypic diversity in maize.
Collapse
Affiliation(s)
- Zi Wang
- State Key Laboratory of Maize Bio-breeding, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Aiai Xia
- State Key Laboratory of Maize Bio-breeding, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Qi Wang
- State Key Laboratory of Maize Bio-breeding, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Zhenhai Cui
- Shenyang Key Laboratory of Maize Genomic Selection Breeding, Shenyang Agricultural University, Shenyang 110866, China
| | - Ming Lu
- Maize Research Institute, Jilin Academy of Agricultural Sciences, Gongzhuling 136100, China
| | - Yusheng Ye
- Maize Research Institute, Liaoning Academy of Agricultural Sciences, Shenyang 110065, China
| | - Yanbo Wang
- Maize Research Institute, Liaoning Academy of Agricultural Sciences, Shenyang 110065, China
| | - Yan He
- State Key Laboratory of Maize Bio-breeding, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| |
Collapse
|
2
|
Singh K, Huff M, Liu J, Park JW, Rickman T, Keremane M, Krueger RR, Kunta M, Roose ML, Dardick C, Staton M, Ramadugu C. Chromosome-Scale, De Novo, Phased Genome Assemblies of Three Australian Limes: Citrus australasica, C. inodora, and C. glauca. PLANTS (BASEL, SWITZERLAND) 2024; 13:1460. [PMID: 38891269 PMCID: PMC11174732 DOI: 10.3390/plants13111460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 05/14/2024] [Accepted: 05/20/2024] [Indexed: 06/21/2024]
Abstract
Huanglongbing (HLB) is a severe citrus disease worldwide. Wild Australian limes like Citrus australasica, C. inodora, and C. glauca possess beneficial HLB resistance traits. Individual trees of the three taxa were extensively used in a breeding program for over a decade to introgress resistance traits into commercial-quality citrus germplasm. We generated high-quality, phased, de novo genome assemblies of the three Australian limes using PacBio long-read sequencing. The genome assembly sizes of the primary and alternate haplotypes were determined for C. australasica (337 Mb/335 Mb), C. inodora (304 Mb/299 Mb), and C. glauca (376 Mb/379 Mb). The nine chromosome-scale scaffolds included 86-91% of the genome sequences generated. The integrity and completeness of the assembled genomes were estimated to be at 97.2-98.8%. Gene annotation studies identified 25,461 genes in C. australasica, 27,665 in C. inodora, and 30,067 in C. glauca. Genes belonging to 118 orthogroups were specific to Australian lime genomes compared to other citrus genomes analyzed. Significantly fewer canonical resistance (R) genes were found in C. inodora and C. glauca (319 and 449, respectively) compared to C. australasica (576), C. clementina (579), and C. sinensis (651). Similar patterns were observed for other gene families associated with potential HLB resistance, including Phloem protein 2 (PP2) and Callose synthase (CalS) genes predicted in the Australian lime genomes. The genomic information on Australian limes developed in the present study will help understand the genetic basis of HLB resistance.
Collapse
Affiliation(s)
- Khushwant Singh
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA; (K.S.); (M.L.R.)
| | - Matthew Huff
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA; (M.H.); (T.R.); (M.S.)
| | - Jianyang Liu
- Innovative Fruit Production, Improvement, and Protection, Appalachian Fruit Research Station, USDA-ARS, Kearneysville, WV 25430, USA; (J.L.); (C.D.)
| | - Jong-Won Park
- Citrus Center, Texas A&M University-Kingsville, Weslaco, TX 78599, USA; (J.-W.P.); (M.K.)
| | - Tara Rickman
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA; (M.H.); (T.R.); (M.S.)
| | - Manjunath Keremane
- National Clonal Germplasm Repository for Citrus and Dates, USDA-ARS, Riverside, CA 92507, USA; (M.K.); (R.R.K.)
| | - Robert R. Krueger
- National Clonal Germplasm Repository for Citrus and Dates, USDA-ARS, Riverside, CA 92507, USA; (M.K.); (R.R.K.)
| | - Madhurababu Kunta
- Citrus Center, Texas A&M University-Kingsville, Weslaco, TX 78599, USA; (J.-W.P.); (M.K.)
| | - Mikeal L. Roose
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA; (K.S.); (M.L.R.)
| | - Chris Dardick
- Innovative Fruit Production, Improvement, and Protection, Appalachian Fruit Research Station, USDA-ARS, Kearneysville, WV 25430, USA; (J.L.); (C.D.)
| | - Margaret Staton
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN 37996, USA; (M.H.); (T.R.); (M.S.)
| | - Chandrika Ramadugu
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA; (K.S.); (M.L.R.)
| |
Collapse
|
3
|
Chen G, Yu D, Yang Y, Li X, Wang X, Sun D, Lu Y, Ke R, Zhang G, Cui J, Feng S. Adaptive expansion of ERVK solo-LTRs is associated with Passeriformes speciation events. Nat Commun 2024; 15:3151. [PMID: 38605055 PMCID: PMC11009239 DOI: 10.1038/s41467-024-47501-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 04/02/2024] [Indexed: 04/13/2024] Open
Abstract
Endogenous retroviruses (ERVs) are ancient retroviral remnants integrated in host genomes, and commonly deleted through unequal homologous recombination, leaving solitary long terminal repeats (solo-LTRs). This study, analysing the genomes of 362 bird species and their reptilian and mammalian outgroups, reveals an unusually higher level of solo-LTRs formation in birds, indicating evolutionary forces might have purged ERVs during evolution. Strikingly in the order Passeriformes, and especially the parvorder Passerida, endogenous retrovirus K (ERVK) solo-LTRs showed bursts of formation and recurrent accumulations coinciding with speciation events over past 22 million years. Moreover, our results indicate that the ongoing expansion of ERVK solo-LTRs in these bird species, marked by high transcriptional activity of ERVK retroviral genes in reproductive organs, caused variation of solo-LTRs between individual zebra finches. We experimentally demonstrated that cis-regulatory activity of recently evolved ERVK solo-LTRs may significantly increase the expression level of ITGA2 in the brain of zebra finches compared to chickens. These findings suggest that ERVK solo-LTRs expansion may introduce novel genomic sequences acting as cis-regulatory elements and contribute to adaptive evolution. Overall, our results underscore that the residual sequences of ancient retroviruses could influence the adaptive diversification of species by regulating host gene expression.
Collapse
Affiliation(s)
- Guangji Chen
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- BGI Research, Wuhan, China
| | - Dan Yu
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
| | - Yu Yang
- School of Medicine, Huaqiao University, Xiamen, Fujian, 361021, China
| | - Xiang Li
- CAS Key Laboratory of Molecular Virology & Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Xiaojing Wang
- CAS Key Laboratory of Molecular Virology & Immunology, Shanghai Institute of Immunity and Infection, Chinese Academy of Sciences, Shanghai, China
| | - Danyang Sun
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
| | - Yanlin Lu
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Center for Genomic Research, International Institutes of Medicine, The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, Zhejiang, China
| | - Rongqin Ke
- School of Medicine, Huaqiao University, Xiamen, Fujian, 361021, China
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China
| | - Jie Cui
- Department of Infectious Diseases, National Medical Center for Infectious Diseases, Huashan Hospital, Institute of Infection and Health Research, Fudan University, Shanghai, China.
- Laboratory for Marine Biology and Biotechnology, Qingdao Marine Science and Technology Center, Qingdao, China.
- Shanghai Sci-Tech Inno Center for Infection & Immunity, Shanghai, 200052, China.
- Shanghai Key Laboratory of Infectious Diseases and Biosafety Emergency Response, Huashan Hospital, Fudan University, Shanghai, China.
| | - Shaohong Feng
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, China.
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China.
- Department of General Surgery of Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China.
| |
Collapse
|
4
|
Cao RB, Chen R, Liao KX, Li H, Xu GB, Jiang XL. Karyotype and LTR-RTs analysis provide insights into oak genomic evolution. BMC Genomics 2024; 25:328. [PMID: 38566015 PMCID: PMC10988972 DOI: 10.1186/s12864-024-10177-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 03/01/2024] [Indexed: 04/04/2024] Open
Abstract
BACKGROUND Whole-genome duplication and long terminal repeat retrotransposons (LTR-RTs) amplification in organisms are essential factors that affect speciation, local adaptation, and diversification of organisms. Understanding the karyotype projection and LTR-RTs amplification could contribute to untangling evolutionary history. This study compared the karyotype and LTR-RTs evolution in the genomes of eight oaks, a dominant lineage in Northern Hemisphere forests. RESULTS Karyotype projections showed that chromosomal evolution was relatively conservative in oaks, especially on chromosomes 1 and 7. Modern oak chromosomes formed through multiple fusions, fissions, and rearrangements after an ancestral triplication event. Species-specific chromosomal rearrangements revealed fragments preserved through natural selection and adaptive evolution. A total of 441,449 full-length LTR-RTs were identified from eight oak genomes, and the number of LTR-RTs for oaks from section Cyclobalanopsis was larger than in other sections. Recent amplification of the species-specific LTR-RTs lineages resulted in significant variation in the abundance and composition of LTR-RTs among oaks. The LTR-RTs insertion suppresses gene expression, and the suppressed intensity in gene regions was larger than in promoter regions. Some centromere and rearrangement regions indicated high-density peaks of LTR/Copia and LTR/Gypsy. Different centromeric regional repeat units (32, 78, 79 bp) were detected on different Q. glauca chromosomes. CONCLUSION Chromosome fusions and arm exchanges contribute to the formation of oak karyotypes. The composition and abundance of LTR-RTs are affected by its recent amplification. LTR-RTs random retrotransposition suppresses gene expression and is enriched in centromere and chromosomal rearrangement regions. This study provides novel insights into the evolutionary history of oak karyotypes and the organization, amplification, and function of LTR-RTs.
Collapse
Affiliation(s)
- Rui-Bin Cao
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China
| | - Ran Chen
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China
| | - Ke-Xin Liao
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China
| | - He Li
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China
| | - Gang-Biao Xu
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China
| | - Xiao-Long Jiang
- The Laboratory of Forestry Genetics, Central South University of Forestry and Technology, 410004, Changsha, Hunan, China.
| |
Collapse
|
5
|
Cang FA, Welles SR, Wong J, Ziaee M, Dlugosch KM. Genome size variation and evolution during invasive range expansion in an introduced plant. Evol Appl 2024; 17:e13624. [PMID: 38283607 PMCID: PMC10810172 DOI: 10.1111/eva.13624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 11/03/2023] [Accepted: 11/13/2023] [Indexed: 01/30/2024] Open
Abstract
Plants demonstrate exceptional variation in genome size across species, and their genome sizes can also vary dramatically across individuals and populations within species. This aspect of genetic variation can have consequences for traits and fitness, but few studies attributed genome size differentiation to ecological and evolutionary processes. Biological invasions present particularly useful natural laboratories to infer selective agents that might drive genome size shifts across environments and population histories. Here, we test hypotheses for the evolutionary causes of genome size variation across 14 invading populations of yellow starthistle, Centaurea solstitialis, in California, United States. We use a survey of genome sizes and trait variation to ask: (1) Is variation in genome size associated with developmental trait variation? (2) Are genome sizes smaller toward the leading edge of the expansion, consistent with selection for "colonizer" traits? Or alternatively, does genome size increase toward the leading edge of the expansion, consistent with predicted consequences of founder effects and drift? (3) Finally, are genome sizes smaller at higher elevations, consistent with selection for shorter development times? We found that 2C DNA content varied 1.21-fold among all samples, and was associated with flowering time variation, such that plants with larger genomes reproduced later, with lower lifetime capitula production. Genome sizes increased toward the leading edge of the invasion, but tended to decrease at higher elevations, consistent with genetic drift during range expansion but potentially strong selection for smaller genomes and faster development time at higher elevations. These results demonstrate how genome size variation can contribute to traits directly tied to reproductive success, and how selection and drift can shape that variation. We highlight the influence of genome size on dynamics underlying a rapid range expansion in a highly problematic invasive plant.
Collapse
Affiliation(s)
- F. Alice Cang
- Department of Ecology and Evolutionary BiologyUniversity of ArizonaTucsonArizonaUSA
| | - Shana R. Welles
- Department of Ecology and Evolutionary BiologyUniversity of ArizonaTucsonArizonaUSA
- Utah Valley UniversityOremUtahUSA
| | - Jenny Wong
- Department of Ecology and Evolutionary BiologyUniversity of ArizonaTucsonArizonaUSA
| | - Maia Ziaee
- Department of Ecology and Evolutionary BiologyUniversity of ArizonaTucsonArizonaUSA
- Mills CollegeOaklandCaliforniaUSA
| | - Katrina M. Dlugosch
- Department of Ecology and Evolutionary BiologyUniversity of ArizonaTucsonArizonaUSA
| |
Collapse
|
6
|
Lyu K, Xiao J, Lyu S, Liu R. Comparative Analysis of Transposable Elements in Strawberry Genomes of Different Ploidy Levels. Int J Mol Sci 2023; 24:16935. [PMID: 38069258 PMCID: PMC10706760 DOI: 10.3390/ijms242316935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 11/25/2023] [Accepted: 11/27/2023] [Indexed: 12/18/2023] Open
Abstract
Transposable elements (TEs) make up a large portion of plant genomes and play a vital role in genome structure, function, and evolution. Cultivated strawberry (Fragaria x ananassa) is one of the most important fruit crops, and its octoploid genome was formed through several rounds of genome duplications from diploid ancestors. Here, we built a pan-genome TE library for the Fragaria genus using ten published strawberry genomes at different ploidy levels, including seven diploids, one tetraploid, and two octoploids, and performed comparative analysis of TE content in these genomes. The TEs comprise 51.83% (F. viridis) to 60.07% (F. nilgerrensis) of the genomes. Long terminal repeat retrotransposons (LTR-RTs) are the predominant TE type in the Fragaria genomes (20.16% to 34.94%), particularly in F. iinumae (34.94%). Estimating TE content and LTR-RT insertion times revealed that species-specific TEs have shaped each strawberry genome. Additionally, the copy number of different LTR-RT families inserted in the last one million years reflects the genetic distance between Fragaria species. Comparing cultivated strawberry subgenomes to extant diploid ancestors showed that F. vesca and F. iinumae are likely the diploid ancestors of the cultivated strawberry, but not F. viridis. These findings provide new insights into the TE variations in the strawberry genomes and their roles in strawberry genome evolution.
Collapse
Affiliation(s)
- Keliang Lyu
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (K.L.); (S.L.)
- Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China;
| | - Jiajing Xiao
- Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China;
| | - Shiheng Lyu
- College of Horticulture, Fujian Agriculture and Forestry University, Fuzhou 350002, China; (K.L.); (S.L.)
| | - Renyi Liu
- Haixia Institute of Science and Technology, Fujian Agriculture and Forestry University, Fuzhou 350002, China;
| |
Collapse
|
7
|
Yang LL, Zhang XY, Wang LY, Li YG, Li XT, Yang Y, Su Q, Chen N, Zhang YL, Li N, Deng CL, Li SF, Gao WJ. Lineage-specific amplification and epigenetic regulation of LTR-retrotransposons contribute to the structure, evolution, and function of Fabaceae species. BMC Genomics 2023; 24:423. [PMID: 37501164 PMCID: PMC10373317 DOI: 10.1186/s12864-023-09530-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/22/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND Long terminal repeat (LTR)-retrotransposons (LTR-RTs) are ubiquitous and make up the majority of nearly all sequenced plant genomes, whereas their pivotal roles in genome evolution, gene expression regulation as well as their epigenetic regulation are still not well understood, especially in a large number of closely related species. RESULTS Here, we analyzed the abundance and dynamic evolution of LTR-RTs in 54 species from an economically and agronomically important family, Fabaceae, and also selected two representative species for further analysis in expression of associated genes, transcriptional activity and DNA methylation patterns of LTR-RTs. Annotation results revealed highly varied proportions of LTR-RTs in these genomes (5.1%~68.4%) and their correlation with genome size was highly positive, and they were significantly contributed to the variance in genome size through species-specific unique amplifications. Almost all of the intact LTR-RTs were inserted into the genomes 4 Mya (million years ago), and more than 50% of them were inserted in the last 0.5 million years, suggesting that recent amplifications of LTR-RTs were an important force driving genome evolution. In addition, expression levels of genes with intronic, promoter, and downstream LTR-RT insertions of Glycine max and Vigna radiata, two agronomically important crops in Fabaceae, showed that the LTR-RTs located in promoter or downstream regions suppressed associated gene expression. However, the LTR-RTs within introns promoted gene expression or had no contribution to gene expression. Additionally, shorter and younger LTR-RTs maintained higher mobility and transpositional potential. Compared with the transcriptionally silent LTR-RTs, the active elements showed significantly lower DNA methylation levels in all three contexts. The distributions of transcriptionally active and silent LTR-RT methylation varied across different lineages due to the position of LTR-RTs located or potentially epigenetic regulation. CONCLUSION Lineage-specific amplification patterns were observed and higher methylation level may repress the activity of LTR-RTs, further influence evolution in Fabaceae species. This study offers valuable clues into the evolution, function, transcriptional activity and epigenetic regulation of LTR-RTs in Fabaceae genomes.
Collapse
Affiliation(s)
- Long-Long Yang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Xin-Yu Zhang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Li-Ying Wang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yan-Ge Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Xiao-Ting Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yi Yang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Qing Su
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Ning Chen
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Yu-Lan Zhang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Ning Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China
| | - Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China.
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang, 453007, P. R. China.
| |
Collapse
|
8
|
Hartig N, Seibt KM, Heitkam T. How to start a LINE: 5' switching rejuvenates LINE retrotransposons in tobacco and related Nicotiana species. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023. [PMID: 36965091 DOI: 10.1111/tpj.16208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 02/10/2023] [Accepted: 02/19/2023] [Indexed: 06/18/2023]
Abstract
By contrast to their conserved mammalian counterparts, plant long interspersed nuclear elements (LINEs) are highly variable, splitting into many low-copy families. Curiously, LINE families from the retrotransposable element (RTE) clade retain a stronger sequence conservation and hence reach higher copy numbers. The cause of this RTE-typical property is not yet understood, but would help clarify why some transposable elements are removed quickly, whereas others persist in plant genomes. Here, we bring forward a detailed study of RTE LINE structure, diversity and evolution in plants. For this, we argue that the nightshade family is the ideal taxon to follow the evolutionary trajectories of RTE LINEs, given their high abundance, recent activity and partnership to non-autonomous elements. Using bioinformatic, cytogenetic and molecular approaches, we detect 4029 full-length RTE LINEs across the Solanaceae. We finely characterize and manually curate a core group of 458 full-length LINEs in allotetraploid tobacco, show an integration event after polyploidization and trace hybridization by RTE LINE composition of parental genomes. Finally, we reveal the role of the untranslated regions (UTRs) as causes for the unique RTE LINE amplification and evolution pattern in plants. On the one hand, we detected a highly conserved motif at the 3' UTR, suggesting strong selective constraints acting on the RTE terminus. On the other hand, we observed successive rounds of 5' UTR cycling, constantly rejuvenating the promoter sequences. This interplay between exchangeable promoters and conserved LINE bodies and 3' UTR likely allows RTE LINEs to persist and thrive in plant genomes.
Collapse
Affiliation(s)
- Nora Hartig
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Tony Heitkam
- Faculty of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
9
|
Sun Y, Liu Y, Shi J, Wang L, Liang C, Yang J, Chen J, Chen M. Biased mutations and gene losses underlying diploidization of the tetraploid broomcorn millet genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2023; 113:787-801. [PMID: 36575912 DOI: 10.1111/tpj.16085] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/07/2022] [Accepted: 12/22/2022] [Indexed: 06/17/2023]
Abstract
Broomcorn millet (Panicum miliaceum L.) is one of the earliest domesticated crops, and is a valuable resource to secure food diversity and combat drought stresses under the global warming scenario. However, due to the absence of extant diploid progenitors, the polyploidy genome of broomcorn millet remains poorly understood. Here, we report the chromosome-scale genome assembly of broomcorn millet. We divided the broomcorn millet genome into two subgenomes using the genome sequence of Panicum hallii, a diploid relative of broomcorn millet. Our analyses revealed that the two subgenomes diverged at ~4.8 million years ago (Mya), while the allotetraploidization of broomcorn millet may have occurred about ~0.48 Mya, suggesting that broomcorn millet is a relatively recent allotetraploid. Comparative analyses showed that subgenome B was larger than subgenome A in size, which was caused by the biased accumulation of long terminal repeat retrotransposons in the progenitor of subgenome B before polyploidization. Notably, the accumulation of biased mutations in the transposable element-rich subgenome B led to more gene losses. Although no significant dominance of either subgenome was observed in the expression profiles of broomcorn millet, we found the minimally expressed genes in P. hallii tended to be lost during diploidization of broomcorn millet. These results suggest that broomcorn millet is at the early stage of diploidization and that mutations likely occurred more on genes that were marked with lower expression levels.
Collapse
Affiliation(s)
- Yanling Sun
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 100101, Beijing, China
- University of Chinese Academy of Sciences, 100039, Beijing, China
| | - Yang Liu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 100101, Beijing, China
- University of Chinese Academy of Sciences, 100039, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China
| | - Jinfeng Shi
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 100101, Beijing, China
| | - Lun Wang
- Institute of Crop Germplasm Resources, Shanxi Academy of Agricultural Sciences, 030031, Taiyuan, China
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 100101, Beijing, China
- University of Chinese Academy of Sciences, 100039, Beijing, China
| | - Jun Yang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, 201602, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, 200032, Shanghai, China
| | - Jinfeng Chen
- University of Chinese Academy of Sciences, 100039, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, 100101, Beijing, China
| | - Mingsheng Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 100101, Beijing, China
- University of Chinese Academy of Sciences, 100039, Beijing, China
| |
Collapse
|
10
|
Monshi FI, Katsube-Tanaka T. 2S albumin g13 polypeptide, less related to Fag e 2, can be eliminated in common buckwheat (Fagopyrum esculentum Moench) seeds. FOOD CHEMISTRY: MOLECULAR SCIENCES 2022; 5:100138. [PMID: 36187231 PMCID: PMC9523277 DOI: 10.1016/j.fochms.2022.100138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 09/22/2022] [Accepted: 09/24/2022] [Indexed: 11/06/2022]
Abstract
2S albumin (g11, g13, g14, and g28) is an important allergen in common buckwheat. g13 is hydrophobic, scarce, and less related to g14 than g11/g28 is related to g14. g13_null allele homozygote produced no g13 protein in seeds. Insert-like sequence of g13_null allele resided frequently in buckwheat genome. g13_null homozygote lowered allergenicity in common buckwheat.
2S albumin (g11, g13, g14, and g28) is an important allergen in common buckwheat (Fagopyrum esculentum). g13 is hydrophobic, rare in seeds, and may show distinct allergenicity from the others; therefore, we tried to eliminate this protein. Phylogenetic and property distance analyses indicated g13 is less related to g14 (Fag e 2) than g11/g28 is related to g14, particularly in the second domain containing the II and III α-helices. A null allele with a 531 bp insertion in the coding region was found for g13 at an allele frequency of 2 % in natural populations of common buckwheat. The g13_null allele homozygote accumulated no g13 protein. A BLAST search for the 531 bp insertion suggested the insert-like sequence resided frequently in the buckwheat genome, including the self-incompatibility responsible gene ELF3 in Fagopyrum tataricum. The g13_null insert-like sequence could, therefore, help in producing hypoallergenic cultivars, and expand the genetic diversity of buckwheat.
Collapse
|
11
|
Cerbin S, Ou S, Li Y, Sun Y, Jiang N. Distinct composition and amplification dynamics of transposable elements in sacred lotus (Nelumbo nucifera Gaertn.). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:172-192. [PMID: 35959634 PMCID: PMC9804982 DOI: 10.1111/tpj.15938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 07/19/2022] [Accepted: 08/08/2022] [Indexed: 06/15/2023]
Abstract
Sacred lotus (Nelumbo nucifera Gaertn.) is a basal eudicot plant with a unique lifestyle, physiological features, and evolutionary characteristics. Here we report the unique profile of transposable elements (TEs) in the genome, using a manually curated repeat library. TEs account for 59% of the genome, and hAT (Ac/Ds) elements alone represent 8%, more than in any other known plant genome. About 18% of the lotus genome is comprised of Copia LTR retrotransposons, and over 25% of them are associated with non-canonical termini (non-TGCA). Such high abundance of non-canonical LTR retrotransposons has not been reported for any other organism. TEs are very abundant in genic regions, with retrotransposons enriched in introns and DNA transposons primarily in flanking regions of genes. The recent insertion of TEs in introns has led to significant intron size expansion, with a total of 200 Mb in the 28 455 genes. This is accompanied by declining TE activity in intergenic regions, suggesting distinct control efficacy of TE amplification in different genomic compartments. Despite the prevalence of TEs in genic regions, some genes are associated with fewer TEs, such as those involved in fruit ripening and stress responses. Other genes are enriched with TEs, and genes in epigenetic pathways are the most associated with TEs in introns, indicating a dynamic interaction between TEs and the host surveillance machinery. The dramatic differential abundance of TEs with genes involved in different biological processes as well as the variation of target preference of different TEs suggests the composition and activity of TEs influence the path of evolution.
Collapse
Affiliation(s)
- Stefan Cerbin
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
- Present address:
Department of Ecology & Evolutionary BiologyUniversity of Kansas1200 Sunnyside AvenueLawrenceKS66045USA
| | - Shujun Ou
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
- Present address:
Department of Computer ScienceJohns Hopkins UniversityBaltimoreMD21218USA
| | - Yang Li
- Department of Electrical EngineeringCity University of Hong KongKowloonHong Kong SARChina
| | - Yanni Sun
- Department of Electrical EngineeringCity University of Hong KongKowloonHong Kong SARChina
| | - Ning Jiang
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
| |
Collapse
|
12
|
Planta J, Liang YY, Xin H, Chansler MT, Prather LA, Jiang N, Jiang J, Childs KL. Chromosome-scale genome assemblies and annotations for Poales species Carex cristatella, Carex scoparia, Juncus effusus, and Juncus inflexus. G3 GENES|GENOMES|GENETICS 2022; 12:6670624. [PMID: 35976112 PMCID: PMC9526063 DOI: 10.1093/g3journal/jkac211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 07/18/2022] [Indexed: 12/03/2022]
Abstract
The majority of sequenced genomes in the monocots are from species belonging to Poaceae, which include many commercially important crops. Here, we expand the number of sequenced genomes from the monocots to include the genomes of 4 related cyperids: Carex cristatella and Carex scoparia from Cyperaceae and Juncus effusus and Juncus inflexus from Juncaceae. The high-quality, chromosome-scale genome sequences from these 4 cyperids were assembled by combining whole-genome shotgun sequencing of Nanopore long reads, Illumina short reads, and Hi-C sequencing data. Some members of the Cyperaceae and Juncaceae are known to possess holocentric chromosomes. We examined the repeat landscapes in our sequenced genomes to search for potential repeats associated with centromeres. Several large satellite repeat families, comprising 3.2–9.5% of our sequenced genomes, showed dispersed distribution of large satellite repeat clusters across all Carex chromosomes, with few instances of these repeats clustering in the same chromosomal regions. In contrast, most large Juncus satellite repeats were clustered in a single location on each chromosome, with sporadic instances of large satellite repeats throughout the Juncus genomes. Recognizable transposable elements account for about 20% of each of the 4 genome assemblies, with the Carex genomes containing more DNA transposons than retrotransposons while the converse is true for the Juncus genomes. These genome sequences and annotations will facilitate better comparative analysis within monocots.
Collapse
Affiliation(s)
- Jose Planta
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
- National Institute of Molecular Biology and Biotechnology, University of the Philippines , Diliman, Quezon City 1101, Philippines
| | - Yu-Ya Liang
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Haoyang Xin
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Matthew T Chansler
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - L Alan Prather
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| | - Ning Jiang
- Department of Horticulture, MSU AgBioResearch, Michigan State University , East Lansing, MI 48824, USA
| | - Jiming Jiang
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
- Department of Horticulture, MSU AgBioResearch, Michigan State University , East Lansing, MI 48824, USA
| | - Kevin L Childs
- Department of Plant Biology, Michigan State University , East Lansing, MI 48824, USA
| |
Collapse
|
13
|
Yin L, Xu G, Yang J, Zhao M. The Heterogeneity in the Landscape of Gene Dominance in Maize is Accompanied by Unique Chromatin Environments. Mol Biol Evol 2022; 39:6709529. [PMID: 36130304 PMCID: PMC9547528 DOI: 10.1093/molbev/msac198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Subgenome dominance after whole-genome duplication (WGD) has been observed in many plant species. However, the degree to which the chromatin environment affects this bias has not been explored. Here, we compared the dominant subgenome (maize1) and the recessive subgenome (maize2) with respect to patterns of sequence substitutions, genes expression, transposable element accumulation, small interfering RNAs, DNA methylation, histone modifications, and accessible chromatin regions (ACRs). Our data show that the degree of bias between subgenomes for all the measured variables does not vary significantly when both of the WGD genes are located in pericentromeric regions. Our data further indicate that the location of maize1 genes in chromosomal arms is pivotal for maize1 to maintain its dominance, but location has a less effect on maize2 homoeologs. In addition to homoeologous genes, we compared ACRs, which often harbor cis-regulatory elements, between the two subgenomes and demonstrate that maize1 ACRs have a higher level of chromatin accessibility, a lower level of sequence substitution, and are enriched in chromosomal arms. Furthermore, we find that a loss of maize1 ACRs near their nearby genes is associated with a reduction in purifying selection and expression of maize1 genes relative to their maize2 homoeologs. Taken together, our data suggest that chromatin environment and cis-regulatory elements are important determinants shaping the divergence and evolution of duplicated genes.
Collapse
Affiliation(s)
- Liangwei Yin
- Department of Biology, Miami University, Oxford, OH 45056
| | - Gen Xu
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE 68588,Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE 68583
| | - Jinliang Yang
- Department of Agronomy and Horticulture, University of Nebraska-Lincoln, Lincoln, NE 68588,Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE 68583
| | | |
Collapse
|
14
|
Li SF, She HB, Yang LL, Lan LN, Zhang XY, Wang LY, Zhang YL, Li N, Deng CL, Qian W, Gao WJ. Impact of LTR-Retrotransposons on Genome Structure, Evolution, and Function in Curcurbitaceae Species. Int J Mol Sci 2022; 23:ijms231710158. [PMID: 36077556 PMCID: PMC9456015 DOI: 10.3390/ijms231710158] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/02/2022] [Accepted: 09/02/2022] [Indexed: 11/17/2022] Open
Abstract
Long terminal repeat (LTR)-retrotransposons (LTR-RTs) comprise a major portion of many plant genomes and may exert a profound impact on genome structure, function, and evolution. Although many studies have focused on these elements in an individual species, their dynamics on a family level remains elusive. Here, we investigated the abundance, evolutionary dynamics, and impact on associated genes of LTR-RTs in 16 species in an economically important plant family, Cucurbitaceae. Results showed that full-length LTR-RT numbers and LTR-RT content varied greatly among different species, and they were highly correlated with genome size. Most of the full-length LTR-RTs were amplified after the speciation event, reflecting the ongoing rapid evolution of these genomes. LTR-RTs highly contributed to genome size variation via species-specific distinct proliferations. The Angela and Tekay lineages with a greater evolutionary age were amplified in Trichosanthes anguina, whereas a recent activity burst of Reina and another ancient round of Tekay activity burst were examined in Sechium edule. In addition, Tekay and Retand lineages belonging to the Gypsy superfamily underwent a recent burst in Gynostemma pentaphyllum. Detailed investigation of genes with intronic and promoter LTR-RT insertion showed diverse functions, but the term of metabolism was enriched in most species. Further gene expression analysis in G.pentaphyllum revealed that the LTR-RTs within introns suppress the corresponding gene expression, whereas the LTR-RTs within promoters exert a complex influence on the downstream gene expression, with the main function of promoting gene expression. This study provides novel insights into the organization, evolution, and function of LTR-RTs in Cucurbitaceae genomes.
Collapse
Affiliation(s)
- Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Hong-Bing She
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Long-Long Yang
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Li-Na Lan
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Xin-Yu Zhang
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Li-Ying Wang
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Yu-Lan Zhang
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Ning Li
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
| | - Wei Qian
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing 100081, China
- Correspondence: (W.Q.); (W.-J.G.)
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang 453007, China
- Correspondence: (W.Q.); (W.-J.G.)
| |
Collapse
|
15
|
Integration of Repeatomic and Cytogenetic Data on Satellite DNA for the Genome Analysis in the Genus Salvia (Lamiaceae). PLANTS 2022; 11:plants11172244. [PMID: 36079625 PMCID: PMC9460151 DOI: 10.3390/plants11172244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 08/25/2022] [Accepted: 08/25/2022] [Indexed: 11/17/2022]
Abstract
Within the complicated and controversial taxonomy of cosmopolitan genus Salvia L. (Lamiaceae) are valuable species Salvia officinalis L. and Salvia sclarea L., which are important for the pharmaceutical, ornamental horticulture, food, and perfume industries. Genome organization and chromosome structure of these essential oil species remain insufficiently studied. For the first time, the comparative repeatome analysis of S. officinalis and S. sclarea was performed using the obtained NGS data, RepeatExplorer/TAREAN pipelines and FISH-based chromosome mapping of the revealed satellite DNA families (satDNAs). In repeatomes of these species, LTR retrotransposons made up the majority of their repetitive DNA. Interspecific variations in genome abundance of Class I and Class II transposable elements, ribosomal DNA, and satellite DNA were revealed. Four (S. sclarea) and twelve (S. officinalis) putative satDNAs were identified. Based on patterns of chromosomal distribution of 45S rDNA; 5S rDNA and the revealed satDNAs, karyograms of S. officinalis and S. sclarea were constructed. Promising satDNAs which can be further used as chromosome markers to assess inter- and intraspecific chromosome variability in Salvia karyotypes were determined. The specific localization of homologous satDNA and 45S rDNA on chromosomes of the studied Salvia species confirmed their common origin, which is consistent with previously reported molecular phylogenetic data.
Collapse
|
16
|
Martínez-Fortún J, Phillips DW, Jones HD. Natural and artificial sources of genetic variation used in crop breeding: A baseline comparator for genome editing. Front Genome Ed 2022; 4:937853. [PMID: 36072906 PMCID: PMC9441798 DOI: 10.3389/fgeed.2022.937853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 07/13/2022] [Indexed: 11/13/2022] Open
Abstract
Traditional breeding has successfully selected beneficial traits for food, feed, and fibre crops over the last several thousand years. The last century has seen significant technological advancements particularly in marker assisted selection and the generation of induced genetic variation, including over the last few decades, through mutation breeding, genetic modification, and genome editing. While regulatory frameworks for traditional varietal development and for genetic modification with transgenes are broadly established, those for genome editing are lacking or are still evolving in many regions. In particular, the lack of “foreign” recombinant DNA in genome edited plants and that the resulting SNPs or INDELs are indistinguishable from those seen in traditional breeding has challenged development of new legislation. Where products of genome editing and other novel breeding technologies possess no transgenes and could have been generated via traditional methods, we argue that it is logical and proportionate to apply equivalent legislative oversight that already exists for traditional breeding and novel foods. This review analyses the types and the scale of spontaneous and induced genetic variation that can be selected during traditional plant breeding activities. It provides a base line from which to judge whether genetic changes brought about by techniques of genome editing or other reverse genetic methods are indeed comparable to those routinely found using traditional methods of plant breeding.
Collapse
|
17
|
Orozco-Arias S, Candamil-Cortes MS, Jaimes PA, Valencia-Castrillon E, Tabares-Soto R, Isaza G, Guyot R. Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning. J Integr Bioinform 2022; 19:jib-2021-0036. [PMID: 35822734 PMCID: PMC9521825 DOI: 10.1515/jib-2021-0036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 06/10/2022] [Indexed: 11/19/2022] Open
Abstract
Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (Oryza granulata) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | | | - Paula A Jaimes
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
| | | | - Reinel Tabares-Soto
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia.,Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, Montpellier, France
| |
Collapse
|
18
|
Bradbury PJ, Casstevens T, Jensen SE, Johnson LC, Miller ZR, Monier B, Romay MC, Song B, Buckler ES. The Practical Haplotype Graph, a platform for storing and using pangenomes for imputation. Bioinformatics 2022; 38:3698-3702. [PMID: 35748708 PMCID: PMC9344836 DOI: 10.1093/bioinformatics/btac410] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 02/28/2022] [Accepted: 06/22/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Pangenomes provide novel insights for population and quantitative genetics, genomics, and breeding not available from studying a single reference genome. Instead, a species is better represented by a pangenome or collection of genomes. Unfortunately, managing and using pangenomes for genomically diverse species is computationally and practically challenging. We developed a trellis graph representation anchored to the reference genome that represents most pangenomes well and can be used to impute complete genomes from low density sequence or variant data. RESULTS The Practical Haplotype Graph (PHG) is a pangenome pipeline, database (PostGRES & SQLite), data model (Java, Kotlin, or R), and Breeding API (BrAPI) web service. The PHG has already been able to accurately represent diversity in four major crops including maize, one of the most genomically diverse species, with up to 1000-fold data compression. Using simulated data, we show that, at even 0.1X coverage, with appropriate reads and sequence alignment, imputation results in extremely accurate haplotype reconstruction. The PHG is a platform and environment for the understanding and application of genomic diversity. AVAILABILITY All resources listed here are freely available. The PHG Docker used to generate the simulation results is https://hub.docker.com/ as maizegenetics/phg:0.0.27. PHG source code is at https://bitbucket.org/bucklerlab/practicalhaplotypegraph/src/master/. The code used for the analysis of simulated data is at https://bitbucket.org/bucklerlab/phg-manuscript/src/master/. The PHG database of NAM parent haplotypes is in the CyVerse data store (https://de.cyverse.org/de/) and named /iplant/home/shared/panzea/panGenome/PHG_db_maize/phg_v5Assemblies_20200608.db. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P J Bradbury
- United States Department of Agriculture-Agricultural Research Service, Robert W. Holley Center, Ithaca, NY 14853 USA
| | - T Casstevens
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - S E Jensen
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - L C Johnson
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - Z R Miller
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - B Monier
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - M C Romay
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - B Song
- Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA
| | - E S Buckler
- United States Department of Agriculture-Agricultural Research Service, Robert W. Holley Center, Ithaca, NY 14853 USA.,Institute for Genomic Diversity,Cornell University, Ithaca, NY 14853 USA.,Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
19
|
Gu X, Su Y, Wang T. 转座元件对植物基因组进化、表观遗传和适应性的作用. CHINESE SCIENCE BULLETIN-CHINESE 2022. [DOI: 10.1360/tb-2022-0296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
20
|
Amosova AV, Yurkevich OY, Bolsheva NL, Samatadze TE, Zoshchuk SA, Muravenko OV. Repeatome Analyses and Satellite DNA Chromosome Patterns in Deschampsia sukatschewii, D. cespitosa, and D. antarctica (Poaceae). Genes (Basel) 2022; 13:genes13050762. [PMID: 35627148 PMCID: PMC9141916 DOI: 10.3390/genes13050762] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Revised: 04/20/2022] [Accepted: 04/22/2022] [Indexed: 02/05/2023] Open
Abstract
Subpolar and polar ecotypes of Deschampsia sukatschewii (Popl.) Roshev, D. cespitosa (L.) P. Beauv, and D. antarctica E. Desv. are well adapted to stressful environmental conditions, which make them useful model plants for genetic research and breeding. For the first time, the comparative repeatome analyses of subpolar and polar D. sukatschewii, D. cespitosa, and D. antarctica was performed using RepeatExplorer/TAREAN pipelines and FISH-based chromosomal mapping of the identified satellite DNA families (satDNAs). In the studied species, mobile genetic elements of class 1 made up the majority of their repetitive DNA; interspecific variations in the total amount of Ty3/Gypsy and Ty1/Copia retroelements, DNA transposons, ribosomal, and satellite DNA were revealed; 12–18 high confident and 7–9 low confident putative satDNAs were identified. According to BLAST, most D. sukatschewii satDNAs demonstrated sequence similarity with satDNAs of D. antarctica and D. cespitosa indicating their common origin. Chromosomal mapping of 45S rDNA, 5S rDNA, and satDNAs of D. sukatschewii allowed us to construct the species karyograms and detect new molecular chromosome markers important for Deschampsia species. Our findings confirmed that genomes of D. sukatschewii and D. cespitosa were more closely related compared to D. antarctica according to repeatome composition and patterns of satDNA chromosomal distribution.
Collapse
|
21
|
Teresi SJ, Teresi MB, Edger PP. TE Density: a tool to investigate the biology of transposable elements. Mob DNA 2022; 13:11. [PMID: 35413944 PMCID: PMC9004194 DOI: 10.1186/s13100-022-00264-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 02/16/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Transposable elements (TEs) are powerful creators of genotypic and phenotypic diversity due to their inherent mutagenic capabilities and in this way they serve as a deep reservoir of sequences for genomic variation. As agents of genetic disruption, a TE's potential to impact phenotype is partially a factor of its location in the genome. Previous research has shown TEs' ability to impact the expression of neighboring genes, however our understanding of this trend is hampered by the exceptional amount of diversity in the TE world, and a lack of publicly available computational methods that quantify the presence of TEs relative to genes. RESULTS Here, we have developed a tool to more easily quantify TE presence relative to genes through the use of only a gene and TE annotation, yielding a new metric we call TE Density. Briefly defined as the proportion of TE-occupied base-pairs relative to a window-size of the genome. This new pipeline reports TE density for each gene in the genome, for each type descriptor of TE (order and superfamily), and for multiple positions and distances relative to the gene (upstream, intragenic, and downstream) over sliding, user-defined windows. In this way, we overcome previous limitations to the study of TE-gene relationships by focusing on all TE types present in the genome, utilizing flexible genomic distances for measurement, and reporting a TE presence metric for every gene in the genome. CONCLUSIONS Together, this new tool opens up new avenues for studying TE-gene relationships, genome architecture, comparative genomics, and the tremendous diversity present of the TE world. TE Density is open-source and freely available at: https://github.com/sjteresi/TE_Density .
Collapse
Affiliation(s)
- Scott J Teresi
- Department of Horticulture, Michigan State University, East Lansing, Michigan, USA
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, Michigan, USA
| | | | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, Michigan, USA.
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, Michigan, USA.
| |
Collapse
|
22
|
Whole-genome resequencing reveals genomic footprints of Italian sweet and hot pepper heirlooms giving insight into genes underlying key agronomic and qualitative traits. BMC Genom Data 2022; 23:21. [PMID: 35337259 PMCID: PMC8957157 DOI: 10.1186/s12863-022-01039-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2021] [Accepted: 03/11/2022] [Indexed: 11/13/2022] Open
Abstract
Background Pepper is a major crop species of the Solanaceae family, largely appreciated for its high nutritional and healthy contribution to human diets. In the Mediterranean basin, the favorable pedoclimatic conditions enhanced the selection of several diversified landraces cultivated pepper (Capsicum annuum), for whom Italy can be considered a main pole of diversification. Hence, a survey of traditional C. annuum genetic resources is essential for deep understanding of such diversity and for applications in genomics assisted breeding. Here, we report whole-genome resequencing analyses of two sweet and two pungent genotypes highly diffused in South Italy and representative of the variability for shape, colour and nutritional properties. Results The four genomes were reconstructed at a chromosomal scale using a reference-guided approach, based on a dataset of 2.6 billion paired-end reads, corresponding to 20× genome coverage and a mapping rate above 99% for a final genomes size of approximately 3 Gb. After five iterations of variant calling, a total of 29,258,818 single nucleotide polymorphisms (SNPs) and 1,879,112 InDels, were identified. Substantial differences were observed among the four genomes based on geographical origin, with chromosomes 9 and 11 showing more polymorphisms in the accessions with higher fruit weight and absence of pungency. Among the identified variants, a small private indel (T - > TA) shared between sweet and big fruits accessions induces a frameshift with the generation of a new stop codon in a gene annotated as extensin, whereas two private SNPs within hot types were identified in 1-aminocyclopropane-1-carboxylate oxidase (ACO), a key gene involved in fruit ripening. The estimation of repetitive elements highlights a preponderant presence of Long Terminal Repeats (LTRs), the majority of which belonged to Gypsy superfamily. By comparing the four genomes with publicly available references including ‘CM334’ and Zunla-1 highlight the presence of 49,475 shared gene families. Conclusions The new genomic sequences aim to enrich the whole genome information of pepper local varieties, providing a valuable tool for precision gene mapping, marker discovery, comparative studies. Such knowledge widens the frontiers to understand the selection history of Italian pepper landraces toward the recognition of specificity local agri-food products marks. Supplementary Information The online version contains supplementary material available at 10.1186/s12863-022-01039-9.
Collapse
|
23
|
Li Y, Jiang N, Sun Y. AnnoSINE: a short interspersed nuclear elements annotation tool for plant genomes. PLANT PHYSIOLOGY 2022; 188:955-970. [PMID: 34792587 PMCID: PMC8825457 DOI: 10.1093/plphys/kiab524] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
Short interspersed nuclear elements (SINEs) are a widespread type of small transposable element (TE). With increasing evidence for their impact on gene function and genome evolution in plants, accurate genome-scale SINE annotation becomes a fundamental step for studying the regulatory roles of SINEs and their relationship with other components in the genomes. Despite the overall promising progress made in TE annotation, SINE annotation remains a major challenge. Unlike some other TEs, SINEs are short and heterogeneous, and they usually lack well-conserved sequence or structural features. Thus, current SINE annotation tools have either low sensitivity or high false discovery rates. Given the demand and challenges, we aimed to provide a more accurate and efficient SINE annotation tool for plant genomes. The pipeline starts with maximizing the pool of SINE candidates via profile hidden Markov model-based homology search and de novo SINE search using structural features. Then, it excludes the false positives by integrating all known features of SINEs and the features of other types of TEs that can often be misannotated as SINEs. As a result, the pipeline substantially improves the tradeoff between sensitivity and accuracy, with both values close to or over 90%. We tested our tool in Arabidopsis thaliana and rice (Oryza sativa), and the results show that our tool competes favorably against existing SINE annotation tools. The simplicity and effectiveness of this tool would potentially be useful for generating more accurate SINE annotations for other plant species. The pipeline is freely available at https://github.com/yangli557/AnnoSINE.
Collapse
Affiliation(s)
- Yang Li
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| |
Collapse
|
24
|
Neale DB, Zimin AV, Zaman S, Scott AD, Shrestha B, Workman RE, Puiu D, Allen BJ, Moore ZJ, Sekhwal MK, De La Torre AR, McGuire PE, Burns E, Timp W, Wegrzyn JL, Salzberg SL. Assembled and annotated 26.5 Gbp coast redwood genome: a resource for estimating evolutionary adaptive potential and investigating hexaploid origin. G3 (BETHESDA, MD.) 2022; 12:6460957. [PMID: 35100403 PMCID: PMC8728005 DOI: 10.1093/g3journal/jkab380] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 10/25/2021] [Indexed: 12/15/2022]
Abstract
Sequencing, assembly, and annotation of the 26.5 Gbp hexaploid genome of coast redwood (Sequoia sempervirens) was completed leading toward discovery of genes related to climate adaptation and investigation of the origin of the hexaploid genome. Deep-coverage short-read Illumina sequencing data from haploid tissue from a single seed were combined with long-read Oxford Nanopore Technologies sequencing data from diploid needle tissue to create an initial assembly, which was then scaffolded using proximity ligation data to produce a highly contiguous final assembly, SESE 2.1, with a scaffold N50 size of 44.9 Mbp. The assembly included several scaffolds that span entire chromosome arms, confirmed by the presence of telomere and centromere sequences on the ends of the scaffolds. The structural annotation produced 118,906 genes with 113 containing introns that exceed 500 Kbp in length and one reaching 2 Mb. Nearly 19 Gbp of the genome represented repetitive content with the vast majority characterized as long terminal repeats, with a 2.9:1 ratio of Copia to Gypsy elements that may aid in gene expression control. Comparison of coast redwood to other conifers revealed species-specific expansions for a plethora of abiotic and biotic stress response genes, including those involved in fungal disease resistance, detoxification, and physical injury/structural remodeling and others supporting flavonoid biosynthesis. Analysis of multiple genes that exist in triplicate in coast redwood but only once in its diploid relative, giant sequoia, supports a previous hypothesis that the hexaploidy is the result of autopolyploidy rather than any hybridizations with separate but closely related conifer species.
Collapse
Affiliation(s)
- David B Neale
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Aleksey V Zimin
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
| | - Sumaira Zaman
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA.,Department of Computer Science & Engineering, University of Connecticut, Storrs, CT 06269, USA
| | - Alison D Scott
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Bikash Shrestha
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA
| | - Rachael E Workman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Daniela Puiu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA
| | - Brian J Allen
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Zane J Moore
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Manoj K Sekhwal
- School of Forestry, Northern Arizona University, Flagstaff, AZ 86011, USA
| | | | - Patrick E McGuire
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Emily Burns
- Save the Redwoods League, San Francisco, CA 94104, USA
| | - Winston Timp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA.,Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Jill L Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT 06269, USA.,Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA
| | - Steven L Salzberg
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.,Center for Computational Biology, Johns Hopkins University, Baltimore, MD 21211, USA.,Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA.,Department of Biostatistics, Johns Hopkins University, Baltimore, MD 21205, USA
| |
Collapse
|
25
|
Huff M, Seaman J, Wu D, Zhebentyayeva T, Kelly LJ, Faridi N, Nelson CD, Cooper E, Best T, Steiner K, Koch J, Romero Severson J, Carlson JE, Buggs R, Staton M. A high quality reference genome for Fraxinus pennsylvanica for ash species restoration and research. Mol Ecol Resour 2021; 22:1284-1302. [PMID: 34748273 PMCID: PMC9299157 DOI: 10.1111/1755-0998.13545] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 10/27/2021] [Accepted: 10/28/2021] [Indexed: 11/30/2022]
Abstract
Green ash (Fraxinus pennsylvanica) is the most widely distributed ash tree in North America. Once common, it has experienced high mortality from the non‐native invasive emerald ash borer (EAB; Agrilus planipennis). A small percentage of native green ash trees that remain healthy in long‐infested areas, termed “lingering ash,” display partial resistance to the insect, indicating that breeding and propagating populations with higher resistance to EAB may be possible. To assist in ash breeding, ecology and evolution studies, we report the first chromosome‐level assembly from the genus Fraxinus for F. pennsylvanica with over 99% of bases anchored to 23 haploid chromosomes, spanning 757 Mb in total, composed of 49.43% repetitive DNA, and containing 35,470 high‐confidence gene models assigned to 22,976 Asterid orthogroups. We also present results of range‐wide genetic variation studies, the identification of candidate genes for important traits including potential EAB‐resistance genes, and an investigation of comparative genome organization among Asterids based on this reference genome platform. Residual duplicated regions within the genome probably resulting from a recent whole genome duplication event in Oleaceae were visualized in relation to wild olive (Olea europaea var. sylvestris). We used our F. pennsylvanica chromosome assembly to construct reference‐guided assemblies of 27 previously sequenced Fraxinus taxa, including F. excelsior. Thus, we present a significant step forward in genomic resources for research and protection of Fraxinus species.
Collapse
Affiliation(s)
- Matt Huff
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Josiah Seaman
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK.,Royal Botanic Gardens, Kew, Richmond, Surrey, UK
| | - Di Wu
- Department of Ecosystem Science and Management, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Tetyana Zhebentyayeva
- Department of Ecosystem Science and Management, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Laura J Kelly
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK.,Royal Botanic Gardens, Kew, Richmond, Surrey, UK
| | - Nurul Faridi
- USDA Forest Service, Southern Research Station, Saucier, Mississippi, USA.,Department of Ecosystem Science and Management, Texas A&M University, College Station, Texas, USA
| | - Charles Dana Nelson
- USDA Forest Service, Southern Research Station, Saucier, Mississippi, USA.,Forest Health Research and Education Center, University of Kentucky, Lexington, Kentucky, USA
| | - Endymion Cooper
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| | - Teodora Best
- Department of Ecosystem Science and Management, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Kim Steiner
- Department of Ecosystem Science and Management, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Jennifer Koch
- United States Department of Agriculture, Forest Service, Northern Research Station, Delaware, Ohio, USA
| | - Jeanne Romero Severson
- Department of Biological Sciences, Notre Dame University, 46556, Notre Dame, Indiana, USA
| | - John E Carlson
- Department of Ecosystem Science and Management, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Richard Buggs
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK.,Royal Botanic Gardens, Kew, Richmond, Surrey, UK
| | - Margaret Staton
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Knoxville, TN, United States
| |
Collapse
|
26
|
Ramachandran D, Huebner CD, Daly M, Haimovitz J, Swale T, Barrett CF. Chromosome Level Genome Assembly and Annotation of Highly Invasive Japanese Stiltgrass (Microstegium vimineum). Genome Biol Evol 2021; 13:6413638. [PMID: 34718556 PMCID: PMC8598173 DOI: 10.1093/gbe/evab238] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/05/2021] [Indexed: 02/06/2023] Open
Abstract
The invasive Japanese stiltgrass (Microstegium vimineum) affects a wide range of ecosystems and threatens biodiversity across the eastern USA. However, the mechanisms underlying rapid adaptation, plasticity, and epigenetics in the invasive range are largely unknown. We present a chromosome-level assembly for M. vimineum to investigate genome dynamics, evolution, adaptation, and the genomics of phenotypic plasticity. We generated a 1.12-Gb genome with scaffold N50 length of 53.44 Mb respectively, taking a de novo assembly approach that combined PacBio and Dovetail Genomics Omni-C sequencing. The assembly contains 23 pseudochromosomes, representing 99.96% of the genome. BUSCO assessment indicated that 80.3% of Poales gene groups are present in the assembly. The genome is predicted to contain 39,604 protein-coding genes, of which 26,288 are functionally annotated. Furthermore, 66.68% of the genome is repetitive, of which unclassified (35.63%) and long-terminal repeat (LTR) retrotransposons (26.90%) are predominant. Similar to other grasses, Gypsy (41.07%) and Copia (32%) are the most abundant LTR-retrotransposon families. The majority of LTR-retrotransposons are derived from a significant expansion in the past 1-2 Myr, suggesting the presence of relatively young LTR-retrotransposon lineages. We find corroborating evidence from Ks plots for a stiltgrass-specific duplication event, distinct from the more ancient grass-specific duplication event. The assembly and annotation of M. vimineum will serve as an essential genomic resource facilitating studies of the invasion process, the history and consequences of polyploidy in grasses, and provides a crucial tool for natural resource managers.
Collapse
Affiliation(s)
| | - Cynthia D Huebner
- Department of Biology, West Virginia University, USA.,USDA Forest Service, Northern Research Station, Morgantown, West Virginia, USA
| | - Mark Daly
- Dovetail Genomics, LLC, Scotts Valley, California, USA
| | | | - Thomas Swale
- Dovetail Genomics, LLC, Scotts Valley, California, USA
| | | |
Collapse
|
27
|
Minow MAA, Lukens L, Rossi V, Colasanti J. Patterns of stability and change in the maize genome: a case study of small RNA transcriptomes in two recombinant inbred lines and their progenitors. Genome 2021; 65:1-12. [PMID: 34597524 DOI: 10.1139/gen-2021-0040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Small RNAs (sRNAs) are epigenetic regulators of eukaryotic genes and transposable elements (TEs). Diverse sRNA expression patterns exist within a species, but how this diversity arises is not well understood. To provide a window into the dynamics of maize sRNA patterning, sRNA and mRNA transcriptomes were examined in two related Zea mays recombinant inbred lines (RILs) and their inbred parents. Analysis of these RILs revealed that most clusters of sRNA expression retained the parental sRNA expression level. However, expression states that differ from the parental allele were also observed, predominantly reflecting decreases in sRNA expression. When RIL sRNA expression differed from the parental allele, the new state was frequently similar between the two RILs, and similar to the expression state found at the allele in the other parent. Novel sRNA expression patterns, distinct from those of either parent, were rare. Additionally, examination of sRNA expression over TEs revealed one TE family, Gyma, which showed consistent enrichment for RIL sRNA expression differences compared to those found in parental alleles. These findings provide insights into how sRNA silencing might evolve over generations and suggest that further investigation into the molecular nature of sRNA trans regulators is warranted.
Collapse
Affiliation(s)
- Mark A A Minow
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, Canada
| | - Lewis Lukens
- Plant Agriculture Department, University of Guelph, Guelph, Ontario, Canada
| | - Vincenzo Rossi
- Council for Agricultural Research and Economics, Research Centre for Cereal and Industrial Crops, I-24126 Bergamo, Italy
| | - Joseph Colasanti
- Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, Canada
| |
Collapse
|
28
|
Stitzer MC, Anderson SN, Springer NM, Ross-Ibarra J. The genomic ecosystem of transposable elements in maize. PLoS Genet 2021; 17:e1009768. [PMID: 34648488 PMCID: PMC8547701 DOI: 10.1371/journal.pgen.1009768] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 10/26/2021] [Accepted: 08/10/2021] [Indexed: 12/16/2022] Open
Abstract
Transposable elements (TEs) constitute the majority of flowering plant DNA, reflecting their tremendous success in subverting, avoiding, and surviving the defenses of their host genomes to ensure their selfish replication. More than 85% of the sequence of the maize genome can be ascribed to past transposition, providing a major contribution to the structure of the genome. Evidence from individual loci has informed our understanding of how transposition has shaped the genome, and a number of individual TE insertions have been causally linked to dramatic phenotypic changes. Genome-wide analyses in maize and other taxa have frequently represented TEs as a relatively homogeneous class of fragmentary relics of past transposition, obscuring their evolutionary history and interaction with their host genome. Using an updated annotation of structurally intact TEs in the maize reference genome, we investigate the family-level dynamics of TEs in maize. Integrating a variety of data, from descriptors of individual TEs like coding capacity, expression, and methylation, as well as similar features of the sequence they inserted into, we model the relationship between attributes of the genomic environment and the survival of TE copies and families. In contrast to the wholesale relegation of all TEs to a single category of junk DNA, these differences reveal a diversity of survival strategies of TE families. Together these generate a rich ecology of the genome, with each TE family representing the evolution of a distinct ecological niche. We conclude that while the impact of transposition is highly family- and context-dependent, a family-level understanding of the ecology of TEs in the genome can refine our ability to predict the role of TEs in generating genetic and phenotypic diversity.
Collapse
Affiliation(s)
- Michelle C. Stitzer
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Sarah N. Anderson
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Nathan M. Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Jeffrey Ross-Ibarra
- Center for Population Biology and Department of Evolution and Ecology, University of California, Davis, California, United States of America
- Genome Center, University of California, Davis, California, United States of America
| |
Collapse
|
29
|
Long J, Liu J, Xia A, Springer NM, He Y. Maize decrease in DNA methylation 1 targets RNA-directed DNA methylation on active chromatin. THE PLANT CELL 2021; 33:2183-2196. [PMID: 33779761 PMCID: PMC8364229 DOI: 10.1093/plcell/koab098] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 03/25/2021] [Indexed: 06/01/2023]
Abstract
DNA methylation plays vital roles in repressing transposable element activity and regulating gene expression. The chromatin-remodeling factor Decrease in DNA methylation 1 (DDM1) is crucial for maintaining DNA methylation across diverse plant species, and is required for RNA-directed DNA methylation (RdDM) to maintain mCHH islands in maize (Zea mays). However, the mechanisms by which DDM1 is involved in RdDM are not well understood. In this work, we used chromatin immunoprecipitation coupled with high-throughput sequencing to ascertain the genome-wide occupancy of ZmDDM1 in the maize genome. The results revealed that ZmDDM1 recognized an 8-bp-long GC-rich degenerate DNA sequence motif, which is enriched in transcription start sites and other euchromatic regions. Meanwhile, 24-nucleotide siRNAs and CHH methylation were delineated at the edge of ZmDDM1-occupied sites. ZmDDM1 co-purified with Argonaute 4 (ZmAGO4) proteins, providing further evidence that ZmDDM1 is a component of RdDM complexes in planta. Consistent with this, the vast majority of ZmDDM1-targeted regions co-localized with ZmAGO4-bound genomic sites. Overall, our results suggest a model that ZmDDM1 may be recruited to euchromatic regions via recognition of a GC-rich motif, thereby remodeling chromatin to provide access for RdDM activities in maize.
Collapse
Affiliation(s)
- Jincheng Long
- MOE Key Laboratory of Crop Heterosis and Utilization, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Jinghan Liu
- MOE Key Laboratory of Crop Heterosis and Utilization, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Aiai Xia
- MOE Key Laboratory of Crop Heterosis and Utilization, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| | - Nathan M. Springer
- Department of Plant and Microbial Biology, University of Minnesota, St. Paul, MN 55108, USA
| | - Yan He
- MOE Key Laboratory of Crop Heterosis and Utilization, National Maize Improvement Center of China, China Agricultural University, Beijing 100094, China
| |
Collapse
|
30
|
Noshay JM, Marand AP, Anderson SN, Zhou P, Mejia Guerra MK, Lu Z, O'Connor CH, Crisp PA, Hirsch CN, Schmitz RJ, Springer NM. Assessing the regulatory potential of transposable elements using chromatin accessibility profiles of maize transposons. Genetics 2021; 217:1-13. [PMID: 33683350 DOI: 10.1093/genetics/iyaa003] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 11/02/2020] [Indexed: 11/14/2022] Open
Abstract
Transposable elements (TEs) have the potential to create regulatory variation both through the disruption of existing DNA regulatory elements and through the creation of novel DNA regulatory elements. In a species with a large genome, such as maize, many TEs interspersed with genes create opportunities for significant allelic variation due to TE presence/absence polymorphisms among individuals. We used information on putative regulatory elements in combination with knowledge about TE polymorphisms in maize to identify TE insertions that interrupt existing accessible chromatin regions (ACRs) in B73 as well as examples of polymorphic TEs that contain ACRs among four inbred lines of maize including B73, Mo17, W22, and PH207. The TE insertions in three other assembled maize genomes (Mo17, W22, or PH207) that interrupt ACRs that are present in the B73 genome can trigger changes to the chromatin, suggesting the potential for both genetic and epigenetic influences of these insertions. Nearly 20% of the ACRs located over 2 kb from the nearest gene are located within an annotated TE. These are regions of unmethylated DNA that show evidence for functional importance similar to ACRs that are not present within TEs. Using a large panel of maize genotypes, we tested if there is an association between the presence of TE insertions that interrupt, or carry, an ACR and the expression of nearby genes. While most TE polymorphisms are not associated with expression for nearby genes, the TEs that carry ACRs exhibit enrichment for being associated with higher expression of nearby genes, suggesting that these TEs may contribute novel regulatory elements. These analyses highlight the potential for a subset of TEs to rewire transcriptional responses in eukaryotic genomes.
Collapse
Affiliation(s)
- Jaclyn M Noshay
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| | - Alexandre P Marand
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Sarah N Anderson
- Department of Genetics, Development, and Cell Biology, Iowa State University, 2437 Pammel Dr, Ames, IA 50011, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| | | | - Zefu Lu
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Christine H O'Connor
- Department of Agronomy and Plant Genetics, University of Minnesota, 1994 Upper Buford Circle, 411 Borlaug Hall, St. Paul, MN 55108, USA
| | - Peter A Crisp
- School of Agriculture and Food Sciences, The University of Queensland, Harley Teakle Building, Keyhold Rd, St Lucia QLD 4067, Australia
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, 1994 Upper Buford Circle, 411 Borlaug Hall, St. Paul, MN 55108, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, 120 W Green St, Athens, GA 30602, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, 140 Gortner Laboratory, 1479 Gortner Avenue, St. Paul, MN 55108, USA
| |
Collapse
|
31
|
Noshay JM, Liang Z, Zhou P, Crisp PA, Marand AP, Hirsch CN, Schmitz RJ, Springer NM. Stability of DNA methylation and chromatin accessibility in structurally diverse maize genomes. G3 (BETHESDA, MD.) 2021; 11:6288454. [PMID: 34849810 PMCID: PMC8496265 DOI: 10.1093/g3journal/jkab190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 05/27/2021] [Indexed: 06/13/2023]
Abstract
Accessible chromatin and unmethylated DNA are associated with many genes and cis-regulatory elements. Attempts to understand natural variation for accessible chromatin regions (ACRs) and unmethylated regions (UMRs) often rely upon alignments to a single reference genome. This limits the ability to assess regions that are absent in the reference genome assembly and monitor how nearby structural variants influence variation in chromatin state. In this study, de novo genome assemblies for four maize inbreds (B73, Mo17, Oh43, and W22) are utilized to assess chromatin accessibility and DNA methylation patterns in a pan-genome context. A more complete set of UMRs and ACRs can be identified when chromatin data are aligned to the matched genome rather than a single reference genome. While there are UMRs and ACRs present within genomic regions that are not shared between genotypes, these features are 6- to 12-fold enriched within regions between genomes. Characterization of UMRs present within shared genomic regions reveals that most UMRs maintain the unmethylated state in other genotypes with only ∼5% being polymorphic between genotypes. However, the majority (71%) of UMRs that are shared between genotypes only exhibit partial overlaps suggesting that the boundaries between methylated and unmethylated DNA are dynamic. This instability is not solely due to sequence variation as these partially overlapping UMRs are frequently found within genomic regions that lack sequence variation. The ability to compare chromatin properties among individuals with structural variation enables pan-epigenome analyses to study the sources of variation for accessible chromatin and unmethylated DNA.
Collapse
Affiliation(s)
- Jaclyn M Noshay
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Zhikai Liang
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Peng Zhou
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Peter A Crisp
- School of Agriculture and Food Sciences, University of Queensland, St Lucia, QLD 4072, Australia
| | | | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108, USA
| | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Nathan M Springer
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN 55108, USA
| |
Collapse
|
32
|
Hufford MB, Seetharam AS, Woodhouse MR, Chougule KM, Ou S, Liu J, Ricci WA, Guo T, Olson A, Qiu Y, Della Coletta R, Tittes S, Hudson AI, Marand AP, Wei S, Lu Z, Wang B, Tello-Ruiz MK, Piri RD, Wang N, Kim DW, Zeng Y, O'Connor CH, Li X, Gilbert AM, Baggs E, Krasileva KV, Portwood JL, Cannon EKS, Andorf CM, Manchanda N, Snodgrass SJ, Hufnagel DE, Jiang Q, Pedersen S, Syring ML, Kudrna DA, Llaca V, Fengler K, Schmitz RJ, Ross-Ibarra J, Yu J, Gent JI, Hirsch CN, Ware D, Dawe RK. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 2021; 373:655-662. [PMID: 34353948 PMCID: PMC8733867 DOI: 10.1126/science.abg5289] [Citation(s) in RCA: 226] [Impact Index Per Article: 75.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 06/24/2021] [Indexed: 12/24/2022]
Abstract
We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The number of pan-genes in these diverse genomes exceeds 103,000, with approximately a third found across all genotypes. The results demonstrate that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres revealed additional variation in major cytological landmarks. We show that combining structural variation with single-nucleotide polymorphisms can improve the power of quantitative mapping studies. We also document variation at the level of DNA methylation and demonstrate that unmethylated regions are enriched for cis-regulatory elements that contribute to phenotypic variation.
Collapse
Affiliation(s)
- Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Arun S Seetharam
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
- Genome Informatics Facility, Iowa State University, Ames, IA 50011, USA
| | - Margaret R Woodhouse
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
| | | | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Jianing Liu
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - William A Ricci
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Tingting Guo
- Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Andrew Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Yinjie Qiu
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Rafael Della Coletta
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Silas Tittes
- Center for Population Biology, University of California, Davis, CA 95616, USA
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | - Asher I Hudson
- Center for Population Biology, University of California, Davis, CA 95616, USA
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
| | | | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Zhenyuan Lu
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Bo Wang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | | | - Rebecca D Piri
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Na Wang
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Dong Won Kim
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Yibing Zeng
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Christine H O'Connor
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
- Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN 55108, USA
| | - Xianran Li
- Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Amanda M Gilbert
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Erin Baggs
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
| | - Ksenia V Krasileva
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
| | - John L Portwood
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
| | - Ethalinda K S Cannon
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
| | - Carson M Andorf
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
| | - Nancy Manchanda
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Samantha J Snodgrass
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - David E Hufnagel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
- Virus and Prion Research Unit, National Animal Disease Center, USDA-ARS, Ames, IA, 50010, USA
| | - Qiuhan Jiang
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Sarah Pedersen
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - Michael L Syring
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
| | - David A Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | | | | | - Robert J Schmitz
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Jeffrey Ross-Ibarra
- Center for Population Biology, University of California, Davis, CA 95616, USA
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
- Genome Center, University of California, Davis, CA 95616, USA
| | - Jianming Yu
- Department of Agronomy, Iowa State University, Ames, IA 50011, USA
| | - Jonathan I Gent
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
| | - Doreen Ware
- USDA-ARS NAA Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, NY 14853, USA
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - R Kelly Dawe
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA.
| |
Collapse
|
33
|
Stritt C, Thieme M, Roulin AC. Rare transposable elements challenge the prevailing view of transposition dynamics in plants. AMERICAN JOURNAL OF BOTANY 2021; 108:1310-1314. [PMID: 34415576 PMCID: PMC9290919 DOI: 10.1002/ajb2.1709] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 05/10/2021] [Accepted: 05/12/2021] [Indexed: 06/01/2023]
Affiliation(s)
- Christoph Stritt
- Institute for Plant and Microbial BiologyUniversity of ZurichZollikerstrasse, 107, 8008ZurichSwitzerland
| | - Michael Thieme
- Institute for Plant and Microbial BiologyUniversity of ZurichZollikerstrasse, 107, 8008ZurichSwitzerland
| | - Anne C. Roulin
- Institute for Plant and Microbial BiologyUniversity of ZurichZollikerstrasse, 107, 8008ZurichSwitzerland
| |
Collapse
|
34
|
Tan S, Ma H, Wang J, Wang M, Wang M, Yin H, Zhang Y, Zhang X, Shen J, Wang D, Banes GL, Zhang Z, Wu J, Huang X, Chen H, Ge S, Chen CL, Zhang YE. DNA transposons mediate duplications via transposition-independent and -dependent mechanisms in metazoans. Nat Commun 2021; 12:4280. [PMID: 34257290 PMCID: PMC8277862 DOI: 10.1038/s41467-021-24585-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2020] [Accepted: 06/23/2021] [Indexed: 01/06/2023] Open
Abstract
Despite long being considered as "junk", transposable elements (TEs) are now accepted as catalysts of evolution. One example is Mutator-like elements (MULEs, one type of terminal inverted repeat DNA TEs, or TIR TEs) capturing sequences as Pack-MULEs in plants. However, their origination mechanism remains perplexing, and whether TIR TEs mediate duplication in animals is almost unexplored. Here we identify 370 Pack-TIRs in 100 animal reference genomes and one Pack-TIR (Ssk-FB4) family in fly populations. We find that single-copy Pack-TIRs are mostly generated via transposition-independent gap filling, and multicopy Pack-TIRs are likely generated by transposition after replication fork switching. We show that a proportion of Pack-TIRs are transcribed and often form chimeras with hosts. We also find that Ssk-FB4s represent a young protein family, as supported by proteomics and signatures of positive selection. Thus, TIR TEs catalyze new gene structures and new genes in animals via both transposition-independent and -dependent mechanisms.
Collapse
Affiliation(s)
- Shengjun Tan
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Huijing Ma
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jinbo Wang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Man Wang
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Center for Cancer Bioinformatics, Peking University Cancer Hospital & Institute, Beijing, China
| | - Mengxia Wang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Haodong Yin
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yaqiong Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Xinying Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jieyu Shen
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Danyang Wang
- University of Chinese Academy of Sciences, Beijing, China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, and China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
| | - Graham L Banes
- Wisconsin National Primate Research Center, University of Wisconsin-Madison, Madison, WI, USA
- CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Zhihua Zhang
- University of Chinese Academy of Sciences, Beijing, China
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, and China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
| | - Jianmin Wu
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Center for Cancer Bioinformatics, Peking University Cancer Hospital & Institute, Beijing, China
| | - Xun Huang
- University of Chinese Academy of Sciences, Beijing, China
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Hua Chen
- University of Chinese Academy of Sciences, Beijing, China
- CAS Key Laboratory of Genomics and Precision Medicine, Beijing Institute of Genomics, and China National Center for Bioinformation, Chinese Academy of Sciences, Beijing, China
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Siqin Ge
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Chun-Long Chen
- Curie Institute, PSL Research University, CNRS UMR 3244, Paris, France.
- Sorbonne University, Paris, France.
| | - Yong E Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
- CAS Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
- Chinese Institute for Brain Research, Beijing, China.
| |
Collapse
|
35
|
Li Z, McKibben MTW, Finch GS, Blischak PD, Sutherland BL, Barker MS. Patterns and Processes of Diploidization in Land Plants. ANNUAL REVIEW OF PLANT BIOLOGY 2021; 72:387-410. [PMID: 33684297 DOI: 10.1146/annurev-arplant-050718-100344] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Most land plants are now known to be ancient polyploids that have rediploidized. Diploidization involves many changes in genome organization that ultimately restore bivalent chromosome pairing and disomic inheritance, and resolve dosage and other issues caused by genome duplication. In this review, we discuss the nature of polyploidy and its impact on chromosome pairing behavior. We also provide an overview of two major and largely independent processes of diploidization: cytological diploidization and genic diploidization/fractionation. Finally, we compare variation in gene fractionation across land plants and highlight the differences in diploidization between plants and animals. Altogether, we demonstrate recent advancements in our understanding of variation in the patterns and processes of diploidization in land plants and provide a road map for future research to unlock the mysteries of diploidization and eukaryotic genome evolution.
Collapse
Affiliation(s)
- Zheng Li
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| | - Michael T W McKibben
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| | - Geoffrey S Finch
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| | - Paul D Blischak
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| | - Brittany L Sutherland
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| | - Michael S Barker
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA; , , , , ,
| |
Collapse
|
36
|
Yang Y, Huang L, Xu C, Qi L, Wu Z, Li J, Chen H, Wu Y, Fu T, Zhu H, Saand MA, Li J, Liu L, Fan H, Zhou H, Qin W. Chromosome-scale genome assembly of areca palm (Areca catechu). Mol Ecol Resour 2021; 21:2504-2519. [PMID: 34133844 DOI: 10.1111/1755-0998.13446] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 06/08/2021] [Accepted: 06/11/2021] [Indexed: 11/28/2022]
Abstract
Areca palm (Areca catechu L.; family Arecaceae) is an important tropical medicinal crop and is also used for masticatory and religious purposes in Asia. Improvements to areca properties made by traditional breeding tools have been very slow, and further advances in its cultivation and practical use require genomic information, which is still unavailable. Here, we present a chromosome-scale reference genome assembly for areca by combining Illumina and PacBio data with Hi-C mapping technologies, covering the predicted A. catechu genome length (2.59 Gb, variety "Reyan#1") to an estimated 240× read depth. The assembly was 2.51 Gb in length with a scaffold N50 of 1.7Mb. The scaffolds were then further assembled into 16 pseudochromosomes, with an N50 of 172 Mb. Transposable elements comprised 80.37% of the areca genome, and 68.68% of them were long-terminal repeat retrotransposon elements. The areca palm genome was predicted to harbour 31,571 protein-coding genes and overall, 92.92% of genes were functionally annotated, including enriched and expanded families of genes responsible for biosynthesis of flavonoid, anthocyanin, monoterpenoid and their derivatives. Comparative analyses indicated that A. catechu probably diverged from its close relatives Elaeis guineensis and Cocos nucifera approximately 50.3 million years ago (Ma). Two whole genome duplication events in areca palm were found to be shared by palms and monocots, respectively. This genome assembly and associated resources represents an important addition to the palm genomics community and will be a valuable resource that will facilitate areca palm breeding and improve our understanding of areca palm biology and evolution.
Collapse
Affiliation(s)
- Yaodong Yang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Liyun Huang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Chunyan Xu
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Lan Qi
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | | | - Jia Li
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | | | - Yi Wu
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Tao Fu
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Hui Zhu
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Mumtaz Ali Saand
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Jing Li
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Liyun Liu
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Haikou Fan
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Huanqi Zhou
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Weiquan Qin
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| |
Collapse
|
37
|
Yañez-Santos AM, Paz RC, Paz-Sepúlveda PB, Urdampilleta JD. Full-length LTR retroelements in Capsicum annuum revealed a few species-specific family bursts with insertional preferences. Chromosome Res 2021; 29:261-284. [PMID: 34086192 DOI: 10.1007/s10577-021-09663-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 05/04/2021] [Accepted: 05/06/2021] [Indexed: 01/01/2023]
Abstract
Capsicum annuum is a species that has undergone an expansion of the size of its genome caused mainly by the amplification of repetitive DNA sequences, including mobile genetic elements. Based on information obtained from sequencing the genome of pepper, the estimated fraction of retroelements is approximately 81%, and previous results revealed an important contribution of lineages derived from Gypsy superfamily. However, the dynamics of the retroelements in the C. annuum genome is poorly understood. In this way, the present work seeks to investigate the phylogenetic diversity and genomic abundance of the families of autonomous (complete and intact) LTR retroelements from C. annuum and inspect their distribution along its chromosomes. In total, we identified 1151 structurally full-length retroelements (340 Copia; 811 Gypsy) grouped in 124 phylogenetic families in the base of their retrotranscriptase. All the evolutive lineages of LTR retroelements identified in plants were present in pepper; however, three of them comprise 83% of the entire LTR retroelements population, the lineages Athila, Del/Tekay, and Ale/Retrofit. From them, only three families represent 70.8% of the total number of the identified retroelements. A massive family-specific wave of amplification of two of them occurred in the last 0.5 Mya (GypsyCa_16; CopiaCa_01), whereas the third is more ancient and occurred 3.0 Mya (GypsyCa_13). Fluorescent in situ hybridization performed with family and lineage-specific probes revealed contrasting patterns of chromosomal affinity. Our results provide a database of the populations LTR retroelements specific to C. annuum genome. The most abundant families were analyzed according to chromosome insertional preferences, suppling useful tools to the design of retroelement-based markers specific to the species.
Collapse
Affiliation(s)
- Anahí Mara Yañez-Santos
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.,Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| | - Rosalía Cristina Paz
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.
| | - Paula Beatriz Paz-Sepúlveda
- Instituto Multidisciplinario de Biología Celular (IMBICE), Consejo Nacional de Investigaciones Científicas y Técnicas de la República Argentina (CONICET) - Comisión de Investigaciones Científicas (CIC) - Universidad Nacional de La Plata (UNLP), La Plata, Argentina
| | - Juan Domingo Urdampilleta
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| |
Collapse
|
38
|
Genome assembly and population genomic analysis provide insights into the evolution of modern sweet corn. Nat Commun 2021; 12:1227. [PMID: 33623026 PMCID: PMC7902669 DOI: 10.1038/s41467-021-21380-4] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 01/26/2021] [Indexed: 01/31/2023] Open
Abstract
Sweet corn is one of the most important vegetables in the United States and Canada. Here, we present a de novo assembly of a sweet corn inbred line Ia453 with the mutated shrunken2-reference allele (Ia453-sh2). This mutation accumulates more sugar and is present in most commercial hybrids developed for the processing and fresh markets. The ten pseudochromosomes cover 92% of the total assembly and 99% of the estimated genome size, with a scaffold N50 of 222.2 Mb. This reference genome completely assembles the large structural variation that created the mutant sh2-R allele. Furthermore, comparative genomics analysis with six field corn genomes highlights differences in single-nucleotide polymorphisms, structural variations, and transposon composition. Phylogenetic analysis of 5,381 diverse maize and teosinte accessions reveals genetic relationships between sweet corn and other types of maize. Our results show evidence for a common origin in northern Mexico for modern sweet corn in the U.S. Finally, population genomic analysis identifies regions of the genome under selection and candidate genes associated with sweet corn traits, such as early flowering, endosperm composition, plant and tassel architecture, and kernel row number. Our study provides a high-quality reference-genome sequence to facilitate comparative genomics, functional studies, and genomic-assisted breeding for sweet corn.
Collapse
|
39
|
Yang Y, Bocs S, Fan H, Armero A, Baudouin L, Xu P, Xu J, This D, Hamelin C, Iqbal A, Qadri R, Zhou L, Li J, Wu Y, Ma Z, Issali AE, Rivallan R, Liu N, Xia W, Peng M, Xiao Y. Coconut genome assembly enables evolutionary analysis of palms and highlights signaling pathways involved in salt tolerance. Commun Biol 2021; 4:105. [PMID: 33483627 PMCID: PMC7822834 DOI: 10.1038/s42003-020-01593-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 12/09/2020] [Indexed: 01/30/2023] Open
Abstract
Coconut (Cocos nucifera) is the emblematic palm of tropical coastal areas all around the globe. It provides vital resources to millions of farmers. In an effort to better understand its evolutionary history and to develop genomic tools for its improvement, a sequence draft was recently released. Here, we present a dense linkage map (8402 SNPs) aiming to assemble the large genome of coconut (2.42 Gbp, 2n = 32) into 16 pseudomolecules. As a result, 47% of the sequences (representing 77% of the genes) were assigned to 16 linkage groups and ordered. We observed segregation distortion in chromosome Cn15, which is a signature of strong selection among pollen grains, favouring the maternal allele. Comparing our results with the genome of the oil palm Elaeis guineensis allowed us to identify major events in the evolutionary history of palms. We find that coconut underwent a massive transposable element invasion in the last million years, which could be related to the fluctuations of sea level during the glaciations at Pleistocene that would have triggered a population bottleneck. Finally, to better understand the facultative halophyte trait of coconut, we conducted an RNA-seq experiment on leaves to identify key players of signaling pathways involved in salt stress response. Altogether, our findings represent a valuable resource for the coconut breeding community.
Collapse
Affiliation(s)
- Yaodong Yang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Stéphanie Bocs
- CIRAD, UMR AGAP, F-34398, Montpellier, France
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
- South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, F-34398, Montpellier, France
| | - Haikuo Fan
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Alix Armero
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - Luc Baudouin
- CIRAD, UMR AGAP, F-34398, Montpellier, France.
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France.
| | - Pengwei Xu
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, P. R. China
| | - Junyang Xu
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, P. R. China
| | - Dominique This
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - Chantal Hamelin
- CIRAD, UMR AGAP, F-34398, Montpellier, France
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
- South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, F-34398, Montpellier, France
| | - Amjad Iqbal
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Rashad Qadri
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Lixia Zhou
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Jing Li
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Yi Wu
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China
| | - Zilong Ma
- Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Science, 571101, Haikou, Hainan, P. R. China
| | - Auguste Emmanuel Issali
- Station Cocotier Marc Delorme, Centre National De Recherche Agronomique (CNRA)07 B.P. 13, Port Bouet, Côte d'Ivoire
| | - Ronan Rivallan
- CIRAD, UMR AGAP, F-34398, Montpellier, France
- AGAP, Univ. Montpellier, CIRAD, INRAE, Institut Agro, F-34398, Montpellier, France
| | - Na Liu
- BGI Genomics, BGI-Shenzhen, Shenzhen, 518083, P. R. China
| | - Wei Xia
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China.
| | - Ming Peng
- Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Science, 571101, Haikou, Hainan, P. R. China.
| | - Yong Xiao
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, 571339, Wenchang, Hainan, P. R. China.
| |
Collapse
|
40
|
Hu Y, Wu X, Jin G, Peng J, Leng R, Li L, Gui D, Fan C, Zhang C. Rapid Genome Evolution and Adaptation of Thlaspi arvense Mediated by Recurrent RNA-Based and Tandem Gene Duplications. FRONTIERS IN PLANT SCIENCE 2021; 12:772655. [PMID: 35058947 PMCID: PMC8764390 DOI: 10.3389/fpls.2021.772655] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 11/09/2021] [Indexed: 05/21/2023]
Abstract
Retrotransposons are the most abundant group of transposable elements (TEs) in plants, providing an extraordinarily versatile source of genetic variation. Thlaspi arvense, a close relative of the model plant Arabidopsis thaliana with worldwide distribution, thrives from sea level to above 4,000 m elevation in the Qinghai-Tibet Plateau (QTP), China. Its strong adaptability renders it an ideal model system for studying plant adaptation in extreme environments. However, how the retrotransposons affect the T. arvense genome evolution and adaptation is largely unknown. We report a high-quality chromosome-scale genome assembly of T. arvense with a scaffold N50 of 59.10 Mb. Long terminal repeat retrotransposons (LTR-RTs) account for 56.94% of the genome assembly, and the Gypsy superfamily is the most abundant TEs. The amplification of LTR-RTs in the last six million years primarily contributed to the genome size expansion in T. arvense. We identified 351 retrogenes and 303 genes flanked by LTRs, respectively. A comparative analysis showed that orthogroups containing those retrogenes and genes flanked by LTRs have a higher percentage of significantly expanded orthogroups (SEOs), and these SEOs possess more recent tandem duplicated genes. All present results indicate that RNA-based gene duplication (retroduplication) accelerated the subsequent tandem duplication of homologous genes resulting in family expansions, and these expanded gene families were implicated in plant growth, development, and stress responses, which were one of the pivotal factors for T. arvense's adaptation to the harsh environment in the QTP regions. In conclusion, the high-quality assembly of the T. arvense genome provides insights into the retroduplication mediated mechanism of plant adaptation to extreme environments.
Collapse
Affiliation(s)
- Yanting Hu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xiaopei Wu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Guihua Jin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Junchu Peng
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Rong Leng
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ling Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Daping Gui
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Chuanzhu Fan
- Department of Biological Sciences, Wayne State University, Detroit, MI, United States
- Chuanzhu Fan,
| | - Chengjun Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- Haiyan Engineering & Technology Center, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- *Correspondence: Chengjun Zhang,
| |
Collapse
|
41
|
Fambrini M, Usai G, Vangelisti A, Mascagni F, Pugliesi C. The plastic genome: The impact of transposable elements on gene functionality and genomic structural variations. Genesis 2020; 58:e23399. [DOI: 10.1002/dvg.23399] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 11/07/2020] [Accepted: 11/10/2020] [Indexed: 12/15/2022]
Affiliation(s)
- Marco Fambrini
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Gabriele Usai
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Alberto Vangelisti
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Flavia Mascagni
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| | - Claudio Pugliesi
- Department of Agriculture, Food and Environment (DAFE) University of Pisa Pisa Italy
| |
Collapse
|
42
|
Cohen ZP, Brevik K, Chen YH, Hawthorne DJ, Weibel BD, Schoville SD. Elevated rates of positive selection drive the evolution of pestiferousness in the Colorado potato beetle (Leptinotarsa decemlineata, Say). Mol Ecol 2020; 30:237-254. [PMID: 33095936 DOI: 10.1111/mec.15703] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 09/28/2020] [Accepted: 10/15/2020] [Indexed: 12/16/2022]
Abstract
Contextualizing evolutionary history and identifying genomic features of an insect that might contribute to its pest status is important in developing early detection and control tactics. In order to understand the evolution of pestiferousness, which we define as the accumulation of traits that contribute to an insect population's success in an agroecosystem, we tested the importance of known genomic properties associated with rapid adaptation in the Colorado potato beetle (CPB), Leptinotarsa decemlineata Say. Within the leaf beetle genus Leptinotarsa, only CPB, and a few populations therein, has risen to pest status on cultivated nightshades, Solanum. Using whole genomes from ten closely related Leptinotarsa species native to the United States, we reconstructed a high-quality species tree and used this phylogenetic framework to assess evolutionary patterns in four genomic features of rapid adaptation: standing genetic variation, gene family expansion and contraction, transposable element abundance and location, and positive selection at protein-coding genes. Throughout approximately 20 million years of history, Leptinotarsa species show little evidence of gene family turnover and transposable element variation. However, there is a clear pattern of CPB experiencing higher rates of positive selection on protein-coding genes. We determine that these rates are associated with greater standing genetic variation due to larger effective population size, which supports the theory that the demographic history contributes to rates of protein evolution. Furthermore, we identify a suite of coding genes under positive selection that are putatively associated with pestiferousness in the Colorado potato beetle lineage. They are involved in the biological processes of xenobiotic detoxification, chemosensation and hormone function.
Collapse
Affiliation(s)
- Zachary P Cohen
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - Kristian Brevik
- Department of Plant and Soil Sciences, University of Vermont, Burlington, VT, USA
| | - Yolanda H Chen
- Department of Plant and Soil Sciences, University of Vermont, Burlington, VT, USA
| | - David J Hawthorne
- Department of Entomology, University of Maryland, College Park, MD, USA
| | - Benjamin D Weibel
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| | - Sean D Schoville
- Department of Entomology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
43
|
Wear EE, Song J, Zynda GJ, Mickelson-Young L, LeBlanc C, Lee TJ, Deppong DO, Allen GC, Martienssen RA, Vaughn MW, Hanley-Bowdoin L, Thompson WF. Comparing DNA replication programs reveals large timing shifts at centromeres of endocycling cells in maize roots. PLoS Genet 2020; 16:e1008623. [PMID: 33052904 PMCID: PMC7588055 DOI: 10.1371/journal.pgen.1008623] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2020] [Revised: 10/26/2020] [Accepted: 08/28/2020] [Indexed: 12/20/2022] Open
Abstract
Plant cells undergo two types of cell cycles–the mitotic cycle in which DNA replication is coupled to mitosis, and the endocycle in which DNA replication occurs in the absence of cell division. To investigate DNA replication programs in these two types of cell cycles, we pulse labeled intact root tips of maize (Zea mays) with 5-ethynyl-2’-deoxyuridine (EdU) and used flow sorting of nuclei to examine DNA replication timing (RT) during the transition from a mitotic cycle to an endocycle. Comparison of the sequence-based RT profiles showed that most regions of the maize genome replicate at the same time during S phase in mitotic and endocycling cells, despite the need to replicate twice as much DNA in the endocycle and the fact that endocycling is typically associated with cell differentiation. However, regions collectively corresponding to 2% of the genome displayed significant changes in timing between the two types of cell cycles. The majority of these regions are small with a median size of 135 kb, shift to a later RT in the endocycle, and are enriched for genes expressed in the root tip. We found larger regions that shifted RT in centromeres of seven of the ten maize chromosomes. These regions covered the majority of the previously defined functional centromere, which ranged between 1 and 2 Mb in size in the reference genome. They replicate mainly during mid S phase in mitotic cells but primarily in late S phase of the endocycle. In contrast, the immediately adjacent pericentromere sequences are primarily late replicating in both cell cycles. Analysis of CENH3 enrichment levels in 8C vs 2C nuclei suggested that there is only a partial replacement of CENH3 nucleosomes after endocycle replication is complete. The shift to later replication of centromeres and possible reduction in CENH3 enrichment after endocycle replication is consistent with a hypothesis that centromeres are inactivated when their function is no longer needed. In traditional cell division, or mitosis, a cell’s genetic material is duplicated and then split between two daughter cells. In contrast, in some specialized cell types, the DNA is duplicated a second time without an intervening division step, resulting in cells that carry twice as much DNA. This phenomenon, which is called the endocycle, is common during plant development. At each step, DNA replication follows an ordered program in which highly compacted DNA is unraveled and replicated in sections at different times during the synthesis (S) phase. In plants, it is unclear whether traditional and endocycle programs are the same, especially since endocycling cells are typically in the process of differentiation. Using root tips of maize, we found that in comparison to replication in the mitotic cell cycle, there is a small portion of the genome whose replication in the endocycle is shifted in time, usually to later in S phase. Some of these regions are scattered around the genome and mostly coincide with active genes. However, the most prominent shifts occur in centromeres. The shift to later replication in centromeres is noteworthy because they orchestrate the process of separating duplicated chromosomes into daughter cells, a function that is not needed in the endocycle.
Collapse
Affiliation(s)
- Emily E. Wear
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
- * E-mail:
| | - Jawon Song
- Texas Advanced Computing Center, University of Texas, Austin, Texas, United States of America
| | - Gregory J. Zynda
- Texas Advanced Computing Center, University of Texas, Austin, Texas, United States of America
| | - Leigh Mickelson-Young
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Chantal LeBlanc
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Tae-Jin Lee
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - David O. Deppong
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - George C. Allen
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina, United States of America
| | - Robert A. Martienssen
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Matthew W. Vaughn
- Texas Advanced Computing Center, University of Texas, Austin, Texas, United States of America
| | - Linda Hanley-Bowdoin
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| | - William F. Thompson
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, North Carolina, United States of America
| |
Collapse
|
44
|
Genome wide annotation and characterization of young, intact long terminal repeat retrotransposons (In-LTR-RTs) of seven legume species. Genetica 2020; 148:253-268. [PMID: 32949338 DOI: 10.1007/s10709-020-00103-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 09/02/2020] [Indexed: 10/23/2022]
Abstract
Availability of genome sequence of different legume species has provided an opportunity to characterize the abundance, distribution, and divergence of canonical intact long terminal retrotransposons (In-LTR-RT) superfamilies. Among seven legume species, Arachis ipaensis (Aip) showed the highest number of full-length canonical In-LTR-RTs (3325), followed by Glycine max (Gma, 2328), Vigna angularis (Van, 1625), Arachis durensis (Adu, 1348), Lotus japonicus (Lja, 1294), Medicago truncatula (Mtr, 788), and Circer arietinum (Car, 124). Divergence time analysis demonstrated that the amplification timeframe of LTR-RTs dramatically varied in different families. The average insertion time of Copia element varied from 0.51 (Van) to 1.37 million years ago (Mya) (Adu, and Aip), whereas that of Gypsy was between 0.22 (Mtr) and 1.82 Mya (Adu). Bayesian phylogenetic tree analysis suggested that the 1397 and 1917 reverse transcriptase (RT) domains of Copia and Gypsy families of the seven legume species were clustered into 7 and 14 major groups, respectively. The highest proportion (approximately 94.79-100%) of transposable element (TE)-associated genes assigned to pathways was mapped to metabolism-related pathways in all species. The results enabled the structural understanding of full-length In-LTR-RTs and will be valuable resource for the further study of the impact of TEs on gene structure and expression in legume species.
Collapse
|
45
|
Mata-Sucre Y, Sader M, Van-Lume B, Gagnon E, Pedrosa-Harand A, Leitch IJ, Lewis GP, Souza G. How diverse is heterochromatin in the Caesalpinia group? Cytogenomic characterization of Erythrostemon hughesii Gagnon & G.P. Lewis (Leguminosae: Caesalpinioideae). PLANTA 2020; 252:49. [PMID: 32918627 DOI: 10.1007/s00425-020-03453-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 08/27/2020] [Indexed: 05/25/2023]
Abstract
Cytogenomic characterization of Erythrostemon hughesii reveals a heterogeneity of repeats in its subtelomeric heterochromatin. Comparative analyses with other Caesalpinia group species reveal a significant reduction in the abundance of Ty3-gypsy/Chromovirus Tekay retrotransposons during its evolution. In numerically stable karyotypes, repetitive DNA variability is one of the main causes of genome and chromosome variation and evolution. Species from the Caesalpinia group (Leguminosae) are karyotypically characterized by 2n = 24, with small chromosomes and highly variable CMA+ heterochromatin banding patterns that correlate with environmental variables. Erythrostemon hughesii differs from other species of the group examined to date for having subtelomeric CMA+ bands; this contrasts with most species in the group which have proximal bands. Here we analyse the repeatome of E. hughesii using genome skimming and chromosomal mapping approaches to characterize the identity of the most abundant repetitive elements and their physical location. The repetitive fraction of E. hughesii comprises 28.73% of the genome. The most abundant elements were retrotransposons (RT) with long terminal repeats (LTR-RT; 9.76%) and satellite DNAs (7.83%). Within the LTR-RTs, the most abundant lineages were: Ty1/copia-Ale (1%), Ty3/gypsy CRM (0.88%) and Ty3/gypsy Athila (0.75%). Using fluorescent in situ hybridization four satellite DNAs and several LTR-RT elements were shown to be present in most subtelomeric CMA+ bands. These results highlight how the repeatome in E. hughesii, a species from Oaxaca state in Mexico, is clearly distinct from Northeast Brazilian species of the Caesalpinia group, mainly due to its high diversity of repeats in its subtelomeric heterochromatic bands and low amount of LTR-RT Ty3/gypsy-Tekay elements. Comparative sequence analysis of Tekay elements from different species is congruent with a clade-specific origin of this LTR-RT after the divergence of the Caesalpinia group. We hypothesize that repeat-rich heterochromatin may play a role in leading to faster genomic divergence between individuals, increasing speciation and diversification.
Collapse
Affiliation(s)
- Yennifer Mata-Sucre
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitaria, Recife, PE, 50670-420, Brazil
| | - Mariela Sader
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitaria, Recife, PE, 50670-420, Brazil
| | - Brena Van-Lume
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitaria, Recife, PE, 50670-420, Brazil
| | - Edeline Gagnon
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh, EH3 5NZ, UK
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitaria, Recife, PE, 50670-420, Brazil
| | - Ilia J Leitch
- Comparative Plant and Fungal Biology Department, Royal Botanic Gardens, Kew, Richmond, TW9 3AB, Surrey, UK
| | - Gwilym P Lewis
- Comparative Plant and Fungal Biology Department, Royal Botanic Gardens, Kew, Richmond, TW9 3AB, Surrey, UK
| | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitaria, Recife, PE, 50670-420, Brazil.
| |
Collapse
|
46
|
Woodruff GC, Teterina AA. Degradation of the Repetitive Genomic Landscape in a Close Relative of Caenorhabditis elegans. Mol Biol Evol 2020; 37:2549-2567. [PMID: 32359146 PMCID: PMC7475029 DOI: 10.1093/molbev/msaa107] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The abundance, diversity, and genomic distribution of repetitive elements is highly variable among species. These patterns are thought to be driven in part by reproductive mode and the interaction of selection and recombination, and recombination rates typically vary by chromosomal position. In the nematode Caenorhabditis elegans, repetitive elements are enriched at chromosome arms and depleted on centers, and this mirrors the chromosomal distributions of other genomic features such as recombination rate. How conserved is this genomic landscape of repeats, and what evolutionary forces maintain it? To address this, we compared the genomic organization of repetitive elements across five Caenorhabditis species with chromosome-level assemblies. As previously reported, repeat content is enriched on chromosome arms in most Caenorhabditis species, and no obvious patterns of repeat content associated with reproductive mode were observed. However, the fig-associated C. inopinata has experienced repetitive element expansion and reveals no association of global repeat density with chromosome position. Patterns of repeat superfamily specific distributions reveal this global pattern is driven largely by a few repeat superfamilies that in C. inopinata have expanded in number and have weak associations with chromosome position. Additionally, 15% of predicted protein-coding genes in C. inopinata align to transposon-related proteins. When these are excluded, C. inopinata has no enrichment of genes in chromosome centers, in contrast to its close relatives who all have such clusters. Forward evolutionary simulations reveal that chromosomal heterogeneity in recombination rate alone can generate structured repetitive genomic landscapes when insertions are weakly deleterious, whereas chromosomal heterogeneity in the fitness effects of transposon insertion can promote such landscapes across a variety of evolutionary scenarios. Thus, patterns of gene density along chromosomes likely contribute to global repetitive landscapes in this group, although other historical or genomic factors are needed to explain the idiosyncrasy of genomic organization of various transposable element taxa within C. inopinata. Taken together, these results highlight the power of comparative genomics and evolutionary simulations in testing hypotheses regarding the causes of genome organization.
Collapse
Affiliation(s)
- Gavin C Woodruff
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
| | - Anastasia A Teterina
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR
- Center of Parasitology, Severtsov Institute of Ecology and Evolution RAS, Moscow, Russia
| |
Collapse
|
47
|
Zavallo D, Crescente JM, Gantuz M, Leone M, Vanzetti LS, Masuelli RW, Asurmendi S. Genomic re-assessment of the transposable element landscape of the potato genome. PLANT CELL REPORTS 2020; 39:1161-1174. [PMID: 32435866 DOI: 10.1007/s00299-020-02554-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 05/07/2020] [Indexed: 05/14/2023]
Abstract
We provide a comprehensive and reliable potato TE landscape, based on a wide variety of identification tools and integrative approaches, producing clear and ready-to-use outputs for the scientific community. Transposable elements (TEs) are DNA sequences with the ability to autoreplicate and move throughout the host genome. TEs are major drivers in stress response and genome evolution. Given their significance, the development of clear and efficient TE annotation pipelines has become essential for many species. The latest de novo TE discovery tools, along with available TEs from Repbase and sRNA-seq data, allowed us to perform a reliable potato TEs detection, classification and annotation through an open-source and freely available pipeline ( https://github.com/DiegoZavallo/TE_Discovery ). Using a variety of tools, approaches and rules, we were able to provide a clearly annotated of characterized TEs landscape. Additionally, we described the distribution of the different types of TEs across the genome, where LTRs and MITEs present a clear clustering pattern in pericentromeric and subtelomeric/telomeric regions respectively. Finally, we analyzed the insertion age and distribution of LTR retrotransposon families which display a distinct pattern between the two major superfamilies. While older Gypsy elements concentrated around heterochromatic regions, younger Copia elements located predominantly on euchromatic regions. Overall, we delivered not only a reliable, ready-to-use potato TE annotation files, but also all the necessary steps to perform de novo detection for other species.
Collapse
Affiliation(s)
- Diego Zavallo
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| | - Juan Manuel Crescente
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Magdalena Gantuz
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Melisa Leone
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina
- Agencia Nacional de Promocion Científica y Tecnológica (ANPCyT), Buenos Aires, Argentina
| | - Leonardo Sebastian Vanzetti
- Grupo Biotecnologia y Recursos Genéticos, EEA INTA Marcos Juárez, Ruta 12 Km 3, 2580, Marcos Juárez, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ricardo Williams Masuelli
- Instituto de Biología Agrícola de Mendoza (IBAM), Facultad de Ciencias Agrarias (FCA), CONICET-UNCuyo, Almirante Brown 500, M5528AHB, Chacras de Coria, Mendoza, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Sebastian Asurmendi
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), Instituto Nacional de Tecnología Agropecuaria (INTA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), Los Reseros y Nicolas Repeto, Hurlingham, Argentina.
| |
Collapse
|
48
|
The unusual dRemp retrotransposon is abundant, highly mutagenic, and mobilized only in the second pollen mitosis of some maize lines. Proc Natl Acad Sci U S A 2020; 117:18091-18098. [PMID: 32661148 DOI: 10.1073/pnas.2010234117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The frequent mutations recovered recently from the pollen of select maize lines resulted from the meiotic mobilization of specific low-copy number long-terminal repeat (LTR) retrotransposons, which differ among lines. Mutations that arise at male meiosis produce kernels with concordant mutant phenotypes in both endosperm and embryo because the two sperms that participate in double fertilization are genetically identical. Those are in a majority. However, a small minority of kernels with a mutant endosperm carry a nonconcordant normal embryo, pointing to a postmeiotic or microgametophytic origin. In this study, we have identified the basis for those nonconcordant mutations. We find that all are produced by transposition of a defective LTR retrotransposon that we have termed dRemp (defective retroelement mobile in pollen). This element has several unique properties. Unlike the mutagenic LTR retrotransposons identified previously, dRemp is present in hundreds of copies in all sequenced lines. It seems to transpose only at the second pollen mitosis because all dRemp insertion mutants are nonconcordant yet recoverable in either the endosperm or the embryo. Although it does not move in most lines, dRemp is highly mobile in the Corn Belt inbred M14, identified earlier by breeders as being highly unstable. Lastly, it can be recovered in an array of structures, ranging from solo LTRs to tandem dRemp repeats containing several internal LTRs, suggestive of extensive recombination during retrotransposition. These results shed further light on the spontaneous mutation process and on the possible basis for inbred instability in maize.
Collapse
|
49
|
Parvathaneni RK, Bertolini E, Shamimuzzaman M, Vera DL, Lung PY, Rice BR, Zhang J, Brown PJ, Lipka AE, Bass HW, Eveland AL. The regulatory landscape of early maize inflorescence development. Genome Biol 2020; 21:165. [PMID: 32631399 PMCID: PMC7336428 DOI: 10.1186/s13059-020-02070-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2019] [Accepted: 06/11/2020] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND The functional genome of agronomically important plant species remains largely unexplored, yet presents a virtually untapped resource for targeted crop improvement. Functional elements of regulatory DNA revealed through profiles of chromatin accessibility can be harnessed for fine-tuning gene expression to optimal phenotypes in specific environments. RESULT Here, we investigate the non-coding regulatory space in the maize (Zea mays) genome during early reproductive development of pollen- and grain-bearing inflorescences. Using an assay for differential sensitivity of chromatin to micrococcal nuclease (MNase) digestion, we profile accessible chromatin and nucleosome occupancy in these largely undifferentiated tissues and classify at least 1.6% of the genome as accessible, with the majority of MNase hypersensitive sites marking proximal promoters, but also 3' ends of maize genes. This approach maps regulatory elements to footprint-level resolution. Integration of complementary transcriptome profiles and transcription factor occupancy data are used to annotate regulatory factors, such as combinatorial transcription factor binding motifs and long non-coding RNAs, that potentially contribute to organogenesis, including tissue-specific regulation between male and female inflorescence structures. Finally, genome-wide association studies for inflorescence architecture traits based solely on functional regions delineated by MNase hypersensitivity reveals new SNP-trait associations in known regulators of inflorescence development as well as new candidates. CONCLUSIONS These analyses provide a comprehensive look into the cis-regulatory landscape during inflorescence differentiation in a major cereal crop, which ultimately shapes architecture and influences yield potential.
Collapse
Affiliation(s)
| | | | - Md Shamimuzzaman
- Donald Danforth Plant Science Center, St. Louis, MO 63132 USA
- Current address: USDA-ARS Edward T. Schafer Agricultural Research Center, Fargo, ND 58102 USA
| | - Daniel L. Vera
- The Center for Genomics and Personalized Medicine, Florida State University, Tallahassee, FL 32306 USA
- Current address: Department of Genetics, Harvard Medical School, Boston, MA 02115 USA
| | - Pei-Yau Lung
- Department of Statistics, Florida State University, Tallahassee, FL 32306 USA
| | - Brian R. Rice
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, IL 61801 USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, FL 32306 USA
| | - Patrick J. Brown
- Department of Plant Sciences, University of California, Davis, CA 95616 USA
| | - Alexander E. Lipka
- Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, IL 61801 USA
| | - Hank W. Bass
- Department of Biological Science, Florida State University, Tallahassee, FL 32306 USA
| | | |
Collapse
|
50
|
Kögler A, Seibt KM, Heitkam T, Morgenstern K, Reiche B, Brückner M, Wolf H, Krabel D, Schmidt T. Divergence of 3' ends as a driver of short interspersed nuclear element (SINE) evolution in the Salicaceae. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:443-458. [PMID: 32056333 DOI: 10.1111/tpj.14721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/13/2020] [Accepted: 01/29/2020] [Indexed: 06/10/2023]
Abstract
Short interspersed nuclear elements (SINEs) are small, non-autonomous and heterogeneous retrotransposons that are widespread in plants. To explore the amplification dynamics and evolutionary history of SINE populations in representative deciduous tree species, we analyzed the genomes of the six following Salicaceae species: Populus deltoides, Populus euphratica, Populus tremula, Populus tremuloides, Populus trichocarpa, and Salix purpurea. We identified 11 Salicaceae SINE families (SaliS-I to SaliS-XI), comprising 27 077 full-length copies. Most of these families harbor segmental similarities, providing evidence for SINE emergence by reshuffling or heterodimerization. We observed two SINE groups, differing in phylogenetic distribution pattern, similarity and 3' end structure. These groups probably emerged during the 'salicoid duplication' (~65 million years ago) in the Salix-Populus progenitor and during the separation of the genus Salix (45-65 million years ago), respectively. In contrast to conserved 5' start motifs across species and SINE families, the 3' ends are highly variable in sequence and length. This extraordinary 3'-end variability results from mutations in the poly(A) tail, which were fixed by subsequent amplificational bursts. We show that the dissemination of newly evolved 3' ends is accomplished by a displacement of older motifs, leading to various 3'-end subpopulations within the SaliS families.
Collapse
Affiliation(s)
- Anja Kögler
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kathrin M Seibt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Tony Heitkam
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| | - Kristin Morgenstern
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Birgit Reiche
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | | | - Heino Wolf
- Staatsbetrieb Sachsenforst, 01796, Pirna, Germany
| | - Doris Krabel
- Department of Forest Sciences, Institute of Forest Botany and Forest Zoology, Technische Universität Dresden, 01735, Tharandt, Germany
| | - Thomas Schmidt
- Faculty of Biology, Institute of Botany, Technische Universität Dresden, 01062, Dresden, Germany
| |
Collapse
|