751
|
Mikkelsen EK, Weir JT. The genome of the Xingu scale-backed antbird (Willisornis vidua nigrigula) reveals lineage-specific adaptations. Genomics 2020; 112:4552-4560. [PMID: 32771623 DOI: 10.1016/j.ygeno.2020.07.047] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 07/08/2020] [Accepted: 07/30/2020] [Indexed: 12/23/2022]
Abstract
Antbirds (Thamnophilidae) are a large neotropical family of passerine bird renowned for the ant-following foraging strategies of several members of this clade. The high diversity of antbirds provides ample opportunity for speciation studies, however these studies can be hindered by the lack of an annotated antbird reference genome. In this study, we produced a high-quality annotated reference genome for the Xingu Scale-backed Antbird (Willisornis vidua nigrigula) using 10X Genomics Chromium linked-reads technology. The assembly is 1.09 Gb, with a scaffold N50 of 12.1 Mb and 17,475 annotated protein coding genes. We compare the proteome of W. v. nigrigula to several other passerines, and produce annotations for two additional antbird genomes in order to identify genes under lineage-specific positive selection and gene families with evidence for significant expansions in antbirds. Several of these genes have functions potentially related to the lineage-specific traits of antbirds, including adaptations for thermoregulation in a humid tropical environment.
Collapse
Affiliation(s)
- Else K Mikkelsen
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S 3B2, ON, Canada.
| | - Jason T Weir
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S 3B2, ON, Canada; Department of Biological Sciences, University of Toronto Scarborough, Toronto M1C 1A4, ON, Canada; Department of Ornithology, Royal Ontario Museum, Toronto, Canada
| |
Collapse
|
752
|
Bellinger MR, Paudel R, Starnes S, Kambic L, Kantar MB, Wolfgruber T, Lamour K, Geib S, Sim S, Miyasaka SC, Helmkampf M, Shintaku M. Taro Genome Assembly and Linkage Map Reveal QTLs for Resistance to Taro Leaf Blight. G3 (BETHESDA, MD.) 2020; 10:2763-2775. [PMID: 32546503 PMCID: PMC7407455 DOI: 10.1534/g3.120.401367] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Accepted: 06/08/2020] [Indexed: 02/06/2023]
Abstract
Taro (Colocasia esculenta) is a food staple widely cultivated in the humid tropics of Asia, Africa, Pacific and the Caribbean. One of the greatest threats to taro production is Taro Leaf Blight caused by the oomycete pathogen Phytophthora colocasiae Here we describe a de novo taro genome assembly and use it to analyze sequence data from a Taro Leaf Blight resistant mapping population. The genome was assembled from linked-read sequences (10x Genomics; ∼60x coverage) and gap-filled and scaffolded with contigs assembled from Oxford Nanopore Technology long-reads and linkage map results. The haploid assembly was 2.45 Gb total, with a maximum contig length of 38 Mb and scaffold N50 of 317,420 bp. A comparison of family-level (Araceae) genome features reveals the repeat content of taro to be 82%, >3.5x greater than in great duckweed (Spirodela polyrhiza), 23%. Both genomes recovered a similar percent of Benchmarking Universal Single-copy Orthologs, 80% and 84%, based on a 3,236 gene database for monocot plants. A greater number of nucleotide-binding leucine-rich repeat disease resistance genes were present in genomes of taro than the duckweed, ∼391 vs. ∼70 (∼182 and ∼46 complete). The mapping population data revealed 16 major linkage groups with 520 markers, and 10 quantitative trait loci (QTL) significantly associated with Taro Leaf Blight disease resistance. The genome sequence of taro enhances our understanding of resistance to TLB, and provides markers that may accelerate breeding programs. This genome project may provide a template for developing genomic resources in other understudied plant species.
Collapse
Affiliation(s)
| | - Roshan Paudel
- University of Hawaii at Manoa, Department of Tropical Plant and Soil Sciences, Honolulu, Hawaii
| | - Steven Starnes
- University of Hawaii at Hilo, College of Agriculture, Forestry and Natural Resource Management, Hilo, Hawaii
| | - Lukas Kambic
- University of Hawaii at Hilo, College of Agriculture, Forestry and Natural Resource Management, Hilo, Hawaii
| | - Michael B Kantar
- University of Hawaii at Manoa, Department of Tropical Plant and Soil Sciences, Honolulu, Hawaii
| | - Thomas Wolfgruber
- University of Hawaii at Manoa, Department of Tropical Plant and Soil Sciences, Honolulu, Hawaii
| | - Kurt Lamour
- University of Tennessee at Knoxville, Department of Entomology and Plant Pathology, Knoxville, Tennessee
| | - Scott Geib
- United States Department of Agriculture-Agricultural Research Service, Hilo, Hawaii
| | - Sheina Sim
- United States Department of Agriculture-Agricultural Research Service, Hilo, Hawaii
| | - Susan C Miyasaka
- University of Hawaii at Manoa, Department of Tropical Plant and Soil Sciences, Honolulu, Hawaii
| | - Martin Helmkampf
- University of Hawaii at Hilo, Department of Biology, Hilo, Hawaii
| | - Michael Shintaku
- University of Hawaii at Hilo, College of Agriculture, Forestry and Natural Resource Management, Hilo, Hawaii,
| |
Collapse
|
753
|
Perumal S, Koh CS, Jin L, Buchwaldt M, Higgins EE, Zheng C, Sankoff D, Robinson SJ, Kagale S, Navabi ZK, Tang L, Horner KN, He Z, Bancroft I, Chalhoub B, Sharpe AG, Parkin IAP. A high-contiguity Brassica nigra genome localizes active centromeres and defines the ancestral Brassica genome. NATURE PLANTS 2020; 6:929-941. [PMID: 32782408 PMCID: PMC7419231 DOI: 10.1038/s41477-020-0735-y] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 06/28/2020] [Indexed: 05/19/2023]
Abstract
It is only recently, with the advent of long-read sequencing technologies, that we are beginning to uncover previously uncharted regions of complex and inherently recursive plant genomes. To comprehensively study and exploit the genome of the neglected oilseed Brassica nigra, we generated two high-quality nanopore de novo genome assemblies. The N50 contig lengths for the two assemblies were 17.1 Mb (12 contigs), one of the best among 324 sequenced plant genomes, and 0.29 Mb (424 contigs), respectively, reflecting recent improvements in the technology. Comparison with a de novo short-read assembly corroborated genome integrity and quantified sequence-related error rates (0.2%). The contiguity and coverage allowed unprecedented access to low-complexity regions of the genome. Pericentromeric regions and coincidence of hypomethylation enabled localization of active centromeres and identified centromere-associated ALE family retro-elements that appear to have proliferated through relatively recent nested transposition events (<1 Ma). Genomic distances calculated based on synteny relationships were used to define a post-triplication Brassica-specific ancestral genome, and to calculate the extensive rearrangements that define the evolutionary distance separating B. nigra from its diploid relatives.
Collapse
Affiliation(s)
- Sampath Perumal
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| | - Chu Shin Koh
- Global Institute for Food Security, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Lingling Jin
- Department of Computing Science, Thompson Rivers University, Kamloops, British Columbia, Canada
| | - Miles Buchwaldt
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| | - Erin E Higgins
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| | - Chunfang Zheng
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | - David Sankoff
- Department of Mathematics and Statistics, University of Ottawa, Ottawa, Ontario, Canada
| | | | - Sateesh Kagale
- National Research Council Canada, Saskatoon, Saskatchewan, Canada
| | - Zahra-Katy Navabi
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
- Global Institute for Food Security, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Lily Tang
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| | - Kyla N Horner
- Agriculture and Agri-Food Canada, Saskatoon, Saskatchewan, Canada
| | - Zhesi He
- Department of Biology, University of York, York, UK
| | - Ian Bancroft
- Department of Biology, University of York, York, UK
| | - Boulos Chalhoub
- Institute of Crop Science, Zhejiang University, Hangzhou, China
| | - Andrew G Sharpe
- Global Institute for Food Security, University of Saskatchewan, Saskatoon, Saskatchewan, Canada.
| | | |
Collapse
|
754
|
European maize genomes highlight intraspecies variation in repeat and gene content. Nat Genet 2020; 52:950-957. [PMID: 32719517 PMCID: PMC7467862 DOI: 10.1038/s41588-020-0671-9] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 06/25/2020] [Indexed: 12/22/2022]
Abstract
The diversity of maize (Zea mays) is the backbone of modern heterotic patterns and hybrid breeding. Historically, US farmers exploited this variability to establish today’s highly productive Corn Belt inbred lines from blends of dent and flint germplasm pools. Here, we report de novo genome sequences of four European flint lines assembled to pseudomolecules with scaffold N50 ranging from 6.1 to 10.4 Mb. Comparative analyses with two US Corn Belt lines explains the pronounced differences between both germplasms. While overall syntenic order and consolidated gene annotations reveal only moderate pangenomic differences, whole-genome alignments delineating the core and dispensable genome, and the analysis of heterochromatic knobs and orthologous long terminal repeat retrotransposons unveil the dynamics of the maize genome. The high-quality genome sequences of the flint pool complement the maize pangenome and provide an important tool to study maize improvement at a genome scale and to enhance modern hybrid breeding. De novo genome assemblies of four European flint maize lines and comparison with two US Corn Belt genomes provide insights into the dynamics of intraspecies variation in repeat and gene content in maize genomes.
Collapse
|
755
|
Xia E, Tong W, Hou Y, An Y, Chen L, Wu Q, Liu Y, Yu J, Li F, Li R, Li P, Zhao H, Ge R, Huang J, Mallano AI, Zhang Y, Liu S, Deng W, Song C, Zhang Z, Zhao J, Wei S, Zhang Z, Xia T, Wei C, Wan X. The Reference Genome of Tea Plant and Resequencing of 81 Diverse Accessions Provide Insights into Its Genome Evolution and Adaptation. MOLECULAR PLANT 2020; 13:1013-1026. [PMID: 32353625 DOI: 10.1016/j.molp.2020.04.010] [Citation(s) in RCA: 240] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2019] [Revised: 02/29/2020] [Accepted: 04/24/2020] [Indexed: 05/19/2023]
Abstract
Tea plant is an important economic crop, which is used to produce the world's oldest and most widely consumed tea beverages. Here, we present a high-quality reference genome assembly of the tea plant (Camellia sinensis var. sinensis) consisting of 15 pseudo-chromosomes. LTR retrotransposons (LTR-RTs) account for 70.38% of the genome, and we present evidence that LTR-RTs play critical roles in genome size expansion and the transcriptional diversification of tea plant genes through preferential insertion in promoter regions and introns. Genes, particularly those coding for terpene biosynthesis proteins, associated with tea aroma and stress resistance were significantly amplified through recent tandem duplications and exist as gene clusters in tea plant genome. Phylogenetic analysis of the sequences of 81 tea plant accessions with diverse origins revealed three well-differentiated tea plant populations, supporting the proposition for the southwest origin of the Chinese cultivated tea plant and its later spread to western Asia through introduction. Domestication and modern breeding left significant signatures on hundreds of genes in the tea plant genome, particularly those associated with tea quality and stress resistance. The genomic sequences of the reported reference and resequenced tea plant accessions provide valuable resources for future functional genomics study and molecular breeding of improved cultivars of tea plants.
Collapse
Affiliation(s)
- Enhua Xia
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Wei Tong
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Yan Hou
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Yanlin An
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Linbo Chen
- Tea Research Institute, Yunnan Academy of Agricultural Sciences, Menghai 666201, China
| | - Qiong Wu
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Yunlong Liu
- Germplasm Bank of Wild Species in Southwestern China, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650204, China
| | - Jie Yu
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Fangdong Li
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Ruopei Li
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Penghui Li
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Huijuan Zhao
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Ruoheng Ge
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Jin Huang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Ali Inayat Mallano
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Yanrui Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Shengrui Liu
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Weiwei Deng
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Chuankui Song
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Zhaoliang Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Jian Zhao
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Shu Wei
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Zhengzhu Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Tao Xia
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China
| | - Chaoling Wei
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China.
| | - Xiaochun Wan
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei 230036, China.
| |
Collapse
|
756
|
Xu Z, Pu X, Gao R, Demurtas OC, Fleck SJ, Richter M, He C, Ji A, Sun W, Kong J, Hu K, Ren F, Song J, Wang Z, Gao T, Xiong C, Yu H, Xin T, Albert VA, Giuliano G, Chen S, Song J. Tandem gene duplications drive divergent evolution of caffeine and crocin biosynthetic pathways in plants. BMC Biol 2020; 18:63. [PMID: 32552824 PMCID: PMC7302004 DOI: 10.1186/s12915-020-00795-3] [Citation(s) in RCA: 94] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Accepted: 05/18/2020] [Indexed: 12/11/2022] Open
Abstract
Background Plants have evolved a panoply of specialized metabolites that increase their environmental fitness. Two examples are caffeine, a purine psychotropic alkaloid, and crocins, a group of glycosylated apocarotenoid pigments. Both classes of compounds are found in a handful of distantly related plant genera (Coffea, Camellia, Paullinia, and Ilex for caffeine; Crocus, Buddleja, and Gardenia for crocins) wherein they presumably evolved through convergent evolution. The closely related Coffea and Gardenia genera belong to the Rubiaceae family and synthesize, respectively, caffeine and crocins in their fruits. Results Here, we report a chromosomal-level genome assembly of Gardenia jasminoides, a crocin-producing species, obtained using Oxford Nanopore sequencing and Hi-C technology. Through genomic and functional assays, we completely deciphered for the first time in any plant the dedicated pathway of crocin biosynthesis. Through comparative analyses with Coffea canephora and other eudicot genomes, we show that Coffea caffeine synthases and the first dedicated gene in the Gardenia crocin pathway, GjCCD4a, evolved through recent tandem gene duplications in the two different genera, respectively. In contrast, genes encoding later steps of the Gardenia crocin pathway, ALDH and UGT, evolved through more ancient gene duplications and were presumably recruited into the crocin biosynthetic pathway only after the evolution of the GjCCD4a gene. Conclusions This study shows duplication-based divergent evolution within the coffee family (Rubiaceae) of two characteristic secondary metabolic pathways, caffeine and crocin biosynthesis, from a common ancestor that possessed neither complete pathway. These findings provide significant insights on the role of tandem duplications in the evolution of plant specialized metabolism.
Collapse
Affiliation(s)
- Zhichao Xu
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China.,Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Xiangdong Pu
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
| | - Ranran Gao
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
| | - Olivia Costantina Demurtas
- Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA), Casaccia Res. Ctr, 00123, Rome, Italy
| | - Steven J Fleck
- Department of Biological Sciences, University at Buffalo, Buffalo, NY, 14260, USA
| | - Michaela Richter
- Department of Biological Sciences, University at Buffalo, Buffalo, NY, 14260, USA
| | - Chunnian He
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China.,Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Aijia Ji
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
| | - Wei Sun
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Jianqiang Kong
- Institute of Materia Medica, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100050, China
| | - Kaizhi Hu
- Chongqing Institute of Medicinal Plant Cultivation, Chongqing, 408435, China
| | - Fengming Ren
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China.,Chongqing Institute of Medicinal Plant Cultivation, Chongqing, 408435, China
| | - Jiejie Song
- College of Life Sciences, Qingdao Agricultural University, Qingdao, 266109, China
| | - Zhe Wang
- Institute of Materia Medica, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100050, China
| | - Ting Gao
- College of Life Sciences, Qingdao Agricultural University, Qingdao, 266109, China
| | - Chao Xiong
- Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China
| | - Haoying Yu
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
| | - Tianyi Xin
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
| | - Victor A Albert
- Department of Biological Sciences, University at Buffalo, Buffalo, NY, 14260, USA.,School of Biological Sciences, Nanyang Technological University, Singapore, 637551, Singapore
| | - Giovanni Giuliano
- Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA), Casaccia Res. Ctr, 00123, Rome, Italy.
| | - Shilin Chen
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China. .,Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700, China.
| | - Jingyuan Song
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China. .,Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China. .,Yunnan Branch, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Jinghong, 666100, China.
| |
Collapse
|
757
|
Li W, Zhang Q, Zhu T, Tong Y, Li K, Shi C, Zhang Y, Liu Y, Jiang J, Liu Y, Xia E, Huang H, Zhang L, Zhang D, Shi C, Jiang W, Zhao Y, Mao S, Jiao J, Xu P, Yang L, Gao L. Draft genomes of two outcrossing wild rice, Oryza rufipogon and O. longistaminata, reveal genomic features associated with mating-system evolution. PLANT DIRECT 2020; 4:e00232. [PMID: 32537559 PMCID: PMC7287411 DOI: 10.1002/pld3.232] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 05/07/2020] [Accepted: 05/15/2020] [Indexed: 05/04/2023]
Abstract
Oryza rufipogon and O. longistaminata are important wild relatives of cultivated rice, harboring a promising source of novel genes for rice breeding programs. Here, we present de novo assembled draft genomes and annotation of O. rufipogon and O. longistaminata. Our analysis reveals a considerable number of lineage-specific gene families associated with the self-incompatibility (SI) and formation of reproductive separation. We show how lineage-specific expansion or contraction of gene families with functional enrichment of the recognition of pollen, thus enlightening their reproductive diversification. We documented a large number of lineage-specific gene families enriched in salt stress, antifungal response, and disease resistance. Our comparative analysis further shows a genome-wide expansion of genes encoding NBS-LRR proteins in these two outcrossing wild species in contrast to six other selfing rice species. Conserved noncoding sequences (CNSs) in the two wild rice genomes rapidly evolve relative to selfing rice species, resulting in the reduction of genomic variation owing to shifts of mating systems. We find that numerous genes related to these rapidly evolving CNSs are enriched in reproductive structure development, flower development, and postembryonic development, which may associate with SI in O. rufipogon and O. longistaminata.
Collapse
Affiliation(s)
- Wei Li
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Qun‐Jie Zhang
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Ting Zhu
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- College of Life ScienceLiaoning Normal UniversityDalianChina
| | - Yan Tong
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Kui Li
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Cong Shi
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
- University of the Chinese Academy of SciencesBeijingChina
| | - Yun Zhang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Yun‐Long Liu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Jian‐Jun Jiang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Yuan Liu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - En‐Hua Xia
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Hui Huang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Ping Zhang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Dan Zhang
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
| | - Chao Shi
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Wen‐Kai Jiang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - You‐Jie Zhao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Shu‐Yan Mao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Jun‐ying Jiao
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Ping‐Zhen Xu
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Li Yang
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| | - Li‐Zhi Gao
- Institution of Genomics and BioinformaticsSouth China Agricultural UniversityGuangzhouChina
- Plant Germplasm and Genomics CenterGermplasm Bank of Wild Species in Southwestern China Kunming Institute of Botany Chinese Academy of SciencesKunmingChina
| |
Collapse
|
758
|
Hunt SP, Jarvis DE, Larsen DJ, Mosyakin SL, Kolano BA, Jackson EW, Martin SL, Jellen EN, Maughan PJ. A Chromosome-Scale Assembly of the Garden Orach ( Atriplex hortensis L.) Genome Using Oxford Nanopore Sequencing. FRONTIERS IN PLANT SCIENCE 2020; 11:624. [PMID: 32523593 PMCID: PMC7261831 DOI: 10.3389/fpls.2020.00624] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 04/22/2020] [Indexed: 05/16/2023]
Abstract
Atriplex hortensis (2n = 2x = 18, 1C genome size ∼1.1 gigabases), also known as garden orach and mountain-spinach, is a highly nutritious, broadleaf annual of the Amaranthaceae-Chenopodiaceae alliance (Chenopodiaceae sensu stricto, subfam. Chenopodioideae) that has spread in cultivation from its native primary domestication area in Eurasia to other temperate and subtropical regions worldwide. Atriplex L. is a highly complex but, as understood now, a monophyletic group of mainly halophytic and/or xerophytic plants, of which A. hortensis has been a vegetable of minor importance in some areas of Eurasia (from Central Asia to the Mediterranean) at least since antiquity. Nonetheless, it is a crop with tremendous nutritional potential due primarily to its exceptional leaf and seed protein quantities (approaching 30%) and quality (high levels of lysine). Although there is some literature describing the taxonomy and production of A. hortensis, there is a general lack of genetic and genomic data that would otherwise help elucidate the genetic variation, phylogenetic positioning, and future potential of the species. Here, we report the assembly of the first high-quality, chromosome-scale reference genome for A. hortensis cv. "Golden." Long-read data from Oxford Nanopore's MinION DNA sequencer was assembled with the program Canu and polished with Illumina short reads. Contigs were scaffolded to chromosome scale using chromatin-proximity maps (Hi-C) yielding a final assembly containing 1,325 scaffolds with a N50 of 98.9 Mb - with 94.7% of the assembly represented in the nine largest, chromosome-scale scaffolds. Sixty-six percent of the genome was classified as highly repetitive DNA, with the most common repetitive elements being Gypsy-(32%) and Copia-like (11%) long-terminal repeats. The annotation was completed using MAKER which identified 37,083 gene models and 2,555 tRNA genes. Completeness of the genome, assessed using the Benchmarking Universal Single Copy Orthologs (BUSCO) metric, identified 97.5% of the conserved orthologs as complete, with only 2.2% being duplicated, reflecting the diploid nature of A. hortensis. A resequencing panel of 21 wild, unimproved and cultivated A. hortensis accessions revealed three distinct populations with little variation within subpopulations. These resources provide vital information to better understand A. hortensis and facilitate future study.
Collapse
Affiliation(s)
- Spencer P. Hunt
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, United States
| | - David E. Jarvis
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, United States
| | - Dallas J. Larsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, United States
| | - Sergei L. Mosyakin
- M.G. Kholodny Institute of Botany, National Academy of Sciences of Ukraine, Kyiv, Ukraine
| | - Bozena A. Kolano
- Institute of Biology, Biotechnology and Environmental Protection, Faculty of Natural Sciences, University of Silesia in Katowice, Katowice, Poland
| | | | - Sara L. Martin
- Agriculture and Agri-Food Canada, Ottawa Research and Development Centre, Ottawa, ON, Canada
| | - Eric N. Jellen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, United States
| | - Peter J. Maughan
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT, United States
| |
Collapse
|
759
|
Liu J, Seetharam AS, Chougule K, Ou S, Swentowsky KW, Gent JI, Llaca V, Woodhouse MR, Manchanda N, Presting GG, Kudrna DA, Alabady M, Hirsch CN, Fengler KA, Ware D, Michael TP, Hufford MB, Dawe RK. Gapless assembly of maize chromosomes using long-read technologies. Genome Biol 2020; 21:121. [PMID: 32434565 PMCID: PMC7238635 DOI: 10.1186/s13059-020-02029-9] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2020] [Accepted: 04/23/2020] [Indexed: 12/16/2022] Open
Abstract
Creating gapless telomere-to-telomere assemblies of complex genomes is one of the ultimate challenges in genomics. We use two independent assemblies and an optical map-based merging pipeline to produce a maize genome (B73-Ab10) composed of 63 contigs and a contig N50 of 162 Mb. This genome includes gapless assemblies of chromosome 3 (236 Mb) and chromosome 9 (162 Mb), and 53 Mb of the Ab10 meiotic drive haplotype. The data also reveal the internal structure of seven centromeres and five heterochromatic knobs, showing that the major tandem repeat arrays (CentC, knob180, and TR-1) are discontinuous and frequently interspersed with retroelements.
Collapse
Affiliation(s)
- Jianing Liu
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA
| | - Arun S Seetharam
- Genome Informatics Facility, Iowa State University, Ames, IA, 50011, USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Kyle W Swentowsky
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Jonathan I Gent
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Victor Llaca
- Corteva Agriscience™, 8325 NW 62nd Ave, Johnston, IA, 50131, USA
| | | | - Nancy Manchanda
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Gernot G Presting
- Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, HI, 96822, USA
| | - David A Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Magdy Alabady
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
- Georgia Genomics and Bioinformatics Core Laboratory, University of Georgia, Athens, GA, 30602, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Kevin A Fengler
- Corteva Agriscience™, 8325 NW 62nd Ave, Johnston, IA, 50131, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, NY, 14853, USA
| | - Todd P Michael
- Informatics Department, J. Craig Venter Institute, La Jolla, CA, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - R Kelly Dawe
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA.
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
760
|
Liu J, Seetharam AS, Chougule K, Ou S, Swentowsky KW, Gent JI, Llaca V, Woodhouse MR, Manchanda N, Presting GG, Kudrna DA, Alabady M, Hirsch CN, Fengler KA, Ware D, Michael TP, Hufford MB, Dawe RK. Gapless assembly of maize chromosomes using long-read technologies. Genome Biol 2020. [PMID: 32434565 DOI: 10.1101/2020.01.14.906230v1.full] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/26/2023] Open
Abstract
Creating gapless telomere-to-telomere assemblies of complex genomes is one of the ultimate challenges in genomics. We use two independent assemblies and an optical map-based merging pipeline to produce a maize genome (B73-Ab10) composed of 63 contigs and a contig N50 of 162 Mb. This genome includes gapless assemblies of chromosome 3 (236 Mb) and chromosome 9 (162 Mb), and 53 Mb of the Ab10 meiotic drive haplotype. The data also reveal the internal structure of seven centromeres and five heterochromatic knobs, showing that the major tandem repeat arrays (CentC, knob180, and TR-1) are discontinuous and frequently interspersed with retroelements.
Collapse
Affiliation(s)
- Jianing Liu
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA
| | - Arun S Seetharam
- Genome Informatics Facility, Iowa State University, Ames, IA, 50011, USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Kyle W Swentowsky
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Jonathan I Gent
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Victor Llaca
- Corteva Agriscience™, 8325 NW 62nd Ave, Johnston, IA, 50131, USA
| | | | - Nancy Manchanda
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Gernot G Presting
- Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, HI, 96822, USA
| | - David A Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Magdy Alabady
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
- Georgia Genomics and Bioinformatics Core Laboratory, University of Georgia, Athens, GA, 30602, USA
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Kevin A Fengler
- Corteva Agriscience™, 8325 NW 62nd Ave, Johnston, IA, 50131, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA ARS NAA Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, NY, 14853, USA
| | - Todd P Michael
- Informatics Department, J. Craig Venter Institute, La Jolla, CA, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - R Kelly Dawe
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA.
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
761
|
Yan H, Bombarely A, Li S. DeepTE: a computational method for de novo classification of transposons with convolutional neural network. Bioinformatics 2020; 36:4269-4275. [DOI: 10.1093/bioinformatics/btaa519] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/12/2020] [Accepted: 05/12/2020] [Indexed: 01/23/2023] Open
Abstract
Abstract
Motivation
Transposable elements (TEs) classification is an essential step to decode their roles in genome evolution. With a large number of genomes from non-model species becoming available, accurate and efficient TE classification has emerged as a new challenge in genomic sequence analysis.
Results
We developed a novel tool, DeepTE, which classifies unknown TEs using convolutional neural networks (CNNs). DeepTE transferred sequences into input vectors based on k-mer counts. A tree structured classification process was used where eight models were trained to classify TEs into super families and orders. DeepTE also detected domains inside TEs to correct false classification. An additional model was trained to distinguish between non-TEs and TEs in plants. Given unclassified TEs of different species, DeepTE can classify TEs into seven orders, which include 15, 24 and 16 super families in plants, metazoans and fungi, respectively. In several benchmarking tests, DeepTE outperformed other existing tools for TE classification. In conclusion, DeepTE successfully leverages CNN for TE classification, and can be used to precisely classify TEs in newly sequenced eukaryotic genomes.
Availability and implementation
DeepTE is accessible at https://github.com/LiLabAtVT/DeepTE.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haidong Yan
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
| | - Aureliano Bombarely
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Department of Life Sciences, University of Milan, Milan 20122, Italy
| | - Song Li
- School of Plant and Environmental Sciences (SPES), Virginia Tech, Blacksburg, VA 24061, USA
- Graduate Program in Genetics, Bioinformatics and Computational Biology (GBCB), Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
762
|
Ou S, Liu J, Chougule KM, Fungtammasan A, Seetharam AS, Stein JC, Llaca V, Manchanda N, Gilbert AM, Wei S, Chin CS, Hufnagel DE, Pedersen S, Snodgrass SJ, Fengler K, Woodhouse M, Walenz BP, Koren S, Phillippy AM, Hannigan BT, Dawe RK, Hirsch CN, Hufford MB, Ware D. Effect of sequence depth and length in long-read assembly of the maize inbred NC358. Nat Commun 2020; 11:2288. [PMID: 32385271 PMCID: PMC7211024 DOI: 10.1038/s41467-020-16037-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Accepted: 04/09/2020] [Indexed: 01/23/2023] Open
Abstract
Improvements in long-read data and scaffolding technologies have enabled rapid generation of reference-quality assemblies for complex genomes. Still, an assessment of critical sequence depth and read length is important for allocating limited resources. To this end, we have generated eight assemblies for the complex genome of the maize inbred line NC358 using PacBio datasets ranging from 20 to 75 × genomic depth and with N50 subread lengths of 11-21 kb. Assemblies with ≤30 × depth and N50 subread length of 11 kb are highly fragmented, with even low-copy genic regions showing degradation at 20 × depth. Distinct sequence-quality thresholds are observed for complete assembly of genes, transposable elements, and highly repetitive genomic features such as telomeres, heterochromatic knobs, and centromeres. In addition, we show high-quality optical maps can dramatically improve contiguity in even our most fragmented base assembly. This study provides a useful resource allocation reference to the community as long-read technologies continue to mature.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Jianing Liu
- Department of Genetics, University of Georgia, Athens, Georgia, 30602, USA
| | - Kapeel M Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA
| | | | - Arun S Seetharam
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
- Genome Informatics Facility, Iowa State University, Ames, Iowa, 50011, USA
| | - Joshua C Stein
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA
| | - Victor Llaca
- Genomics Technologies, Applied Science and Technology, Corteva Agriscience TM, Johnston, Iowa, 50131, USA
| | - Nancy Manchanda
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Amanda M Gilbert
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota, 55108, USA
| | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA
| | - Chen-Shan Chin
- DNAnexus, Inc., Mountain View, San Francisco, California, 94040, USA
| | - David E Hufnagel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Sarah Pedersen
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Samantha J Snodgrass
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA
| | - Kevin Fengler
- Genomics Technologies, Applied Science and Technology, Corteva Agriscience TM, Johnston, Iowa, 50131, USA
| | - Margaret Woodhouse
- USDA ARS Corn Insects and Crop Genetics Research Unit, Ames, Iowa, 50011, USA
| | - Brian P Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, 20892, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, 20892, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, 20892, USA
| | - Brett T Hannigan
- DNAnexus, Inc., Mountain View, San Francisco, California, 94040, USA
| | - R Kelly Dawe
- Department of Genetics, University of Georgia, Athens, Georgia, 30602, USA.
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota, 55108, USA.
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa, 50011, USA.
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 11724, USA.
- USDA ARS Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, New York, 14853, USA.
| |
Collapse
|
763
|
Gao S, Wang B, Xie S, Xu X, Zhang J, Pei L, Yu Y, Yang W, Zhang Y. A high-quality reference genome of wild Cannabis sativa. HORTICULTURE RESEARCH 2020; 7:73. [PMID: 32377363 PMCID: PMC7195422 DOI: 10.1038/s41438-020-0295-3] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 03/19/2020] [Accepted: 03/19/2020] [Indexed: 05/02/2023]
Abstract
Cannabis sativa is a well-known plant species that has great economic and ecological significance. An incomplete genome of cloned C. sativa was obtained by using SOAPdenovo software in 2011. To further explore the utilization of this plant resource, we generated an updated draft genome sequence for wild-type varieties of C. sativa in China using PacBio single-molecule sequencing and Hi-C technology. Our assembled genome is approximately 808 Mb, with scaffold and contig N50 sizes of 83.00 Mb and 513.57 kb, respectively. Repetitive elements account for 74.75% of the genome. A total of 38,828 protein-coding genes were annotated, 98.20% of which were functionally annotated. We provide the first comprehensive de novo genome of wild-type varieties of C. sativa distributed in Tibet, China. Due to long-term growth in the wild environment, these varieties exhibit higher heterozygosity and contain more genetic information. This genetic resource is of great value for future investigations of cannabinoid metabolic pathways and will aid in promoting the commercial production of C. sativa and the effective utilization of cannabinoids. The assembled genome is also a valuable resource for intensively and effectively investigating the C. sativa genome further in the future.
Collapse
Affiliation(s)
- Shan Gao
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| | - Baishi Wang
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| | - Shanshan Xie
- Beijing Century Legend Bioscience Co., Ltd., Beijing, 102300 China
| | - Xiaoyu Xu
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| | - Jin Zhang
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| | - Li Pei
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| | - Yongyi Yu
- Beijing Century Legend Bioscience Co., Ltd., Beijing, 102300 China
| | - Weifei Yang
- Beijing Century Legend Bioscience Co., Ltd., Beijing, 102300 China
| | - Ying Zhang
- Institute of Forensic Science, Ministry of Public Security, No. 17 South Muxidi Lane, Xicheng District, Beijing, 100038 China
| |
Collapse
|
764
|
Flynn JM, Hubley R, Goubert C, Rosen J, Clark AG, Feschotte C, Smit AF. RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A 2020; 117:9451-9457. [PMID: 32300014 PMCID: PMC7196820 DOI: 10.1073/pnas.1921046117] [Citation(s) in RCA: 1807] [Impact Index Per Article: 361.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable elements (TEs). Because the types and sequences of TEs are highly variable across species, automated TE discovery and annotation are challenging and time-consuming tasks. A critical first step is the de novo identification and accurate compilation of sequence models representing all of the unique TE families dispersed in the genome. Here we introduce RepeatModeler2, a pipeline that greatly facilitates this process. This program brings substantial improvements over the original version of RepeatModeler, one of the most widely used tools for TE discovery. In particular, this version incorporates a module for structural discovery of complete long terminal repeat (LTR) retroelements, which are widespread in eukaryotic genomes but recalcitrant to automated identification because of their size and sequence complexity. We benchmarked RepeatModeler2 on three model species with diverse TE landscapes and high-quality, manually curated TE libraries: Drosophila melanogaster (fruit fly), Danio rerio (zebrafish), and Oryza sativa (rice). In these three species, RepeatModeler2 identified approximately 3 times more consensus sequences matching with >95% sequence identity and sequence coverage to the manually curated sequences than the original RepeatModeler. As expected, the greatest improvement is for LTR retroelements. Thus, RepeatModeler2 represents a valuable addition to the genome annotation toolkit that will enhance the identification and study of TEs in eukaryotic genome sequences. RepeatModeler2 is available as source code or a containerized package under an open license (https://github.com/Dfam-consortium/RepeatModeler, http://www.repeatmasker.org/RepeatModeler/).
Collapse
Affiliation(s)
- Jullien M Flynn
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853
| | | | - Clément Goubert
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853
| | - Jeb Rosen
- Institute for Systems Biology, Seattle, WA 98109
| | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853;
| | - Cédric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853;
| | - Arian F Smit
- Institute for Systems Biology, Seattle, WA 98109
| |
Collapse
|
765
|
Chen ZJ, Sreedasyam A, Ando A, Song Q, De Santiago LM, Hulse-Kemp AM, Ding M, Ye W, Kirkbride RC, Jenkins J, Plott C, Lovell J, Lin YM, Vaughn R, Liu B, Simpson S, Scheffler BE, Wen L, Saski CA, Grover CE, Hu G, Conover JL, Carlson JW, Shu S, Boston LB, Williams M, Peterson DG, McGee K, Jones DC, Wendel JF, Stelly DM, Grimwood J, Schmutz J. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat Genet 2020; 52:525-533. [PMID: 32313247 PMCID: PMC7203012 DOI: 10.1038/s41588-020-0614-5] [Citation(s) in RCA: 249] [Impact Index Per Article: 49.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Accepted: 03/16/2020] [Indexed: 01/08/2023]
Abstract
Polyploidy is an evolutionary innovation for many animals and all flowering plants, but its impact on selection and domestication remains elusive. Here we analyze genome evolution and diversification for all five allopolyploid cotton species, including economically important Upland and Pima cottons. Although these polyploid genomes are conserved in gene content and synteny, they have diversified by subgenomic transposon exchanges that equilibrate genome size, evolutionary rate heterogeneities and positive selection between homoeologs within and among lineages. These differential evolutionary trajectories are accompanied by gene-family diversification and homoeolog expression divergence among polyploid lineages. Selection and domestication drive parallel gene expression similarities in fibers of two cultivated cottons, involving coexpression networks and N6-methyladenosine RNA modifications. Furthermore, polyploidy induces recombination suppression, which correlates with altered epigenetic landscapes and can be overcome by wild introgression. These genomic insights will empower efforts to manipulate genetic recombination and modify epigenetic landscapes and target genes for crop improvement. Sequencing and genomic diversification of five allopolyploid cotton species provide insights into polyploid genome evolution and epigenetic landscapes for cotton improvement.
Collapse
Affiliation(s)
- Z Jeffrey Chen
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA. .,State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China.
| | | | - Atsumi Ando
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Qingxin Song
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA.,State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Luis M De Santiago
- Department of Soil and Crop Sciences, Texas A&M University System, College Station, TX, USA
| | - Amanda M Hulse-Kemp
- US Department of Agriculture-Agricultural Research Service, Genomics and Bioinformatics Research Unit, Raleigh, NC, USA
| | - Mingquan Ding
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA.,College of Agriculture and Food Science, Zhejiang A&F University, Lin'an, China
| | - Wenxue Ye
- State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Ryan C Kirkbride
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | | | - John Lovell
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | - Yu-Ming Lin
- Department of Soil and Crop Sciences, Texas A&M University System, College Station, TX, USA
| | - Robert Vaughn
- Department of Soil and Crop Sciences, Texas A&M University System, College Station, TX, USA
| | - Bo Liu
- Department of Soil and Crop Sciences, Texas A&M University System, College Station, TX, USA
| | - Sheron Simpson
- US Department of Agriculture-Agricultural Research Service, Genomics and Bioinformatics Research Unit, Stoneville, MS, USA
| | - Brian E Scheffler
- US Department of Agriculture-Agricultural Research Service, Genomics and Bioinformatics Research Unit, Stoneville, MS, USA
| | - Li Wen
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC, USA
| | - Christopher A Saski
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC, USA
| | - Corrinne E Grover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Guanjing Hu
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Justin L Conover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Joseph W Carlson
- The US Department of Energy Joint Genome Institute, Walnut Creek, CA, USA
| | - Shengqiang Shu
- The US Department of Energy Joint Genome Institute, Walnut Creek, CA, USA
| | - Lori B Boston
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA
| | | | - Daniel G Peterson
- Institute for Genomics, Biocomputing and Biotechnology and Department of Plant and Soil Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Keith McGee
- School of Agriculture and Applied Sciences, Alcorn State University, Lorman, MS, USA
| | - Don C Jones
- Agriculture and Environmental Research, Cotton Incorporated, Cary, NC, USA
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - David M Stelly
- Department of Soil and Crop Sciences, Texas A&M University System, College Station, TX, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA.,The US Department of Energy Joint Genome Institute, Walnut Creek, CA, USA
| |
Collapse
|
766
|
Wang H, Sun S, Ge W, Zhao L, Hou B, Wang K, Lyu Z, Chen L, Xu S, Guo J, Li M, Su P, Li X, Wang G, Bo C, Fang X, Zhuang W, Cheng X, Wu J, Dong L, Chen W, Li W, Xiao G, Zhao J, Hao Y, Xu Y, Gao Y, Liu W, Liu Y, Yin H, Li J, Li X, Zhao Y, Wang X, Ni F, Ma X, Li A, Xu SS, Bai G, Nevo E, Gao C, Ohm H, Kong L. Horizontal gene transfer of Fhb7 from fungus underlies Fusarium head blight resistance in wheat. Science 2020; 368:science.aba5435. [PMID: 32273397 DOI: 10.1126/science.aba5435] [Citation(s) in RCA: 350] [Impact Index Per Article: 70.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Accepted: 03/26/2020] [Indexed: 12/22/2022]
Abstract
Fusarium head blight (FHB), a fungal disease caused by Fusarium species that produce food toxins, currently devastates wheat production worldwide, yet few resistance resources have been discovered in wheat germplasm. Here, we cloned the FHB resistance gene Fhb7 by assembling the genome of Thinopyrum elongatum, a species used in wheat distant hybridization breeding. Fhb7 encodes a glutathione S-transferase (GST) and confers broad resistance to Fusarium species by detoxifying trichothecenes through de-epoxidation. Fhb7 GST homologs are absent in plants, and our evidence supports that Th. elongatum has gained Fhb7 through horizontal gene transfer (HGT) from an endophytic Epichloë species. Fhb7 introgressions in wheat confers resistance to both FHB and crown rot in diverse wheat backgrounds without yield penalty, providing a solution for Fusarium resistance breeding.
Collapse
Affiliation(s)
- Hongwei Wang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| | - Silong Sun
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Wenyang Ge
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Lanfei Zhao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Bingqian Hou
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Kai Wang
- Novogene Bioinformatics Institute, Beijing 100083, PR China
| | - Zhongfan Lyu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Liyang Chen
- Novogene Bioinformatics Institute, Beijing 100083, PR China
| | - Shoushen Xu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Jun Guo
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, Shandong 250100, PR China
| | - Min Li
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Peisen Su
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xuefeng Li
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Guiping Wang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Cunyao Bo
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xiaojian Fang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Wenwen Zhuang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xinxin Cheng
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Jianwen Wu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Luhao Dong
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Wuying Chen
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Wen Li
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Guilian Xiao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Jinxiao Zhao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Yongchao Hao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Ying Xu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Yu Gao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Wenjing Liu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Yanhe Liu
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Huayan Yin
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Jiazhu Li
- College of Chemistry and Chemical Engineering, Yantai University, Yantai, Shandong 264005, PR China
| | - Xiang Li
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Yan Zhao
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xiaoqian Wang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Fei Ni
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xin Ma
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Anfei Li
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Steven S Xu
- USDA-ARS, Cereal Crops Research Unit, Edward T. Schafer Agricultural Research Center, Fargo, ND 58102, USA
| | - Guihua Bai
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS 66506, USA
| | - Eviatar Nevo
- Institute of Evolution, University of Haifa, Mount Carmel, Haifa 3498838, Israel
| | - Caixia Gao
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, PR China
| | - Herbert Ohm
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | - Lingrang Kong
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| |
Collapse
|
767
|
Züst T, Strickler SR, Powell AF, Mabry ME, An H, Mirzaei M, York T, Holland CK, Kumar P, Erb M, Petschenka G, Gómez JM, Perfectti F, Müller C, Pires JC, Mueller LA, Jander G. Independent evolution of ancestral and novel defenses in a genus of toxic plants ( Erysimum, Brassicaceae). eLife 2020; 9:e51712. [PMID: 32252891 PMCID: PMC7180059 DOI: 10.7554/elife.51712] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Accepted: 03/24/2020] [Indexed: 11/13/2022] Open
Abstract
Phytochemical diversity is thought to result from coevolutionary cycles as specialization in herbivores imposes diversifying selection on plant chemical defenses. Plants in the speciose genus Erysimum (Brassicaceae) produce both ancestral glucosinolates and evolutionarily novel cardenolides as defenses. Here we test macroevolutionary hypotheses on co-expression, co-regulation, and diversification of these potentially redundant defenses across this genus. We sequenced and assembled the genome of E. cheiranthoides and foliar transcriptomes of 47 additional Erysimum species to construct a phylogeny from 9868 orthologous genes, revealing several geographic clades but also high levels of gene discordance. Concentrations, inducibility, and diversity of the two defenses varied independently among species, with no evidence for trade-offs. Closely related, geographically co-occurring species shared similar cardenolide traits, but not glucosinolate traits, likely as a result of specific selective pressures acting on each defense. Ancestral and novel chemical defenses in Erysimum thus appear to provide complementary rather than redundant functions.
Collapse
Affiliation(s)
- Tobias Züst
- Institute of Plant Sciences, University of BernBernSwitzerland
| | | | | | - Makenzie E Mabry
- Division of Biological Sciences, University of MissouriColumbiaUnited States
| | - Hong An
- Division of Biological Sciences, University of MissouriColumbiaUnited States
| | | | | | | | | | - Matthias Erb
- Institute of Plant Sciences, University of BernBernSwitzerland
| | - Georg Petschenka
- Institut für Insektenbiotechnologie, Justus-Liebig-Universität GiessenGiessenGermany
| | - José-María Gómez
- Department of Functional and Evolutionary Ecology, Estación Experimental de Zonas Áridas (EEZA-CSIC)AlmeríaSpain
| | - Francisco Perfectti
- Research Unit Modeling Nature, Department of Genetics, University of GranadaGranadaSpain
| | - Caroline Müller
- Department of Chemical Ecology, Bielefeld UniversityBielefeldGermany
| | - J Chris Pires
- Division of Biological Sciences, University of MissouriColumbiaUnited States
| | | | | |
Collapse
|
768
|
The sterlet sturgeon genome sequence and the mechanisms of segmental rediploidization. Nat Ecol Evol 2020; 4:841-852. [PMID: 32231327 PMCID: PMC7269910 DOI: 10.1038/s41559-020-1166-x] [Citation(s) in RCA: 145] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 02/27/2020] [Indexed: 12/20/2022]
Abstract
Sturgeons seem to be frozen in time. The archaic characteristics of this ancient fish lineage place it in a key phylogenetic position at the base of the ~30,000 modern teleost fish species. Moreover, sturgeons are notoriously polyploid, providing unique opportunities to investigate the evolution of polyploid genomes. We assembled a high-quality chromosome-level reference genome for the sterlet, Acipenser ruthenus. Our analysis revealed a very low protein evolution rate that is at least as slow as in other deep branches of the vertebrate tree, such as that of the coelacanth. We uncovered a whole-genome duplication that occurred in the Jurassic, early in the evolution of the entire sturgeon lineage. Following this polyploidization, the rediploidization of the genome included the loss of whole chromosomes in a segmental deduplication process. While known adaptive processes helped conserve a high degree of structural and functional tetraploidy over more than 180 million years, the reduction of redundancy of the polyploid genome seems to have been remarkably random. A genome assembly of the sterlet, Acipenser ruthenus, reveals a whole-genome duplication early in the evolution of the entire sturgeon lineage and provides details about the rediploidization of the genome.
Collapse
|
769
|
Cerbin S, Wai CM, VanBuren R, Jiang N. GingerRoot: A Novel DNA Transposon Encoding Integrase-Related Transposase in Plants and Animals. Genome Biol Evol 2020; 11:3181-3193. [PMID: 31633753 PMCID: PMC6839031 DOI: 10.1093/gbe/evz230] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2019] [Indexed: 02/06/2023] Open
Abstract
Transposable elements represent the largest components of many eukaryotic genomes and different genomes harbor different combinations of elements. Here, we discovered a novel DNA transposon in the genome of the clubmoss Selaginella lepidophylla. Further searching for related sequences to the conserved DDE region uncovered the presence of this superfamily of elements in fish, coral, sea anemone, and other animal species. However, this element appears restricted to Bryophytes and Lycophytes in plants. This transposon, named GingerRoot, is associated with a 6 bp (base pair) target site duplication, and 100-150 bp terminal inverted repeats. Analysis of transposase sequences identified the DDE motif, a catalytic domain, which shows similarity to the integrase of Gypsy-like long terminal repeat retrotransposons, the most abundant component in plant genomes. A total of 77 intact and several hundred truncated copies of GingerRoot elements were identified in S. lepidophylla. Like Gypsy retrotransposons, GingerRoots show a lack of insertion preference near genes, which contrasts to the compact genome size of about 100 Mb. Nevertheless, a considerable portion of GingerRoot elements was found to carry gene fragments, suggesting the capacity of duplicating gene sequences is unlikely attributed to the proximity to genes. Elements carrying gene fragments appear to be less methylated, more diverged, and more distal to genes than those without gene fragments, indicating they are preferentially retained in gene-poor regions. This study has identified a broadly dispersed, novel DNA transposon, and the first plant DNA transposon with an integrase-related transposase, suggesting the possibility of de novo formation of Gypsy-like elements in plants.
Collapse
Affiliation(s)
- Stefan Cerbin
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Ching Man Wai
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824
| |
Collapse
|
770
|
Zhang W, Liu J, Zhang Y, Qiu J, Li Y, Zheng B, Hu F, Dai S, Huang X. A high-quality genome sequence of alkaligrass provides insights into halophyte stress tolerance. SCIENCE CHINA-LIFE SCIENCES 2020; 63:1269-1282. [DOI: 10.1007/s11427-020-1662-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 03/01/2020] [Indexed: 02/07/2023]
|
771
|
Zhang Z, Chen Y, Zhang J, Ma X, Li Y, Li M, Wang D, Kang M, Wu H, Yang Y, Olson MS, DiFazio SP, Wan D, Liu J, Ma T. Improved genome assembly provides new insights into genome evolution in a desert poplar (Populus euphratica). Mol Ecol Resour 2020; 20. [PMID: 32034885 DOI: 10.1111/1755-0998.13142] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Revised: 01/21/2020] [Accepted: 02/03/2020] [Indexed: 12/30/2022]
Abstract
Populus euphratica is well adapted to extreme desert environments and is an important model species for elucidating the mechanisms of abiotic stress resistance in trees. The current assembly of P. euphratica genome is highly fragmented with many gaps and errors, thereby impeding downstream applications. Here, we report an improved chromosome-level reference genome of P. euphratica (v2.0) using single-molecule sequencing and chromosome conformation capture (Hi-C) technologies. Relative to the previous reference genome, our assembly represents a nearly 60-fold improvement in contiguity, with a scaffold N50 size of 28.59 Mb. Using this genome, we have found that extensive expansion of Gypsy elements in P. euphratica led to its rapid increase in genome size compared to any other Salicaceae species studied to date, and potentially contributed to adaptive divergence driven by insertions near genes involved in stress tolerance. We also detected a wide range of unique structural rearrangements in P. euphratica, including 2,549 translocations, 454 inversions, 121 tandem and 14 segmental duplications. Several key genes likely to be involved in tolerance to abiotic stress were identified within these regions. This high-quality genome represents a valuable resource for poplar breeding and genetic improvement in the future, as well as comparative genomic analysis with other Salicaceae species.
Collapse
Affiliation(s)
- Zhiyang Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Yang Chen
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Junlin Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Xinzhi Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Yiling Li
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Mengmeng Li
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Deyan Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Minghui Kang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Haolin Wu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| | - Yongzhi Yang
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology & College of Life Sciences, Lanzhou University, Lanzhou, China
| | - Matthew S Olson
- Department of Biological Sciences, Texas Tech University, Lubbock, TX, USA
| | - Stephen P DiFazio
- Department of Biology, West Virginia University, Morgantown, WV, USA
| | - Dongshi Wan
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology & College of Life Sciences, Lanzhou University, Lanzhou, China
| | - Jianquan Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China.,State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology & College of Life Sciences, Lanzhou University, Lanzhou, China
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu, China
| |
Collapse
|
772
|
Manchanda N, Portwood JL, Woodhouse MR, Seetharam AS, Lawrence-Dill CJ, Andorf CM, Hufford MB. GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations. BMC Genomics 2020; 21:193. [PMID: 32122303 PMCID: PMC7053122 DOI: 10.1186/s12864-020-6568-2] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 02/07/2020] [Indexed: 11/28/2022] Open
Abstract
Background Genome assemblies are foundational for understanding the biology of a species. They provide a physical framework for mapping additional sequences, thereby enabling characterization of, for example, genomic diversity and differences in gene expression across individuals and tissue types. Quality metrics for genome assemblies gauge both the completeness and contiguity of an assembly and help provide confidence in downstream biological insights. To compare quality across multiple assemblies, a set of common metrics are typically calculated and then compared to one or more gold standard reference genomes. While several tools exist for calculating individual metrics, applications providing comprehensive evaluations of multiple assembly features are, perhaps surprisingly, lacking. Here, we describe a new toolkit that integrates multiple metrics to characterize both assembly and gene annotation quality in a way that enables comparison across multiple assemblies and assembly types. Results Our application, named GenomeQC, is an easy-to-use and interactive web framework that integrates various quantitative measures to characterize genome assemblies and annotations. GenomeQC provides researchers with a comprehensive summary of these statistics and allows for benchmarking against gold standard reference assemblies. Conclusions The GenomeQC web application is implemented in R/Shiny version 1.5.9 and Python 3.6 and is freely available at https://genomeqc.maizegdb.org/ under the GPL license. All source code and a containerized version of the GenomeQC pipeline is available in the GitHub repository https://github.com/HuffordLab/GenomeQC.
Collapse
Affiliation(s)
- Nancy Manchanda
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - John L Portwood
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, 50011, USA
| | | | - Arun S Seetharam
- Genome Informatics Facility, Iowa State University, Ames, IA, 50011, USA
| | - Carolyn J Lawrence-Dill
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, 50011, USA.,Department of Agronomy, Iowa State University, Ames, IA, 50011, USA
| | - Carson M Andorf
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Ames, IA, 50011, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution and Organismal Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
773
|
Li FW, Nishiyama T, Waller M, Frangedakis E, Keller J, Li Z, Fernandez-Pozo N, Barker MS, Bennett T, Blázquez MA, Cheng S, Cuming AC, de Vries J, de Vries S, Delaux PM, Diop IS, Harrison CJ, Hauser D, Hernández-García J, Kirbis A, Meeks JC, Monte I, Mutte SK, Neubauer A, Quandt D, Robison T, Shimamura M, Rensing SA, Villarreal JC, Weijers D, Wicke S, Wong GKS, Sakakibara K, Szövényi P. Anthoceros genomes illuminate the origin of land plants and the unique biology of hornworts. NATURE PLANTS 2020; 6:259-272. [PMID: 32170292 PMCID: PMC8075897 DOI: 10.1038/s41477-020-0618-2] [Citation(s) in RCA: 197] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 02/11/2020] [Indexed: 05/12/2023]
Abstract
Hornworts comprise a bryophyte lineage that diverged from other extant land plants >400 million years ago and bears unique biological features, including a distinct sporophyte architecture, cyanobacterial symbiosis and a pyrenoid-based carbon-concentrating mechanism (CCM). Here, we provide three high-quality genomes of Anthoceros hornworts. Phylogenomic analyses place hornworts as a sister clade to liverworts plus mosses with high support. The Anthoceros genomes lack repeat-dense centromeres as well as whole-genome duplication, and contain a limited transcription factor repertoire. Several genes involved in angiosperm meristem and stomatal function are conserved in Anthoceros and upregulated during sporophyte development, suggesting possible homologies at the genetic level. We identified candidate genes involved in cyanobacterial symbiosis and found that LCIB, a Chlamydomonas CCM gene, is present in hornworts but absent in other plant lineages, implying a possible conserved role in CCM function. We anticipate that these hornwort genomes will serve as essential references for future hornwort research and comparative studies across land plants.
Collapse
Affiliation(s)
- Fay-Wei Li
- Boyce Thompson Institute, Ithaca, NY, USA.
- Plant Biology Section, Cornell University, Ithaca, NY, USA.
| | - Tomoaki Nishiyama
- Advanced Science Research Center, Kanazawa University, Ishikawa, Japan
| | - Manuel Waller
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | | | - Jean Keller
- LRSV, Université de Toulouse, CNRS, UPS Castanet-Tolosan, Toulouse, France
| | - Zheng Li
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | | | - Michael S Barker
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, USA
| | - Tom Bennett
- Centre for Plant Sciences, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Miguel A Blázquez
- Instituto de Biología Molecular y Celular de Plantas, CSIC-Universidad Politécnica de Valencia, Valencia, Spain
| | - Shifeng Cheng
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Andrew C Cuming
- Centre for Plant Sciences, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Jan de Vries
- Institute for Microbiology and Genetics, Department of Applied Bioinformatics, Georg-August University Göttingen, Göttingen, Germany
| | - Sophie de Vries
- Institute of Population Genetics, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Pierre-Marc Delaux
- LRSV, Université de Toulouse, CNRS, UPS Castanet-Tolosan, Toulouse, France
| | - Issa S Diop
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - C Jill Harrison
- School of Biological Sciences, University of Bristol, Bristol, UK
| | | | - Jorge Hernández-García
- Instituto de Biología Molecular y Celular de Plantas, CSIC-Universidad Politécnica de Valencia, Valencia, Spain
| | - Alexander Kirbis
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - John C Meeks
- Department of Microbiology and Molecular Genetics, University of California, Davis, CA, USA
| | - Isabel Monte
- Department of Plant and Microbial Biology, University of Zurich, Zurich, Switzerland
| | - Sumanth K Mutte
- Laboratory of Biochemistry, Wageningen University & Research, Wageningen, the Netherlands
| | - Anna Neubauer
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Dietmar Quandt
- Nees Institute for Biodiversity of Plants, University of Bonn, Bonn, Germany
| | - Tanner Robison
- Boyce Thompson Institute, Ithaca, NY, USA
- Plant Biology Section, Cornell University, Ithaca, NY, USA
| | - Masaki Shimamura
- Graduate School of Integrated Sciences for Life, Hiroshima University, Hiroshima, Japan
| | - Stefan A Rensing
- Faculty of Biology, Philipps University of Marburg, Marburg, Germany
- BIOSS Centre for Biological Signalling Studies, University of Freiburg, Freiburg, Germany
- LOEWE Center for Synthetic Microbiology (SYNMIKRO), University of Marburg, Marburg, Germany
| | - Juan Carlos Villarreal
- Department of Biology, Laval University, Quebec City, Quebec, Canada
- Smithsonian Tropical Research Institute, Balboa, Panamá
| | - Dolf Weijers
- Laboratory of Biochemistry, Wageningen University & Research, Wageningen, the Netherlands
| | - Susann Wicke
- Institute for Evolution and Biodiversity, University of Muenster, Münster, Germany
| | - Gane K-S Wong
- Department of Biological Sciences, Department of Medicine, University of Alberta, Edmonton, Alberta, Canada
- BGI-Shenzhen, Shenzhen, China
| | | | - Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland.
- Zurich-Basel Plant Science Center, Zurich, Switzerland.
| |
Collapse
|
774
|
Sturtevant D, Lu S, Zhou ZW, Shen Y, Wang S, Song JM, Zhong J, Burks DJ, Yang ZQ, Yang QY, Cannon AE, Herrfurth C, Feussner I, Borisjuk L, Munz E, Verbeck GF, Wang X, Azad RK, Singleton B, Dyer JM, Chen LL, Chapman KD, Guo L. The genome of jojoba ( Simmondsia chinensis): A taxonomically isolated species that directs wax ester accumulation in its seeds. SCIENCE ADVANCES 2020; 6:eaay3240. [PMID: 32195345 PMCID: PMC7065883 DOI: 10.1126/sciadv.aay3240] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Accepted: 12/16/2019] [Indexed: 05/10/2023]
Abstract
Seeds of the desert shrub, jojoba (Simmondsia chinensis), are an abundant, renewable source of liquid wax esters, which are valued additives in cosmetic products and industrial lubricants. Jojoba is relegated to its own taxonomic family, and there is little genetic information available to elucidate its phylogeny. Here, we report the high-quality, 887-Mb genome of jojoba assembled into 26 chromosomes with 23,490 protein-coding genes. The jojoba genome has only the whole-genome triplication (γ) shared among eudicots and no recent duplications. These genomic resources coupled with extensive transcriptome, proteome, and lipidome data helped to define heterogeneous pathways and machinery for lipid synthesis and storage, provided missing evolutionary history information for this taxonomically segregated dioecious plant species, and will support efforts to improve the agronomic properties of jojoba.
Collapse
Affiliation(s)
- Drew Sturtevant
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Shaoping Lu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
| | - Zhi-Wei Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Yin Shen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Shuo Wang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Jia-Ming Song
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Jinshun Zhong
- Institute for Plant Genetics, Heinrich Heine University, Dusseldorf, NRW, Germany
| | - David J. Burks
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
| | - Zhi-Quan Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Qing-Yong Yang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
| | - Ashley E. Cannon
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
| | - Cornelia Herrfurth
- Department of Plant Biochemistry and Service Unit for Metabolomics and Lipidomics, Albrecht-von-Haller-Institute and Goettingen Center for Molecular Biosciences (GZMB), University of Goettingen, Goettingen, Germany
| | - Ivo Feussner
- Department of Plant Biochemistry and Service Unit for Metabolomics and Lipidomics, Albrecht-von-Haller-Institute and Goettingen Center for Molecular Biosciences (GZMB), University of Goettingen, Goettingen, Germany
| | - Ljudmilla Borisjuk
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Eberhard Munz
- Leibniz-Institute of Plant Genetics and Crop Plant Research (IPK), Gatersleben, Germany
| | - Guido F. Verbeck
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
- Department of Chemistry, University of North Texas, Denton, TX, USA
| | - Xuexia Wang
- Department of Mathematics, University of North Texas, Denton, TX, USA
| | - Rajeev K. Azad
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
- Department of Mathematics, University of North Texas, Denton, TX, USA
| | - Brenda Singleton
- USDA-ARS, US Arid-Land Agricultural Research Center, Maricopa, AZ, USA
| | - John M. Dyer
- USDA-ARS, US Arid-Land Agricultural Research Center, Maricopa, AZ, USA
| | - Ling-Ling Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, China
- Corresponding author. (L.-L.C.); (K.D.C.); (L.G.)
| | - Kent D. Chapman
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, TX, USA
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Corresponding author. (L.-L.C.); (K.D.C.); (L.G.)
| | - Liang Guo
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, China
- Corresponding author. (L.-L.C.); (K.D.C.); (L.G.)
| |
Collapse
|
775
|
VanBuren R, Man Wai C, Wang X, Pardo J, Yocca AE, Wang H, Chaluvadi SR, Han G, Bryant D, Edger PP, Messing J, Sorrells ME, Mockler TC, Bennetzen JL, Michael TP. Exceptional subgenome stability and functional divergence in the allotetraploid Ethiopian cereal teff. Nat Commun 2020; 11:884. [PMID: 32060277 PMCID: PMC7021729 DOI: 10.1038/s41467-020-14724-z] [Citation(s) in RCA: 99] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 01/30/2020] [Indexed: 12/22/2022] Open
Abstract
Teff (Eragrostis tef) is a cornerstone of food security in the Horn of Africa, where it is prized for stress resilience, grain nutrition, and market value. Here, we report a chromosome-scale assembly of allotetraploid teff (variety Dabbi) and patterns of subgenome dynamics. The teff genome contains two complete sets of homoeologous chromosomes, with most genes maintaining as syntenic gene pairs. TE analysis allows us to estimate that the teff polyploidy event occurred ~1.1 million years ago (mya) and that the two subgenomes diverged ~5.0 mya. Despite this divergence, we detect no large-scale structural rearrangements, homoeologous exchanges, or biased gene loss, in contrast to many other allopolyploids. The two teff subgenomes have partitioned their ancestral functions based on divergent expression across a diverse expression atlas. Together, these genomic resources will be useful for accelerating breeding of this underutilized grain crop and for fundamental insights into polyploid genome evolution.
Collapse
Affiliation(s)
- Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA.
- Plant Resilience Institute, Michigan State University, East Lansing, MI, 48824, USA.
| | - Ching Man Wai
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Plant Resilience Institute, Michigan State University, East Lansing, MI, 48824, USA
| | - Xuewen Wang
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA
| | - Jeremy Pardo
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Plant Resilience Institute, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Alan E Yocca
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Hao Wang
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA
| | | | - Guomin Han
- Department of Genetics, University of Georgia, Athens, GA, 30602, USA
| | - Douglas Bryant
- Donald Danforth Plant Science Center, St. Louis, MO, 63132, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Joachim Messing
- Waksman Institute of Microbiology, Rutgers University, Springfield, USA
| | - Mark E Sorrells
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY, USA
| | - Todd C Mockler
- Donald Danforth Plant Science Center, St. Louis, MO, 63132, USA
| | | | | |
Collapse
|
776
|
Chen Y, Ma T, Zhang L, Kang M, Zhang Z, Zheng Z, Sun P, Shrestha N, Liu J, Yang Y. Genomic analyses of a “living fossil”: The endangered dove‐tree. Mol Ecol Resour 2020; 20. [DOI: 10.1111/1755-0998.13138] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 01/09/2020] [Accepted: 01/13/2020] [Indexed: 12/18/2022]
Affiliation(s)
- Yang Chen
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
| | - Tao Ma
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
| | - Lushui Zhang
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
| | - Minghui Kang
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
| | - Zhiyang Zhang
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
| | - Zeyu Zheng
- State Key Laboratory of Grassland Agro‐Ecosystem Institute of Innovation Ecology Lanzhou University Lanzhou China
| | - Pengchuan Sun
- School of Life Sciences North China University of Science and Technology Caofeidian, Tangshan China
| | - Nawal Shrestha
- State Key Laboratory of Grassland Agro‐Ecosystem Institute of Innovation Ecology Lanzhou University Lanzhou China
| | - Jianquan Liu
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
- State Key Laboratory of Grassland Agro‐Ecosystem Institute of Innovation Ecology Lanzhou University Lanzhou China
| | - Yongzhi Yang
- Key Laboratory of Bio‐Resource and Eco‐Environment of Ministry of Education & State Key Lab of Hydraulics & Mountain River Engineering College of Life Sciences Sichuan University Chengdu China
- State Key Laboratory of Grassland Agro‐Ecosystem Institute of Innovation Ecology Lanzhou University Lanzhou China
| |
Collapse
|
777
|
Liu B, Yan J, Li W, Yin L, Li P, Yu H, Xing L, Cai M, Wang H, Zhao M, Zheng J, Sun F, Wang Z, Jiang Z, Ou Q, Li S, Qu L, Zhang Q, Zheng Y, Qiao X, Xi Y, Zhang Y, Jiang F, Huang C, Liu C, Ren Y, Wang S, Liu H, Guo J, Wang H, Dong H, Peng C, Qian W, Fan W, Wan F. Mikania micrantha genome provides insights into the molecular mechanism of rapid growth. Nat Commun 2020; 11:340. [PMID: 31953413 PMCID: PMC6969026 DOI: 10.1038/s41467-019-13926-4] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Accepted: 12/06/2019] [Indexed: 11/08/2022] Open
Abstract
Mikania micrantha is one of the top 100 worst invasive species that can cause serious damage to natural ecosystems and substantial economic losses. Here, we present its 1.79 Gb chromosome-scale reference genome. Half of the genome is composed of long terminal repeat retrotransposons, 80% of which have been derived from a significant expansion in the past one million years. We identify a whole genome duplication event and recent segmental duplications, which may be responsible for its rapid environmental adaptation. Additionally, we show that M. micrantha achieves higher photosynthetic capacity by CO2 absorption at night to supplement the carbon fixation during the day, as well as enhanced stem photosynthesis efficiency. Furthermore, the metabolites of M. micrantha can increase the availability of nitrogen by enriching the microbes that participate in nitrogen cycling pathways. These findings collectively provide insights into the rapid growth and invasive adaptation.
Collapse
Affiliation(s)
- Bo Liu
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Jian Yan
- Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture and Rural Affairs; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture; College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
| | - Weihua Li
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Lijuan Yin
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
- Key Laboratory of Protein Function and Regulation in Agricultural Organisms of Guangdong province, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Ping Li
- Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture and Rural Affairs; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture; College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
| | - Hanxia Yu
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Longsheng Xing
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Minling Cai
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Hengchao Wang
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Mengxin Zhao
- The Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jin Zheng
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Feng Sun
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Zhenzhen Wang
- Key Laboratory of Agro-Environment in the Tropics, Ministry of Agriculture and Rural Affairs; Guangdong Provincial Key Laboratory of Eco-Circular Agriculture; College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
| | - Zhaoyang Jiang
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Qiaojing Ou
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Shubin Li
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Lu Qu
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Qilei Zhang
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Yaping Zheng
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China
| | - Xi Qiao
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yu Xi
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yan Zhang
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Fan Jiang
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Cong Huang
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Conghui Liu
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Yuwei Ren
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Sen Wang
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Hangwei Liu
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China
| | - Jianyang Guo
- The Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Haihong Wang
- Key Laboratory of Protein Function and Regulation in Agricultural Organisms of Guangdong province, College of Life Science, South China Agricultural University, Guangzhou, 510642, China
| | - Hui Dong
- Fairy Lake Botanical Garden, Shenzhen and Chinese Academy of Sciences, Shenzhen, 518004, China
| | - Changlian Peng
- Institute of Ecological Science, Guangdong Provincial Key Laboratory of Biotechnology for Plant Development; Guangzhou Key Laboratory of Subtropical Biodiversity and Biomonitoring; School of Life Science, South China Normal University, Guangzhou, 510631, China.
| | - Wanqiang Qian
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.
| | - Wei Fan
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.
| | - Fanghao Wan
- Guangdong Laboratory of Lingnan Modern Agriculture, Shenzhen; Genome Analysis Laboratory of the Ministry of Agriculture; Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518120, China.
- The Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, 100193, China.
| |
Collapse
|
778
|
Jedlicka P, Lexa M, Kejnovsky E. What Can Long Terminal Repeats Tell Us About the Age of LTR Retrotransposons, Gene Conversion and Ectopic Recombination? FRONTIERS IN PLANT SCIENCE 2020; 11:644. [PMID: 32508870 PMCID: PMC7251063 DOI: 10.3389/fpls.2020.00644] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 04/27/2020] [Indexed: 05/10/2023]
Abstract
LTR retrotransposons constitute a significant part of plant genomes and their evolutionary dynamics play an important role in genome size changes. Current methods of LTR retrotransposon age estimation are based only on LTR (long terminal repeat) divergence. This has prompted us to analyze sequence similarity of LTRs in 25,144 LTR retrotransposons from fifteen plant species as well as formation of solo LTRs. We found that approximately one fourth of nested retrotransposons showed a higher LTR divergence than the pre-existing retrotransposons into which they had been inserted. Moreover, LTR similarity was correlated with LTR length. We propose that gene conversion can contribute to this phenomenon. Gene conversion prediction in LTRs showed potential converted regions in 25% of LTR pairs. Gene conversion was higher in species with smaller genomes while the proportion of solo LTRs did not change with genome size in analyzed species. The negative correlation between the extent of gene conversion and the abundance of solo LTRs suggests interference between gene conversion and ectopic recombination. Since such phenomena limit the traditional methods of LTR retrotransposon age estimation, we recommend an improved approach based on the exclusion of regions affected by gene conversion.
Collapse
Affiliation(s)
- Pavel Jedlicka
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czechia
| | - Matej Lexa
- Faculty of Informatics, Masaryk University, Brno, Czechia
| | - Eduard Kejnovsky
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czechia
- *Correspondence: Eduard Kejnovsky,
| |
Collapse
|
779
|
Wang W, Das A, Kainer D, Schalamun M, Morales-Suarez A, Schwessinger B, Lanfear R. The draft nuclear genome assembly of Eucalyptus pauciflora: a pipeline for comparing de novo assemblies. Gigascience 2020; 9:giz160. [PMID: 31895413 PMCID: PMC6939829 DOI: 10.1093/gigascience/giz160] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 11/19/2019] [Accepted: 12/02/2019] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Eucalyptus pauciflora (the snow gum) is a long-lived tree with high economic and ecological importance. Currently, little genomic information for E. pauciflora is available. Here, we sequentially assemble the genome of Eucalyptus pauciflora with different methods, and combine multiple existing and novel approaches to help to select the best genome assembly. FINDINGS We generated high coverage of long- (Nanopore, 174×) and short- (Illumina, 228×) read data from a single E. pauciflora individual and compared assemblies from 5 assemblers (Canu, SMARTdenovo, Flye, Marvel, and MaSuRCA) with different read lengths (1 and 35 kb minimum read length). A key component of our approach is to keep a randomly selected collection of ∼10% of both long and short reads separated from the assemblies to use as a validation set for assessing assemblies. Using this validation set along with a range of existing tools, we compared the assemblies in 8 ways: contig N50, BUSCO scores, LAI (long terminal repeat assembly index) scores, assembly ploidy, base-level error rate, CGAL (computing genome assembly likelihoods) scores, structural variation, and genome sequence similarity. Our result showed that MaSuRCA generated the best assembly, which is 594.87 Mb in size, with a contig N50 of 3.23 Mb, and an estimated error rate of ∼0.006 errors per base. CONCLUSIONS We report a draft genome of E. pauciflora, which will be a valuable resource for further genomic studies of eucalypts. The approaches for assessing and comparing genomes should help in assessing and choosing among many potential genome assemblies from a single dataset.
Collapse
Affiliation(s)
- Weiwen Wang
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
| | - Ashutosh Das
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
- Department of Genetics and Animal Breeding, Faculty of Veterinary Medicine, Chittagong Veterinary and Animal Sciences University. Khulshi, Chattogram, 4225, Bangladesh
| | - David Kainer
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
| | - Miriam Schalamun
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
- Institute of Applied Genetics and Cell Biology, University of Natural Resources and Life Sciences. Muthgasse 18, Vienna, 1190 Wien, Austria
| | - Alejandro Morales-Suarez
- Department of Biological Sciences, Macquarie University.Building 6SR (E8B), 6 Science Rd, Sydney, NSW, 2109, Australia
| | - Benjamin Schwessinger
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
| | - Robert Lanfear
- Research School of Biology, the Australian National University. 134 Linnaeus Way, Acton, Canberra, ACT, 2601, Australia
| |
Collapse
|
780
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019. [PMID: 31843001 DOI: 10.1101/657890v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, 92697, USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Jireh R A Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Adam J Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | | - Tyler A Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY, 14853, USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA.
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN, 55108, USA.
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
781
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019; 20:275. [PMID: 31843001 PMCID: PMC6913007 DOI: 10.1186/s13059-019-1905-y] [Citation(s) in RCA: 699] [Impact Index Per Article: 116.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 11/28/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697 USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
| | - Jireh R. A. Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Adam J. Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | | | - Tyler A. Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY 14853 USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824 USA
| | - Candice N. Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108 USA
| | - Matthew B. Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| |
Collapse
|
782
|
Ou S, Jiang N. LTR_FINDER_parallel: parallelization of LTR_FINDER enabling rapid identification of long terminal repeat retrotransposons. Mob DNA 2019; 10:48. [PMID: 31857828 PMCID: PMC6909508 DOI: 10.1186/s13100-019-0193-0] [Citation(s) in RCA: 126] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 12/05/2019] [Indexed: 11/10/2022] Open
Abstract
Annotation of plant genomes is still a challenging task due to the abundance of repetitive sequences, especially long terminal repeat (LTR) retrotransposons. LTR_FINDER is a widely used program for the identification of LTR retrotransposons but its application on large genomes is hindered by its single-threaded processes. Here we report an accessory program that allows parallel operation of LTR_FINDER, resulting in up to 8500X faster identification of LTR elements. It takes only 72 min to process the 14.5 Gb bread wheat (Triticum aestivum) genome in comparison to 1.16 years required by the original sequential version. LTR_FINDER_parallel is freely available at https://github.com/oushujun/LTR_FINDER_parallel.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Horticulture, Michigan State University, East Lansing, MI 48824 USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824 USA
| |
Collapse
|
783
|
Marchant DB, Sessa EB, Wolf PG, Heo K, Barbazuk WB, Soltis PS, Soltis DE. The C-Fern (Ceratopteris richardii) genome: insights into plant genome evolution with the first partial homosporous fern genome assembly. Sci Rep 2019; 9:18181. [PMID: 31796775 PMCID: PMC6890710 DOI: 10.1038/s41598-019-53968-8] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 11/04/2019] [Indexed: 01/04/2023] Open
Abstract
Ferns are notorious for possessing large genomes and numerous chromosomes. Despite decades of speculation, the processes underlying the expansive genomes of ferns are unclear, largely due to the absence of a sequenced homosporous fern genome. The lack of this crucial resource has not only hindered investigations of evolutionary processes responsible for the unusual genome characteristics of homosporous ferns, but also impeded synthesis of genome evolution across land plants. Here, we used the model fern species Ceratopteris richardii to address the processes (e.g., polyploidy, spread of repeat elements) by which the large genomes and high chromosome numbers typical of homosporous ferns may have evolved and have been maintained. We directly compared repeat compositions in species spanning the green plant tree of life and a diversity of genome sizes, as well as both short- and long-read-based assemblies of Ceratopteris. We found evidence consistent with a single ancient polyploidy event in the evolutionary history of Ceratopteris based on both genomic and cytogenetic data, and on repeat proportions similar to those found in large flowering plant genomes. This study provides a major stepping-stone in the understanding of land plant evolutionary genomics by providing the first homosporous fern reference genome, as well as insights into the processes underlying the formation of these massive genomes.
Collapse
Affiliation(s)
- D Blaine Marchant
- Department of Biology, Stanford University, Stanford, CA, 94305, USA.
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA.
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA.
| | - Emily B Sessa
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
- The Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Paul G Wolf
- Department of Biology, Utah State University, Logan, UT, 84322, USA
- Department of Biological Sciences, University of Alabama in Huntsville, Huntsville, AL, 35899, USA
| | - Kweon Heo
- Department of Applied Plant Sciences, Kangwon National University, Chuncheon, 24341, Korea
| | - W Brad Barbazuk
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
- The Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- The Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
- The Biodiversity Institute, University of Florida, Gainesville, FL, 32611, USA
| | - Douglas E Soltis
- Department of Biology, University of Florida, Gainesville, FL, 32611, USA
- Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
- The Genetics Institute, University of Florida, Gainesville, FL, 32611, USA
- The Biodiversity Institute, University of Florida, Gainesville, FL, 32611, USA
| |
Collapse
|
784
|
Yang X, Kang M, Yang Y, Xiong H, Wang M, Zhang Z, Wang Z, Wu H, Ma T, Liu J, Xi Z. A chromosome-level genome assembly of the Chinese tupelo Nyssa sinensis. Sci Data 2019; 6:282. [PMID: 31767848 PMCID: PMC6877568 DOI: 10.1038/s41597-019-0296-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Accepted: 10/21/2019] [Indexed: 01/15/2023] Open
Abstract
The deciduous Chinese tupelo (Nyssa sinensis Oliv.) is a popular ornamental tree for the spectacular autumn leaf color. Here, using single-molecule sequencing and chromosome conformation capture data, we report a high-quality, chromosome-level genome assembly of N. sinensis. PacBio long reads were de novo assembled into 647 polished contigs with a total length of 1,001.42 megabases (Mb) and an N50 size of 3.62 Mb, which is in line with genome sizes estimated using flow cytometry and the k-mer analysis. These contigs were further clustered and ordered into 22 pseudo-chromosomes based on Hi-C data, matching the chromosome counts in Nyssa obtained from previous cytological studies. In addition, a total of 664.91 Mb of repetitive elements were identified and a total of 37,884 protein-coding genes were predicted in the genome of N. sinensis. All data were deposited in publicly available repositories, and should be a valuable resource for genomics, evolution, and conservation biology.
Collapse
Affiliation(s)
- Xuchen Yang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Minghui Kang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Yanting Yang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Haifeng Xiong
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Mingcheng Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Zhiyang Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Zefu Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Haolin Wu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Jianquan Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
- State Key Laboratory of Grassland Agro-Ecosystems, College of Life Sciences, Lanzhou University, Lanzhou, 730000, China
| | - Zhenxiang Xi
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China.
| |
Collapse
|
785
|
Varadharajan S, Rastas P, Löytynoja A, Matschiner M, Calboli FCF, Guo B, Nederbragt AJ, Jakobsen KS, Merilä J. A High-Quality Assembly of the Nine-Spined Stickleback (Pungitius pungitius) Genome. Genome Biol Evol 2019; 11:3291-3308. [PMID: 31687752 PMCID: PMC7145574 DOI: 10.1093/gbe/evz240] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/30/2019] [Indexed: 12/22/2022] Open
Abstract
The Gasterosteidae fish family hosts several species that are important models for eco-evolutionary, genetic, and genomic research. In particular, a wealth of genetic and genomic data has been generated for the three-spined stickleback (Gasterosteus aculeatus), the "ecology's supermodel," whereas the genomic resources for the nine-spined stickleback (Pungitius pungitius) have remained relatively scarce. Here, we report a high-quality chromosome-level genome assembly of P. pungitius consisting of 5,303 contigs (N50 = 1.2 Mbp) with a total size of 521 Mbp. These contigs were mapped to 21 linkage groups using a high-density linkage map, yielding a final assembly with 98.5% BUSCO completeness. A total of 25,062 protein-coding genes were annotated, and about 23% of the assembly was found to consist of repetitive elements. A comprehensive analysis of repetitive elements uncovered centromere-specific tandem repeats and provided insights into the evolution of retrotransposons. A multigene phylogenetic analysis inferred a divergence time of about 26 million years ago (Ma) between nine- and three-spined sticklebacks, which is far older than the commonly assumed estimate of 13 Ma. Compared with the three-spined stickleback, we identified an additional duplication of several genes in the hemoglobin cluster. Sequencing data from populations adapted to different environments indicated potential copy number variations in hemoglobin genes. Furthermore, genome-wide synteny comparisons between three- and nine-spined sticklebacks identified chromosomal rearrangements underlying the karyotypic differences between the two species. The high-quality chromosome-scale assembly of the nine-spined stickleback genome obtained with long-read sequencing technology provides a crucial resource for comparative and population genomic investigations of stickleback fishes and teleosts.
Collapse
Affiliation(s)
- Srinidhi Varadharajan
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
| | - Pasi Rastas
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
| | - Ari Löytynoja
- Institute of Biotechnology, University of Helsinki, Finland
| | - Michael Matschiner
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
- Department of Paleontology and Museum, University of Zurich, Switzerland
| | - Federico C F Calboli
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
- Laboratory of Biodiversity and Evolutionary Genomics, KU Leuven, Leuven, Belgium
| | - Baocheng Guo
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology Chinese Academy of Sciences, Beijing, China
| | - Alexander J Nederbragt
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, Norway
| | - Kjetill S Jakobsen
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
| | - Juha Merilä
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
| |
Collapse
|
786
|
Varadharajan S, Rastas P, Löytynoja A, Matschiner M, Calboli FCF, Guo B, Nederbragt AJ, Jakobsen KS, Merilä J. A High-Quality Assembly of the Nine-Spined Stickleback (Pungitius pungitius) Genome. Genome Biol Evol 2019. [PMID: 31687752 DOI: 10.1101/741751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open
Abstract
The Gasterosteidae fish family hosts several species that are important models for eco-evolutionary, genetic, and genomic research. In particular, a wealth of genetic and genomic data has been generated for the three-spined stickleback (Gasterosteus aculeatus), the "ecology's supermodel," whereas the genomic resources for the nine-spined stickleback (Pungitius pungitius) have remained relatively scarce. Here, we report a high-quality chromosome-level genome assembly of P. pungitius consisting of 5,303 contigs (N50 = 1.2 Mbp) with a total size of 521 Mbp. These contigs were mapped to 21 linkage groups using a high-density linkage map, yielding a final assembly with 98.5% BUSCO completeness. A total of 25,062 protein-coding genes were annotated, and about 23% of the assembly was found to consist of repetitive elements. A comprehensive analysis of repetitive elements uncovered centromere-specific tandem repeats and provided insights into the evolution of retrotransposons. A multigene phylogenetic analysis inferred a divergence time of about 26 million years ago (Ma) between nine- and three-spined sticklebacks, which is far older than the commonly assumed estimate of 13 Ma. Compared with the three-spined stickleback, we identified an additional duplication of several genes in the hemoglobin cluster. Sequencing data from populations adapted to different environments indicated potential copy number variations in hemoglobin genes. Furthermore, genome-wide synteny comparisons between three- and nine-spined sticklebacks identified chromosomal rearrangements underlying the karyotypic differences between the two species. The high-quality chromosome-scale assembly of the nine-spined stickleback genome obtained with long-read sequencing technology provides a crucial resource for comparative and population genomic investigations of stickleback fishes and teleosts.
Collapse
Affiliation(s)
- Srinidhi Varadharajan
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
| | - Pasi Rastas
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
| | - Ari Löytynoja
- Institute of Biotechnology, University of Helsinki, Finland
| | - Michael Matschiner
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
- Department of Paleontology and Museum, University of Zurich, Switzerland
| | - Federico C F Calboli
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
- Laboratory of Biodiversity and Evolutionary Genomics, KU Leuven, Leuven, Belgium
| | - Baocheng Guo
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology Chinese Academy of Sciences, Beijing, China
| | - Alexander J Nederbragt
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, Norway
| | - Kjetill S Jakobsen
- Department of Biology, Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
| | - Juha Merilä
- Ecological Genetics Research Unit, Research Programme in Organismal and Evolutionary Biology, Faculty of Biological and Environmental Sciences, University of Helsinki, Finland
| |
Collapse
|
787
|
Wu H, Ma T, Kang M, Ai F, Zhang J, Dong G, Liu J. A high-quality Actinidia chinensis (kiwifruit) genome. HORTICULTURE RESEARCH 2019; 6:117. [PMID: 31645971 PMCID: PMC6804796 DOI: 10.1038/s41438-019-0202-y] [Citation(s) in RCA: 100] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 07/04/2019] [Accepted: 09/02/2019] [Indexed: 05/04/2023]
Abstract
Actinidia chinensis (kiwifruit) is a perennial horticultural crop species of the Actinidiaceae family with high nutritional and economic value. Two versions of the A. chinensis genomes have been previously assembled, based mainly on relatively short reads. Here, we report an improved chromosome-level reference genome of A. chinensis (v3.0), based mainly on PacBio long reads and Hi-C data. The high-quality assembled genome is 653 Mb long, with 0.76% heterozygosity. At least 43% of the genome consists of repetitive sequences, and the most abundant long terminal repeats were further identified and account for 23.38% of our novel genome. It has clear improvements in contiguity, accuracy, and gene annotation over the two previous versions and contains 40,464 annotated protein-coding genes, of which 94.41% are functionally annotated. Moreover, further analyses of genetic collinearity revealed that the kiwifruit genome has undergone two whole-genome duplications: one affecting all Ericales families near the K-T extinction event and a recent genus-specific duplication. The reference genome presented here will be highly useful for further molecular elucidation of diverse traits and for the breeding of this horticultural crop, as well as evolutionary studies with related taxa.
Collapse
Affiliation(s)
- Haolin Wu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
| | - Minghui Kang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
| | - Fandi Ai
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
| | - Junlin Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
| | - Guanyong Dong
- The Limited Agriculture Company of Xinyuan Sacred Fruit, Shifang, Deyang, 618409 Sichuan China
| | - Jianquan Liu
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education and State Key Laboratory of Hydraulics and Mountain River Engineering, College of Life Sciences, Sichuan University, Chengdu, 610065 China
- State Key Laboratory of Grassland Agro-Ecosystem, Institute of Innovation Ecology, Lanzhou University, Lanzhou, 730000 China
| |
Collapse
|
788
|
Peng Y, Zhang Y, Gui Y, An D, Liu J, Xu X, Li Q, Wang J, Wang W, Shi C, Fan L, Lu B, Deng Y, Teng S, He Z. Elimination of a Retrotransposon for Quenching Genome Instability in Modern Rice. MOLECULAR PLANT 2019; 12:1395-1407. [PMID: 31228579 DOI: 10.1016/j.molp.2019.06.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Revised: 06/05/2019] [Accepted: 06/09/2019] [Indexed: 05/26/2023]
Abstract
Transposable elements (TEs) constitute the most abundant portions of plant genomes and can dramatically shape host genomes during plant evolution. They also play important roles in crop domestication. However, whether TEs themselves are also selected during crop domestication has remained unknown. Here, we identify an active long terminal repeat (LTR) retrotransposon, HUO, as a potential target of selection during rice domestication and breeding. HUO is a low-copy-number LTR retrotransposon, and is active under natural growth conditions and transmitted through male gametogenesis, preferentially inserting into genomic regions capable of transcription. HUO exists in all wild rice accessions and about half of the archaeological rice grains (1200-7000 years ago) and landraces surveyed, but is absent in almost all modern varieties, indicating its gradual elimination during rice domestication and breeding. Further analyses showed that HUO is subjected to strict gene silencing through the RNA-directed DNA methylation pathway. Our results also suggest that multiple HUO copies may trigger genomic instability through altering genome-wide DNA methylation and small RNA biogenesis and changing global gene expression, resulting in decreased disease resistance and yield, coinciding with its elimination during rice breeding. Together, our study suggests that negative selection of an active retrotransposon might be important for genome stability during crop domestication and breeding.
Collapse
Affiliation(s)
- Yu Peng
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Yingying Zhang
- The Protected Horticulture Institute, Shanghai Academy of Agricultural Sciences, Shanghai 201403, China
| | - Yijie Gui
- School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Dong An
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Junzhong Liu
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Xun Xu
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Qun Li
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China
| | - Junmin Wang
- Zhejiang Academy of Agricultural Sciences, Hangzhou 310021, China
| | - Wen Wang
- Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Chunhai Shi
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Longjiang Fan
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Baorong Lu
- School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Yiwen Deng
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China.
| | - Sheng Teng
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China.
| | - Zuhua He
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology & Ecology, Chinese Academy of Sciences, Shanghai 200032, China.
| |
Collapse
|
789
|
Surm JM, Stewart ZK, Papanicolaou A, Pavasovic A, Prentis PJ. The draft genome of Actinia tenebrosa reveals insights into toxin evolution. Ecol Evol 2019; 9:11314-11328. [PMID: 31641475 PMCID: PMC6802032 DOI: 10.1002/ece3.5633] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 08/06/2019] [Accepted: 08/12/2019] [Indexed: 12/17/2022] Open
Abstract
Sea anemones have a wide array of toxic compounds (peptide toxins found in their venom) which have potential uses as therapeutics. To date, the majority of studies characterizing toxins in sea anemones have been restricted to species from the superfamily, Actinioidea. No highly complete draft genomes are currently available for this superfamily, however, highlighting our limited understanding of the genes encoding toxins in this important group. Here we have sequenced, assembled, and annotated a draft genome for Actinia tenebrosa. The genome is estimated to be approximately 255 megabases, with 31,556 protein-coding genes. Quality metrics revealed that this draft genome matches the quality and completeness of other model cnidarian genomes, including Nematostella, Hydra, and Acropora. Phylogenomic analyses revealed strong conservation of the Cnidaria and Hexacorallia core-gene set. However, we found that lineage-specific gene families have undergone significant expansion events compared with shared gene families. Enrichment analysis performed for both gene ontologies, and protein domains revealed that genes encoding toxins contribute to a significant proportion of the lineage-specific genes and gene families. The results make clear that the draft genome of A. tenebrosa will provide insight into the evolution of toxins and lineage-specific genes, and provide an important resource for the discovery of novel biological compounds.
Collapse
Affiliation(s)
- Joachim M. Surm
- Faculty of HealthSchool of Biomedical SciencesQueensland University of TechnologyKelvin GroveQldAustralia
- Institute of Health and Biomedical InnovationQueensland University of TechnologyKelvin GroveQldAustralia
| | - Zachary K. Stewart
- Science and Engineering FacultySchool of Earth, Environmental and Biological SciencesQueensland University of TechnologyBrisbaneQldAustralia
- Institute for Future EnvironmentsQueensland University of TechnologyBrisbaneQldAustralia
| | | | - Ana Pavasovic
- Faculty of HealthSchool of Biomedical SciencesQueensland University of TechnologyKelvin GroveQldAustralia
| | - Peter J. Prentis
- Science and Engineering FacultySchool of Earth, Environmental and Biological SciencesQueensland University of TechnologyBrisbaneQldAustralia
- Institute for Future EnvironmentsQueensland University of TechnologyBrisbaneQldAustralia
| |
Collapse
|
790
|
Lantican DV, Strickler SR, Canama AO, Gardoce RR, Mueller LA, Galvez HF. De Novo Genome Sequence Assembly of Dwarf Coconut ( Cocos nucifera L. 'Catigan Green Dwarf') Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species. G3 (BETHESDA, MD.) 2019; 9:2377-2393. [PMID: 31167834 PMCID: PMC6686914 DOI: 10.1534/g3.119.400215] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 05/31/2019] [Indexed: 11/23/2022]
Abstract
We report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, 'Catigan Green Dwarf' (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50X Illumina paired-end MiSeq reads of the same genome. The draft genome was improved through Chicago sequencing to generate a scaffold assembly that results in a total genome size of 2.1 Gbp consisting of 7,998 scaffolds with N50 of 570,487 bp. The final assembly covers around 97.6% of the estimated genome size of coconut 'CATD' based on homozygous k-mer peak analysis. A total of 34,958 high-confidence gene models were predicted and functionally associated to various economically important traits, such as pest/disease resistance, drought tolerance, coconut oil biosynthesis, and putative transcription factors. The assembled genome was used to infer the evolutionary relationship within the palm family based on genomic variations and synteny of coding gene sequences. Data show that at least three (3) rounds of whole genome duplication occurred and are commonly shared by these members of the Arecaceae family. A total of 7,139 unique SSR markers were designed to be used as a resource in marker-based breeding. In addition, we discovered 58,503 variants in coconut by aligning the Hainan Tall (HAT) WGS reads to the non-repetitive regions of the assembled CATD genome. The gene markers and genome-wide SSR markers established here will facilitate the development of varieties with resilience to climate change, resistance to pests and diseases, and improved oil yield and quality.
Collapse
Affiliation(s)
- Darlon V Lantican
- Genetics Laboratory, Institute of Plant Breeding, College of Agriculture and Food Science, University of the Philippines Los Baños, College, Laguna, Philippines 4031
- Philippine Genome Center, University of the Philippines System, Diliman, Quezon City, Philippines
| | | | - Alma O Canama
- Genetics Laboratory, Institute of Plant Breeding, College of Agriculture and Food Science, University of the Philippines Los Baños, College, Laguna, Philippines 4031
| | - Roanne R Gardoce
- Genetics Laboratory, Institute of Plant Breeding, College of Agriculture and Food Science, University of the Philippines Los Baños, College, Laguna, Philippines 4031
| | | | - Hayde F Galvez
- Genetics Laboratory, Institute of Plant Breeding, College of Agriculture and Food Science, University of the Philippines Los Baños, College, Laguna, Philippines 4031
- Institute of Crop Science, College of Agriculture and Food Science, University of the Philippines Los Baños, College, Laguna, Philippines 4031
| |
Collapse
|
791
|
Orozco-Arias S, Isaza G, Guyot R. Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning. Int J Mol Sci 2019; 20:E3837. [PMID: 31390781 PMCID: PMC6696364 DOI: 10.3390/ijms20153837] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 01/26/2023] Open
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales 170001, Colombia
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Romain Guyot
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales 170001, Colombia.
- Institut de Recherche pour le Développement, CIRAD, University Montpellier, 34000 Montpellier, France.
| |
Collapse
|
792
|
Ou S, Chen J, Jiang N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res 2019; 46:e126. [PMID: 30107434 PMCID: PMC6265445 DOI: 10.1093/nar/gky730] [Citation(s) in RCA: 335] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 07/31/2018] [Indexed: 12/15/2022] Open
Abstract
Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Jinfeng Chen
- Department of Plant Pathology and Microbiology, University of California, Riverside, CA 92507, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
793
|
Zhao D, Hamilton JP, Vaillancourt B, Zhang W, Eizenga GC, Cui Y, Jiang J, Buell CR, Jiang N. The unique epigenetic features of Pack-MULEs and their impact on chromosomal base composition and expression spectrum. Nucleic Acids Res 2019; 46:2380-2397. [PMID: 29365184 PMCID: PMC5861414 DOI: 10.1093/nar/gky025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Accepted: 01/18/2018] [Indexed: 12/11/2022] Open
Abstract
Acquisition and rearrangement of host genes by transposable elements (TEs) is an important mechanism to increase gene diversity as exemplified by the ∼3000 Pack-Mutator-like TEs in the rice genome which have acquired gene sequences (Pack-MULEs), yet remain enigmatic. To identify signatures of functioning Pack-MULEs and Pack-MULE evolution, we generated transcriptome, translatome, and epigenome datasets and compared Pack-MULEs to genes and other TE families. Approximately 40% of Pack-MULEs were transcribed with 9% having translation evidence, clearly distinguishing them from other TEs. Pack-MULEs exhibited a unique expression profile associated with specificity in reproductive tissues that may be associated with seed traits. Expressed Pack-MULEs resemble regular protein-coding genes as exhibited by a low level of DNA methylation, association with active histone marks and DNase I hypersensitive sites, and an absence of repressive histone marks, suggesting that a substantial fraction of Pack-MULEs are potentially functional in vivo. Interestingly, the expression capacity of Pack-MULEs is independent of the local genomic environment, and the insertion and expression of Pack-MULEs may have altered the local chromosomal expression pattern as well as counteracted the impact of recombination on chromosomal base composition, which has profound consequences on the evolution of chromosome structure.
Collapse
Affiliation(s)
- Dongyan Zhao
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - John P Hamilton
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | | | - Wenli Zhang
- Department of Horticulture, University of Wisconsin, Madison, WI 53705, USA.,State Key Laboratory for Crop Genetics and Germplasm Enhancement, Nanjing Agriculture University, Nanjing, Jiangsu 210095, China
| | - Georgia C Eizenga
- USDA-ARS Dale Bumpers National Rice Research Center, 2890 Highway 130 East, Stuttgart, AR 72160, USA
| | - Yuehua Cui
- Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA
| | - Jiming Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
794
|
Valencia JD, Girgis HZ. LtrDetector: A tool-suite for detecting long terminal repeat retrotransposons de-novo. BMC Genomics 2019; 20:450. [PMID: 31159720 PMCID: PMC6547461 DOI: 10.1186/s12864-019-5796-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Accepted: 05/14/2019] [Indexed: 12/19/2022] Open
Abstract
BACKGROUND Long terminal repeat retrotransposons are the most abundant transposons in plants. They play important roles in alternative splicing, recombination, gene regulation, and defense mechanisms. Large-scale sequencing projects for plant genomes are currently underway. Software tools are important for annotating long terminal repeat retrotransposons in these newly available genomes. However, the available tools are not very sensitive to known elements and perform inconsistently on different genomes. Some are hard to install or obsolete. They may struggle to process large plant genomes. None can be executed in parallel out of the box and very few have features to support visual review of new elements. To overcome these limitations, we developed LtrDetector, which uses techniques inspired by signal-processing. RESULTS We compared LtrDetector to LTR_Finder and LTRharvest, the two most successful predecessor tools, on six plant genomes. For each organism, we constructed a ground truth data set based on queries from a consensus sequence database. According to this evaluation, LtrDetector was the most sensitive tool, achieving 16-23% improvement in sensitivity over LTRharvest and 21% improvement over LTR_Finder. All three tools had low false positive rates, with LtrDetector achieving 98.2% precision, in between its two competitors. Overall, LtrDetector provides the best compromise between high sensitivity and low false positive rate while requiring moderate time and utilizing memory available on personal computers. CONCLUSIONS LtrDetector uses a novel methodology revolving around k-mer distributions, which allows it to produce high-quality results using relatively lightweight procedures. It is easy to install and use. It is not species specific, performing well using its default parameters on genomes of varying size and repeat content. It is automatically configured for parallel execution and runs efficiently on an ordinary personal computer. It includes a k-mer scores visualization tool to facilitate manual review of the identified elements. These features make LtrDetector an attractive tool for future annotation projects involving long terminal repeat retrotransposons.
Collapse
Affiliation(s)
- Joseph D Valencia
- The Bioinformatics Toolsmith Laboratory, Tandy School of Computer Science, University of Tulsa, 800 South Tucker Drive, Tulsa, 74104, OK, USA
| | - Hani Z Girgis
- The Bioinformatics Toolsmith Laboratory, Tandy School of Computer Science, University of Tulsa, 800 South Tucker Drive, Tulsa, 74104, OK, USA.
| |
Collapse
|
795
|
Wai CM, Weise SE, Ozersky P, Mockler TC, Michael TP, VanBuren R. Time of day and network reprogramming during drought induced CAM photosynthesis in Sedum album. PLoS Genet 2019; 15:e1008209. [PMID: 31199791 PMCID: PMC6594660 DOI: 10.1371/journal.pgen.1008209] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 06/26/2019] [Accepted: 05/24/2019] [Indexed: 12/22/2022] Open
Abstract
Plants with facultative crassulacean acid metabolism (CAM) maximize performance through utilizing C3 or C4 photosynthesis under ideal conditions while temporally switching to CAM under water stress (drought). While genome-scale analyses of constitutive CAM plants suggest that time of day networks are shifted, or phased to the evening compared to C3, little is known for how the shift from C3 to CAM networks is modulated in drought induced CAM. Here we generate a draft genome for the drought-induced CAM-cycling species Sedum album. Through parallel sampling in well-watered (C3) and drought (CAM) conditions, we uncover a massive rewiring of time of day expression and a CAM and stress-specific network. The core circadian genes are expanded in S. album and under CAM induction, core clock genes either change phase or amplitude. While the core clock cis-elements are conserved in S. album, we uncover a set of novel CAM and stress specific cis-elements consistent with our finding of rewired co-expression networks. We identified shared elements between constitutive CAM and CAM-cycling species and expression patterns unique to CAM-cycling S. album. Together these results demonstrate that drought induced CAM-cycling photosynthesis evolved through the mobilization of a stress-specific, time of day network, and not solely the phasing of existing C3 networks. These results will inform efforts to engineer water use efficiency into crop plants for growth on marginal land.
Collapse
Affiliation(s)
- Ching Man Wai
- Department of Horticulture, Michigan State University, East Lansing, Michigan, United States of America
| | - Sean E. Weise
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, United States of America
| | - Philip Ozersky
- Donald Danforth Plant Science Center, St. Louis MO, United States of America
| | - Todd C. Mockler
- Donald Danforth Plant Science Center, St. Louis MO, United States of America
| | - Todd P. Michael
- J. Craig Venter Institute, La Jolla, CA, United States of America
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, Michigan, United States of America
- Plant Resilience Institute, Michigan State University, East Lansing, MI, United States of America
| |
Collapse
|
796
|
Zhang L, Hu J, Han X, Li J, Gao Y, Richards CM, Zhang C, Tian Y, Liu G, Gul H, Wang D, Tian Y, Yang C, Meng M, Yuan G, Kang G, Wu Y, Wang K, Zhang H, Wang D, Cong P. A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour. Nat Commun 2019; 10:1494. [PMID: 30940818 PMCID: PMC6445120 DOI: 10.1038/s41467-019-09518-x] [Citation(s) in RCA: 206] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 03/13/2019] [Indexed: 01/14/2023] Open
Abstract
A complete and accurate genome sequence provides a fundamental tool for functional genomics and DNA-informed breeding. Here, we assemble a high-quality genome (contig N50 of 6.99 Mb) of the apple anther-derived homozygous line HFTH1, including 22 telomere sequences, using a combination of PacBio single-molecule real-time (SMRT) sequencing, chromosome conformation capture (Hi-C) sequencing, and optical mapping. In comparison to the Golden Delicious reference genome, we identify 18,047 deletions, 12,101 insertions and 14 large inversions. We reveal that these extensive genomic variations are largely attributable to activity of transposable elements. Interestingly, we find that a long terminal repeat (LTR) retrotransposon insertion upstream of MdMYB1, a core transcriptional activator of anthocyanin biosynthesis, is associated with red-skinned phenotype. This finding provides insights into the molecular mechanisms underlying red fruit coloration, and highlights the utility of this high-quality genome assembly in deciphering agriculturally important trait in apple.
Collapse
Affiliation(s)
- Liyi Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Jiang Hu
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Xiaolei Han
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Jingjing Li
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Yuan Gao
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Christopher M Richards
- USDA-ARS National Center for Genetic Resources Preservation, Fort Collins, CO, 80521, USA
| | - Caixia Zhang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yi Tian
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Guiming Liu
- Beijing Agro-Biotechnology Research Center, Beijing Academy of Agriculture and Forestry Sciences, 100097, Beijing, China
| | - Hera Gul
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Dajiang Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yu Tian
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Chuanxin Yang
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Minghui Meng
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China
| | - Gaopeng Yuan
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Guodong Kang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Yonglong Wu
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Kun Wang
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China
| | - Hengtao Zhang
- Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Science, 450009, Zhengzhou, Henan, China
| | - Depeng Wang
- Nextomics Biosciences Institute, 430000, Wuhan, Hubei, China.
| | - Peihua Cong
- Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Research Institute of Pomology, Chinese Academy of Agricultural Science, 125100, Xingcheng, Liaoning, China.
| |
Collapse
|
797
|
Edger PP, Poorten TJ, VanBuren R, Hardigan MA, Colle M, McKain MR, Smith RD, Teresi SJ, Nelson ADL, Wai CM, Alger EI, Bird KA, Yocca AE, Pumplin N, Ou S, Ben-Zvi G, Brodt A, Baruch K, Swale T, Shiue L, Acharya CB, Cole GS, Mower JP, Childs KL, Jiang N, Lyons E, Freeling M, Puzey JR, Knapp SJ. Origin and evolution of the octoploid strawberry genome. Nat Genet 2019; 51:541-547. [PMID: 30804557 PMCID: PMC6882729 DOI: 10.1038/s41588-019-0356-4] [Citation(s) in RCA: 363] [Impact Index Per Article: 60.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 01/15/2019] [Indexed: 01/19/2023]
Abstract
Cultivated strawberry emerged from the hybridization of two wild octoploid species, both descendants from the merger of four diploid progenitor species into a single nucleus more than 1 million years ago. Here we report a near-complete chromosome-scale assembly for cultivated octoploid strawberry (Fragaria × ananassa) and uncovered the origin and evolutionary processes that shaped this complex allopolyploid. We identified the extant relatives of each diploid progenitor species and provide support for the North American origin of octoploid strawberry. We examined the dynamics among the four subgenomes in octoploid strawberry and uncovered the presence of a single dominant subgenome with significantly greater gene content, gene expression abundance, and biased exchanges between homoeologous chromosomes, as compared with the other subgenomes. Pathway analysis showed that certain metabolomic and disease-resistance traits are largely controlled by the dominant subgenome. These findings and the reference genome should serve as a powerful platform for future evolutionary studies and enable molecular breeding in strawberry.
Collapse
Affiliation(s)
- Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, USA.
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI, USA.
| | - Thomas J Poorten
- Department of Plant Sciences, University of California-Davis, Davis, California, USA
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
- Plant Resilience Institute, Michigan State University, East Lansing, MI, USA
| | - Michael A Hardigan
- Department of Plant Sciences, University of California-Davis, Davis, California, USA
| | - Marivi Colle
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Michael R McKain
- Department of Biological Sciences, University of Alabama, Tuscaloosa, AL, USA
| | - Ronald D Smith
- Department of Biology, College of William and Mary, Williamsburg, VA, USA
| | - Scott J Teresi
- Department of Biology, College of William and Mary, Williamsburg, VA, USA
| | | | - Ching Man Wai
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Elizabeth I Alger
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Kevin A Bird
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI, USA
| | - Alan E Yocca
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Nathan Pumplin
- Department of Plant Sciences, University of California-Davis, Davis, California, USA
| | - Shujun Ou
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI, USA
| | | | | | | | | | | | - Charlotte B Acharya
- Department of Plant Sciences, University of California-Davis, Davis, California, USA
| | - Glenn S Cole
- Department of Plant Sciences, University of California-Davis, Davis, California, USA
| | - Jeffrey P Mower
- Center for Plant Science Innovation, University of Nebraska, Lincoln, NE, USA
| | - Kevin L Childs
- Department of Plant Biology, Michigan State University, East Lansing, MI, USA
- Center for Genomics Enabled Plant Science, Michigan State University, East Lansing, MI, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI, USA
| | - Eric Lyons
- School of Plant Sciences, University of Arizona, Tucson, AZ, USA
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Joshua R Puzey
- Department of Biology, College of William and Mary, Williamsburg, VA, USA
| | - Steven J Knapp
- Department of Plant Sciences, University of California-Davis, Davis, California, USA.
| |
Collapse
|
798
|
Colle M, Leisner CP, Wai CM, Ou S, Bird KA, Wang J, Wisecaver JH, Yocca AE, Alger EI, Tang H, Xiong Z, Callow P, Ben-Zvi G, Brodt A, Baruch K, Swale T, Shiue L, Song GQ, Childs KL, Schilmiller A, Vorsa N, Buell CR, VanBuren R, Jiang N, Edger PP. Haplotype-phased genome and evolution of phytonutrient pathways of tetraploid blueberry. Gigascience 2019; 8:giz012. [PMID: 30715294 PMCID: PMC6423372 DOI: 10.1093/gigascience/giz012] [Citation(s) in RCA: 139] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2018] [Revised: 12/18/2018] [Accepted: 01/18/2019] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND Highbush blueberry (Vaccinium corymbosum) has long been consumed for its unique flavor and composition of health-promoting phytonutrients. However, breeding efforts to improve fruit quality in blueberry have been greatly hampered by the lack of adequate genomic resources and a limited understanding of the underlying genetics encoding key traits. The genome of highbush blueberry has been particularly challenging to assemble due, in large part, to its polyploid nature and genome size. FINDINGS Here, we present a chromosome-scale and haplotype-phased genome assembly of the cultivar "Draper," which has the highest antioxidant levels among a diversity panel of 71 cultivars and 13 wild Vaccinium species. We leveraged this genome, combined with gene expression and metabolite data measured across fruit development, to identify candidate genes involved in the biosynthesis of important phytonutrients among other metabolites associated with superior fruit quality. Genome-wide analyses revealed that both polyploidy and tandem gene duplications modified various pathways involved in the biosynthesis of key phytonutrients. Furthermore, gene expression analyses hint at the presence of a spatial-temporal specific dominantly expressed subgenome including during fruit development. CONCLUSIONS These findings and the reference genome will serve as a valuable resource to guide future genome-enabled breeding of important agronomic traits in highbush blueberry.
Collapse
Affiliation(s)
- Marivi Colle
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- MSU AgBioResearch, Michigan State University, 446 West Circle Drive, East Lansing, MI, 48824, USA
| | - Courtney P Leisner
- Department of Plant Biology, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
| | - Ching Man Wai
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
| | - Shujun Ou
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, 293 Farm Lane, East Lansing, MI, 48824, USA
| | - Kevin A Bird
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, 293 Farm Lane, East Lansing, MI, 48824, USA
| | - Jie Wang
- Department of Plant Biology, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
- Center for Genomics Enabled Plant Science, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824, USA
| | - Jennifer H Wisecaver
- Department of Biochemistry, Purdue University, 175 South University Street, West Lafayette, IN, 47907, USA
- Purdue Center for Plant Biology, Purdue University, 610 Purdue Mall, West Lafayette, IN, 47907, USA
| | - Alan E Yocca
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
| | - Elizabeth I Alger
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
| | - Haibao Tang
- Human Longevity Inc., 4570 Executive Drive, San Diego, CA 92121, USA
| | - Zhiyong Xiong
- Key Laboratory of Herbage and Endemic Crop Biotechnology, School of Life Sciences, Inner Mongolia University, 221 Aimin Road, Hohhot, 010070, China
| | - Pete Callow
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
| | - Gil Ben-Zvi
- NRGene, 5 Golda Meir Street, Ness Ziona, 7403648, Israel
| | - Avital Brodt
- NRGene, 5 Golda Meir Street, Ness Ziona, 7403648, Israel
| | - Kobi Baruch
- NRGene, 5 Golda Meir Street, Ness Ziona, 7403648, Israel
| | - Thomas Swale
- Dovetail Genomics, 100 Enterprise Way, Scotts Valley, CA, 95066, USA
| | - Lily Shiue
- Dovetail Genomics, 100 Enterprise Way, Scotts Valley, CA, 95066, USA
| | - Guo-qing Song
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
| | - Kevin L Childs
- Department of Plant Biology, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
- Center for Genomics Enabled Plant Science, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824, USA
| | - Anthony Schilmiller
- Mass Spectrometry & Metabolomics Core Facility, Michigan State University, 603 Wilson Road, East Lansing, MI, 48824, USA
| | - Nicholi Vorsa
- Department of Plant Biology, Rutgers University, 59 Dudley Road, New Brunswick, NJ, 08901, USA
- Philip E. Marucci Center for Blueberry and Cranberry Research and Extension, Rutgers University, 125A Lake Oswego Road, Chatsworth, NJ, 08019, USA
| | - C Robin Buell
- MSU AgBioResearch, Michigan State University, 446 West Circle Drive, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
- Plant Resilience Institute, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
| | - Robert VanBuren
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- Plant Resilience Institute, Michigan State University, 612 Wilson Road, East Lansing, MI, 48824 USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, 293 Farm Lane, East Lansing, MI, 48824, USA
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, 1066 Bogue Street, East Lansing, MI, 48824, USA
- MSU AgBioResearch, Michigan State University, 446 West Circle Drive, East Lansing, MI, 48824, USA
- Ecology, Evolutionary Biology and Behavior, Michigan State University, 293 Farm Lane, East Lansing, MI, 48824, USA
| |
Collapse
|
799
|
Wu M, Kostyun JL, Moyle LC. Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae. Genome Biol Evol 2019; 11:335-349. [PMID: 30608583 PMCID: PMC6368146 DOI: 10.1093/gbe/evy274] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/16/2018] [Indexed: 12/11/2022] Open
Abstract
Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ∼80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ∼1.45 Gb, spanning ∼96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (∼80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1–2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.
Collapse
Affiliation(s)
- Meng Wu
- Department of Biology, Indiana University Bloomington
| | - Jamie L Kostyun
- Department of Biology, Indiana University Bloomington.,Department of Plant Biology, University of Vermont
| | | |
Collapse
|
800
|
Thomas J, Perron H, Feschotte C. Variation in proviral content among human genomes mediated by LTR recombination. Mob DNA 2018; 9:36. [PMID: 30568734 PMCID: PMC6298018 DOI: 10.1186/s13100-018-0142-3] [Citation(s) in RCA: 66] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 11/29/2018] [Indexed: 01/23/2023] Open
Abstract
Background Human endogenous retroviruses (HERVs) occupy a substantial fraction of the genome and impact cellular function with both beneficial and deleterious consequences. The vast majority of HERV sequences descend from ancient retroviral families no longer capable of infection or genomic propagation. In fact, most are no longer represented by full-length proviruses but by solitary long terminal repeats (solo LTRs) that arose via non-allelic recombination events between the two LTRs of a proviral insertion. Because LTR-LTR recombination events may occur long after proviral insertion but are challenging to detect in resequencing data, we hypothesize that this mechanism is a source of genomic variation in the human population that remains vastly underestimated. Results We developed a computational pipeline specifically designed to capture dimorphic proviral/solo HERV allelic variants from short-read genome sequencing data. When applied to 279 individuals sequenced as part of the Simons Genome Diversity Project, the pipeline retrieves most of the dimorphic loci previously reported for the HERV-K(HML2) subfamily as well as dozens of additional candidates, including members of the HERV-H and HERV-W families previously involved in human development and disease. We experimentally validate several of these newly discovered dimorphisms, including the first reported instance of an unfixed HERV-W provirus and an HERV-H locus driving a transcript (ESRG) implicated in the maintenance of embryonic stem cell pluripotency. Conclusions Our findings indicate that human proviral content exhibit more extensive interindividual variation than previously recognized, which has important bearings for deciphering the contribution of HERVs to human physiology and disease. Because LTR retroelements and LTR recombination are ubiquitous in eukaryotes, our computational pipeline should facilitate the mapping of this type of genomic variation for a wide range of organisms. Electronic supplementary material The online version of this article (10.1186/s13100-018-0142-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jainy Thomas
- 1Department of Human Genetics, University of Utah School of Medicine, 15 North 2030 East, Rm 5100, Salt Lake City, UT 84112 USA
| | - Hervé Perron
- GeNeuro, Plan-les-Ouates, Geneva, Switzerland.,3Université Claude Bernard, Lyon, France
| | - Cédric Feschotte
- 4Department of Molecular Biology and Genetics, Cornell University, 107 Biotechnology Building, Ithaca, NY 14853 USA
| |
Collapse
|