1
|
Dhakal U, Kim HS, Toomajian C. The landscape and predicted roles of structural variants in Fusarium graminearum genomes. G3 (BETHESDA, MD.) 2024; 14:jkae065. [PMID: 38546739 DOI: 10.1093/g3journal/jkae065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 02/22/2024] [Indexed: 06/06/2024]
Abstract
Structural rearrangements, such as inversions, translocations, duplications, and large insertions and deletions, are large-scale genomic variants that can play an important role in shaping phenotypic variation and in genome adaptation and evolution. We used chromosomal-level assemblies from eight Fusarium graminearum isolates to study structural variants and their role in fungal evolution. We generated the assemblies of four of these genomes after Oxford Nanopore sequencing. A total of 87 inversions, 159 translocations, 245 duplications, 58,489 insertions, and 34,102 deletions were detected. Regions of high recombination rate are associated with structural rearrangements, and a significant proportion of inversions, translocations, and duplications overlap with the repeat content of the genome, suggesting recombination and repeat elements are major factors in the origin of structural rearrangements in F. graminearum. Large insertions and deletions introduce presence-absence polymorphisms for many genes, including secondary metabolite biosynthesis cluster genes and predicted effectors genes. Translocation events were found to be shuffling predicted effector-rich regions of the genomes and are likely contributing to the gain and loss of effectors facilitated by recombination. Breakpoints of some structural rearrangements fall within coding sequences and are likely altering the protein products. Structural rearrangements in F. graminearum thus have an important role to play in shaping pathogen-host interactions and broader evolution through genome reorganization, the introduction of presence-absence polymorphisms, and changing protein products and gene regulation.
Collapse
Affiliation(s)
- Upasana Dhakal
- Department of Plant Pathology, Kansas State University, Manhattan, KS 66506, USA
| | - Hye-Seon Kim
- USDA, Agricultural Research Service, National Center for Agricultural Utilization Research, Mycotoxin Prevention and Applied Microbiology Research Unit, 1815 N University St., Peoria, IL 61604, USA
| | | |
Collapse
|
2
|
Yu Z, Li J, Wang H, Ping B, Li X, Liu Z, Guo B, Yu Q, Zou Y, Sun Y, Ma F, Zhao T. Transposable elements in Rosaceae: insights into genome evolution, expression dynamics, and syntenic gene regulation. HORTICULTURE RESEARCH 2024; 11:uhae118. [PMID: 38919560 PMCID: PMC11197308 DOI: 10.1093/hr/uhae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 04/17/2024] [Indexed: 06/27/2024]
Abstract
Transposable elements (TEs) exert significant influence on plant genomic structure and gene expression. Here, we explored TE-related aspects across 14 Rosaceae genomes, investigating genomic distribution, transposition activity, expression patterns, and nearby differentially expressed genes (DEGs). Analyses unveiled distinct long terminal repeat retrotransposon (LTR-RT) evolutionary patterns, reflecting varied genome size changes among nine species over the past million years. In the past 2.5 million years, Rubus idaeus showed a transposition rate twice as fast as Fragaria vesca, while Pyrus bretschneideri displayed significantly faster transposition compared with Crataegus pinnatifida. Genes adjacent to recent TE insertions were linked to adversity resistance, while those near previous insertions were functionally enriched in morphogenesis, enzyme activity, and metabolic processes. Expression analysis revealed diverse responses of LTR-RTs to internal or external conditions. Furthermore, we identified 3695 pairs of syntenic DEGs proximal to TEs in Malus domestica cv. 'Gala' and M. domestica (GDDH13), suggesting TE insertions may contribute to varietal trait differences in these apple varieties. Our study across representative Rosaceae species underscores the pivotal role of TEs in plant genome evolution within this diverse family. It elucidates how these elements regulate syntenic DEGs on a genome-wide scale, offering insights into Rosaceae-specific genomic evolution.
Collapse
Affiliation(s)
- Ze Yu
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Jiale Li
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Hanyu Wang
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Boya Ping
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xinchu Li
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Zhiguang Liu
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Bocheng Guo
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Qiaoming Yu
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yangjun Zou
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Yaqiang Sun
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Fengwang Ma
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Tao Zhao
- State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, Shaanxi 712100, China
| |
Collapse
|
3
|
Liu B, Shen CC, Xia SW, Song SS, Su LH, Li Y, Hao Q, Liu YJ, Guan DL, Wang N, Wang WJ, Zhao X, Li HX, Li XX, Lai YS. A nanopore-based cucumber genome assembly reveals structural variations at two QTLs controlling hypocotyl elongation. PLANT PHYSIOLOGY 2024; 195:970-985. [PMID: 38478469 DOI: 10.1093/plphys/kiae153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 02/06/2024] [Indexed: 06/02/2024]
Abstract
The Xishuangbanna (XIS) cucumber (Cucumis sativus var. xishuangbannanesis) is a semiwild variety that has many distinct agronomic traits. Here, long reads generated by Nanopore sequencing technology helped assembling a high-quality genome (contig N50 = 8.7 Mb) of landrace XIS49. A total of 10,036 structural/sequence variations (SVs) were identified when comparing with Chinese Long (CL), and known SVs controlling spines, tubercles, and carpel number were confirmed in XIS49 genome. Two QTLs of hypocotyl elongation under low light, SH3.1 and SH6.1, were fine-mapped using introgression lines (donor parent, XIS49; recurrent parent, CL). SH3.1 encodes a red-light receptor Phytochrome B (PhyB, CsaV3_3G015190). A ∼4 kb region with large deletion and highly divergent regions (HDRs) were identified in the promoter of the PhyB gene in XIS49. Loss of function of this PhyB caused a super-long hypocotyl phenotype. SH6.1 encodes a CCCH-type zinc finger protein FRIGIDA-ESSENTIAL LIKE (FEL, CsaV3_6G050300). FEL negatively regulated hypocotyl elongation but it was transcriptionally suppressed by long terminal repeats retrotransposon insertion in CL cucumber. Mechanistically, FEL physically binds to the promoter of CONSTITUTIVE PHOTOMORPHOGENIC 1a (COP1a), regulating the expression of COP1a and the downstream hypocotyl elongation. These above results demonstrate the genetic mechanism of cucumber hypocotyl elongation under low light.
Collapse
Affiliation(s)
- Bin Liu
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
- Hami-melon Research Center, Xinjiang Academy of Agricultural Sciences, 830091 Urumqi, China
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Cheng-Cheng Shen
- College of Horticulture, Shanxi Agricultural University, 030801 Jinzhong, China
| | - Shi-Wei Xia
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Shan-Shan Song
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Li-Hong Su
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Yu Li
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Qian Hao
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Yan-Jun Liu
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Dai-Lu Guan
- Centre for Research in Agricultural Genomics (CRAG), CSIC-IRTA-UAB-UB, Campus Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Ning Wang
- College of Horticulture, Shanxi Agricultural University, 030801 Jinzhong, China
| | - Wen-Jiao Wang
- College of Horticulture, Shanxi Agricultural University, 030801 Jinzhong, China
| | - Xiang Zhao
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Huan-Xiu Li
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| | - Xi-Xiang Li
- Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, 100080 Beijing, China
| | - Yun-Song Lai
- College of Horticulture, Sichuan Agricultural University, 611130 Chengdu, China
| |
Collapse
|
4
|
Huang J, Zhang Y, Li Y, Xing M, Lei C, Wang S, Nie Y, Wang Y, Zhao M, Han Z, Sun X, Zhou H, Wang Y, Zheng X, Xiao X, Fan W, Liu Z, Guo W, Zhang L, Cheng Y, Qian Q, He H, Yang Q, Qiao W. Haplotype-resolved gapless genome and chromosome segment substitution lines facilitate gene identification in wild rice. Nat Commun 2024; 15:4573. [PMID: 38811581 PMCID: PMC11137157 DOI: 10.1038/s41467-024-48845-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 05/15/2024] [Indexed: 05/31/2024] Open
Abstract
The abundant genetic variation harbored by wild rice (Oryza rufipogon) has provided a reservoir of useful genes for rice breeding. However, the genome of wild rice has not yet been comprehensively assessed. Here, we report the haplotype-resolved gapless genome assembly and annotation of wild rice Y476. In addition, we develop two sets of chromosome segment substitution lines (CSSLs) using Y476 as the donor parent and cultivated rice as the recurrent parents. By analyzing the gapless reference genome and CSSL population, we identify 254 QTLs associated with agronomic traits, biotic and abiotic stresses. We clone a receptor-like kinase gene associated with rice blast resistance and confirm its wild rice allele improves rice blast resistance. Collectively, our study provides a haplotype-resolved gapless reference genome and demonstrates a highly efficient platform for gene identification from wild rice.
Collapse
Affiliation(s)
- Jingfen Huang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yilin Zhang
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Yapeng Li
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Meng Xing
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Cailin Lei
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Shizhuang Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Yamin Nie
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Yanyan Wang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Mingchao Zhao
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Zhenyun Han
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xianjun Sun
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Han Zhou
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Yan Wang
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China
| | - Xiaoming Zheng
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Xiaorong Xiao
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
- Hainan Academy of Agricultural Sciences, Haikou, Hainan, China
| | - Weiya Fan
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Ziran Liu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Wenlong Guo
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Lifang Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yunlian Cheng
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Qian Qian
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China
| | - Hang He
- School of Advanced Agriculture Sciences and School of Life Sciences, State Key Laboratory of Protein and Plant Gene Research, Peking University, Beijing, China.
- Peking University Institute of Advanced Agricultural Sciences, Shandong Laboratory of Advanced Agricultural Sciences at Weifang, Weifang, Shandong, China.
| | - Qingwen Yang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China.
| | - Weihua Qiao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
- National Nanfan Research Institute (Sanya), Chinese Academy of Agricultural Sciences, Sanya, Hainan, China.
| |
Collapse
|
5
|
Puginier C, Libourel C, Otte J, Skaloud P, Haon M, Grisel S, Petersen M, Berrin JG, Delaux PM, Dal Grande F, Keller J. Phylogenomics reveals the evolutionary origins of lichenization in chlorophyte algae. Nat Commun 2024; 15:4452. [PMID: 38789482 PMCID: PMC11126685 DOI: 10.1038/s41467-024-48787-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 05/10/2024] [Indexed: 05/26/2024] Open
Abstract
Mutualistic symbioses have contributed to major transitions in the evolution of life. Here, we investigate the evolutionary history and the molecular innovations at the origin of lichens, which are a symbiosis established between fungi and green algae or cyanobacteria. We de novo sequence the genomes or transcriptomes of 12 lichen algal symbiont (LAS) and closely related non-symbiotic algae (NSA) to improve the genomic coverage of Chlorophyte algae. We then perform ancestral state reconstruction and comparative phylogenomics. We identify at least three independent gains of the ability to engage in the lichen symbiosis, one in Trebouxiophyceae and two in Ulvophyceae, confirming the convergent evolution of the lichen symbioses. A carbohydrate-active enzyme from the glycoside hydrolase 8 (GH8) family was identified as a top candidate for the molecular-mechanism underlying lichen symbiosis in Trebouxiophyceae. This GH8 was acquired in lichenizing Trebouxiophyceae by horizontal gene transfer, concomitantly with the ability to associate with lichens fungal symbionts (LFS) and is able to degrade polysaccharides found in the cell wall of LFS. These findings indicate that a combination of gene family expansion and horizontal gene transfer provided the basis for lichenization to evolve in chlorophyte algae.
Collapse
Affiliation(s)
- Camille Puginier
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, UPS, INP, Toulouse, 31320, Castanet-Tolosan, France
| | - Cyril Libourel
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, UPS, INP, Toulouse, 31320, Castanet-Tolosan, France
| | - Juergen Otte
- Senckenberg Biodiversity and Climate Research Centre (SBiK-F), Senckenberganlage 25, 60325, Frankfurt am Main, Germany
| | - Pavel Skaloud
- Department of Botany, Faculty of Science, Charles University, Benátská 2, CZ-12800, Praha 2, Czech Republic
| | - Mireille Haon
- INRAE, Aix Marseille Université, UMR1163 Biodiversité et Biotechnologie Fongiques (BBF), 13009, Marseille, France
- INRAE, Aix Marseille Université, 3PE Platform, 13009, Marseille, France
| | - Sacha Grisel
- INRAE, Aix Marseille Université, UMR1163 Biodiversité et Biotechnologie Fongiques (BBF), 13009, Marseille, France
- INRAE, Aix Marseille Université, 3PE Platform, 13009, Marseille, France
| | - Malte Petersen
- High Performance Computing & Analytics Lab, University of Bonn, Friedrich-Hirzebruch-Allee 8, 53115, Bonn, Germany
| | - Jean-Guy Berrin
- INRAE, Aix Marseille Université, UMR1163 Biodiversité et Biotechnologie Fongiques (BBF), 13009, Marseille, France
- INRAE, Aix Marseille Université, 3PE Platform, 13009, Marseille, France
| | - Pierre-Marc Delaux
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, UPS, INP, Toulouse, 31320, Castanet-Tolosan, France.
| | - Francesco Dal Grande
- Senckenberg Biodiversity and Climate Research Centre (SBiK-F), Senckenberganlage 25, 60325, Frankfurt am Main, Germany.
- LOEWE Centre for Translational Biodiversity Genomics (TBG), Senckenberganlage 25, 60325, Frankfurt am Main, Germany.
- Department of Biology, University of Padova, Padua, Italy.
| | - Jean Keller
- Laboratoire de Recherche en Sciences Végétales (LRSV), Université de Toulouse, CNRS, UPS, INP, Toulouse, 31320, Castanet-Tolosan, France.
- Department of Insect Symbiosis, Max Planck Institute for Chemical Ecology, 07745, Jena, Germany.
| |
Collapse
|
6
|
Liu Q, Xiong G, Wang Z, Wu Y, Tu T, Schwarzacher T, Heslop-Harrison JS. Chromosome-level genome assembly of the diploid oat species Avena longiglumis. Sci Data 2024; 11:412. [PMID: 38649380 PMCID: PMC11035610 DOI: 10.1038/s41597-024-03248-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 04/10/2024] [Indexed: 04/25/2024] Open
Abstract
Diploid wild oat Avena longiglumis has nutritional and adaptive traits which are valuable for common oat (A. sativa) breeding. The combination of Illumina, Nanopore and Hi-C data allowed us to assemble a high-quality chromosome-level genome of A. longiglumis (ALO), evidenced by contig N50 of 12.68 Mb with 99% BUSCO completeness for the assembly size of 3,960.97 Mb. A total of 40,845 protein-coding genes were annotated. The assembled genome was composed of 87.04% repetitive DNA sequences. Dotplots of the genome assembly (PI657387) with two published ALO genomes were compared to indicate the conservation of gene order and equal expansion of all syntenic blocks among three genome assemblies. Two recent whole-genome duplication events were characterized in genomes of diploid Avena species. These findings provide new knowledge for the genomic features of A. longiglumis, give information about the species diversity, and will accelerate the functional genomics and breeding studies in oat and related cereal crops.
Collapse
Affiliation(s)
- Qing Liu
- State Key Laboratory of Plant Diversity and Specialty Crops / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- South China National Botanical Garden, Guangzhou, China.
| | - Gui Xiong
- State Key Laboratory of Plant Diversity and Specialty Crops / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ziwei Wang
- School of Biology and Agriculture, Shaoguan University, Shaoguan, China
| | - Yongxing Wu
- College of Agriculture, South China Agricultural University, Guangzhou, China
| | - Tieyao Tu
- State Key Laboratory of Plant Diversity and Specialty Crops / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, China
| | - Trude Schwarzacher
- State Key Laboratory of Plant Diversity and Specialty Crops / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, China
- University of Leicester, Department of Genetics and Genome Biology, Institute for Environmental Futures, Leicester, UK
| | - John Seymour Heslop-Harrison
- State Key Laboratory of Plant Diversity and Specialty Crops / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- South China National Botanical Garden, Guangzhou, China.
- University of Leicester, Department of Genetics and Genome Biology, Institute for Environmental Futures, Leicester, UK.
| |
Collapse
|
7
|
Sacchi B, Humphries Z, Kružlicová J, Bodláková M, Pyne C, Choudhury BI, Gong Y, Bačovský V, Hobza R, Barrett SCH, Wright SI. Phased Assembly of Neo-Sex Chromosomes Reveals Extensive Y Degeneration and Rapid Genome Evolution in Rumex hastatulus. Mol Biol Evol 2024; 41:msae074. [PMID: 38606901 PMCID: PMC11057207 DOI: 10.1093/molbev/msae074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 01/31/2024] [Accepted: 04/05/2024] [Indexed: 04/13/2024] Open
Abstract
Y chromosomes are thought to undergo progressive degeneration due to stepwise loss of recombination and subsequent reduction in selection efficiency. However, the timescales and evolutionary forces driving degeneration remain unclear. To investigate the evolution of sex chromosomes on multiple timescales, we generated a high-quality phased genome assembly of the massive older (<10 MYA) and neo (<200,000 yr) sex chromosomes in the XYY cytotype of the dioecious plant Rumex hastatulus and a hermaphroditic outgroup Rumex salicifolius. Our assemblies, supported by fluorescence in situ hybridization, confirmed that the neo-sex chromosomes were formed by two key events: an X-autosome fusion and a reciprocal translocation between the homologous autosome and the Y chromosome. The enormous sex-linked regions of the X (296 Mb) and two Y chromosomes (503 Mb) both evolved from large repeat-rich genomic regions with low recombination; however, the complete loss of recombination on the Y still led to over 30% gene loss and major rearrangements. In the older sex-linked region, there has been a significant increase in transposable element abundance, even into and near genes. In the neo-sex-linked regions, we observed evidence of extensive rearrangements without gene degeneration and loss. Overall, we inferred significant degeneration during the first 10 million years of Y chromosome evolution but not on very short timescales. Our results indicate that even when sex chromosomes emerge from repetitive regions of already-low recombination, the complete loss of recombination on the Y chromosome still leads to a substantial increase in repetitive element content and gene degeneration.
Collapse
Affiliation(s)
- Bianca Sacchi
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Zoë Humphries
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Jana Kružlicová
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Markéta Bodláková
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Cassandre Pyne
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Baharul I Choudhury
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
- Department of Biology, Queen’s University, Kingston, Canada
| | - Yunchen Gong
- Centre for Analysis of Genome Evolution and Function, University of Toronto, Toronto, Canada
| | - Václav Bačovský
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Roman Hobza
- Department of Plant Developmental Genetics, Institute of Biophysics of the Czech Academy of Sciences, Brno, Czech Republic
| | - Spencer C H Barrett
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
- Centre for Analysis of Genome Evolution and Function, University of Toronto, Toronto, Canada
| |
Collapse
|
8
|
Ma PF, Liu YL, Guo C, Jin G, Guo ZH, Mao L, Yang YZ, Niu LZ, Wang YJ, Clark LG, Kellogg EA, Xu ZC, Ye XY, Liu JX, Zhou MY, Luo Y, Yang Y, Soltis DE, Bennetzen JL, Soltis PS, Li DZ. Genome assemblies of 11 bamboo species highlight diversification induced by dynamic subgenome dominance. Nat Genet 2024; 56:710-720. [PMID: 38491323 PMCID: PMC11018529 DOI: 10.1038/s41588-024-01683-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 02/08/2024] [Indexed: 03/18/2024]
Abstract
Polyploidy (genome duplication) is a pivotal force in evolution. However, the interactions between parental genomes in a polyploid nucleus, frequently involving subgenome dominance, are poorly understood. Here we showcase analyses of a bamboo system (Poaceae: Bambusoideae) comprising a series of lineages from diploid (herbaceous) to tetraploid and hexaploid (woody), with 11 chromosome-level de novo genome assemblies and 476 transcriptome samples. We find that woody bamboo subgenomes exhibit stunning karyotype stability, with parallel subgenome dominance in the two tetraploid clades and a gradual shift of dominance in the hexaploid clade. Allopolyploidization and subgenome dominance have shaped the evolution of tree-like lignified culms, rapid growth and synchronous flowering characteristic of woody bamboos as large grasses. Our work provides insights into genome dominance in a remarkable polyploid system, including its dependence on genomic context and its ability to switch which subgenomes are dominant over evolutionary time.
Collapse
Affiliation(s)
- Peng-Fei Ma
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Yun-Long Liu
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Cen Guo
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- Center for Integrative Conservation & Yunnan Key Laboratory for the Conservation of Tropical Rainforests and Asian Elephants, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Menglun, Mengla, Yunnan, China
| | - Guihua Jin
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Zhen-Hua Guo
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Ling Mao
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Yi-Zhou Yang
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Liang-Zhong Niu
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Yu-Jiao Wang
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Lynn G Clark
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, 345 Bessey, Ames, IA, USA
| | | | - Zu-Chang Xu
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Xia-Ying Ye
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Jing-Xia Liu
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Meng-Yuan Zhou
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Yan Luo
- Center for Integrative Conservation & Yunnan Key Laboratory for the Conservation of Tropical Rainforests and Asian Elephants, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Menglun, Mengla, Yunnan, China
| | - Yang Yang
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Department of Biology, University of Florida, Gainesville, FL, USA
| | | | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
| | - De-Zhu Li
- Germplasm Bank of Wild Species & Yunnan Key Laboratory of Crop Wild Relatives Omics, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China.
- Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China.
- Key Laboratory for Plant Diversity and Biogeography in East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan, China.
| |
Collapse
|
9
|
Cuello C, Jansen HJ, Abdallah C, Zamar Mbadinga DL, Birer Williams C, Durand M, Oudin A, Papon N, Giglioli-Guivarc'h N, Dirks RP, Jensen MK, O'Connor SE, Besseau S, Courdavault V. The Madagascar palm genome provides new insights on the evolution of Apocynaceae specialized metabolism. Heliyon 2024; 10:e28078. [PMID: 38533072 PMCID: PMC10963385 DOI: 10.1016/j.heliyon.2024.e28078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 03/07/2024] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open
Abstract
Specialized metabolites possess diverse interesting biological activities and some cardenolides- and monoterpene indole alkaloids- (MIAs) derived pharmaceuticals are currently used to treat human diseases such as cancers or hypertension. While these two families of biocompounds are produced by specific subfamilies of Apocynaceae, one member of this medicinal plant family, the succulent tree Pachypodium lamerei Drake (also known as Madagascar palm), does not produce such specialized metabolites. To explore the evolutionary paths that have led to the emergence and loss of cardenolide and MIA biosynthesis in Apocynaceae, we sequenced and assembled the P. lamerei genome by combining Oxford Nanopore Technologies long-reads and Illumina short-reads. Phylogenomics revealed that, among the Apocynaceae whose genomes have been sequenced, the Madagascar palm is so far the species closest to the common ancestor between MIA producers/non-MIA producers. Transposable elements, constituting 72.48% of the genome, emerge as potential key players in shaping genomic architecture and influencing specialized metabolic pathways. The absence of crucial MIA biosynthetic genes such as strictosidine synthase in P. lamerei and non-Rauvolfioideae species hints at a transposon-mediated mechanism behind gene loss. Phylogenetic analysis not only showcases the evolutionary divergence of specialized metabolite biosynthesis within Apocynaceae but also underscores the role of transposable elements in this intricate process. Moreover, we shed light on the low conservation of enzymes involved in the final stages of MIA biosynthesis in the distinct MIA-producing plant families, inferring independent gains of these specialized enzymes along the evolution of these medicinal plant clades. Overall, this study marks a leap forward in understanding the genomic dynamics underpinning the evolution of specialized metabolites biosynthesis in the Apocynaceae family, with transposons emerging as potential architects of genomics restructuring and gene loss.
Collapse
Affiliation(s)
- Clément Cuello
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | - Hans J. Jansen
- Future Genomics Technologies, 2333 BE, Leiden, the Netherlands
| | - Cécile Abdallah
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | | | - Caroline Birer Williams
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | - Mickael Durand
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | - Audrey Oudin
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | - Nicolas Papon
- Univ Angers, Univ Brest, IRF, SFR ICAT, F-49000, Angers, France
| | | | - Ron P. Dirks
- Future Genomics Technologies, 2333 BE, Leiden, the Netherlands
| | - Michael Krogh Jensen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs, Lyngby, Denmark
| | - Sarah Ellen O'Connor
- Department of Natural Product Biosynthesis, Max Planck Institute for Chemical Ecology, Jena, 07745, Germany
| | - Sébastien Besseau
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| | - Vincent Courdavault
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200, Tours, France
| |
Collapse
|
10
|
Yocca A, Akinyuwa M, Bailey N, Cliver B, Estes H, Guillemette A, Hasannin O, Hutchison J, Jenkins W, Kaur I, Khanna RR, Loftin M, Lopes L, Moore-Pollard E, Olofintila O, Oyebode GO, Patel J, Thapa P, Waldinger M, Zhang J, Zhang Q, Goertzen L, Carey SB, Hargarten H, Mattheis J, Zhang H, Jones T, Boston L, Grimwood J, Ficklin S, Honaas L, Harkess A. A chromosome-scale assembly for 'd'Anjou' pear. G3 (BETHESDA, MD.) 2024; 14:jkae003. [PMID: 38190814 PMCID: PMC10917493 DOI: 10.1093/g3journal/jkae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 07/27/2023] [Accepted: 12/01/2023] [Indexed: 01/10/2024]
Abstract
Cultivated pear consists of several Pyrus species with Pyrus communis (European pear) representing a large fraction of worldwide production. As a relatively recently domesticated crop and perennial tree, pear can benefit from genome-assisted breeding. Additionally, comparative genomics within Rosaceae promises greater understanding of evolution within this economically important family. Here, we generate a fully phased chromosome-scale genome assembly of P. communis 'd'Anjou.' Using PacBio HiFi and Dovetail Omni-C reads, the genome is resolved into the expected 17 chromosomes, with each haplotype totaling nearly 540 Megabases and a contig N50 of nearly 14 Mb. Both haplotypes are highly syntenic to each other and to the Malus domestica 'Honeycrisp' apple genome. Nearly 45,000 genes were annotated in each haplotype, over 90% of which have direct RNA-seq expression evidence. We detect signatures of the known whole-genome duplication shared between apple and pear, and we estimate 57% of d'Anjou genes are retained in duplicate derived from this event. This genome highlights the value of generating phased diploid assemblies for recovering the full allelic complement in highly heterozygous crop species.
Collapse
Affiliation(s)
- Alan Yocca
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Mary Akinyuwa
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL 36849, USA
| | - Nick Bailey
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Brannan Cliver
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Harrison Estes
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL 36849, USA
| | | | - Omar Hasannin
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Jennifer Hutchison
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Wren Jenkins
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Ishveen Kaur
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | | | - Madelene Loftin
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Lauren Lopes
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Erika Moore-Pollard
- Department of Biological Sciences, University of Memphis, Memphis, TN 38152-3530, USA
| | | | | | - Jinesh Patel
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Parbati Thapa
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL 36849, USA
| | - Martin Waldinger
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Jie Zhang
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Qiong Zhang
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Leslie Goertzen
- Department of Biological Sciences, Auburn University, Auburn, AL 36849, USA
| | - Sarah B Carey
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Heidi Hargarten
- Physiology and Pathology of Tree Fruits Research Laboratory, USDA ARS, Wenatchee, WA 98801, USA
| | - James Mattheis
- Physiology and Pathology of Tree Fruits Research Laboratory, USDA ARS, Wenatchee, WA 98801, USA
| | - Huiting Zhang
- Physiology and Pathology of Tree Fruits Research Laboratory, USDA ARS, Wenatchee, WA 98801, USA
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA
| | - Teresa Jones
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - LoriBeth Boston
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Jane Grimwood
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
- HudsonAlpha Genome Sequencing Center, HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| | - Stephen Ficklin
- Department of Horticulture, Washington State University, Pullman, WA 99164-6414, USA
| | - Loren Honaas
- Physiology and Pathology of Tree Fruits Research Laboratory, USDA ARS, Wenatchee, WA 98801, USA
| | - Alex Harkess
- HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, USA
| |
Collapse
|
11
|
Hirabayashi K, Debnath SC, Owens GL. Unveiling the evolutionary history of lingonberry (Vaccinium vitis-idaea L.) through genome sequencing and assembly of European and North American subspecies. G3 (BETHESDA, MD.) 2024; 14:jkad294. [PMID: 38142435 PMCID: PMC10917501 DOI: 10.1093/g3journal/jkad294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 10/23/2023] [Accepted: 12/18/2023] [Indexed: 12/26/2023]
Abstract
Lingonberry (Vaccinium vitis-idaea L.) produces tiny red berries that are tart and nutty in flavor. It grows widely in the circumpolar region, including Scandinavia, northern parts of Eurasia, Alaska, and Canada. Although cultivation is currently limited, the plant has a long history of cultural use among indigenous communities. Given its potential as a food source, genomic resources for lingonberry are significantly lacking. To advance genomic knowledge, the genomes for 2 subspecies of lingonberry (V. vitis-idaea ssp. minus and ssp. vitis-idaea var. 'Red Candy') were sequenced and de novo assembled into contig-level assemblies. The assemblies were scaffolded using the bilberry genome (Vaccinium myrtillus) to generate a chromosome-anchored reference genome consisting of 12 chromosomes each with a total length of 548.07 Mb [contig N50 = 1.17 Mb, BUSCO (C%) = 96.5%] for ssp. vitis-idaea and 518.70 Mb [contig N50 = 1.40 Mb, BUSCO (C%) = 96.9%] for ssp. minus. RNA-seq-based gene annotation identified 27,243 and 25,718 genes on the respective assembly, and transposable element detection methods found that 45.82 and 44.58% of the genome were repeats. Phylogenetic analysis confirmed that lingonberry was most closely related to bilberry and was more closely related to blueberries than cranberries. Estimates of past effective population size suggested a continuous decline over the past 1-3 MYA, possibly due to the impacts of repeated glacial cycles during the Pleistocene leading to frequent population fragmentation. The genomic resource created in this study can be used to identify industry-relevant genes (e.g. anthocyanin production), infer phylogeny, and call sequence-level variants (e.g. SNPs) in future research.
Collapse
Affiliation(s)
- Kaede Hirabayashi
- Department of Biology, University of Victoria, 3800 Finnerty Road, Victoria, BC V8W 2Y2, Canada
| | - Samir C Debnath
- Agriculture and Agri-Food Canada, St.John's Research and Development Centre, 204 Brookfield Road, St. John’s, Newfoundland and Labrador L A1E 0B2, Canada
| | - Gregory L Owens
- Department of Biology, University of Victoria, 3800 Finnerty Road, Victoria, BC V8W 2Y2, Canada
| |
Collapse
|
12
|
Guiglielmoni N, Villegas LI, Kirangwa J, Schiffer PH. Revisiting genomes of non-model species with long reads yields new insights into their biology and evolution. Front Genet 2024; 15:1308527. [PMID: 38384712 PMCID: PMC10879605 DOI: 10.3389/fgene.2024.1308527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 01/04/2024] [Indexed: 02/23/2024] Open
Abstract
High-quality genomes obtained using long-read data allow not only for a better understanding of heterozygosity levels, repeat content, and more accurate gene annotation and prediction when compared to those obtained with short-read technologies, but also allow to understand haplotype divergence. Advances in long-read sequencing technologies in the last years have made it possible to produce such high-quality assemblies for non-model organisms. This allows us to revisit genomes, which have been problematic to scaffold to chromosome-scale with previous generations of data and assembly software. Nematoda, one of the most diverse and speciose animal phyla within metazoans, remains poorly studied, and many previously assembled genomes are fragmented. Using long reads obtained with Nanopore R10.4.1 and PacBio HiFi, we generated highly contiguous assemblies of a diploid nematode of the Mermithidae family, for which no closely related genomes are available to date, as well as a collapsed assembly and a phased assembly for a triploid nematode from the Panagrolaimidae family. Both genomes had been analysed before, but the fragmented assemblies had scaffold sizes comparable to the length of long reads prior to assembly. Our new assemblies illustrate how long-read technologies allow for a much better representation of species genomes. We are now able to conduct more accurate downstream assays based on more complete gene and transposable element predictions.
Collapse
|
13
|
Rossi FPN, Flores VS, Uceda-Campos G, Amgarten DE, Setubal JC, da Silva AM. Comparative Analyses of Bacteriophage Genomes. Methods Mol Biol 2024; 2802:427-453. [PMID: 38819567 DOI: 10.1007/978-1-0716-3838-5_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Bacterial viruses (bacteriophages or phages) are the most abundant and diverse biological entities on Earth. There is a renewed worldwide interest in phage-centered research motivated by their enormous potential as antimicrobials to cope with multidrug-resistant pathogens. An ever-growing number of complete phage genomes are becoming available, derived either from newly isolated phages (cultivated phages) or recovered from metagenomic sequencing data (uncultivated phages). Robust comparative analysis is crucial for a comprehensive understanding of genotypic variations of phages and their related evolutionary processes, and to investigate the interaction mechanisms between phages and their hosts. In this chapter, we present a protocol for phage comparative genomics employing tools selected out of the many currently available, focusing on complete genomes of phages classified in the class Caudoviricetes. This protocol provides accurate identification of similarities, differences, and patterns among new and previously known complete phage genomes as well as phage clustering and taxonomic classification.
Collapse
Affiliation(s)
| | - Vinicius Sousa Flores
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | - Guillermo Uceda-Campos
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | | | - João Carlos Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil
| | - Aline Maria da Silva
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Sao Paulo, SP, Brazil.
| |
Collapse
|
14
|
Liu Q, Ye L, Li M, Wang Z, Xiong G, Ye Y, Tu T, Schwarzacher T, Heslop-Harrison JSP. Genome-wide expansion and reorganization during grass evolution: from 30 Mb chromosomes in rice and Brachypodium to 550 Mb in Avena. BMC PLANT BIOLOGY 2023; 23:627. [PMID: 38062402 PMCID: PMC10704644 DOI: 10.1186/s12870-023-04644-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 11/29/2023] [Indexed: 12/18/2023]
Abstract
BACKGROUND The BOP (Bambusoideae, Oryzoideae, and Pooideae) clade of the Poaceae has a common ancestor, with similarities to the genomes of rice, Oryza sativa (2n = 24; genome size 389 Mb) and Brachypodium, Brachypodium distachyon (2n = 10; 271 Mb). We exploit chromosome-scale genome assemblies to show the nature of genomic expansion, structural variation, and chromosomal rearrangements from rice and Brachypodium, to diploids in the tribe Aveneae (e.g., Avena longiglumis, 2n = 2x = 14; 3,961 Mb assembled to 3,850 Mb in chromosomes). RESULTS Most of the Avena chromosome arms show relatively uniform expansion over the 10-fold to 15-fold genome-size increase. Apart from non-coding sequence diversification and accumulation around the centromeres, blocks of genes are not interspersed with blocks of repeats, even in subterminal regions. As in the tribe Triticeae, blocks of conserved synteny are seen between the analyzed species with chromosome fusion, fission, and nesting (insertion) events showing deep evolutionary conservation of chromosome structure during genomic expansion. Unexpectedly, the terminal gene-rich chromosomal segments (representing about 50 Mb) show translocations between chromosomes during speciation, with homogenization of genome-specific repetitive elements within the tribe Aveneae. Newly-formed intergenomic translocations of similar extent are found in the hexaploid A. sativa. CONCLUSIONS The study provides insight into evolutionary mechanisms and speciation in the BOP clade, which is valuable for measurement of biodiversity, development of a clade-wide pangenome, and exploitation of genomic diversity through breeding programs in Poaceae.
Collapse
Affiliation(s)
- Qing Liu
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- South China National Botanical Garden, Guangzhou, 510650, China.
- Center for Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, 510650, China.
| | - Lyuhan Ye
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Mingzhi Li
- Bio&Data Biotechnologies Co. Ltd, Guangzhou, 510663, China
| | - Ziwei Wang
- Henry Fok School of Biology and Agriculture, Shaoguan University, Shaoguan, 512005, China
| | - Gui Xiong
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yushi Ye
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, 510650, China
| | - Tieyao Tu
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- South China National Botanical Garden, Guangzhou, 510650, China
- Center for Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Trude Schwarzacher
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
- Department of Genetics and Genome Biology, Institute for Environmental Futures, University of Leicester, Leicester, LE1 7RH, UK
| | - John Seymour Pat Heslop-Harrison
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- Department of Genetics and Genome Biology, Institute for Environmental Futures, University of Leicester, Leicester, LE1 7RH, UK.
| |
Collapse
|
15
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
16
|
Yocca AE, Platts A, Alger E, Teresi S, Mengist MF, Benevenuto J, Ferrão LFV, Jacobs M, Babinski M, Magallanes-Lundback M, Bayer P, Golicz A, Humann JL, Main D, Espley RV, Chagné D, Albert NW, Montanari S, Vorsa N, Polashock J, Díaz-Garcia L, Zalapa J, Bassil NV, Munoz PR, Iorizzo M, Edger PP. Blueberry and cranberry pangenomes as a resource for future genetic studies and breeding efforts. HORTICULTURE RESEARCH 2023; 10:uhad202. [PMID: 38023484 PMCID: PMC10673653 DOI: 10.1093/hr/uhad202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 10/01/2023] [Indexed: 12/01/2023]
Abstract
Domestication of cranberry and blueberry began in the United States in the early 1800s and 1900s, respectively, and in part owing to their flavors and health-promoting benefits are now cultivated and consumed worldwide. The industry continues to face a wide variety of production challenges (e.g. disease pressures), as well as a demand for higher-yielding cultivars with improved fruit quality characteristics. Unfortunately, molecular tools to help guide breeding efforts for these species have been relatively limited compared with those for other high-value crops. Here, we describe the construction and analysis of the first pangenome for both blueberry and cranberry. Our analysis of these pangenomes revealed both crops exhibit great genetic diversity, including the presence-absence variation of 48.4% genes in highbush blueberry and 47.0% genes in cranberry. Auxiliary genes, those not shared by all cultivars, are significantly enriched with molecular functions associated with disease resistance and the biosynthesis of specialized metabolites, including compounds previously associated with improving fruit quality traits. The discovery of thousands of genes, not present in the previous reference genomes for blueberry and cranberry, will serve as the basis of future research and as potential targets for future breeding efforts. The pangenome, as a multiple-sequence alignment, as well as individual annotated genomes, are publicly available for analysis on the Genome Database for Vaccinium-a curated and integrated web-based relational database. Lastly, the core-gene predictions from the pangenomes will serve useful to develop a community genotyping platform to guide future molecular breeding efforts across the family.
Collapse
Affiliation(s)
- Alan E Yocca
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, United States
| | - Adrian Platts
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, United States
| | - Elizabeth Alger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
| | - Scott Teresi
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
- Genetics and Genome Sciences, Michigan State University, East Lansing, MI, 48824, United States
| | - Molla F Mengist
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC United States
| | - Juliana Benevenuto
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, United States
| | - Luis Felipe V Ferrão
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, United States
| | - MacKenzie Jacobs
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824, United States
| | - Michal Babinski
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
| | | | - Philipp Bayer
- University of Western Australia, Perth 6009Australia
| | | | - Jodi L Humann
- Department of Horticulture, Washington State University, Pullman, WA, 99163, United States
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA, 99163, United States
| | - Richard V Espley
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
| | - David Chagné
- The New Zealand Institute for Plant and Food Research Limited (PFR), Palmerston, New Zealand
| | - Nick W Albert
- The New Zealand Institute for Plant and Food Research Limited (PFR), Palmerston, New Zealand
| | - Sara Montanari
- The New Zealand Institute for Plant and Food Research Limited (PFR), Motueka, New Zealand
| | - Nicholi Vorsa
- SEBS, Plant Biology, Rutgers University, New Brunswick NJ 01019United States
| | - James Polashock
- SEBS, Plant Biology, Rutgers University, New Brunswick NJ 01019United States
| | - Luis Díaz-Garcia
- Department of Viticulture and Enology, University of California, Davis, Davis, CA 95616, United States
| | - Juan Zalapa
- Department of Viticulture and Enology, University of California, Davis, Davis, CA 95616, United States
| | - Nahla V Bassil
- National Clonal Germplasm Repository, USDA-ARS, Corvallis, OR 97333, United States
| | - Patricio R Munoz
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, United States
| | - Massimo Iorizzo
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NCUnited States
- Department of Horticulture, North Carolina State University, Kannapolis, NCUnited States
| | - Patrick P Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, United States
- Genetics and Genome Sciences, Michigan State University, East Lansing, MI, 48824, United States
- MSU AgBioResearch, Michigan State University, East Lansing, MI, 48824, United States
| |
Collapse
|
17
|
Ouadi S, Sierro N, Kessler F, Ivanov NV. Chromosome-scale assemblies of S. malaccense, S. aqueum, S. jambos, and S. syzygioides provide insights into the evolution of Syzygium genomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1248780. [PMID: 37868305 PMCID: PMC10587690 DOI: 10.3389/fpls.2023.1248780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 08/28/2023] [Indexed: 10/24/2023]
Abstract
Syzygium is a large and diverse tree genus in the Myrtaceae family. Genome assemblies for clove (Syzygium aromaticum, 370 Mb) and sea apple (Syzygium grande, 405 Mb) provided the first insights into the genomic features and evolution of the Syzygium genus. Here, we present additional de novo chromosome-scale genome assemblies for Syzygium malaccense, Syzygium aqueum, Syzygium jambos, and Syzygium syzygioides. Genome profiling analyses show that S. malaccense, like S. aromaticum and S. grande, is diploid (2n = 2x = 22), while the S. aqueum, S. jambos, and S. syzygioides specimens are autotetraploid (2n = 4x = 44). The genome assemblies of S. malaccense (430 Mb), S. aqueum (392 Mb), S. jambos (426 Mb), and S. syzygioides (431 Mb) are highly complete (BUSCO scores of 98%). Comparative genomics analyses showed conserved organization of the 11 chromosomes with S. aromaticum and S. grande, and revealed species-specific evolutionary dynamics of the long terminal repeat retrotransposon elements belonging to the Gypsy and Copia lineages. This set of Syzygium genomes is a valuable resource for future structural and functional comparative genomic studies on Myrtaceae species.
Collapse
Affiliation(s)
- Sonia Ouadi
- Faculty of Sciences, Laboratory of Plant Physiology, University of Neuchâtel, Neuchâtel, Switzerland
- Philip Morris International R&D, Philip Morris Products S.A., Neuchâtel, Switzerland
| | - Nicolas Sierro
- Philip Morris International R&D, Philip Morris Products S.A., Neuchâtel, Switzerland
| | - Felix Kessler
- Faculty of Sciences, Laboratory of Plant Physiology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Nikolai V Ivanov
- Faculty of Sciences, Laboratory of Plant Physiology, University of Neuchâtel, Neuchâtel, Switzerland
- Philip Morris International R&D, Philip Morris Products S.A., Neuchâtel, Switzerland
| |
Collapse
|
18
|
Mokhtar MM, El Allali A. MegaLTR: a web server and standalone pipeline for detecting and annotating LTR-retrotransposons in plant genomes. FRONTIERS IN PLANT SCIENCE 2023; 14:1237426. [PMID: 37810401 PMCID: PMC10552921 DOI: 10.3389/fpls.2023.1237426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/21/2023] [Indexed: 10/10/2023]
Abstract
LTR-retrotransposons (LTR-RTs) are a class of RNA-replicating transposon elements (TEs) that can alter genome structure and function by moving positions, repositioning genes, shifting exons, and causing chromosomal rearrangements. LTR-RTs are widespread in many plant genomes and constitute a significant portion of the genome. Their movement and activity in eukaryotic genomes can provide insight into genome evolution and gene function, especially when LTR-RTs are located near or within genes. Building the redundant and non-redundant LTR-RTs libraries and their annotations for species lacking this resource requires extensive bioinformatics pipelines and expensive computing power to analyze large amounts of genomic data. This increases the need for online services that provide computational resources with minimal overhead and maximum efficiency. Here, we present MegaLTR as a web server and standalone pipeline that detects intact LTR-RTs at the whole-genome level and integrates multiple tools for structure-based, homologybased, and de novo identification, classification, annotation, insertion time determination, and LTR-RT gene chimera analysis. MegaLTR also provides statistical analysis and visualization with multiple tools and can be used to accelerate plant species discovery and assist breeding programs in their efforts to improve genomic resources. We hope that the development of online services such as MegaLTR, which can analyze large amounts of genomic data, will become increasingly important for the automated detection and annotation of LTR-RT elements.
Collapse
Affiliation(s)
- Morad M. Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Benguerir, Morocco
| |
Collapse
|
19
|
Liao X, Zhu W, Zhou J, Li H, Xu X, Zhang B, Gao X. Repetitive DNA sequence detection and its role in the human genome. Commun Biol 2023; 6:954. [PMID: 37726397 PMCID: PMC10509279 DOI: 10.1038/s42003-023-05322-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Repetitive DNA sequences playing critical roles in driving evolution, inducing variation, and regulating gene expression. In this review, we summarized the definition, arrangement, and structural characteristics of repeats. Besides, we introduced diverse biological functions of repeats and reviewed existing methods for automatic repeat detection, classification, and masking. Finally, we analyzed the type, structure, and regulation of repeats in the human genome and their role in the induction of complex diseases. We believe that this review will facilitate a comprehensive understanding of repeats and provide guidance for repeat annotation and in-depth exploration of its association with human diseases.
Collapse
Affiliation(s)
- Xingyu Liao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Wufei Zhu
- Department of Endocrinology, Yichang Central People's Hospital, The First College of Clinical Medical Science, China Three Gorges University, 443000, Yichang, P.R. China
| | - Juexiao Zhou
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Haoyang Li
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xiaopeng Xu
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Bin Zhang
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.
| |
Collapse
|
20
|
Wagner CI, Kopp MEL, Thorburn J, Jones CS, Hoarau G, Noble LR. Characteristics of the spiny dogfish (Squalus acanthias) nuclear genome. G3 (BETHESDA, MD.) 2023; 13:jkad146. [PMID: 37395764 PMCID: PMC10468316 DOI: 10.1093/g3journal/jkad146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 02/28/2023] [Accepted: 06/12/2023] [Indexed: 07/04/2023]
Abstract
Sequenced shark nuclear genomes are underrepresented, with reference genomes available for only four out of nine orders so far. Here, we present the nuclear genome, with annotations, of the spiny dogfish (Squalus acanthias), a shark of interest to biomedical and conservation efforts, and the first representative of the second largest order of sharks (Squaliformes) with nuclear genome annotations available. Using Pacific Biosciences Continuous Long Read data in combination with Illumina paired-end and Hi-C sequencing, we assembled the genome de novo, followed by RNA-Seq-supported annotation. The final chromosome-level assembly is 3.7 Gb in size, has a BUSCO completeness score of 91.6%, and an error rate of less than 0.02%. Annotation predicted 33,283 gene models in the spiny dogfish's genome, of which 31,979 are functionally annotated.
Collapse
Affiliation(s)
- C Isabel Wagner
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| | - Martina E L Kopp
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| | - James Thorburn
- School of Biology, University of St Andrews, St Andrews, KY16 9ST, UK
- School of Applied Sciences, Edinburgh Napier University, Edinburgh, EH11 4BN, UK
| | - Catherine S Jones
- School of Biological Sciences, University of Aberdeen, Aberdeen, AB24 3FX, UK
| | - Galice Hoarau
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| | - Leslie R Noble
- Faculty of Biosciences and Aquaculture, Nord University, 8026 Bodø, Norway
| |
Collapse
|
21
|
Yocca AE, Platts A, Alger E, Teresi S, Mengist MF, Benevenuto J, Ferrão LFV, Jacobs M, Babinski M, Magallanes-Lundback M, Bayer P, Golicz A, Humann JL, Main D, Espley RV, Chagné D, Albert NW, Montanari S, Vorsa N, Polashock J, Díaz-Garcia L, Zalapa J, Bassil NV, Munoz PR, Iorizzo M, Edger PP. Blueberry and cranberry pangenomes as a resource for future genetic studies and breeding efforts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.31.551392. [PMID: 37577683 PMCID: PMC10418200 DOI: 10.1101/2023.07.31.551392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Domestication of cranberry and blueberry began in the United States in the early 1800s and 1900s, respectively, and in part owing to their flavors and health-promoting benefits are now cultivated and consumed worldwide. The industry continues to face a wide variety of production challenges (e.g. disease pressures) as well as a demand for higher-yielding cultivars with improved fruit quality characteristics. Unfortunately, molecular tools to help guide breeding efforts for these species have been relatively limited compared with those for other high-value crops. Here, we describe the construction and analysis of the first pangenome for both blueberry and cranberry. Our analysis of these pangenomes revealed both crops exhibit great genetic diversity, including the presence-absence variation of 48.4% genes in highbush blueberry and 47.0% genes in cranberry. Auxiliary genes, those not shared by all cultivars, are significantly enriched with molecular functions associated with disease resistance and the biosynthesis of specialized metabolites, including compounds previously associated with improving fruit quality traits. The discovery of thousands of genes, not present in the previous reference genomes for blueberry and cranberry, will serve as the basis of future research and as potential targets for future breeding efforts. The pangenome, as a multiple-sequence alignment, as well as individual annotated genomes, are publicly available for analysis on the Genome Database for Vaccinium - a curated and integrated web-based relational database. Lastly, the core-gene predictions from the pangenomes will serve useful to develop a community genotyping platform to guide future molecular breeding efforts across the family.
Collapse
Affiliation(s)
- Alan E. Yocca
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Adrian Platts
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Elizabeth Alger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Scott Teresi
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Genetics and Genome Sciences, Michigan State University, East Lansing, MI, 48824, USA
| | - Molla F. Mengist
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC USA
| | - Juliana Benevenuto
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - Luis Felipe V. Ferrão
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - MacKenzie Jacobs
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Michal Babinski
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | | | - Philipp Bayer
- University of Western Australia, Perth 6009 Australia
| | | | - Jodi L Humann
- Department of Horticulture, Washington State University, Pullman, WA, 99163, USA
| | - Dorrie Main
- Department of Horticulture, Washington State University, Pullman, WA, 99163, USA
| | - Richard V. Espley
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
| | - David Chagné
- The New Zealand Institute for Plant and Food Research Limited (PFR), Palmerston, New Zealand
| | - Nick W. Albert
- The New Zealand Institute for Plant and Food Research Limited (PFR), Palmerston, New Zealand
| | - Sara Montanari
- The New Zealand Institute for Plant and Food Research Limited (PFR), Motueka, New Zealand
| | - Nicholi Vorsa
- SEBS, Plant Biology, Rutgers University, New Brunswick NJ 01019 USA
| | - James Polashock
- SEBS, Plant Biology, Rutgers University, New Brunswick NJ 01019 USA
| | - Luis Díaz-Garcia
- USDA-ARS, VCRU, Department of Horticulture, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Juan Zalapa
- USDA-ARS, VCRU, Department of Horticulture, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Nahla V. Bassil
- USDA-ARS, National Clonal Germplasm Repository, Corvallis, OR 97333, USA
| | - Patricio R. Munoz
- Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA
| | - Massimo Iorizzo
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC USA
- Department of Horticulture, North Carolina State University, Kannapolis, NC USA
| | - Patrick P. Edger
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
- Genetics and Genome Sciences, Michigan State University, East Lansing, MI, 48824, USA
- MSU AgBioResearch, Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
22
|
Catto MA, Labadie PE, Jacobson AL, Kennedy GG, Srinivasan R, Hunt BG. Pest status, molecular evolution, and epigenetic factors derived from the genome assembly of Frankliniella fusca, a thysanopteran phytovirus vector. BMC Genomics 2023; 24:343. [PMID: 37344773 DOI: 10.1186/s12864-023-09375-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 05/13/2023] [Indexed: 06/23/2023] Open
Abstract
BACKGROUND The tobacco thrips (Frankliniella fusca Hinds; family Thripidae; order Thysanoptera) is an important pest that can transmit viruses such as the tomato spotted wilt orthotospovirus to numerous economically important agricultural row crops and vegetables. The structural and functional genomics within the order Thysanoptera has only begun to be explored. Within the > 7000 known thysanopteran species, the melon thrips (Thrips palmi Karny) and the western flower thrips (Frankliniella occidentalis Pergrande) are the only two thysanopteran species with assembled genomes. RESULTS A genome of F. fusca was assembled by long-read sequencing of DNA from an inbred line. The final assembly size was 370 Mb with a single copy ortholog completeness of ~ 99% with respect to Insecta. The annotated genome of F. fusca was compared with the genome of its congener, F. occidentalis. Results revealed many instances of lineage-specific differences in gene content. Analyses of sequence divergence between the two Frankliniella species' genomes revealed substitution patterns consistent with positive selection in ~ 5% of the protein-coding genes with 1:1 orthologs. Further, gene content related to its pest status, such as xenobiotic detoxification and response to an ambisense-tripartite RNA virus (orthotospovirus) infection was compared with F. occidentalis. Several F. fusca genes related to virus infection possessed signatures of positive selection. Estimation of CpG depletion, a mutational consequence of DNA methylation, revealed that F. fusca genes that were downregulated and alternatively spliced in response to virus infection were preferentially targeted by DNA methylation. As in many other insects, DNA methylation was enriched in exons in Frankliniella, but gene copies with homology to DNA methyltransferase 3 were numerous and fragmented. This phenomenon seems to be relatively unique to thrips among other insect groups. CONCLUSIONS The F. fusca genome assembly provides an important resource for comparative genomic analyses of thysanopterans. This genomic foundation allows for insights into molecular evolution, gene regulation, and loci important to agricultural pest status.
Collapse
Affiliation(s)
- Michael A Catto
- Department of Entomology, University of Georgia, Athens, GA, 30602, USA
| | - Paul E Labadie
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC, 27695, USA
| | - Alana L Jacobson
- Department of Entomology and Plant Pathology, Auburn University College of Agriculture, Auburn, AL, 36849, USA
| | - George G Kennedy
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC, 27695, USA
| | | | - Brendan G Hunt
- Department of Entomology, University of Georgia, Griffin, GA, 30223, USA.
| |
Collapse
|
23
|
Akagi T, Varkonyi-Gasic E, Shirasawa K, Catanach A, Henry IM, Mertten D, Datson P, Masuda K, Fujita N, Kuwada E, Ushijima K, Beppu K, Allan AC, Charlesworth D, Kataoka I. Recurrent neo-sex chromosome evolution in kiwifruit. NATURE PLANTS 2023; 9:393-402. [PMID: 36879018 DOI: 10.1038/s41477-023-01361-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 01/31/2023] [Indexed: 05/18/2023]
Abstract
Sex chromosome evolution is thought to be tightly associated with the acquisition and maintenance of sexual dimorphisms. Plant sex chromosomes have evolved independently in many lineages1,2 and can provide a powerful comparative framework to study this. We assembled and annotated genome sequences of three kiwifruit species (genus Actinidia) and uncovered recurrent sex chromosome turnovers in multiple lineages. Specifically, we observed structural evolution of the neo-Y chromosomes, which was driven via rapid bursts of transposable element insertions. Surprisingly, sexual dimorphisms were conserved in the different species studied, despite the fact that the partially sex-linked genes differ between them. Using gene editing in kiwifruit, we demonstrated that one of the two Y-chromosome-encoded sex-determining genes, Shy Girl, shows pleiotropic effects that can explain the conserved sexual dimorphisms. These plant sex chromosomes therefore maintain sexual dimorphisms through the conservation of a single gene, without a process involving interactions between separate sex-determining genes and genes for sexually dimorphic traits.
Collapse
Affiliation(s)
- Takashi Akagi
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan.
- Japan Science and Technology Agency (JST), PRESTO, Kawaguchi-shi, Japan.
| | - Erika Varkonyi-Gasic
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
| | - Kenta Shirasawa
- Kazusa DNA Research Institute, Kazusa-Kamatari, Kisarazu, Japan
| | - Andrew Catanach
- The New Zealand Institute for Plant and Food Research Limited (PFR), Christchurch, New Zealand
| | - Isabelle M Henry
- Department of Plant Biology and Genome Center, University of California Davis, Davis, CA, USA
| | - Daniel Mertten
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
| | - Paul Datson
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
- The Kiwifruit Breeding Centre, Auckland, New Zealand
| | - Kanae Masuda
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan
| | - Naoko Fujita
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan
| | - Eriko Kuwada
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan
| | - Koichiro Ushijima
- Graduate School of Environmental and Life Science, Okayama University, Okayama, Japan
| | - Kenji Beppu
- Faculty of Agriculture, Kagawa University, Miki, Kagawa, Japan
| | - Andrew C Allan
- The New Zealand Institute for Plant and Food Research Limited (PFR), Auckland, New Zealand
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | | | - Ikuo Kataoka
- Faculty of Agriculture, Kagawa University, Miki, Kagawa, Japan
| |
Collapse
|
24
|
Rodriguez F, Arkhipova IR. An Overview of Best Practices for Transposable Element Identification, Classification, and Annotation in Eukaryotic Genomes. Methods Mol Biol 2023; 2607:1-23. [PMID: 36449155 PMCID: PMC10149145 DOI: 10.1007/978-1-0716-2883-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
Transposable elements (TEs) exert an increasingly diverse spectrum of influences on eukaryotic genome structure, function, and evolution. A deluge of genomic, transcriptomic, and proteomic data provides the foundation for turning essentially any non-model eukaryotic species into an emerging model to study any and all aspects of organismal biology, ultimately shaping future directions for biomedical, environmental, and biodiversity research. However, identification and annotation of the mobile genome component still lags behind the standards accepted for host gene annotation. To achieve the objective of providing every genome project with a comprehensive description of its mobilome component in addition to the standard genic and transcriptomic datasets, each step of TE identification, classification, and annotation should be focused on improving TE boundary designation, reducing identification error rates, and providing accurate information on the type and integrity of TE insertions. Here, we offer practical advice for generating TE models in de novo assemblies for non-model organisms, provide step-by-step instructions to guide inexperienced TE annotators through some of the commonly utilized TE analysis pipelines, and entertain suggestions for tool improvement which could be implemented by interested developers.
Collapse
Affiliation(s)
- Fernando Rodriguez
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA.
| | - Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA.
| |
Collapse
|
25
|
Calero-Layana M, López-Cruz C, Ocaña A, Tejera E, Armijos-Jaramillo V. Evolutionary analysis of endogenous intronic retroviruses in primates reveals an enrichment in transcription binding sites associated with key regulatory processes. PeerJ 2022; 10:e14431. [PMID: 36575684 PMCID: PMC9790151 DOI: 10.7717/peerj.14431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 10/31/2022] [Indexed: 12/24/2022] Open
Abstract
Background Endogenous retroviruses (ERVs) are the result of the integration of retroviruses into host DNA following germline infection. Endogenous retroviruses are made up of three main genes: gag, pol, and env, each of which encodes viral proteins that can be conserved or not. ERVs have been observed in a wide range of vertebrate genomes and their functions are associated with viral silencing and gene regulation. Results In this work, we studied the evolutionary history of endogenous retroviruses associated with five human genes (INPP5B, DET1, PSMA1, USH2A, and MACROD2), which are located within intron sections. To verify the retroviral origin of the candidates, several approaches were used to detect and locate ERV elements. Both orthologous and paralogous genes were identified by Ensembl and then analyzed for ERV presence using RetroTector. A phylogenetic tree was reconstructed to identify the minimum time point of ERV acquisition. From that search, we detected ERVs throughout the primate lineage and in some other groups. Also, we identified the minimum origin of the ERVs from the parvorder Catarrhini to the Homininae subfamily. Conclusions With the data collected, and by observing the transcription factors annotated inside ERVs, we propose that these elements play a relevant role in gene expression regulation and they probably possess important features for tumorigenesis control.
Collapse
Affiliation(s)
- Melissa Calero-Layana
- Ingeniería en Biotecnología. Facultad de Ingeniería y Ciencias Aplicadas, Universidad de las Americas, Quito, Ecuador
| | - Carmen López-Cruz
- Ingeniería en Biotecnología. Facultad de Ingeniería y Ciencias Aplicadas, Universidad de las Americas, Quito, Ecuador
| | - Agustín Ocaña
- Ingeniería en Biotecnología. Facultad de Ingeniería y Ciencias Aplicadas, Universidad de las Americas, Quito, Ecuador
| | - Eduardo Tejera
- Ingeniería en Biotecnología. Facultad de Ingeniería y Ciencias Aplicadas, Universidad de las Americas, Quito, Ecuador,Grupo de Bio-Quimioinformática, Universidad de las Americas, Quito, Ecuador
| | - Vinicio Armijos-Jaramillo
- Ingeniería en Biotecnología. Facultad de Ingeniería y Ciencias Aplicadas, Universidad de las Americas, Quito, Ecuador,Grupo de Bio-Quimioinformática, Universidad de las Americas, Quito, Ecuador
| |
Collapse
|
26
|
Cuello C, Stander EA, Jansen HJ, Dugé De Bernonville T, Oudin A, Birer Williams C, Lanoue A, Giglioli Guivarc'h N, Papon N, Dirks RP, Jensen MK, O'Connor SE, Besseau S, Courdavault V. An updated version of the Madagascar periwinkle genome. F1000Res 2022; 11:1541. [PMID: 36761838 PMCID: PMC9902796 DOI: 10.12688/f1000research.129212.1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/13/2022] [Indexed: 12/24/2022] Open
Abstract
The Madagascar periwinkle, Catharanthus roseus, belongs to the Apocynaceae family. This medicinal plant, endemic to Madagascar, produces many important drugs including the monoterpene indole alkaloids (MIA) vincristine and vinblastine used to treat cancer worldwide. Here, we provide a new version of the C. roseus genome sequence obtained through the combination of Oxford Nanopore Technologies long-reads and Illumina short-reads. This more contiguous assembly consists of 173 scaffolds with a total length of 581.128 Mb and an N50 of 12.241 Mb. Using publicly available RNAseq data, 21,061 protein coding genes were predicted and functionally annotated. A total of 42.87% of the genome was annotated as transposable elements, most of them being long-terminal repeats. Together with the increasing access to MIA-producing plant genomes, this updated version should ease evolutionary studies leading to a better understanding of MIA biosynthetic pathway evolution.
Collapse
Affiliation(s)
- Clément Cuello
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France
| | - Emily Amor Stander
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France
| | - Hans J. Jansen
- Future Genomics Technologies, Leiden, 2333BE, The Netherlands
| | - Thomas Dugé De Bernonville
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France,Present address: Centre de Recherche, Limagrain, Chappes, 07745, France
| | - Audrey Oudin
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France
| | | | - Arnaud Lanoue
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France
| | | | - Nicolas Papon
- IRF, SFR ICAT, Univ Angers, Univ Brest, Angers, 49000, France
| | - Ron P. Dirks
- Future Genomics Technologies, Leiden, 2333BE, The Netherlands
| | - Michael Krogh Jensen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, 2800, Denmark
| | - Sarah Ellen O'Connor
- Department of Natural Product Biosynthesis, Max Planck Institute for Chemical Ecology, Jena, 07745, Germany
| | - Sébastien Besseau
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France
| | - Vincent Courdavault
- EA2106 Biomolécules et Biotechnologies Végétales, Université de Tours, Tours, 37200, France,
| |
Collapse
|
27
|
Rabanal FA, Gräff M, Lanz C, Fritschi K, Llaca V, Lang M, Carbonell-Bejerano P, Henderson I, Weigel D. Pushing the limits of HiFi assemblies reveals centromere diversity between two Arabidopsis thaliana genomes. Nucleic Acids Res 2022; 50:12309-12327. [PMID: 36453992 PMCID: PMC9757041 DOI: 10.1093/nar/gkac1115] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 09/13/2022] [Accepted: 11/10/2022] [Indexed: 12/05/2022] Open
Abstract
Although long-read sequencing can often enable chromosome-level reconstruction of genomes, it is still unclear how one can routinely obtain gapless assemblies. In the model plant Arabidopsis thaliana, other than the reference accession Col-0, all other accessions de novo assembled with long-reads until now have used PacBio continuous long reads (CLR). Although these assemblies sometimes achieved chromosome-arm level contigs, they inevitably broke near the centromeres, excluding megabases of DNA from analysis in pan-genome projects. Since PacBio high-fidelity (HiFi) reads circumvent the high error rate of CLR technologies, albeit at the expense of read length, we compared a CLR assembly of accession Eyach15-2 to HiFi assemblies of the same sample. The use of five different assemblers starting from subsampled data allowed us to evaluate the impact of coverage and read length. We found that centromeres and rDNA clusters are responsible for 71% of contig breaks in the CLR scaffolds, while relatively short stretches of GA/TC repeats are at the core of >85% of the unfilled gaps in our best HiFi assemblies. Since the HiFi technology consistently enabled us to reconstruct gapless centromeres and 5S rDNA clusters, we demonstrate the value of the approach by comparing these previously inaccessible regions of the genome between the Eyach15-2 accession and the reference accession Col-0.
Collapse
Affiliation(s)
| | | | - Christa Lanz
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Katrin Fritschi
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Victor Llaca
- Genomics Technologies, Corteva Agriscience, Johnston, IA 50131, USA
| | - Michelle Lang
- Genomics Technologies, Corteva Agriscience, Johnston, IA 50131, USA
| | - Pablo Carbonell-Bejerano
- Department of Molecular Biology, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Ian Henderson
- Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
| | - Detlef Weigel
- Correspondence may also be addressed to Detlef Weigel. Tel: +49 7071 601 1410;
| |
Collapse
|
28
|
Stander EA, Cuello C, Birer-Williams C, Kulagina N, Jansen HJ, Carqueijeiro I, Méteignier LV, Vergès V, Oudin A, Papon N, Dirks RP, Jensen MK, O’Connor SE, Dugé de Bernonville T, Besseau S, Courdavault V. The Vinca minor genome highlights conserved evolutionary traits in monoterpene indole alkaloid synthesis. G3 (BETHESDA, MD.) 2022; 12:jkac268. [PMID: 36200869 PMCID: PMC9713385 DOI: 10.1093/g3journal/jkac268] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 09/28/2022] [Indexed: 11/26/2023]
Abstract
Vinca minor, also known as the lesser periwinkle, is a well-known species from the Apocynaceae, native to central and southern Europe. This plant synthesizes monoterpene indole alkaloids, which are a class of specialized metabolites displaying a wide range of bioactive- and pharmacologically important properties. Within the almost 50 monoterpene indole alkaloids it produces, V. minor mainly accumulates vincamine, which is commercially used as a nootropic. Using a combination of Oxford Nanopore Technologies long read- and Illumina short-read sequencing, a 679,098 Mb V. minor genome was assembled into 296 scaffolds with an N50 scaffold length of 6 Mb, and encoding 29,624 genes. These genes were functionally annotated and used in a comparative genomic analysis to establish gene families and to investigate gene family expansion and contraction across the phylogenetic tree. Furthermore, homology-based monoterpene indole alkaloid gene predictions together with a metabolic analysis across 4 different V. minor tissue types guided the identification of candidate monoterpene indole alkaloid genes. These candidates were finally used to identify monoterpene indole alkaloid gene clusters, which combined with synteny analysis allowed for the discovery of a functionally validated vincadifformine-16-hydroxylase, reinforcing the potential of this dataset for monoterpene indole alkaloids gene discovery. It is expected that access to these resources will facilitate the elucidation of unknown monoterpene indole alkaloid biosynthetic routes with the potential of transferring these pathways to heterologous expression systems for large-scale monoterpene indole alkaloid production.
Collapse
Affiliation(s)
- Emily Amor Stander
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | - Clément Cuello
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | | | - Natalja Kulagina
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | - Hans J Jansen
- Future Genomics Technologies, 2333 BE Leiden, The Netherlands
| | - Ines Carqueijeiro
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | | | - Valentin Vergès
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | - Audrey Oudin
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | - Nicolas Papon
- Univ Angers, Univ Brest, IRF, SFR ICAT, F-49000 Angers, France
| | - Ron P Dirks
- Future Genomics Technologies, 2333 BE Leiden, The Netherlands
| | - Michael Krogh Jensen
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kongens Lyngby, Denmark
| | - Sarah Ellen O’Connor
- Department of Natural Product Biosynthesis, Max Planck Institute for Chemical Ecology, Jena 07745, Germany
| | | | - Sébastien Besseau
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| | - Vincent Courdavault
- Biomolécules et Biotechnologies Végétales, EA2106, Université de Tours, 37200 Tours, France
| |
Collapse
|
29
|
Gourlie R, McDonald M, Hafez M, Ortega-Polo R, Low KE, Abbott DW, Strelkov SE, Daayf F, Aboukhaddour R. The pangenome of the wheat pathogen Pyrenophora tritici-repentis reveals novel transposons associated with necrotrophic effectors ToxA and ToxB. BMC Biol 2022; 20:239. [PMID: 36280878 PMCID: PMC9594970 DOI: 10.1186/s12915-022-01433-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 10/04/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In fungal plant pathogens, genome rearrangements followed by selection pressure for adaptive traits have facilitated the co-evolutionary arms race between hosts and their pathogens. Pyrenophora tritici-repentis (Ptr) has emerged recently as a foliar pathogen of wheat worldwide and its populations consist of isolates that vary in their ability to produce combinations of different necrotrophic effectors. These effectors play vital roles in disease development. Here, we sequenced the genomes of a global collection (40 isolates) of Ptr to gain insights into its gene content and genome rearrangements. RESULTS A comparative genome analysis revealed an open pangenome, with an abundance of accessory genes (~ 57%) reflecting Ptr's adaptability. A clear distinction between pathogenic and non-pathogenic genomes was observed in size, gene content, and phylogenetic relatedness. Chromosomal rearrangements and structural organization, specifically around effector coding genes, were detailed using long-read assemblies (PacBio RS II) generated in this work in addition to previously assembled genomes. We also discovered the involvement of large mobile elements associated with Ptr's effectors: ToxA, the gene encoding for the necrosis effector, was found as a single copy within a 143-kb 'Starship' transposon (dubbed 'Horizon') with a clearly defined target site and target site duplications. 'Horizon' was located on different chromosomes in different isolates, indicating mobility, and the previously described ToxhAT transposon (responsible for horizontal transfer of ToxA) was nested within this newly identified Starship. Additionally, ToxB, the gene encoding the chlorosis effector, was clustered as three copies on a 294-kb element, which is likely a different putative 'Starship' (dubbed 'Icarus') in a ToxB-producing isolate. ToxB and its putative transposon were missing from the ToxB non-coding reference isolate, but the homolog toxb and 'Icarus' were both present in a different non-coding isolate. This suggests that ToxB may have been mobile at some point during the evolution of the Ptr genome which is contradictory to the current assumption of ToxB vertical inheritance. Finally, the genome architecture of Ptr was defined as 'one-compartment' based on calculated gene distances and evolutionary rates. CONCLUSIONS These findings together reflect on the highly plastic nature of the Ptr genome which has likely helped to drive its worldwide adaptation and has illuminated the involvement of giant transposons in facilitating the evolution of virulence in Ptr.
Collapse
Affiliation(s)
- Ryan Gourlie
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| | - Megan McDonald
- grid.6572.60000 0004 1936 7486School of Biosciences, University of Birmingham, Institute of Microbiology and Infection, Edgbaston, Birmingham, UK
| | - Mohamed Hafez
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| | - Rodrigo Ortega-Polo
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| | - Kristin E. Low
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| | - D. Wade Abbott
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| | - Stephen E. Strelkov
- grid.17089.370000 0001 2190 316XFaculty of Agricultural, Life, and Environmental Sciences, University of Alberta, Edmonton, AB Canada
| | - Fouad Daayf
- grid.21613.370000 0004 1936 9609Faculty of Agricultural and Food Sciences, University of Manitoba, Winnipeg, MB Canada
| | - Reem Aboukhaddour
- grid.55614.330000 0001 1302 4958Agriculture and Agri-Food Canada, Lethbridge, AB Canada
| |
Collapse
|
30
|
Shang L, Li X, He H, Yuan Q, Song Y, Wei Z, Lin H, Hu M, Zhao F, Zhang C, Li Y, Gao H, Wang T, Liu X, Zhang H, Zhang Y, Cao S, Yu X, Zhang B, Zhang Y, Tan Y, Qin M, Ai C, Yang Y, Zhang B, Hu Z, Wang H, Lv Y, Wang Y, Ma J, Wang Q, Lu H, Wu Z, Liu S, Sun Z, Zhang H, Guo L, Li Z, Zhou Y, Li J, Zhu Z, Xiong G, Ruan J, Qian Q. A super pan-genomic landscape of rice. Cell Res 2022; 32:878-896. [PMID: 35821092 PMCID: PMC9525306 DOI: 10.1038/s41422-022-00685-z] [Citation(s) in RCA: 89] [Impact Index Per Article: 44.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 06/10/2022] [Indexed: 01/09/2023] Open
Abstract
Pan-genomes from large natural populations can capture genetic diversity and reveal genomic complexity. Using de novo long-read assembly, we generated a graph-based super pan-genome of rice consisting of a 251-accession panel comprising both cultivated and wild species of Asian and African rice. Our pan-genome reveals extensive structural variations (SVs) and gene presence/absence variations. Additionally, our pan-genome enables the accurate identification of nucleotide-binding leucine-rich repeat genes and characterization of their inter- and intraspecific diversity. Moreover, we uncovered grain weight-associated SVs which specify traits by affecting the expression of their nearby genes. We characterized genetic variants associated with submergence tolerance, seed shattering and plant architecture and found independent selection for a common set of genes that drove adaptation and domestication in Asian and African rice. This super pan-genome facilitates pinpointing of lineage-specific haplotypes for trait-associated genes and provides insights into the evolutionary events that have shaped the genomic architecture of various rice species.
Collapse
Affiliation(s)
- Lianguang Shang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China.
| | - Xiaoxia Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Huiying He
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Qiaoling Yuan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Yanni Song
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Zhaoran Wei
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Hai Lin
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Min Hu
- State Key Laboratory for Agrobiotechnology, National Center for Evaluation of Agricultural Wild Plants (Rice), Department of Plant Genetics and Breeding, China Agricultural University, Beijing, China
| | - Fengli Zhao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Chao Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Yuhua Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Hongsheng Gao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Tianyi Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Xiangpei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Hong Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Ya Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Shuaimin Cao
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Xiaoman Yu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Bintao Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Yong Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Yiqing Tan
- Academy for Advanced Interdisciplinary Studies, Plant Phenomics Research Center, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Mao Qin
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Cheng Ai
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Yingxue Yang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Bin Zhang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Zhiqiang Hu
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Hongru Wang
- Department of Integrative Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Yang Lv
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China.,State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, Zhejiang, China
| | - Yuexing Wang
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, Zhejiang, China
| | - Jie Ma
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, Zhejiang, China
| | - Quan Wang
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Hongwei Lu
- Key Laboratory of Molecular Design for Plant Cell Factory of Guangdong Higher Education Institutes, Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, Guangdong, China
| | - Zhe Wu
- Key Laboratory of Molecular Design for Plant Cell Factory of Guangdong Higher Education Institutes, Institute of Plant and Food Science, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, Guangdong, China
| | - Shanlin Liu
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing, China
| | | | - Hongliang Zhang
- State Key Laboratory of Agrobiotechnology/Beijing Key Laboratory of Crop Genetic Improvement, College of Agronomy and Biotechnology, China Agricultural University, Beijing, China
| | - Longbiao Guo
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, Zhejiang, China
| | - Zichao Li
- State Key Laboratory of Agrobiotechnology/Beijing Key Laboratory of Crop Genetic Improvement, College of Agronomy and Biotechnology, China Agricultural University, Beijing, China
| | - Yongfeng Zhou
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China
| | - Jiayang Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Zuofeng Zhu
- State Key Laboratory for Agrobiotechnology, National Center for Evaluation of Agricultural Wild Plants (Rice), Department of Plant Genetics and Breeding, China Agricultural University, Beijing, China.
| | - Guosheng Xiong
- Academy for Advanced Interdisciplinary Studies, Plant Phenomics Research Center, Nanjing Agricultural University, Nanjing, Jiangsu, China.
| | - Jue Ruan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China.
| | - Qian Qian
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, China. .,State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, Zhejiang, China.
| |
Collapse
|
31
|
Ouadi S, Sierro N, Goepfert S, Bovet L, Glauser G, Vallat A, Peitsch MC, Kessler F, Ivanov NV. The clove (Syzygium aromaticum) genome provides insights into the eugenol biosynthesis pathway. Commun Biol 2022; 5:684. [PMID: 35810198 PMCID: PMC9271057 DOI: 10.1038/s42003-022-03618-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 06/22/2022] [Indexed: 11/09/2022] Open
Abstract
The clove (Syzygium aromaticum) is an important tropical spice crop in global trade. Evolving environmental pressures necessitate modern characterization and selection techniques that are currently inaccessible to clove growers owing to the scarcity of genomic and genetic information. Here, we present a 370-Mb high-quality chromosome-scale genome assembly for clove. Comparative genomic analysis between S. aromaticum and Eucalyptus grandis—both species of the Myrtaceae family—reveals good genome structure conservation and intrachromosomal rearrangements on seven of the eleven chromosomes. We report genes that belong to families involved in the biosynthesis of eugenol, the major bioactive component of clove products. On the basis of our transcriptomic and metabolomic findings, we propose a hypothetical scenario in which eugenol acetate plays a key role in high eugenol accumulation in clove leaves and buds. The clove genome is a new contribution to omics resources for the Myrtaceae family and an important tool for clove research. A newly assembled clove genome is compared with E. grandis to investigate genome evolution between the two genera of the Myrtaceae family, and putative genes involved in the biosynthesis of eugenol are identified through transcriptomics and metabolomics.
Collapse
Affiliation(s)
- Sonia Ouadi
- Faculty of Sciences, Laboratory of Plant Physiology, University of Neuchâtel, 2000, Neuchâtel, Switzerland.,PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Nicolas Sierro
- PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Simon Goepfert
- PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Lucien Bovet
- PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Gaetan Glauser
- Faculty of Sciences, Neuchâtel Platform of Analytical Chemistry, University of Neuchâtel, 2000, Neuchâtel, Switzerland
| | - Armelle Vallat
- Faculty of Sciences, Neuchâtel Platform of Analytical Chemistry, University of Neuchâtel, 2000, Neuchâtel, Switzerland
| | - Manuel C Peitsch
- PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland
| | - Felix Kessler
- Faculty of Sciences, Laboratory of Plant Physiology, University of Neuchâtel, 2000, Neuchâtel, Switzerland
| | - Nikolai V Ivanov
- PMI R&D, Philip Morris Products S. A, Quai Jeanrenaud 5, CH-2000, Neuchâtel, Switzerland.
| |
Collapse
|
32
|
Balcão VM, Moreli FC, Silva EC, Belline BG, Martins LF, Rossi FPN, Pereira C, Vila MMDC, da Silva AM. Isolation and Molecular Characterization of a Novel Lytic Bacteriophage That Inactivates MDR Klebsiella pneumoniae Strains. Pharmaceutics 2022; 14:pharmaceutics14071421. [PMID: 35890314 PMCID: PMC9324672 DOI: 10.3390/pharmaceutics14071421] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/02/2022] [Accepted: 07/05/2022] [Indexed: 02/07/2023] Open
Abstract
The worldwide increase in serious infections caused by multidrug-resistant (MDR) K. pneumoniae emphasizes the urgent need of new therapeutic strategies for the control of this pathogen. There is growing interest in the use of bacteriophages (or phages) to treat K. pneumoniae infections, and newly isolated phages are needed. Here, we report the isolation and physical/biological/molecular characterization of a novel lytic phage and its efficacy in the control of MDR K. pneumoniae. The phage vB_KpnS_Uniso31, referred to hereafter as phage Kpn31, was isolated from hospital wastewater using K. pneumoniae CCCD-K001 as the host. Phage Kpn31 presents a siphovirus-like morphotype and was classified as Demerecviridae; Sugarlandvirus based on its complete genome sequence. The 113,444 bp Kpn31 genome does not encode known toxins or antimicrobial resistance genes, nor does it encode depolymerases related sequences. Phage Kpn31 showed an eclipse time of 15 min and a burst size of 9.12 PFU/host cell, allowing us to conclude it replicates well in K. pneumoniae CCCD-K001 with a latency period of 30 min. Phage Kpn31 was shown to be effective against at least six MDR K. pneumoniae clinical isolates in in vitro antibacterial activity assays. Based on its features, phage Kpn31 has potential for controlling infections caused by MDR K. pneumoniae.
Collapse
Affiliation(s)
- Victor M Balcão
- PhageLab, Laboratory of Biofilms and Bacteriophages, University of Sorocaba, Sorocaba 18023-000, Brazil
- Department of Biology and CESAM, Campus Universitário de Santiago, University of Aveiro, 3810-193 Aveiro, Portugal
| | - Fernanda C Moreli
- PhageLab, Laboratory of Biofilms and Bacteriophages, University of Sorocaba, Sorocaba 18023-000, Brazil
| | - Erica C Silva
- PhageLab, Laboratory of Biofilms and Bacteriophages, University of Sorocaba, Sorocaba 18023-000, Brazil
| | - Bianca G Belline
- PhageLab, Laboratory of Biofilms and Bacteriophages, University of Sorocaba, Sorocaba 18023-000, Brazil
| | - Layla F Martins
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo 05508-000, Brazil
| | - Fernando P N Rossi
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo 05508-000, Brazil
| | - Carla Pereira
- Department of Biology and CESAM, Campus Universitário de Santiago, University of Aveiro, 3810-193 Aveiro, Portugal
| | - Marta M D C Vila
- PhageLab, Laboratory of Biofilms and Bacteriophages, University of Sorocaba, Sorocaba 18023-000, Brazil
| | - Aline M da Silva
- Department of Biochemistry, Institute of Chemistry, University of São Paulo, São Paulo 05508-000, Brazil
| |
Collapse
|
33
|
Wang Z, Rouard M, Biswas MK, Droc G, Cui D, Roux N, Baurens FC, Ge XJ, Schwarzacher T, Heslop-Harrison P(JS, Liu Q. A chromosome-level reference genome of Ensete glaucum gives insight into diversity and chromosomal and repetitive sequence evolution in the Musaceae. Gigascience 2022; 11:6576245. [PMID: 35488861 PMCID: PMC9055855 DOI: 10.1093/gigascience/giac027] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/26/2022] [Accepted: 02/22/2022] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Ensete glaucum (2n = 2x = 18) is a giant herbaceous monocotyledonous plant in the small Musaceae family along with banana (Musa). A high-quality reference genome sequence assembly of E. glaucum is a resource for functional and evolutionary studies of Ensete, Musaceae, and the Zingiberales. FINDINGS Using Oxford Nanopore Technologies, chromosome conformation capture (Hi-C), Illumina and RNA survey sequence, supported by molecular cytogenetics, we report a high-quality 481.5 Mb genome assembly with 9 pseudo-chromosomes and 36,836 genes. A total of 55% of the genome is composed of repetitive sequences with predominantly LTR-retroelements (37%) and DNA transposons (7%). The single 5S ribosomal DNA locus had an exceptionally long monomer length of 1,056 bp, more than twice that of the monomers at multiple loci in Musa. A tandemly repeated satellite (1.1% of the genome, with no similar sequence in Musa) was present around all centromeres, together with a few copies of a long interspersed nuclear element (LINE) retroelement. The assembly enabled us to characterize in detail the chromosomal rearrangements occurring between E. glaucum and the x = 11 species of Musa. One E. glaucum chromosome has the same gene content as Musa acuminata, while others show multiple, complex, but clearly defined evolutionary rearrangements in the change between x= 9 and 11. CONCLUSIONS The advance towards a Musaceae pangenome including E. glaucum, tolerant of extreme environments, makes a complete set of gene alleles, copy number variation, and a reference for structural variation available for crop breeding and understanding environmental responses. The chromosome-scale genome assembly shows the nature of chromosomal fusion and translocation events during speciation, and features of rapid repetitive DNA change in terms of copy number, sequence, and genomic location, critical to understanding its role in diversity and evolution.
Collapse
Affiliation(s)
- Ziwei Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China,Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou 510650, China,College of Life Sciences, University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Mathieu Rouard
- Bioversity International, Parc Scientifique Agropolis II, 34397 Montpellier Cedex 5, France,French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Alliance Bioversity and CIAT, CIRAD, INRAE, IRD, F-34398 Montpellier, France
| | - Manosh Kumar Biswas
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
| | - Gaetan Droc
- French Institute of Bioinformatics (IFB) - South Green Bioinformatics Platform, Alliance Bioversity and CIAT, CIRAD, INRAE, IRD, F-34398 Montpellier, France,CIRAD, UMR AGAP Institut, F-34398 Montpellier, France,UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Dongli Cui
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China,Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou 510650, China,College of Life Sciences, University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Nicolas Roux
- Bioversity International, Parc Scientifique Agropolis II, 34397 Montpellier Cedex 5, France
| | - Franc-Christophe Baurens
- CIRAD, UMR AGAP Institut, F-34398 Montpellier, France,UMR AGAP Institut, Univ Montpellier, CIRAD, INRAE, Institut Agro, F-34398 Montpellier, France
| | - Xue-Jun Ge
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China,Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Trude Schwarzacher
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization/Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China,Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
| | - Pat (J S) Heslop-Harrison
- Correspondence address. Qing Liu. Key Laboratory of Plant Resources Conservation and Sustainable Utilization / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China. Pat Heslop-Harrison. Department of Genetics and Genome Biology, University of Leicester, Leicester, LE 7RH, UK Qing Liu. Key Laboratory of Plant Resources Conservation and Sustainable Utilization / Guangdong Provincial Key Laboratory of Applied Botany, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China E-mail:
| | - Qing Liu
- Correspondence address. Pat Heslop-Harrison. Department of Genetics and Genome Biology, University of Leicester, Leicester, LE 7RH, UK. E-mail:
| |
Collapse
|
34
|
Storer JM, Hubley R, Rosen J, Smit AFA. Methodologies for the De novo Discovery of Transposable Element Families. Genes (Basel) 2022; 13:709. [PMID: 35456515 PMCID: PMC9025800 DOI: 10.3390/genes13040709] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The discovery and characterization of transposable element (TE) families are crucial tasks in the process of genome annotation. Careful curation of TE libraries for each organism is necessary as each has been exposed to a unique and often complex set of TE families. De novo methods have been developed; however, a fully automated and accurate approach to the development of complete libraries remains elusive. In this review, we cover established methods and recent developments in de novo TE analysis. We also present various methodologies used to assess these tools and discuss opportunities for further advancement of the field.
Collapse
Affiliation(s)
| | | | | | - Arian F. A. Smit
- Institute for Systems Biology, Seattle, WA 98109, USA; (J.M.S.); (R.H.); (J.R.)
| |
Collapse
|
35
|
Lemay MA, Sibbesen JA, Torkamaneh D, Hamel J, Levesque RC, Belzile F. Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology. BMC Biol 2022; 20:53. [PMID: 35197050 PMCID: PMC8867729 DOI: 10.1186/s12915-022-01255-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/16/2022] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species.
Collapse
Affiliation(s)
- Marc-André Lemay
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| | | | - Davoud Torkamaneh
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| | - Jérémie Hamel
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
- Département de microbiologie-infectiologie et d’immunologie, Université Laval, Quebec, Canada
| | - Roger C. Levesque
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
- Département de microbiologie-infectiologie et d’immunologie, Université Laval, Quebec, Canada
| | - François Belzile
- Département de phytologie, Université Laval, Quebec, Canada
- Institut de biologie intégrative et des systèmes, Université Laval, Quebec, Canada
| |
Collapse
|
36
|
Finding and Characterizing Repeats in Plant Genomes. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2443:327-385. [PMID: 35037215 DOI: 10.1007/978-1-0716-2067-0_18] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.
Collapse
|
37
|
Termignoni-Garcia F, Kirchman JJ, Clark J, Edwards SV. Comparative Population Genomics of Cryptic Speciation and Adaptive Divergence in Bicknell's and Gray-Cheeked Thrushes (Aves: Catharus bicknelli and Catharus minimus). Genome Biol Evol 2022; 14:evab255. [PMID: 34999784 PMCID: PMC8743040 DOI: 10.1093/gbe/evab255] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/10/2021] [Indexed: 02/07/2023] Open
Abstract
Cryptic speciation may occur when reproductive isolation is recent or the accumulation of morphological differences between sister lineages is slowed by stabilizing selection preventing phenotypic differentiation. In North America, Bicknell's Thrush (Catharus bicknelli) and its sister species, the Gray-cheeked Thrush (Catharus minimus), are parapatrically breeding migratory songbirds, distinguishable in nature only by subtle differences in song and coloration, and were recognized as distinct species only in the 1990s. Previous molecular studies have estimated that the species diverged approximately 120,000-420,000 YBP and found very low levels of introgression despite their similarity and sympatry in the spring (prebreeding) migration. To further clarify the history, genetic divergence, genomic structure, and adaptive processes in C. bicknelli and C. minimus, we sequenced and assembled high-coverage reference genomes of both species and resequenced genomes from population samples of C. bicknelli, C. minimus, and two individuals of the Swainson's Thrush (Catharus ustulatus). The genome of C. bicknelli exhibits markedly higher abundances of transposable elements compared with other Catharus and chicken. Demographic and admixture analyses confirm moderate genome-wide differentiation (Fst ≈ 0.10) and limited gene flow between C. bicknelli and C. minimus, but suggest a more recent divergence than estimates based on mtDNA. We find evidence of rapid evolution of the Z-chromosome and elevated divergence consistent with natural selection on genomic regions near genes involved with neuronal processes in C. bicknelli. These genomes are a useful resource for future investigations of speciation, migration, and adaptation in Catharus thrushes.
Collapse
Affiliation(s)
- Flavia Termignoni-Garcia
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts, USA
| | | | - Johnathan Clark
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts, USA
| |
Collapse
|
38
|
Yow AG, Bostan H, Castanera R, Ruggieri V, Mengist MF, Curaba J, Young R, Gillitt N, Iorizzo M. Improved High-Quality Genome Assembly and Annotation of Pineapple (Ananas comosus) Cultivar MD2 Revealed Extensive Haplotype Diversity and Diversified FRS/FRF Gene Family. Genes (Basel) 2021; 13:genes13010052. [PMID: 35052394 PMCID: PMC8774480 DOI: 10.3390/genes13010052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Revised: 12/09/2021] [Accepted: 12/21/2021] [Indexed: 11/18/2022] Open
Abstract
Pineapple (Ananas comosus (L.) Merr.) is the second most important tropical fruit crop globally, and ‘MD2’ is the most important cultivated variety. A high-quality genome is important for molecular-based breeding, but available pineapple genomes still have some quality limitations. Here, PacBio and Hi-C data were used to develop a new high-quality MD2 assembly and gene prediction. Compared to the previous MD2 assembly, major improvements included a 26.6-fold increase in contig N50 length, phased chromosomes, and >6000 new genes. The new MD2 assembly also included 161.6 Mb additional sequences and >3000 extra genes compared to the F153 genome. Over 48% of the predicted genes harbored potential deleterious mutations, indicating that the high level of heterozygosity in this species contributes to maintaining functional alleles. The genome was used to characterize the FAR1-RELATED SEQUENCE (FRS) genes that were expanded in pineapple and rice. Transposed and dispersed duplications contributed to expanding the numbers of these genes in the pineapple lineage. Several AcFRS genes were differentially expressed among tissue-types and stages of flower development, suggesting that their expansion contributed to evolving specialized functions in reproductive tissues. The new MD2 assembly will serve as a new reference for genetic and genomic studies in pineapple.
Collapse
Affiliation(s)
- Ashley G. Yow
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA;
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC 28081, USA; (H.B.); (M.F.M.); (J.C.)
| | - Hamed Bostan
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC 28081, USA; (H.B.); (M.F.M.); (J.C.)
| | - Raúl Castanera
- Centre for Research in Agricultural Genomics CSIC-IRTA-UAB-UB, Campus UAB, 08193 Barcelona, Spain;
| | | | - Molla F. Mengist
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC 28081, USA; (H.B.); (M.F.M.); (J.C.)
| | - Julien Curaba
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC 28081, USA; (H.B.); (M.F.M.); (J.C.)
| | - Roberto Young
- Research Department of Dole, Standard Fruit de Honduras, Zona Mazapan, La Ceiba 31101, Honduras;
| | - Nicholas Gillitt
- Core Genomics Lab, David H. Murdock Research Institute, Kannapolis, NC 28081, USA;
| | - Massimo Iorizzo
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695, USA;
- Plants for Human Health Institute, North Carolina State University, Kannapolis, NC 28081, USA; (H.B.); (M.F.M.); (J.C.)
- Correspondence:
| |
Collapse
|
39
|
Mokhtar MM, Alsamman AM, Abd-Elhalim HM, El Allali A. CicerSpTEdb: A web-based database for high-resolution genome-wide identification of transposable elements in Cicer species. PLoS One 2021; 16:e0259540. [PMID: 34762703 PMCID: PMC8584679 DOI: 10.1371/journal.pone.0259540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 10/20/2021] [Indexed: 11/19/2022] Open
Abstract
Recently, Cicer species have experienced increased research interest due to their economic importance, especially in genetics, genomics, and crop improvement. The Cicer arietinum, Cicer reticulatum, and Cicer echinospermum genomes have been sequenced and provide valuable resources for trait improvement. Since the publication of the chickpea draft genome, progress has been made in genome assembly, functional annotation, and identification of polymorphic markers. However, work is still needed to identify transposable elements (TEs) and make them available for researchers. In this paper, we present CicerSpTEdb, a comprehensive TE database for Cicer species that aims to improve our understanding of the organization and structural variations of the chickpea genome. Using structure and homology-based methods, 3942 C. echinospermum, 3579 C. reticulatum, and 2240 C. arietinum TEs were identified. Comparisons between Cicer species indicate that C. echinospermum has the highest number of LTR-RT and hAT TEs. C. reticulatum has more Mutator, PIF Harbinger, Tc1 Mariner, and CACTA TEs, while C. arietinum has the highest number of Helitron. CicerSpTEdb enables users to search and visualize TEs by location and download their results. The database will provide a powerful resource that can assist in developing TE target markers for molecular breeding and answer related biological questions. Database URL: http://cicersptedb.easyomics.org/index.php.
Collapse
Affiliation(s)
- Morad M. Mokhtar
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
- * E-mail: (AEA); (MMM)
| | | | - Haytham M. Abd-Elhalim
- Agricultural Genetic Engineering Research Institute, Agricultural Research Center, Giza, Egypt
| | - Achraf El Allali
- African Genome Center, Mohammed VI Polytechnic University, Ben Guerir, Morocco
- * E-mail: (AEA); (MMM)
| |
Collapse
|
40
|
Wittmeyer KT, Oppenheim SJ, Hopper KR. Assemblies of the genomes of parasitic wasps using meta-assembly and scaffolding with genetic linkage. G3 (BETHESDA, MD.) 2021; 12:6423991. [PMID: 34751385 PMCID: PMC8727961 DOI: 10.1093/g3journal/jkab386] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 10/25/2021] [Indexed: 01/09/2023]
Abstract
Safe, effective biological-control introductions against invasive pests depend on narrowly host-specific natural enemies with the ability to adapt to a changing environment. As part of a project on the genetic architectures of these traits, we assembled and annotated the genomes of two aphid parasitoids, Aphelinus atriplicis and Aphelinus certus. We report here several assemblies of A. atriplicis made with Illumina and PacBio data, which we combined into a meta-assembly. We scaffolded the meta-assembly with markers from a genetic map of hybrids between A. atriplicis and A. certus. We used this genetic-linkage scaffolded (GLS) assembly of A. atriplicis to scaffold a de novo assembly of A. certus. The de novo assemblies of A. atriplicis differed in contiguity, and the meta-assembly of these assemblies was more contiguous than the best de novo assembly. Scaffolding with genetic-linkage data allowed chromosomal-level assembly of the A. atriplicis genome and scaffolding a de novo assembly of A. certus with this GLS assembly, greatly increased the contiguity of the A. certus assembly to the point where it was also at the chromosomal-level. However, completeness of the A. atriplicis assembly, as measured by percent complete, single-copy BUSCO hymenopteran genes, varied little among de novo assemblies and was not increased by meta-assembly or genetic scaffolding. Furthermore, the greater contiguity of the meta-assembly and GLS assembly had little or no effect on the numbers of genes identified, the proportions with homologs or functional annotations. Increased contiguity of the A. certus assembly provided modest improvement in assembly completeness, as measured by percent complete, single-copy BUSCO hymenopteran genes. The total genic sequence increased, and while the number of genes declined, gene length increased, which together suggest greater accuracy of gene models. More contiguous assemblies provide uses other than gene annotation, for example, identifying the genes associated with quantitative trait loci and understanding of chromosomal rearrangements associated with speciation.
Collapse
Affiliation(s)
- Kameron T Wittmeyer
- USDA-ARS, Beneficial Insect Introductions Research Unit, Newark, DE 19713, USA
| | | | - Keith R Hopper
- USDA-ARS, Beneficial Insect Introductions Research Unit, Newark, DE 19713, USA,Corresponding author: USDA-ARS, Beneficial Insect Introductions Research Unit, 501 South Chapel Street, Newark, DE 19713, USA.
| |
Collapse
|
41
|
Hill AM, Rybarski JR, Hu K, Finkelstein IJ, Wilke CO. Opfi: A Python package for identifying gene clusters in large genomics and metagenomics data sets. JOURNAL OF OPEN SOURCE SOFTWARE 2021; 6:3678. [PMID: 35445164 PMCID: PMC9017871 DOI: 10.21105/joss.03678] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Gene clusters are sets of co-localized, often contiguous genes that together perform specific functions, many of which are relevant to biotechnology. There is a need for software tools that can extract candidate gene clusters from vast amounts of available genomic data. Therefore, we developed Opfi: a modular pipeline for identification of arbitrary gene clusters in assembled genomic or metagenomic sequences. Opfi contains functions for annotation, de-deduplication, and visualization of putative gene clusters. It utilizes a customizable rule-based filtering approach for selection of candidate systems that adhere to user-defined criteria. Opfi is implemented in Python, and is available on the Python Package Index and on Bioconda (Grüning et al., 2018).
Collapse
Affiliation(s)
- Alexis M Hill
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| | - James R Rybarski
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Kuang Hu
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas 78712, USA
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Ilya J Finkelstein
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas 78712, USA
- Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, Texas, 78712, USA
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
42
|
Liao X, Li M, Hu K, Wu FX, Gao X, Wang J. A sensitive repeat identification framework based on short and long reads. Nucleic Acids Res 2021; 49:e100. [PMID: 34214175 PMCID: PMC8464074 DOI: 10.1093/nar/gkab563] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2020] [Revised: 06/08/2021] [Accepted: 06/18/2021] [Indexed: 12/11/2022] Open
Abstract
Numerous studies have shown that repetitive regions in genomes play indispensable roles in the evolution, inheritance and variation of living organisms. However, most existing methods cannot achieve satisfactory performance on identifying repeats in terms of both accuracy and size, since NGS reads are too short to identify long repeats whereas SMS (Single Molecule Sequencing) long reads are with high error rates. In this study, we present a novel identification framework, LongRepMarker, based on the global de novo assembly and k-mer based multiple sequence alignment for precisely marking long repeats in genomes. The major characteristics of LongRepMarker are as follows: (i) by introducing barcode linked reads and SMS long reads to assist the assembly of all short paired-end reads, it can identify the repeats to a greater extent; (ii) by finding the overlap sequences between assemblies or chomosomes, it locates the repeats faster and more accurately; (iii) by using the multi-alignment unique k-mers rather than the high frequency k-mers to identify repeats in overlap sequences, it can obtain the repeats more comprehensively and stably; (iv) by applying the parallel alignment model based on the multi-alignment unique k-mers, the efficiency of data processing can be greatly optimized and (v) by taking the corresponding identification strategies, structural variations that occur between repeats can be identified. Comprehensive experimental results show that LongRepMarker can achieve more satisfactory results than the existing de novo detection methods (https://github.com/BioinformaticsCSU/LongRepMarker).
Collapse
Affiliation(s)
- Xingyu Liao
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Min Li
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Kang Hu
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| | - Fang-Xiang Wu
- Department of Mechanical Engineering and Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, SK S7N5A9, Canada
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Jianxin Wang
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha 410083, P.R. China
| |
Collapse
|
43
|
Qin P, Lu H, Du H, Wang H, Chen W, Chen Z, He Q, Ou S, Zhang H, Li X, Li X, Li Y, Liao Y, Gao Q, Tu B, Yuan H, Ma B, Wang Y, Qian Y, Fan S, Li W, Wang J, He M, Yin J, Li T, Jiang N, Chen X, Liang C, Li S. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell 2021; 184:3542-3558.e16. [PMID: 34051138 DOI: 10.1016/j.cell.2021.04.046] [Citation(s) in RCA: 199] [Impact Index Per Article: 66.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Revised: 01/31/2021] [Accepted: 04/24/2021] [Indexed: 12/30/2022]
Abstract
Structural variations (SVs) and gene copy number variations (gCNVs) have contributed to crop evolution, domestication, and improvement. Here, we assembled 31 high-quality genomes of genetically diverse rice accessions. Coupling with two existing assemblies, we developed pan-genome-scale genomic resources including a graph-based genome, providing access to rice genomic variations. Specifically, we discovered 171,072 SVs and 25,549 gCNVs and used an Oryza glaberrima assembly to infer the derived states of SVs in the Oryza sativa population. Our analyses of SV formation mechanisms, impacts on gene expression, and distributions among subpopulations illustrate the utility of these resources for understanding how SVs and gCNVs shaped rice environmental adaptation and domestication. Our graph-based genome enabled genome-wide association study (GWAS)-based identification of phenotype-associated genetic variations undetectable when using only SNPs and a single reference assembly. Our work provides rich population-scale resources paired with easy-to-access tools to facilitate rice breeding as well as plant functional genomics and evolutionary biology research.
Collapse
Affiliation(s)
- Peng Qin
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China.
| | - Hongwei Lu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Huilong Du
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China; School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Hao Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Weilan Chen
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Zhuo Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Qiang He
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Hongyu Zhang
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Xuanzhao Li
- School of Life Sciences, Institute of Life Sciences and Green Development, Hebei University, Baoding, China
| | - Xiuxiu Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Yan Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA, USA
| | - Qiang Gao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China
| | - Bin Tu
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Hua Yuan
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Bingtian Ma
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Yuping Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Yangwen Qian
- Biogle Genome Editing Center, Changzhou, Jiangsu, China
| | - Shijun Fan
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Weitao Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Jing Wang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Min He
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Junjie Yin
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ting Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, USA
| | - Xuewei Chen
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | - Shigui Li
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Rice Research Institute, Sichuan Agricultural University, Chengdu, Sichuan, China.
| |
Collapse
|
44
|
Tiedeman Z, Signor S. The transposable elements of the Drosophila serrata reference panel. Genome Biol Evol 2021; 13:6265467. [PMID: 33950180 PMCID: PMC8434751 DOI: 10.1093/gbe/evab100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/30/2021] [Indexed: 11/13/2022] Open
Abstract
Transposable elements (TEs) are an important component of the complex genomic ecosystem. Understanding the tempo and mode of TE proliferation, that is whether it is in maintained in transposition selection balance, or is induced periodically by environmental stress or other factors, is important for understanding the evolution of organismal genomes through time. Although TEs have been characterized in individuals or limited samples, a true understanding of the population genetics of TEs, and therefore the tempo and mode of transposition, is still lacking. Here, we characterize the TE landscape in an important model Drosophila, Drosophila serrata using the D. serrata reference panel, which is comprised of 102 sequenced inbred genotypes. We annotate the families of TEs in the D. serrata genome and investigate variation in TE copy number between genotypes. We find that many TEs have low copy number in the population, but this varies by family and includes a single TE making up to 50% of the genome content of TEs. We find that some TEs proliferate in particular genotypes compared with population levels. In addition, we characterize variation in each TE family allowing copy number to vary in each genotype and find that some TEs have diversified very little between individuals suggesting recent spread. TEs are important sources of spontaneous mutations in Drosophila, making up a large fraction of the total number of mutations in particular genotypes. Understanding the dynamics of TEs within populations will be an important step toward characterizing the origin of variation within and between species.
Collapse
Affiliation(s)
- Zachery Tiedeman
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| | - Sarah Signor
- Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, U.S.A
| |
Collapse
|
45
|
Abstract
Transposable elements (TEs) are important contributors to genome structure and evolution. With the growth of sequencing technologies, various computational pipelines and software programs have been developed to facilitate TE identification and annotation. These computational tools can be categorized into three types based on their underlying approach: homology-based, structural-based, and de novo methods. Each of these tools has advantages and disadvantages. In this chapter, we introduce EDTA (Extensive de novo TE Annotator), a new comprehensive pipeline composed of high-quality tools to identify and annotate all types of TEs. The development of EDTA is based on the benchmarking results of a collection of TE annotation methods. The selected programs are evaluated by their ability to identify true TEs as well as to exclude false candidates. Here, we present an overview of the EDTA pipeline and a detailed manual for its use. The source code of EDTA is available at https://github.com/oushujun/EDTA .
Collapse
Affiliation(s)
- Weijia Su
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, USA
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, USA
| | - Thomas Peterson
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, USA.
- Department of Agronomy, Iowa State University, Ames, IA, USA.
| |
Collapse
|
46
|
Feng C, Dai M, Liu Y, Chen M. Sequence repetitiveness quantification and de novo repeat detection by weighted k-mer coverage. Brief Bioinform 2020; 22:5855256. [PMID: 32591772 DOI: 10.1093/bib/bbaa086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 04/10/2020] [Accepted: 04/22/2020] [Indexed: 11/12/2022] Open
Abstract
DNA repeats are abundant in eukaryotic genomes and have been proved to play a vital role in genome evolution and regulation. A large number of approaches have been proposed to identify various repeats in the genome. Some de novo repeat identification tools can efficiently generate sequence repetitive scores based on k-mer counting for repeat detection. However, we noticed that these tools can still be improved in terms of repetitive score calculation, sensitivity to segmental duplications and detection specificity. Therefore, here, we present a new computational approach named Repeat Locator (RepLoc), which is based on weighted k-mer coverage to quantify the genome sequence repetitiveness and locate the repetitive sequences. According to the repetitiveness map of the human genome generated by RepLoc, we found that there may be relationships between sequence repetitiveness and genome structures. A comprehensive benchmark shows that RepLoc is a more efficient k-mer counting based tool for de novo repeat detection. The RepLoc software is freely available at http://bis.zju.edu.cn/reploc.
Collapse
Affiliation(s)
- Cong Feng
- Ming Chen's laboratory in Zhejiang University
| | - Min Dai
- Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences
| | | | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University
| |
Collapse
|
47
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019. [PMID: 31843001 DOI: 10.1101/657890v1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, 92697, USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
| | - Jireh R A Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Adam J Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | | | - Tyler A Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario, N1G 2W1, Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY, 14853, USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA, 50011, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA.
| | - Candice N Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN, 55108, USA.
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
48
|
Ou S, Su W, Liao Y, Chougule K, Agda JRA, Hellinga AJ, Lugo CSB, Elliott TA, Ware D, Peterson T, Jiang N, Hirsch CN, Hufford MB. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol 2019; 20:275. [PMID: 31843001 PMCID: PMC6913007 DOI: 10.1186/s13059-019-1905-y] [Citation(s) in RCA: 452] [Impact Index Per Article: 90.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 11/28/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sequencing technology and assembly algorithms have matured to the point that high-quality de novo assembly is possible for large, repetitive genomes. Current assemblies traverse transposable elements (TEs) and provide an opportunity for comprehensive annotation of TEs. Numerous methods exist for annotation of each class of TEs, but their relative performances have not been systematically compared. Moreover, a comprehensive pipeline is needed to produce a non-redundant library of TEs for species lacking this resource to generate whole-genome TE annotations. RESULTS We benchmark existing programs based on a carefully curated library of rice TEs. We evaluate the performance of methods annotating long terminal repeat (LTR) retrotransposons, terminal inverted repeat (TIR) transposons, short TIR transposons known as miniature inverted transposable elements (MITEs), and Helitrons. Performance metrics include sensitivity, specificity, accuracy, precision, FDR, and F1. Using the most robust programs, we create a comprehensive pipeline called Extensive de-novo TE Annotator (EDTA) that produces a filtered non-redundant TE library for annotation of structurally intact and fragmented elements. EDTA also deconvolutes nested TE insertions frequently found in highly repetitive genomic regions. Using other model species with curated TE libraries (maize and Drosophila), EDTA is shown to be robust across both plant and animal species. CONCLUSIONS The benchmarking results and pipeline developed here will greatly facilitate TE annotation in eukaryotic genomes. These annotations will promote a much more in-depth understanding of the diversity and evolution of TEs at both intra- and inter-species levels. EDTA is open-source and freely available: https://github.com/oushujun/EDTA.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| | - Weija Su
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697 USA
| | - Kapeel Chougule
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
| | - Jireh R. A. Agda
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Adam J. Hellinga
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | | | - Tyler A. Elliott
- Centre for Biodiversity Genomics, University of Guelph, Guelph, Ontario N1G 2W1 Canada
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724 USA
- USDA-ARS NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, NY 14853 USA
| | - Thomas Peterson
- Department of Genetics, Development, and Cell Biology, Iowa State University, Ames, IA 50011 USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824 USA
| | - Candice N. Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN 55108 USA
| | - Matthew B. Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| |
Collapse
|
49
|
Martin SL, Parent JS, Laforest M, Page E, Kreiner JM, James T. Population Genomic Approaches for Weed Science. PLANTS (BASEL, SWITZERLAND) 2019; 8:E354. [PMID: 31546893 PMCID: PMC6783936 DOI: 10.3390/plants8090354] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 09/12/2019] [Accepted: 09/14/2019] [Indexed: 12/16/2022]
Abstract
Genomic approaches are opening avenues for understanding all aspects of biological life, especially as they begin to be applied to multiple individuals and populations. However, these approaches typically depend on the availability of a sequenced genome for the species of interest. While the number of genomes being sequenced is exploding, one group that has lagged behind are weeds. Although the power of genomic approaches for weed science has been recognized, what is needed to implement these approaches is unfamiliar to many weed scientists. In this review we attempt to address this problem by providing a primer on genome sequencing and provide examples of how genomics can help answer key questions in weed science such as: (1) Where do agricultural weeds come from; (2) what genes underlie herbicide resistance; and, more speculatively, (3) can we alter weed populations to make them easier to control? This review is intended as an introduction to orient weed scientists who are thinking about initiating genome sequencing projects to better understand weed populations, to highlight recent publications that illustrate the potential for these methods, and to provide direction to key tools and literature that will facilitate the development and execution of weed genomic projects.
Collapse
Affiliation(s)
- Sara L Martin
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
| | - Jean-Sebastien Parent
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
| | - Martin Laforest
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada.
| | - Eric Page
- Harrow Research and Development Centre, Agriculture and Agri-Food Canada, Harrow, ON N0R 1G0, Canada.
| | - Julia M Kreiner
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada.
| | - Tracey James
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, Ottawa, ON K1A 0C6, Canada.
| |
Collapse
|