1
|
Sonsungsan P, Nganga ML, Lieberman MC, Amundson KR, Stewart V, Plaimas K, Comai L, Henry IM. A k-mer-based bulked segregant analysis approach to map seed traits in unphased heterozygous potato genomes. G3 (BETHESDA, MD.) 2024; 14:jkae035. [PMID: 38366577 PMCID: PMC10989861 DOI: 10.1093/g3journal/jkae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 02/06/2024] [Accepted: 02/09/2024] [Indexed: 02/18/2024]
Abstract
High-throughput sequencing-based methods for bulked segregant analysis (BSA) allow for the rapid identification of genetic markers associated with traits of interest. BSA studies have successfully identified qualitative (binary) and quantitative trait loci (QTLs) using QTL mapping. However, most require population structures that fit the models available and a reference genome. Instead, high-throughput short-read sequencing can be combined with BSA of k-mers (BSA-k-mer) to map traits that appear refractory to standard approaches. This method can be applied to any organism and is particularly useful for species with genomes diverged from the closest sequenced genome. It is also instrumental when dealing with highly heterozygous and potentially polyploid genomes without phased haplotype assemblies and for which a single haplotype can control a trait. Finally, it is flexible in terms of population structure. Here, we apply the BSA-k-mer method for the rapid identification of candidate regions related to seed spot and seed size in diploid potato. Using a mixture of F1 and F2 individuals from a cross between 2 highly heterozygous parents, candidate sequences were identified for each trait using the BSA-k-mer approach. Using parental reads, we were able to determine the parental origin of the loci. Finally, we mapped the identified k-mers to a closely related potato genome to validate the method and determine the genomic loci underlying these sequences. The location identified for the seed spot matches with previously identified loci associated with pigmentation in potato. The loci associated with seed size are novel. Both loci are relevant in future breeding toward true seeds in potato.
Collapse
Affiliation(s)
- Pajaree Sonsungsan
- Program in Bioinformatics and Computational Biology, Graduate School, Chulalongkorn University, Bangkok 10330, Thailand
| | - Mwaura Livingstone Nganga
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Meric C Lieberman
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Kirk R Amundson
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Victoria Stewart
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Kitiporn Plaimas
- Omics Science and Bioinformatics Center, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
- Advanced Virtual and Intelligent Computing (AVIC) Center, Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Luca Comai
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Isabelle M Henry
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|
2
|
Fletcher K, Michelmore R. Genome-Enabled Insights into Downy Mildew Biology and Evolution. ANNUAL REVIEW OF PHYTOPATHOLOGY 2023; 61:165-183. [PMID: 37268005 DOI: 10.1146/annurev-phyto-021622-103440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Oomycetes that cause downy mildew diseases are highly specialized, obligately biotrophic phytopathogens that can have major impacts on agriculture and natural ecosystems. Deciphering the genome sequence of these organisms provides foundational tools to study and deploy control strategies against downy mildew pathogens (DMPs). The recent telomere-to-telomere genome assembly of the DMP Peronospora effusa revealed high levels of synteny with distantly related DMPs, higher than expected repeat content, and previously undescribed architectures. This provides a road map for generating similar high-quality genome assemblies for other oomycetes. This review discusses biological insights made using this and other assemblies, including ancestral chromosome architecture, modes of sexual and asexual variation, the occurrence of heterokaryosis, candidate gene identification, functional validation, and population dynamics. We also discuss future avenues of research likely to be fruitful in studies of DMPs and highlight resources necessary for advancing our understanding and ability to forecast and control disease outbreaks.
Collapse
Affiliation(s)
- Kyle Fletcher
- The Genome Center, University of California, Davis, California, USA
| | - Richard Michelmore
- The Genome Center, University of California, Davis, California, USA
- Department of Plant Sciences; Department of Molecular and Cellular Biology; Department of Medical Microbiology and Immunology, University of California, Davis, California, USA;
| |
Collapse
|
3
|
Fletcher K, Martin F, Isakeit T, Cavanaugh K, Magill C, Michelmore R. The genome of the oomycete Peronosclerospora sorghi, a cosmopolitan pathogen of maize and sorghum, is inflated with dispersed pseudogenes. G3 (BETHESDA, MD.) 2023; 13:jkac340. [PMID: 36592124 PMCID: PMC9997571 DOI: 10.1093/g3journal/jkac340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 11/14/2022] [Accepted: 12/05/2022] [Indexed: 01/03/2023]
Abstract
Several species in the oomycete genus Peronosclerospora cause downy mildew on maize and can result in significant yield losses in Asia. Bio-surveillance of these pathogens is a high priority to prevent epidemics on maize in the United States and consequent damage to the US economy. The unresolved taxonomy and dearth of molecular resources for Peronosclerospora spp. hinder these efforts. P. sorghi is a pathogen of sorghum and maize with a global distribution, for which limited diversity has been detected in the southern USA. We characterized the genome, transcriptome, and mitogenome of an isolate, representing the US pathotype 6. The highly homozygous genome was assembled using 10× Genomics linked reads and scaffolded using Hi-C into 13 chromosomes. The total assembled length was 303.2 Mb, larger than any other oomycete previously assembled. The mitogenome was 38 kb, similar in size to other oomycetes, although it had a unique gene order. Nearly 20,000 genes were annotated in the nuclear genome, more than described for other downy mildew causing oomycetes. The 13 chromosomes of P. sorghi were highly syntenic with the 17 chromosomes of Peronospora effusa with conserved centromeric regions and distinct chromosomal fusions. The increased assembly size and gene count of P. sorghi is due to extensive retrotransposition, resulting in putative pseudogenization. Ancestral genes had higher transcript abundance and were enriched for differential expression. This study provides foundational resources for analysis of Peronosclerospora and comparisons to other oomycete genera. Further genomic studies of global Peronosclerospora spp. will determine the suitability of the mitogenome, ancestral genes, and putative pseudogenes for marker development and taxonomic relationships.
Collapse
Affiliation(s)
- Kyle Fletcher
- The Genome Center, University of California, Davis, CA 95616, USA
| | - Frank Martin
- U.S. Department of Agriculture–Agriculture Research Service, Salinas, CA, 93905, USA
| | - Thomas Isakeit
- Department of Plant Pathology and Microbiology, Texas A&M University, College Station, TX 77843, USA
| | - Keri Cavanaugh
- The Genome Center, University of California, Davis, CA 95616, USA
| | - Clint Magill
- Department of Plant Pathology and Microbiology, Texas A&M University, College Station, TX 77843, USA
| | - Richard Michelmore
- The Genome Center, University of California, Davis, CA 95616, USA
- Departments of Plant Sciences, Molecular & Cellular Biology, Medical Microbiology & Immunology, University of California, Davis, CA 95616, USA
| |
Collapse
|
4
|
Fletcher K, Shin OH, Clark KJ, Feng C, Putman AI, Correll JC, Klosterman SJ, Van Deynze A, Michelmore RW. Ancestral Chromosomes for Family Peronosporaceae Inferred from a Telomere-to-Telomere Genome Assembly of Peronospora effusa. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2022; 35:450-463. [PMID: 35226812 DOI: 10.1094/mpmi-09-21-0227-r] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Downy mildew disease of spinach, caused by the oomycete Peronospora effusa, causes major losses to spinach production. In this study, the 17 chromosomes of P. effusa were assembled telomere-to-telomere, using Pacific Biosciences high-fidelity reads. Of these, 16 chromosomes are complete and gapless; chromosome 15 contains one gap bridging the nucleolus organizer region. This is the first telomere-to-telomere genome assembly for an oomycete. Putative centromeric regions were identified on all chromosomes. This new assembly enables a reevaluation of the genomic composition of Peronospora spp.; the assembly was almost double the size and contained more repeat sequences than previously reported for any Peronospora species. Genome fragments consistently underrepresented in six previously reported assemblies of P. effusa typically encoded repeats. Some genes annotated as encoding effectors were organized into multigene clusters on several chromosomes. Putative effectors were annotated on 16 of the 17 chromosomes. The intergenic distances between annotated genes were consistent with compartmentalization of the genome into gene-dense and gene-sparse regions. Genes encoding putative effectors were enriched in gene-sparse regions. The near-gapless assembly revealed apparent horizontal gene transfer from Ascomycete fungi. Gene order was highly conserved between P. effusa and the genetically oriented assembly of the oomycete Bremia lactucae; high levels of synteny were also detected with Phytophthora sojae. Extensive synteny between phylogenetically distant species suggests that many other oomycete species may have similar chromosome organization. Therefore, this assembly provides the foundation for genomic analyses of diverse oomycetes.[Formula: see text] Copyright © 2022 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Collapse
Affiliation(s)
- Kyle Fletcher
- The Genome Center, University of California, Davis, CA, U.S.A
| | - Oon-Ha Shin
- Seed Biotechnology Center, Department of Plant Sciences, University of California, Davis, CA, U.S.A
| | - Kelley J Clark
- United States Department of Agriculture-Agricultural Research Station, 1636 East Alisal Street, Salinas, CA, U.S.A
- Department of Entomology & Plant Pathology, University of Arkansas, Fayetteville, AR, U.S.A
| | - Chunda Feng
- Department of Entomology & Plant Pathology, University of Arkansas, Fayetteville, AR, U.S.A
| | - Alexander I Putman
- Department of Microbiology and Plant Pathology, University of California, Riverside, CA, U.S.A
| | - James C Correll
- Department of Entomology & Plant Pathology, University of Arkansas, Fayetteville, AR, U.S.A
| | - Steven J Klosterman
- United States Department of Agriculture-Agricultural Research Station, 1636 East Alisal Street, Salinas, CA, U.S.A
| | - Allen Van Deynze
- Seed Biotechnology Center, Department of Plant Sciences, University of California, Davis, CA, U.S.A
| | - Richard W Michelmore
- The Genome Center, University of California, Davis, CA, U.S.A
- Departments of Plant Sciences, Molecular & Cellular Biology, Medical Microbiology & Immunology, University of California, Davis, CA, U.S.A
| |
Collapse
|
5
|
Walve R, Salmela L. HGGA: hierarchical guided genome assembler. BMC Bioinformatics 2022; 23:167. [PMID: 35525918 PMCID: PMC9077837 DOI: 10.1186/s12859-022-04701-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 04/25/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND De novo genome assembly typically produces a set of contigs instead of the complete genome. Thus additional data such as genetic linkage maps, optical maps, or Hi-C data is needed to resolve the complete structure of the genome. Most of the previous work uses the additional data to order and orient contigs. RESULTS Here we introduce a framework to guide genome assembly with additional data. Our approach is based on clustering the reads, such that each read in each cluster originates from nearby positions in the genome according to the additional data. These sets are then assembled independently and the resulting contigs are further assembled in a hierarchical manner. We implemented our approach for genetic linkage maps in a tool called HGGA. CONCLUSIONS Our experiments on simulated and real Pacific Biosciences long reads and genetic linkage maps show that HGGA produces a more contiguous assembly with less contigs and from 1.2 to 9.8 times higher NGA50 or N50 than a plain assembly of the reads and 1.03 to 6.5 times higher NGA50 or N50 than a previous approach integrating genetic linkage maps with contig assembly. Furthermore, also the correctness of the assembly remains similar or improves as compared to an assembly using only the read data.
Collapse
Affiliation(s)
- Riku Walve
- Department of Computer Science, Helsinki Institute for Information Technology HIIT, University of Helsinki, Helsinki, Finland
| | - Leena Salmela
- Department of Computer Science, Helsinki Institute for Information Technology HIIT, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
6
|
Macias-González M, Truco MJ, Han R, Jenni S, Michelmore RW. High-resolution genetic dissection of the major QTL for tipburn resistance in lettuce, Lactuca sativa. G3 (BETHESDA, MD.) 2021; 11:jkab097. [PMID: 33772545 PMCID: PMC8495944 DOI: 10.1093/g3journal/jkab097] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Accepted: 03/21/2021] [Indexed: 12/13/2022]
Abstract
Tipburn is an important physiological disorder of lettuce, Lactuca sativa L., related to calcium deficiency that can result in leaf necrosis and unmarketable crops. The major quantitative trait locus (QTL), qTPB5.2, can account for up to 70% of the phenotypic variance for tipburn incidence in the field. This QTL was genetically dissected to identify candidate genes for tipburn by creating lines with recombination events within the QTL and assessing their resistance to tipburn. By comparing lines with contrasting haplotypes, the genetic region was narrowed down to ∼877 Kb that was associated with a reduction of tipburn by ∼60%. Analysis of the lettuce reference genome sequence revealed 12 genes in this region, one of which is a calcium transporter with a single nucleotide polymorphism in an exon between haplotypes with contrasting phenotypes. RNA-seq analysis of recombinants revealed two genes that were differentially expressed between contrasting haplotypes consistent with the tipburn phenotype. One encodes a Teosinte branched1/Cycloidea/Proliferating Cell factor transcription factor; however, differential expression of the calcium transporter was detected. The phenotypic data indicated that there is a second region outside of the ∼877 Kb region but within the QTL, at which a haplotype from the susceptible parent decreased tipburn by 10-20%. A recombinant line was identified with beneficial haplotypes in each region from both parents that showed greater tipburn resistance than the resistant parent; this line could be used as the foundation for breeding cultivars with more resistance than is currently available.
Collapse
Affiliation(s)
- Miguel Macias-González
- The Genome Center, University of California, Davis, Davis, CA 95616, USA
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| | - Maria Jose Truco
- The Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Rongkui Han
- The Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Sylvie Jenni
- Science and Technology Branch, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, QC J3B 3E6, Canada
| | - Richard W Michelmore
- The Genome Center, University of California, Davis, Davis, CA 95616, USA
- Department of Plant Sciences, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|