1
|
Zou C, Sapkota S, Figueroa-Balderas R, Glaubitz J, Cantu D, Kingham BF, Sun Q, Cadle-Davidson L. A multitiered haplotype strategy to enhance phased assembly and fine mapping of a disease resistance locus. PLANT PHYSIOLOGY 2023; 193:2321-2336. [PMID: 37706526 DOI: 10.1093/plphys/kiad494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 07/10/2023] [Accepted: 08/17/2023] [Indexed: 09/15/2023]
Abstract
Fine mapping of quantitative trait loci (QTL) to dissect the genetic basis of traits of interest is essential to modern breeding practice. Here, we employed a multitiered haplotypic marker system to increase fine mapping accuracy by constructing a chromosome-level, haplotype-resolved parental genome, accurate detection of recombination sites, and allele-specific characterization of the transcriptome. In the first tier of this system, we applied the preexisting panel of 2,000 rhAmpSeq core genome markers that is transferable across the entire Vitis genus and provides a genomic resolution of 200 kb to 1 Mb. The second tier consisted of high-density haplotypic markers generated from Illumina skim sequencing data for samples enriched for relevant recombinations, increasing the potential resolution to hundreds of base pairs. We used this approach to dissect a novel Resistance to Plasmopara viticola-33 (RPV33) locus conferring resistance to grapevine downy mildew, narrowing the candidate region to only 0.46 Mb. In the third tier, we used allele-specific RNA-seq analysis to identify a cluster of 3 putative disease resistance RPP13-like protein 2 genes located tandemly in a nonsyntenic insertion as candidates for the disease resistance trait. In addition, combining the rhAmpSeq core genome haplotype markers and skim sequencing-derived high-density haplotype markers enabled chromosomal-level scaffolding and phasing of the grape Vitis × doaniana 'PI 588149' assembly, initially built solely from Pacific Biosciences (PacBio) high-fidelity (HiFi) reads, leading to the correction of 16 large-scale phasing errors. Our mapping strategy integrates high-density, phased genetic information with individual reference genomes to pinpoint the genetic basis of QTLs and will likely be widely adopted in highly heterozygous species.
Collapse
Affiliation(s)
- Cheng Zou
- BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
| | - Surya Sapkota
- School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
- Grape Genetics Research Unit, USDA-ARS, Geneva, NY 14456, USA
| | - Rosa Figueroa-Balderas
- Department of Viticulture and Enology, University of California Davis, Davis, CA 95616, USA
| | - Jeff Glaubitz
- BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
| | - Dario Cantu
- Department of Viticulture and Enology, University of California Davis, Davis, CA 95616, USA
| | - Brewster F Kingham
- DNA Sequencing & Genotyping Center, Delaware Biotechnology Institute, University of Delaware, Newark, DE 19711, USA
| | - Qi Sun
- BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
| | - Lance Cadle-Davidson
- School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY 14456, USA
- Grape Genetics Research Unit, USDA-ARS, Geneva, NY 14456, USA
| |
Collapse
|
2
|
Nazareno ES, Fiedler JD, Ardayfio NK, Miller ME, Figueroa M, Kianian SF. Genetic Analysis and Physical Mapping of Oat Adult Plant Resistance Loci Against Puccinia coronata f. sp. avenae. PHYTOPATHOLOGY 2023; 113:1307-1316. [PMID: 36721375 DOI: 10.1094/phyto-10-22-0395-r] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Six quantitative trait loci (QTLs) for adult plant resistance against oat crown rust (Puccinia coronata f. sp. avenae) were identified from mapping three recombinant inbred populations. Using genotyping-by-sequencing with markers called against the OT3098 v1 reference genome, the QTLs were mapped on six different chromosomes: Chr1D, Chr4D, Chr5A, Chr5D, Chr7A, and Chr7C. Composite interval mapping with marker cofactor selection showed that the phenotypic variance explained by all identified QTLs for coefficient of infection range from 12.2 to 46.9%, whereas heritability estimates ranged from 0.11 to 0.38. The significant regions were narrowed down to intervals of 3.9 to 25 cM, equivalent to physical distances of 11 to 133 Mb. At least two flanking single-nucleotide polymorphism markers were identified within 10 cM of each QTL that could be used in marker-assisted introgression, pyramiding, and selection. The additive effects of the QTLs in each population were determined using single-nucleotide polymorphism haplotype data, which showed a significantly lower coefficient of infection in lines homozygous for the resistant alleles. Analysis of pairwise linkage disequilibrium also revealed high correlation of markers and presence of linkage blocks in the significant regions. To further facilitate marker-assisted breeding, polymerase chain reaction allelic competitive extension (PACE) markers for the adult plant resistance loci were developed. Putative candidate genes were also identified in each of the significant regions, which include resistance gene analogs that encode for kinases, ligases, and predicted receptors of avirulence proteins from pathogens.
Collapse
Affiliation(s)
- Eric S Nazareno
- Department of Plant Pathology, University of Minnesota, St. Paul, MN, U.S.A
| | - Jason D Fiedler
- U.S. Department of Agriculture-Agricultural Research Service, Cereal Crops Research Unit, Fargo, ND, U.S.A
| | - Naa Korkoi Ardayfio
- U.S. Department of Agriculture-Agricultural Research Service, Cereal Crops Research Unit, Fargo, ND, U.S.A
| | - Marisa E Miller
- Department of Plant Pathology, University of Minnesota, St. Paul, MN, U.S.A
- Pairwise Plants, LLC, 807 East Main Street, Suite 4-100, Durham, NC, U.S.A
| | - Melania Figueroa
- Commonwealth Scientific and Industrial Research Organisation, Agriculture and Food, Canberra, ACT, Australia
| | - Shahryar F Kianian
- U.S. Department of Agriculture-Agricultural Research Service, Cereal Disease Laboratory, St. Paul, MN, U.S.A
| |
Collapse
|
3
|
Bhattarai G, Shi A, Mou B, Correll JC. Skim resequencing finely maps the downy mildew resistance loci RPF2 and RPF3 in spinach cultivars whale and Lazio. HORTICULTURE RESEARCH 2023; 10:uhad076. [PMID: 37323230 PMCID: PMC10261881 DOI: 10.1093/hr/uhad076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 04/10/2023] [Indexed: 06/17/2023]
Abstract
Commercial production of spinach (Spinacia oleracea L.) is centered in California and Arizona in the US, where downy mildew caused by Peronospora effusa is the most destructive disease. Nineteen typical races of P. effusa have been reported to infect spinach, with 16 identified after 1990. The regular appearance of new pathogen races breaks the resistance gene introgressed in spinach. We attempted to map and delineate the RPF2 locus at a finer resolution, identify linked single nucleotide polymorphism (SNP) markers, and report candidate downy mildew resistance (R) genes. Progeny populations segregating for RPF2 locus derived from resistant differential cultivar Lazio were infected using race 5 of P. effusa and were used to study for genetic transmission and mapping analysis in this study. Association analysis performed with low coverage whole genome resequencing-generated SNP markers mapped the RPF2 locus between 0.47 to 1.46 Mb of chromosome 3 with peak SNP (Chr3_1, 221, 009) showing a LOD value of 61.6 in the GLM model in TASSEL, which was within 1.08 Kb from Spo12821, a gene that encodes CC-NBS-LRR plant disease resistance protein. In addition, a combined analysis of progeny panels of Lazio and Whale segregating for RPF2 and RPF3 loci delineated the resistance section in chromosome 3 between 1.18-1.23 and 1.75-1.76 Mb. This study provides valuable information on the RPF2 resistance region in the spinach cultivar Lazio compared to RPF3 loci in the cultivar Whale. The RPF2 and RPF3 specific SNP markers, plus the resistant genes reported here, could add value to breeding efforts to develop downy mildew resistant cultivars in the future.
Collapse
Affiliation(s)
| | | | - Beiquan Mou
- USDA-ARS Crop Improvement and Protection Research Unit, Salinas, CA 93905, USA
| | - James C Correll
- Department of Plant Pathology, University of Arkansas, Fayetteville, AR 72701, USA
| |
Collapse
|
4
|
Sapkota S, Zou C, Ledbetter C, Underhill A, Sun Q, Gadoury D, Cadle-Davidson L. Discovery and genome-guided mapping of REN12 from Vitis amurensis, conferring strong, rapid resistance to grapevine powdery mildew. HORTICULTURE RESEARCH 2023; 10:uhad052. [PMID: 37213681 PMCID: PMC10194894 DOI: 10.1093/hr/uhad052] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 03/12/2023] [Indexed: 05/23/2023]
Abstract
Powdery mildew resistance genes restrict infection attempts at different stages of pathogenesis. Here, a strong and rapid powdery mildew resistance phenotype was discovered from Vitis amurensis 'PI 588631' that rapidly stopped over 97% of Erysiphe necator conidia, before or immediately after emergence of a secondary hypha from appressoria. This resistance was effective across multiple years of vineyard evaluation on leaves, stems, rachises, and fruit and against a diverse array of E. necator laboratory isolates. Using core genome rhAmpSeq markers, resistance mapped to a single dominant locus (here named REN12) on chromosome 13 near 22.8-27.0 Mb, irrespective of tissue type, explaining up to 86.9% of the phenotypic variation observed on leaves. Shotgun sequencing of recombinant vines using skim-seq technology enabled the locus to be further resolved to a 780 kb region, from 25.15 to 25.93 Mb. RNASeq analysis indicated the allele-specific expression of four resistance genes (NLRs) from the resistant parent. REN12 is one of the strongest powdery mildew resistance loci in grapevine yet documented, and the rhAmpSeq sequences presented here can be directly used for marker-assisted selection or converted to other genotyping platforms. While no virulent isolates were identified among the genetically diverse isolates and wild populations of E. necator tested here, NLR loci like REN12 are often race-specific. Thus, stacking of multiple resistance genes and minimal use of fungicides should enhance the durability of resistance and could enable a 90% reduction in fungicides in low-rainfall climates where few other pathogens attack the foliage or fruit.
Collapse
Affiliation(s)
- Surya Sapkota
- School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, 14456, USA
| | - Cheng Zou
- BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
| | - Craig Ledbetter
- United States Department of Agriculture (USDA)-Agricultural Research Service (ARS), Crop Diseases, Pests and Genetics Research Unit, San Joaquin Valley Agricultural Sciences Center, Parlier, CA, 93648, USA
| | - Anna Underhill
- USDA-ARS, Grape Genetics Research Unit, Geneva, NY, 14456, USA
| | - Qi Sun
- BRC Bioinformatics Facility, Institute of Biotechnology, Cornell University, Ithaca, NY, 14853, USA
| | - David Gadoury
- School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, 14456, USA
| | | |
Collapse
|
5
|
Abstract
Over the past decade, advances in plant genotyping have been critical in enabling the identification of genetic diversity, in understanding evolution, and in dissecting important traits in both crops and native plants. The widespread popularity of single-nucleotide polymorphisms (SNPs) has prompted significant improvements to SNP-based genotyping, including SNP arrays, genotyping by sequencing, and whole-genome resequencing. More recent approaches, including genotyping structural variants, utilizing pangenomes to capture species-wide genetic diversity and exploiting machine learning to analyze genotypic data sets, are pushing the boundaries of what plant genotyping can offer. In this chapter, we highlight these innovations and discuss how they will accelerate and advance future genotyping efforts.
Collapse
|
6
|
de Greef E, Suh A, Thorstensen MJ, Delmore KE, Fraser KC. Genomic architecture of migration timing in a long-distance migratory songbird. Sci Rep 2023; 13:2437. [PMID: 36765096 PMCID: PMC9918537 DOI: 10.1038/s41598-023-29470-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 02/06/2023] [Indexed: 02/12/2023] Open
Abstract
The impact of climate change on spring phenology poses risks to migratory birds, as migration timing is controlled predominantly by endogenous mechanisms. Despite recent advances in our understanding of the underlying genetic basis of migration timing, the ways that migration timing phenotypes in wild individuals may map to specific genomic regions requires further investigation. We examined the genetic architecture of migration timing in a long-distance migratory songbird (purple martin, Progne subis subis) by integrating genomic data with an extensive dataset of direct migratory tracks. A moderate to large amount of variance in spring migration arrival timing was explained by genomics (proportion of phenotypic variation explained by genomics = 0.74; polygenic score R2 = 0.24). On chromosome 1, a region that was differentiated between migration timing phenotypes contained genes that could facilitate nocturnal flights and act as epigenetic modifiers. Overall, these results advance our understanding of the genomic underpinnings of migration timing.
Collapse
Affiliation(s)
- Evelien de Greef
- Department of Biological Sciences, University of Manitoba, Winnipeg, R3T 2N2, Canada.
| | - Alexander Suh
- Department of Organismal Biology, Uppsala University, 752 36, Uppsala, Sweden
- School of Biological Sciences, University of East Anglia, Norwich, NR4 7TU, UK
| | - Matt J Thorstensen
- Department of Biological Sciences, University of Manitoba, Winnipeg, R3T 2N2, Canada
| | - Kira E Delmore
- Department of Biology, Texas A&M University, College Station, TX, 77843, USA
| | - Kevin C Fraser
- Department of Biological Sciences, University of Manitoba, Winnipeg, R3T 2N2, Canada
| |
Collapse
|
7
|
Lu Q, Yu X, Wang H, Yu Z, Zhang X, Zhao Y. Construction of ultra-high-density genetic linkage map of a sorghum-sudangrass hybrid using whole genome resequencing. PLoS One 2022; 17:e0278153. [PMID: 36445892 PMCID: PMC9707794 DOI: 10.1371/journal.pone.0278153] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 11/10/2022] [Indexed: 12/02/2022] Open
Abstract
The sorghum-sudangrass hybrid is a vital annual gramineous herbage. Few reports exist on its ultra-high-density genetic map. In this study, we sought to create an ultra-high-density genetic linkage map for this hybrid to strengthen its functional genomics research and genetic breeding. We used 150 sorghum-sudangrass hybrid F2 individuals and their parents (scattered ear sorghum and red hull sudangrass) for high-throughput sequencing on the basis of whole genome resequencing. In total, 1,180.66 Gb of data were collected. After identification, filtration for integrity, and partial segregation, over 5,656 single nucleotide polymorphism markers of high quality were detected. An ultra-high-density genetic linkage map was constructed using these data. The markers covered approximately 2,192.84 cM of the map with average marker intervals of 0.39 cM. The length ranged from 115.39 cM to 264.04 cM for the 10 linkage groups. Currently, this represents the first genetic linkage map of this size, number of molecular markers, density, and coverage for sorghum-sudangrass hybrid. The findings of this study provide valuable genome-level information on species evolution and comparative genomics analysis and lay the foundation for further research on quantitative trait loci fine mapping and gene cloning and marker-assisted breeding of important traits in sorghum-sudangrass hybrids.
Collapse
Affiliation(s)
- Qianqian Lu
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
| | - Xiaoxia Yu
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
| | - Huiting Wang
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
| | - Zhuo Yu
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
- * E-mail:
| | - Xia Zhang
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
| | - Yaqi Zhao
- Agricultural College, Inner Mongolia Agricultural University, Hohhot, Inner Mongolia, China
| |
Collapse
|
8
|
Bhattarai G, Olaoye D, Mou B, Correll JC, Shi A. Mapping and selection of downy mildew resistance in spinach cv. whale by low coverage whole genome sequencing. FRONTIERS IN PLANT SCIENCE 2022; 13:1012923. [PMID: 36275584 PMCID: PMC9583407 DOI: 10.3389/fpls.2022.1012923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Accepted: 09/16/2022] [Indexed: 06/16/2023]
Abstract
Spinach (Spinacia oleracea) is a popular leafy vegetable crop and commercial production is centered in California and Arizona in the US. The oomycete Peronospora effusa causes the most important disease in spinach, downy mildew. A total of nineteen races of P. effusa are known, with more than 15 documented in the last three decades, and the regular emergence of new races is continually overcoming the genetic resistance to the pathogen. This study aimed to finely map the downy mildew resistance locus RPF3 in spinach, identify single nucleotide polymorphism (SNP) markers associated with the resistance, refine the candidate genes responsible for the resistance, and evaluate the prediction performance using multiple machine learning genomic prediction (GP) methods. Segregating progeny population developed from a cross of resistant cultivar Whale and susceptible cultivar Viroflay to race 5 of P. effusa was inoculated under greenhouse conditions to determine downy mildew disease response across the panel. The progeny panel and the parents were resequenced at low coverage (1x) to identify genome wide SNP markers. Association analysis was performed using disease response phenotype data and SNP markers in TASSEL, GAPIT, and GENESIS programs and mapped the race 5 resistance loci (RPF3) to 1.25 and 2.73 Mb of Monoe-Viroflay chromosome 3 with the associated SNP in the 1.25 Mb region was 0.9 Kb from the NBS-LRR gene SOV3g001250. The RPF3 locus in the 1.22-1.23 Mb region of Sp75 chromosome 3 is 2.41-3.65 Kb from the gene Spo12821 annotated as NBS-LRR disease resistance protein. This study extended our understanding of the genetic basis of downy mildew resistance in spinach cultivar Whale and mapped the RPF3 resistance loci close to the NBS-LRR gene providing a target to pursue functional validation. Three SNP markers efficiently selected resistance based on multiple genomic selection (GS) models. The results from this study have added new genomic resources, generated an informed basis of the RPF3 locus resistant to spinach downy mildew pathogen, and developed markers and prediction methods to select resistant lines.
Collapse
Affiliation(s)
- Gehendra Bhattarai
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Dotun Olaoye
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| | - Beiquan Mou
- Crop Improvement and Protection Research Unit, United States Department of Agriculture, Agricultural Research Service, Salinas, CA, United States
| | - James C. Correll
- Department of Plant Pathology, University of Arkansas, Fayetteville, AR, United States
| | - Ainong Shi
- Department of Horticulture, University of Arkansas, Fayetteville, AR, United States
| |
Collapse
|
9
|
Genomic regions associated with virulence in Setosphaeria turcica identified by linkage mapping in a biparental population. Fungal Genet Biol 2021; 159:103655. [PMID: 34954385 DOI: 10.1016/j.fgb.2021.103655] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 11/17/2021] [Accepted: 12/19/2021] [Indexed: 01/06/2023]
Abstract
Northern corn leaf blight (NCLB) and sorghum leaf blight (SLB) are significant diseases of maize and sorghum, respectively, caused by the filamentous fungus Setosphaeria turcica. Strains of S. turcica are typically host-specific and infect either maize or sorghum. Host specificity in this pathogen is attributed to a single locus for maize and a second distinct locus for sorghum. To identify the genetic basis of host specificity in S. turcica, we generated a biparental population of S. turcica by crossing strains specific to maize and sorghum, phenotyped the population for leaf blight on sorghum and maize, genotyped the population to create a linkage map of S. turcica, and located candidate virulence regions. A total of 190 ascospores from 35 pseudothecia were isolated from the cross of maize and sorghum-specific strains. Greenhouse phenotyping of the biparental population (n = 144) showed independent inheritance of virulence, as indicated by a 1:1:1:1 segregation for virulence to maize, sorghum, both maize and sorghum, and avirulence to both crops. The population and host-specific parent strains were genotyped using genome skim sequencing on an Illumina NovaSeq 6000 platform resulting in over 780 million reads. A total of 32,635 variants including single nucleotide polymorphisms and indels were scored. There was evidence for a large deletion in the sorghum-specific strain of S. turcica. A genetic map consisting of 17 linkage groups spanning 3,069 centimorgans was constructed. Virulence to sorghum and maize mapped on distinct linkage groups with a significant QTL detected for virulence to maize. Furthermore, a single locus each for the in vitro traits hyphal growth rate and conidiation were identified and mapped onto two other linkage groups. In vitro traits did not correlate with in planta virulence complexity, suggesting that virulence on both hosts does not incur a fitness cost. Hyphal growth rate and conidiation were negatively correlated, indicating differences in hyphal growth versus dispersal ability for this pathogen. Identification of genetic regions underlying virulence specificity and saprotrophic growth traits in S. turcica provides a better understanding of the S. turcica- Andropogoneae pathosystem.
Collapse
|
10
|
Jordan KW, Bradbury PJ, Miller ZR, Nyine M, He F, Fraser M, Anderson J, Mason E, Katz A, Pearce S, Carter AH, Prather S, Pumphrey M, Chen J, Cook J, Liu S, Rudd JC, Wang Z, Chu C, Ibrahim AMH, Turkus J, Olson E, Nagarajan R, Carver B, Yan L, Taagen E, Sorrells M, Ward B, Ren J, Akhunova A, Bai G, Bowden R, Fiedler J, Faris J, Dubcovsky J, Guttieri M, Brown-Guedira G, Buckler E, Jannink JL, Akhunov ED. Development of the Wheat Practical Haplotype Graph Database as a Resource for Genotyping Data Storage and Genotype Imputation. G3-GENES GENOMES GENETICS 2021; 12:6423995. [PMID: 34751373 PMCID: PMC9210282 DOI: 10.1093/g3journal/jkab390] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Accepted: 10/21/2021] [Indexed: 12/04/2022]
Abstract
To improve the efficiency of high-density genotype data storage and imputation in bread wheat (Triticum aestivum L.), we applied the Practical Haplotype Graph (PHG) tool. The Wheat PHG database was built using whole-exome capture sequencing data from a diverse set of 65 wheat accessions. Population haplotypes were inferred for the reference genome intervals defined by the boundaries of the high-quality gene models. Missing genotypes in the inference panels, composed of wheat cultivars or recombinant inbred lines genotyped by exome capture, genotyping-by-sequencing (GBS), or whole-genome skim-seq sequencing approaches, were imputed using the Wheat PHG database. Though imputation accuracy varied depending on the method of sequencing and coverage depth, we found 92% imputation accuracy with 0.01× sequence coverage, which was slightly lower than the accuracy obtained using the 0.5× sequence coverage (96.6%). Compared to Beagle, on average, PHG imputation was ∼3.5% (P-value < 2 × 10−14) more accurate, and showed 27% higher accuracy at imputing a rare haplotype introgressed from a wild relative into wheat. We found reduced accuracy of imputation with independent 2× GBS data (88.6%), which increases to 89.2% with the inclusion of parental haplotypes in the database. The accuracy reduction with GBS is likely associated with the small overlap between GBS markers and the exome capture dataset, which was used for constructing PHG. The highest imputation accuracy was obtained with exome capture for the wheat D genome, which also showed the highest levels of linkage disequilibrium and proportion of identity-by-descent regions among accessions in the PHG database. We demonstrate that genetic mapping based on genotypes imputed using PHG identifies SNPs with a broader range of effect sizes that together explain a higher proportion of genetic variance for heading date and meiotic crossover rate compared to previous studies.
Collapse
Affiliation(s)
- Katherine W Jordan
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA.,USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS, 66502, USA
| | - Peter J Bradbury
- USDA-ARS, Plant Soil and Nutrition Research Unit, Ithaca, NY, 14853, USA
| | - Zachary R Miller
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Moses Nyine
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Fei He
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Max Fraser
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Jim Anderson
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, 55108, USA
| | - Esten Mason
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, 80521, USA
| | - Andrew Katz
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, 80521, USA
| | - Stephen Pearce
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, 80521, USA
| | - Arron H Carter
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Samuel Prather
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Michael Pumphrey
- Department of Crop and Soil Sciences, Washington State University, Pullman, WA, 99164, USA
| | - Jianli Chen
- Department of Plant Sciences, University of Idaho, Aberdeen, ID, 83210, USA
| | - Jason Cook
- Department of Plant Sciences and Plant Pathology, Montana State University, Bozeman, MT, 59717, USA
| | - Shuyu Liu
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Amarillo, TX, 79106, USA
| | - Jackie C Rudd
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Amarillo, TX, 79106, USA
| | - Zhen Wang
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Amarillo, TX, 79106, USA
| | - Chenggen Chu
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Amarillo, TX, 79106, USA
| | - Amir M H Ibrahim
- Department of Soil and Crop Sciences, Texas A&M AgriLife Research, Amarillo, TX, 79106, USA
| | - Jonathan Turkus
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, 48824, USA
| | - Eric Olson
- Department of Plant, Soil and Microbial Sciences, Michigan State University, East Lansing, MI, 48824, USA
| | - Ragupathi Nagarajan
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK, 74075, USA
| | - Brett Carver
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK, 74075, USA
| | - Liuling Yan
- Department of Plant and Soil Sciences, Oklahoma State University, Stillwater, OK, 74075, USA
| | - Ellie Taagen
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Mark Sorrells
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA
| | - Brian Ward
- USDA-ARS, Plant Science Research Unit, Raleigh, NC, 27695, USA
| | - Jie Ren
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA.,Integrative Genomics Facility, Kansas State University, Manhattan, KS, 66506 USA
| | - Alina Akhunova
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA.,Integrative Genomics Facility, Kansas State University, Manhattan, KS, 66506 USA
| | - Guihua Bai
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS, 66502, USA
| | - Robert Bowden
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS, 66502, USA
| | - Jason Fiedler
- USDA-ARS, Cereal Crops Research Unit, Fargo, ND, 58102, USA
| | - Justin Faris
- USDA-ARS, Cereal Crops Research Unit, Fargo, ND, 58102, USA
| | - Jorge Dubcovsky
- Department of Plant Sciences, University of California-Davis, Davis, CA, 95616, USA
| | - Mary Guttieri
- USDA-ARS, Hard Winter Wheat Genetics Research Unit, Manhattan, KS, 66502, USA
| | | | - Ed Buckler
- USDA-ARS, Plant Soil and Nutrition Research Unit, Ithaca, NY, 14853, USA
| | - Jean-Luc Jannink
- USDA-ARS, Plant Soil and Nutrition Research Unit, Ithaca, NY, 14853, USA
| | - Eduard D Akhunov
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| |
Collapse
|
11
|
Impact of genotypic errors with equal and unequal family contribution on accuracy of genomic prediction in aquaculture using simulation. Sci Rep 2021; 11:18318. [PMID: 34526591 PMCID: PMC8443606 DOI: 10.1038/s41598-021-97873-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 08/31/2021] [Indexed: 11/08/2022] Open
Abstract
Genotypic errors, conflict between recorded genotype and the true genotype, can lead to false or biased population genetic parameters. Here, the effect of genotypic errors on accuracy of genomic predictions and genomic relationship matrix are investigated using a simulation study based on population and genomic structure comparable to black tiger prawn, Penaeus monodon. Fifty full-sib families across five generations with phenotypic and genotypic information on 53 K SNPs were simulated. Ten replicates of different scenarios with three heritability estimates, equal and unequal family contributions were generated. Within each scenario, four SNP densities and three genotypic error rates in each SNP density were implemented. Results showed that family contribution did not have a substantial impact on accuracy of predictions across different datasets. In the absence of genotypic errors, 3 K SNP density was found to be efficient in estimating the accuracy, whilst increasing the SNP density from 3 to 20 K resulted in a marginal increase in accuracy of genomic predictions using the current population and genomic parameters. In addition, results showed that the presence of even 10% errors in a 10 and 20 K SNP panel might not have a severe impact on accuracy of predictions. However, below 10 K marker density, even a 5% error can result in lower accuracy of predictions.
Collapse
|
12
|
Scheben A, Severn-Ellis AA, Patel D, Pradhan A, Rae SJ, Batley J, Edwards D. Linkage mapping and QTL analysis of flowering time using ddRAD sequencing with genotype error correction in Brassica napus. BMC PLANT BIOLOGY 2020; 20:546. [PMID: 33287721 PMCID: PMC7720618 DOI: 10.1186/s12870-020-02756-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Accepted: 11/25/2020] [Indexed: 05/11/2023]
Abstract
BACKGROUND Brassica napus is an important oilseed crop cultivated worldwide. During domestication and breeding of B. napus, flowering time has been a target of selection because of its substantial impact on yield. Here we use double digest restriction-site associated DNA sequencing (ddRAD) to investigate the genetic basis of flowering in B. napus. An F2 mapping population was derived from a cross between an early-flowering spring type and a late-flowering winter type. RESULTS Flowering time in the mapping population differed by up to 25 days between individuals. High genotype error rates persisted after initial quality controls, as suggested by a genotype discordance of ~ 12% between biological sequencing replicates. After genotype error correction, a linkage map spanning 3981.31 cM and compromising 14,630 single nucleotide polymorphisms (SNPs) was constructed. A quantitative trait locus (QTL) on chromosome C2 was detected, covering eight flowering time genes including FLC. CONCLUSIONS These findings demonstrate the effectiveness of the ddRAD approach to sample the B. napus genome. Our results also suggest that ddRAD genotype error rates can be higher than expected in F2 populations. Quality filtering and genotype correction and imputation can substantially reduce these error rates and allow effective linkage mapping and QTL analysis.
Collapse
Affiliation(s)
- Armin Scheben
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, NY, 11724, USA
| | - Anita A Severn-Ellis
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Dhwani Patel
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Aneeta Pradhan
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Stephen J Rae
- BASF Agricultural Solutions Belgium NV, BASF Innovation Center Gent, Technologiepark-Zwijnaarde 101, 9052, Ghent, Belgium
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia.
| |
Collapse
|
13
|
Torkamaneh D, Laroche J, Belzile F. Fast-GBS v2.0: an analysis toolkit for genotyping-by-sequencing data. Genome 2020; 63:577-581. [PMID: 33006480 DOI: 10.1139/gen-2020-0077] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Genotyping-by-sequencing (GBS) is a rapid, flexible, low-cost, and robust genotyping method that simultaneously discovers variants and calls genotypes within a broad range of samples. These characteristics make GBS an excellent tool for many applications and research questions from conservation biology to functional genomics in both model and non-model species. Continued improvement of GBS relies on a more comprehensive understanding of data analysis, development of fast and efficient bioinformatics pipelines, accurate missing data imputation, and active post-release support. Here, we present the second generation of Fast-GBS (v2.0) that offers several new options (e.g., processing paired-end reads and imputation of missing data) and features (e.g., summary statistics of genotypes) to improve the GBS data analysis process. The performance assessment analysis showed that Fast-GBS v2.0 outperformed other available analytical pipelines, such as GBS-SNP-CROP and Gb-eaSy. Fast-GBS v2.0 provides an analysis platform that can be run with different types of sequencing data, modest computational resources, and allows for missing-data imputation for various species in different contexts.
Collapse
Affiliation(s)
- Davoud Torkamaneh
- Département de Phytologie, Université Laval, Québec City, QC, Canada.,Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada.,Department of Plant Agriculture, University of Guelph, Guelph, ON, Canada
| | - Jérôme Laroche
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| | - François Belzile
- Département de Phytologie, Université Laval, Québec City, QC, Canada.,Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| |
Collapse
|
14
|
Ton LB, Neik TX, Batley J. The Use of Genetic and Gene Technologies in Shaping Modern Rapeseed Cultivars ( Brassica napus L.). Genes (Basel) 2020; 11:E1161. [PMID: 33008008 PMCID: PMC7600269 DOI: 10.3390/genes11101161] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 09/27/2020] [Accepted: 09/27/2020] [Indexed: 12/20/2022] Open
Abstract
Since their domestication, Brassica oilseed species have undergone progressive transformation allied with the development of breeding and molecular technologies. The canola (Brassica napus) crop has rapidly expanded globally in the last 30 years with intensive innovations in canola varieties, providing for a wider range of markets apart from the food industry. The breeding efforts of B. napus, the main source of canola oil and canola meal, have been mainly focused on improving seed yield, oil quality, and meal quality along with disease resistance, abiotic stress tolerance, and herbicide resistance. The revolution in genetics and gene technologies, including genetic mapping, molecular markers, genomic tools, and gene technology, especially gene editing tools, has allowed an understanding of the complex genetic makeup and gene functions in the major bioprocesses of the Brassicales, especially Brassica oil crops. Here, we provide an overview on the contributions of these technologies in improving the major traits of B. napus and discuss their potential use to accomplish new improvement targets.
Collapse
Affiliation(s)
- Linh Bao Ton
- School of Biological Science, The University of Western Australia, Perth, WA 6009, Australia;
| | - Ting Xiang Neik
- Sunway College Kuala Lumpur, No. 2, Jalan Universiti, Bandar Sunway, Selangor 47500, Malaysia;
| | - Jacqueline Batley
- School of Biological Science, The University of Western Australia, Perth, WA 6009, Australia;
| |
Collapse
|
15
|
Fikere M, Barbulescu DM, Malmberg MM, Spangenberg GC, Cogan NOI, Daetwyler HD. Meta-analysis of GWAS in canola blackleg (Leptosphaeria maculans) disease traits demonstrates increased power from imputed whole-genome sequence. Sci Rep 2020; 10:14300. [PMID: 32868838 PMCID: PMC7459325 DOI: 10.1038/s41598-020-71274-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 08/13/2020] [Indexed: 12/21/2022] Open
Abstract
Blackleg disease causes yield losses in canola (Brassica napus L.). To identify resistance genes and genomic regions, genome-wide association studies (GWAS) of 585 diverse winter and spring canola accessions were performed using imputed whole-genome sequence (WGS) and transcriptome genotype-by-sequencing (GBSt). Blackleg disease phenotypes were collected across three years in six trials. GWAS were performed in several ways and their respective power was judged by the number of significant single nucleotide polymorphisms (SNP), the false discovery rate (FDR), and the percentage of SNP that validated in additional field trials in two subsequent years. WGS GWAS with 1,234,708 million SNP detected a larger number of significant SNP, achieved a lower FDR and a higher validation rate than GBSt with 64,072 SNP. A meta-analysis combining survival and average internal infection resulted in lower FDR but also lower validation rates. The meta-analysis GWAS identified 79 genomic regions (674 SNP) conferring potential resistance to L. maculans. While several GWAS signals localised in regions of known Rlm genes, fifty-three new potential resistance regions were detected. Seventeen regions had underlying genes with putative functions related to disease defence or stress response in Arabidopsis thaliana. This study provides insight into the genetic architecture and potential molecular mechanisms underlying canola L. maculans resistance.
Collapse
Affiliation(s)
- M Fikere
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia.,Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia.,Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of Queensland, Brisbane, QLD, 4072, Australia
| | - D M Barbulescu
- Agriculture Victoria, Grains Innovation Park, Horsham, VIC, 3401, Australia
| | - M M Malmberg
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia.,Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia
| | - G C Spangenberg
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia.,Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia
| | - N O I Cogan
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia.,Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia
| | - H D Daetwyler
- School of Applied Systems Biology, La Trobe University, Bundoora, VIC, 3086, Australia. .,Centre for AgriBioscience, Agriculture Victoria, AgriBio, Bundoora, VIC, 3083, Australia.
| |
Collapse
|
16
|
Yao Z, You FM, N'Diaye A, Knox RE, McCartney C, Hiebert CW, Pozniak C, Xu W. Evaluation of variant calling tools for large plant genome re-sequencing. BMC Bioinformatics 2020; 21:360. [PMID: 32807073 PMCID: PMC7430858 DOI: 10.1186/s12859-020-03704-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 07/28/2020] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Discovering single nucleotide polymorphisms (SNPs) from agriculture crop genome sequences has been a widely used strategy for developing genetic markers for several applications including marker-assisted breeding, population diversity studies for eco-geographical adaption, genotyping crop germplasm collections, and others. Accurately detecting SNPs from large polyploid crop genomes such as wheat is crucial and challenging. A few variant calling methods have been previously developed but they show a low concordance between their variant calls. A gold standard of variant sets generated from one human individual sample was established for variant calling tool evaluations, however hitherto no gold standard of crop variant set is available for wheat use. The intent of this study was to evaluate seven SNP variant calling tools (FreeBayes, GATK, Platypus, Samtools/mpileup, SNVer, VarScan, VarDict) with the two most popular mapping tools (BWA-mem and Bowtie2) on wheat whole exome capture (WEC) re-sequencing data from allohexaploid wheat. RESULTS We found the BWA-mem mapping tool had both a higher mapping rate and a higher accuracy rate than Bowtie2. With the same mapping quality (MQ) cutoff, BWA-mem detected more variant bases in mapping reads than Bowtie2. The reads preprocessed with quality trimming or duplicate removal did not significantly affect the final mapping performance in terms of mapped reads. Based on the concordance and receiver operating characteristic (ROC), the Samtools/mpileup variant calling tool with BWA-mem mapping of raw sequence reads outperformed other tests followed by FreeBayes and GATK in terms of specificity and sensitivity. VarDict and VarScan were the poorest performing variant calling tools with the wheat WEC sequence data. CONCLUSION The BWA-mem and Samtools/mpileup pipeline, with no need to preprocess the raw read data before mapping onto the reference genome, was ascertained the optimum for SNP calling for the complex wheat genome re-sequencing. These results also provide useful guidelines for reliable variant identification from deep sequencing of other large polyploid crop genomes.
Collapse
Affiliation(s)
- Zhen Yao
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, Manitoba, R6M 1Y5, Canada
| | - Frank M You
- Ottawa Research and Development Centre, Agriculture and Agri-Food Canada, 960 Carling Avenue, Ottawa, Ontario, K1A 0C6, Canada
| | - Amidou N'Diaye
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, S7N 5A8, Canada
| | - Ron E Knox
- Swift Current Research and Development Centre, Agriculture and Agri-Food Canada, Box 1030, Swift Current, Saskatchewan, S9H 3X2, Canada
| | - Curt McCartney
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, Manitoba, R6M 1Y5, Canada
| | - Colin W Hiebert
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, Manitoba, R6M 1Y5, Canada
| | - Curtis Pozniak
- Department of Plant Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, S7N 5A8, Canada
| | - Wayne Xu
- Morden Research and Development Centre, Agriculture and Agri-Food Canada, 101 Route 100, Morden, Manitoba, R6M 1Y5, Canada.
| |
Collapse
|
17
|
Construction of a High-Density Genetic Map and Mapping of Firmness in Grapes ( Vitis vinifera L.) Based on Whole-Genome Resequencing. Int J Mol Sci 2020; 21:ijms21030797. [PMID: 31991832 PMCID: PMC7037167 DOI: 10.3390/ijms21030797] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 01/22/2020] [Accepted: 01/22/2020] [Indexed: 12/14/2022] Open
Abstract
Berry firmness is one of the most important quality traits in table grapes. The underlying molecular and genetic mechanisms for berry firmness remain unclear. We constructed a high-density genetic map based on whole-genome resequencing to identify loci associated with berry firmness. The genetic map had 19 linkage groups, including 1662 bin markers (26,039 SNPs), covering 1463.38 cM, and the average inter-marker distance was 0.88 cM. An analysis of berry firmness in the F1 population and both parents for three consecutive years revealed continuous variability in F1, with a distribution close to the normal distribution. Based on the genetic map and phenotypic data, three potentially significant quantitative trait loci (QTLs) related to berry firmness were identified by composite interval mapping. The contribution rate of each QTL ranged from 21.5% to 28.6%. We identified four candidate genes associated with grape firmness, which are related to endoglucanase, abscisic acid (ABA), and transcription factors. A qRT-PCR analysis revealed that the expression of abscisic-aldehyde oxidase-like gene (VIT_18s0041g02410) and endoglucanase 3 gene (VIT_18s0089g00210) in Muscat Hamburg was higher than in Crimson Seedless at the veraison stage, which was consistent with that of parent berry firmness. These results confirmed that VIT_18s0041g02410 and VIT_18s0089g00210 are candidate genes associated with berry firmness.
Collapse
|
18
|
Malmberg MM, Spangenberg GC, Daetwyler HD, Cogan NOI. Assessment of low-coverage nanopore long read sequencing for SNP genotyping in doubled haploid canola (Brassica napus L.). Sci Rep 2019; 9:8688. [PMID: 31213642 PMCID: PMC6582154 DOI: 10.1038/s41598-019-45131-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 05/28/2019] [Indexed: 11/16/2022] Open
Abstract
Despite the high accuracy of short read sequencing (SRS), there are still issues with attaining accurate single nucleotide polymorphism (SNP) genotypes at low sequencing coverage and in highly duplicated genomes due to misalignment. Long read sequencing (LRS) systems, including the Oxford Nanopore Technologies (ONT) minION, have become popular options for de novo genome assembly and structural variant characterisation. The current high error rate often requires substantial post-sequencing correction and would appear to prevent the adoption of this system for SNP genotyping, but nanopore sequencing errors are largely random. Using low coverage ONT minION sequencing for genotyping of pre-validated SNP loci was examined in 9 canola doubled haploids. The minION genotypes were compared to the Illumina sequences to determine the extent and nature of genotype discrepancies between the two systems. The significant increase in read length improved alignment to the genome and the absence of classical SRS biases results in a more even representation of the genome. Sequencing errors are present, primarily in the form of heterozygous genotypes, which can be removed in completely homozygous backgrounds but requires more advanced bioinformatics in heterozygous genomes. Developments in this technology are promising for routine genotyping in the future.
Collapse
Affiliation(s)
- M M Malmberg
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - G C Spangenberg
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - H D Daetwyler
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia.,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia
| | - N O I Cogan
- Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Victoria, 3083, Australia. .,School of Applied Systems Biology, La Trobe University, Bundoora, Victoria, 3086, Australia.
| |
Collapse
|