1
|
Bilton TP, Sharma SK, Schofield MR, Black MA, Jacobs JME, Bryan GJ, Dodds KG. Construction of relatedness matrices in autopolyploid populations using low-depth high-throughput sequencing data. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2024; 137:64. [PMID: 38430392 PMCID: PMC10908621 DOI: 10.1007/s00122-024-04568-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 01/30/2024] [Indexed: 03/03/2024]
Abstract
KEY MESSAGE An improved estimator of genomic relatedness using low-depth high-throughput sequencing data for autopolyploids is developed. Its outputs strongly correlate with SNP array-based estimates and are available in the package GUSrelate. High-throughput sequencing (HTS) methods have reduced sequencing costs and resources compared to array-based tools, facilitating the investigation of many non-model polyploid species. One important quantity that can be computed from HTS data is the genetic relatedness between all individuals in a population. However, HTS data are often messy, with multiple sources of errors (i.e. sequencing errors or missing parental alleles) which, if not accounted for, can lead to bias in genomic relatedness estimates. We derive a new estimator for constructing a genomic relationship matrix (GRM) from HTS data for autopolyploid species that accounts for errors associated with low sequencing depths, implemented in the R package GUSrelate. Simulations revealed that GUSrelate performed similarly to existing GRM methods at high depth but reduced bias in self-relatedness estimates when the sequencing depth was low. Using a panel consisting of 351 tetraploid potato genotypes, we found that GUSrelate produced GRMs from genotyping-by-sequencing (GBS) data that were highly correlated with a GRM computed from SNP array data, and less biased than existing methods when benchmarking against the array-based GRM estimates. GUSrelate provides researchers with a tool to reliably construct GRMs from low-depth HTS data.
Collapse
Affiliation(s)
- Timothy P Bilton
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand.
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | - Sanjeev Kumar Sharma
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Matthew R Schofield
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand
| | - Michael A Black
- Department of Biochemistry, University of Otago, Dunedin, New Zealand
| | | | - Glenn J Bryan
- Cell and Molecular Sciences, The James Hutton Institute, Invergowrie, Dundee, UK
| | - Ken G Dodds
- AgResearch, Invermay Agricultural Centre, Mosgiel, New Zealand
| |
Collapse
|
2
|
Boocock J, Alexander N, Tapia LA, Walter-McNeill L, Munugala C, Bloom JS, Kruglyak L. Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570640. [PMID: 38106186 PMCID: PMC10723400 DOI: 10.1101/2023.12.07.570640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Expression quantitative trait loci (eQTLs) provide a key bridge between noncoding DNA sequence variants and organismal traits. The effects of eQTLs can differ among tissues, cell types, and cellular states, but these differences are obscured by gene expression measurements in bulk populations. We developed a one-pot approach to map eQTLs in Saccharomyces cerevisiae by single-cell RNA sequencing (scRNA-seq) and applied it to over 100,000 single cells from three crosses. We used scRNA-seq data to genotype each cell, measure gene expression, and classify the cells by cell-cycle stage. We mapped thousands of local and distant eQTLs and identified interactions between eQTL effects and cell-cycle stages. We took advantage of single-cell expression information to identify hundreds of genes with allele-specific effects on expression noise. We used cell-cycle stage classification to map 20 loci that influence cell-cycle progression. One of these loci influenced the expression of genes involved in the mating response. We showed that the effects of this locus arise from a common variant (W82R) in the gene GPA1, which encodes a signaling protein that negatively regulates the mating pathway. The 82R allele increases mating efficiency at the cost of slower cell-cycle progression and is associated with a higher rate of outcrossing in nature. Our results provide a more granular picture of the effects of genetic variants on gene expression and downstream traits.
Collapse
Affiliation(s)
- James Boocock
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Noah Alexander
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Leslie Alamo Tapia
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Laura Walter-McNeill
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Chetan Munugala
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Joshua S Bloom
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Leonid Kruglyak
- Department of Human Genetics, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Biological Chemistry, University of California, Los Angeles, Los Angeles, CA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| |
Collapse
|
3
|
Thérèse Navarro A, Bourke PM, van de Weg E, Clot CR, Arens P, Finkers R, Maliepaard C. Smooth Descent: A ploidy-aware algorithm to improve linkage mapping in the presence of genotyping errors. Front Genet 2023; 14:1049988. [PMID: 36936433 PMCID: PMC10014611 DOI: 10.3389/fgene.2023.1049988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 02/20/2023] [Indexed: 03/08/2023] Open
Abstract
Linkage mapping is an approach to order markers based on recombination events. Mapping algorithms cannot easily handle genotyping errors, which are common in high-throughput genotyping data. To solve this issue, strategies have been developed, aimed mostly at identifying and eliminating these errors. One such strategy is SMOOTH, an iterative algorithm to detect genotyping errors. Unlike other approaches, SMOOTH can also be used to impute the most probable alternative genotypes, but its application is limited to diploid species and to markers heterozygous in only one of the parents. In this study we adapted SMOOTH to expand its use to any marker type and to autopolyploids with the use of identity-by-descent probabilities, naming the updated algorithm Smooth Descent (SD). We applied SD to real and simulated data, showing that in the presence of genotyping errors this method produces better genetic maps in terms of marker order and map length. SD is particularly useful for error rates between 5% and 20% and when error rates are not homogeneous among markers or individuals. With a starting error rate of 10%, SD reduced it to ∼5% in diploids, ∼7% in tetraploids and ∼8.5% in hexaploids. Conversely, the correlation between true and estimated genetic maps increased by 0.03 in tetraploids and by 0.2 in hexaploids, while worsening slightly in diploids (∼0.0011). We also show that the combination of genotype curation and map re-estimation allowed us to obtain better genetic maps while correcting wrong genotypes. We have implemented this algorithm in the R package Smooth Descent.
Collapse
|
4
|
Lázaro-Guevara JM, Flores-Robles BJ, Garrido-Lopez KM, McKeown RJ, Flores-Morán AE, Labrador-Sánchez E, Pinillos-Aransay V, Trasahedo EA, López-Martín JA, Soberanis LSR, Melgar MY, Téllez-Arreola JL, Thébault SC. Identification of RP1 as the genetic cause of retinitis pigmentosa in a multi-generational pedigree using Extremely Low-Coverage Whole Genome Sequencing (XLC-WGS). Gene X 2023; 851:146956. [DOI: 10.1016/j.gene.2022.146956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 09/25/2022] [Accepted: 10/03/2022] [Indexed: 11/04/2022] Open
|
5
|
Taniguti CH, Taniguti LM, Amadeu RR, Lau J, Gesteira GDS, Oliveira TDP, Ferreira GC, Pereira GDS, Byrne D, Mollinari M, Riera-Lizarazu O, Garcia AAF. Developing best practices for genotyping-by-sequencing analysis in the construction of linkage maps. Gigascience 2022; 12:giad092. [PMID: 37889010 PMCID: PMC10603770 DOI: 10.1093/gigascience/giad092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 06/27/2023] [Accepted: 10/06/2023] [Indexed: 10/28/2023] Open
Abstract
BACKGROUND Genotyping-by-sequencing (GBS) provides affordable methods for genotyping hundreds of individuals using millions of markers. However, this challenges bioinformatic procedures that must overcome possible artifacts such as the bias generated by polymerase chain reaction duplicates and sequencing errors. Genotyping errors lead to data that deviate from what is expected from regular meiosis. This, in turn, leads to difficulties in grouping and ordering markers, resulting in inflated and incorrect linkage maps. Therefore, genotyping errors can be easily detected by linkage map quality evaluations. RESULTS We developed and used the Reads2Map workflow to build linkage maps with simulated and empirical GBS data of diploid outcrossing populations. The workflows run GATK, Stacks, TASSEL, and Freebayes for single-nucleotide polymorphism calling and updog, polyRAD, and SuperMASSA for genotype calling, as well as OneMap and GUSMap to build linkage maps. Using simulated data, we observed which genotype call software fails in identifying common errors in GBS sequencing data and proposed specific filters to better handle them. We tested whether it is possible to overcome errors in a linkage map using genotype probabilities from each software or global error rates to estimate genetic distances with an updated version of OneMap. We also evaluated the impact of segregation distortion, contaminant samples, and haplotype-based multiallelic markers in the final linkage maps. Through our evaluations, we observed that some of the approaches produce different results depending on the dataset (dataset dependent) and others produce consistent advantageous results among them (dataset independent). CONCLUSIONS We set as default in the Reads2Map workflows the approaches that showed to be dataset independent for GBS datasets according to our results. This reduces the number of required tests to identify optimal pipelines and parameters for other empirical datasets. Using Reads2Map, users can select the pipeline and parameters that best fit their data context. The Reads2MapApp shiny app provides a graphical representation of the results to facilitate their interpretation.
Collapse
Affiliation(s)
- Cristiane Hayumi Taniguti
- Department of Genetics, University of São Paulo, São Paulo 13418-900, Brazil
- Department of Horticultural Sciences, Texas A&M University, College Station, TX 77843-0001, USA
| | - Lucas Mitsuo Taniguti
- Department of Genetics, University of São Paulo, São Paulo 13418-900, Brazil
- Mendelics Genomic Analysis, São Paulo 02511-000, Brazil
| | | | - Jeekin Lau
- Department of Horticultural Sciences, Texas A&M University, College Station, TX 77843-0001, USA
| | - Gabriel de Siqueira Gesteira
- Department of Genetics, University of São Paulo, São Paulo 13418-900, Brazil
- Bioinformatics Research Center, Department of Horticultural Sciences, North Carolina State University, Raleigh, NC 27695-7566, USA
| | | | | | | | - David Byrne
- Department of Horticultural Sciences, Texas A&M University, College Station, TX 77843-0001, USA
| | - Marcelo Mollinari
- Bioinformatics Research Center, Department of Horticultural Sciences, North Carolina State University, Raleigh, NC 27695-7566, USA
| | - Oscar Riera-Lizarazu
- Department of Horticultural Sciences, Texas A&M University, College Station, TX 77843-0001, USA
| | | |
Collapse
|
6
|
Nguyen Ba AN, Lawrence KR, Rego-Costa A, Gopalakrishnan S, Temko D, Michor F, Desai MM. Barcoded Bulk QTL mapping reveals highly polygenic and epistatic architecture of complex traits in yeast. eLife 2022; 11:73983. [PMID: 35147078 PMCID: PMC8979589 DOI: 10.7554/elife.73983] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 02/11/2022] [Indexed: 11/25/2022] Open
Abstract
Mapping the genetic basis of complex traits is critical to uncovering the biological mechanisms that underlie disease and other phenotypes. Genome-wide association studies (GWAS) in humans and quantitative trait locus (QTL) mapping in model organisms can now explain much of the observed heritability in many traits, allowing us to predict phenotype from genotype. However, constraints on power due to statistical confounders in large GWAS and smaller sample sizes in QTL studies still limit our ability to resolve numerous small-effect variants, map them to causal genes, identify pleiotropic effects across multiple traits, and infer non-additive interactions between loci (epistasis). Here, we introduce barcoded bulk quantitative trait locus (BB-QTL) mapping, which allows us to construct, genotype, and phenotype 100,000 offspring of a budding yeast cross, two orders of magnitude larger than the previous state of the art. We use this panel to map the genetic basis of eighteen complex traits, finding that the genetic architecture of these traits involves hundreds of small-effect loci densely spaced throughout the genome, many with widespread pleiotropic effects across multiple traits. Epistasis plays a central role, with thousands of interactions that provide insight into genetic networks. By dramatically increasing sample size, BB-QTL mapping demonstrates the potential of natural variants in high-powered QTL studies to reveal the highly polygenic, pleiotropic, and epistatic architecture of complex traits.
Collapse
Affiliation(s)
- Alex N Nguyen Ba
- Department of Organismic and Evolutionary Biology, Harvard University
| | | | - Artur Rego-Costa
- Department of Organismic and Evolutionary Biology, Harvard University
| | | | | | | | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University
| |
Collapse
|
7
|
Olsen KM, Goad DM, Wright SJ, Dutta ML, Myers SR, Small LL, Li LF. Dual-species origin of an adaptive chemical defense polymorphism. THE NEW PHYTOLOGIST 2021; 232:1477-1487. [PMID: 34320221 DOI: 10.1111/nph.17654] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]
Abstract
Allopolyploid speciation and chemical defense diversification are two of the most characteristic features of plant evolution; although the former has likely shaped the latter, this has rarely been documented. Here we document allopolyploidy-mediated chemical defense evolution in the origin of cyanogenesis (HCN release upon tissue damage) in white clover (Trifolium repens). We combined linkage mapping of the loci that control cyanogenesis (Ac, controlling production of cyanogenic glucosides; and Li, controlling production of their hydrolyzing enzyme linamarase) with genome sequence comparisons between white clover, a recently evolved allotetraploid, and its diploid progenitors (Trifolium pallescens, Trifolium occidentale). The Ac locus (a three-gene cluster comprising the cyanogenic glucoside pathway) is derived from T. occidentale; it maps to linkage group 2O (occidentale subgenome) and is orthologous to a highly similar cluster in the T. occidentale reference genome. By contrast, Li maps to linkage group 4P (pallescens subgenome), indicating an origin in the other progenitor species. These results indicate that cyanogenesis evolved in white clover as a product of the interspecific hybridization that created the species. This allopolyploidization-derived chemical defense, together with subsequent selection on intraspecific cyanogenesis variation, appears to have contributed to white clover's ecological success as a globally distributed weed species.
Collapse
Affiliation(s)
- Kenneth M Olsen
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
| | - David M Goad
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
| | - Sara J Wright
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
- Biological Sciences Department, Rowan University, Glassboro, NJ, 08028, USA
| | - Maya L Dutta
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
| | - Samantha R Myers
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
| | - Linda L Small
- Department of Biology, Washington University in St. Louis, 1 Brookings Dr., St Louis, MO, 63130, USA
| | - Lin-Feng Li
- Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| |
Collapse
|
8
|
Souza VFD, Pereira GDS, Pastina MM, Parrella RADC, Simeone MLF, Barros BDA, Noda RW, da Costa e Silva L, Magalhães JVD, Schaffert RE, Garcia AAF, Damasceno CMB. QTL mapping for bioenergy traits in sweet sorghum recombinant inbred lines. G3 GENES|GENOMES|GENETICS 2021; 11:6370150. [PMID: 34519766 PMCID: PMC8527507 DOI: 10.1093/g3journal/jkab314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 08/26/2021] [Indexed: 11/13/2022]
Abstract
Abstract
During the past decade, sweet sorghum (Sorghum bicolor Moench L.) has shown great potential for bioenergy production, especially biofuels. In this study, 223 recombinant inbred lines (RILs) derived from a cross between two sweet sorghum lines (Brandes × Wray) were evaluated in three trials. Single-nucleotide polymorphisms (SNPs) derived from genotyping by sequencing of 272 RILs were used to build a high-density genetic map comprising 3,767 SNPs spanning 1,368.83 cM. Multitrait multiple interval mapping (MT-MIM) was carried out to map quantitative trait loci (QTL) for eight bioenergy traits. A total of 33 QTLs were identified for flowering time, plant height, total soluble solids and sucrose (five QTLs each), fibers (four QTLs), and fresh biomass yield, juice extraction yield, and reducing sugars (three QTLs each). QTL hotspots were found on chromosomes 1, 3, 6, 9, and 10, in addition to other QTLs detected on chromosomes 4 and 8. We observed that 14 out of the 33 mapped QTLs were found in all three trials. Upon further development and validation in other crosses, the results provided by the present study have a great potential to be used in marker-assisted selection in sorghum breeding programs for biofuel production.
Collapse
Affiliation(s)
| | - Guilherme da Silva Pereira
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | | | | | | | | | | | | | | | | | - Antonio Augusto Franco Garcia
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo, Piracicaba, SP, 13418-900, Brazil
| | | |
Collapse
|
9
|
Wright SJ, Goad DM, Gross BL, Muñoz PR, Olsen KM. Genetic trade-offs underlie divergent life history strategies for local adaptation in white clover. Mol Ecol 2021; 31:3742-3760. [PMID: 34532899 DOI: 10.1111/mec.16180] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/25/2021] [Accepted: 09/02/2021] [Indexed: 01/26/2023]
Abstract
Local adaptation is common in plants, yet characterization of its underlying genetic basis is rare in herbaceous perennials. Moreover, while many plant species exhibit intraspecific chemical defence polymorphisms, their importance for local adaptation remains poorly understood. We examined the genetic architecture of local adaptation in a perennial, obligately-outcrossing herbaceous legume, white clover (Trifolium repens). This widespread species displays a well-studied chemical defence polymorphism for cyanogenesis (HCN release following tissue damage) and has evolved climate-associated cyanogenesis clines throughout its range. Two biparental F2 mapping populations, derived from three parents collected in environments spanning the U.S. latitudinal species range (Duluth, MN, St. Louis, MO and Gainesville, FL), were grown in triplicate for two years in reciprocal common garden experiments in the parental environments (6,012 total plants). Vegetative growth and reproductive fitness traits displayed trade-offs across reciprocal environments, indicating local adaptation. Genetic mapping of fitness traits revealed a genetic architecture characterized by allelic trade-offs between environments, with 100% and 80% of fitness QTL in the two mapping populations showing significant QTL×E interactions, consistent with antagonistic pleiotropy. Across the genome there were three hotspots of QTL colocalization. Unexpectedly, we found little evidence that the cyanogenesis polymorphism contributes to local adaptation. Instead, divergent life history strategies in reciprocal environments were major fitness determinants: selection favoured early investment in flowering at the cost of multiyear survival in the southernmost site versus delayed flowering and multiyear persistence in the northern environments. Our findings demonstrate that multilocus genetic trade-offs contribute to contrasting life history characteristics that allow for local adaptation in this outcrossing herbaceous perennial.
Collapse
Affiliation(s)
- Sara J Wright
- Department of Biology, Washington University, St. Louis, Missouri, USA
| | - David M Goad
- Department of Biology, Washington University, St. Louis, Missouri, USA
| | - Briana L Gross
- Biology Department, University of Minnesota-Duluth, Duluth, Minnesota, USA
| | - Patricio R Muñoz
- Horticultural Science Department, University of Florida, Gainesville, Florida, USA
| | - Kenneth M Olsen
- Department of Biology, Washington University, St. Louis, Missouri, USA
| |
Collapse
|
10
|
Liao Y, Voorrips RE, Bourke PM, Tumino G, Arens P, Visser RGF, Smulders MJM, Maliepaard C. Using probabilistic genotypes in linkage analysis of polyploids. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2021; 134:2443-2457. [PMID: 34032878 PMCID: PMC8277618 DOI: 10.1007/s00122-021-03834-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/10/2021] [Indexed: 05/21/2023]
Abstract
In polyploids, linkage mapping is carried out using genotyping with discrete dosage scores. Here, we use probabilistic genotypes and we validate it for the construction of polyploid linkage maps. Marker genotypes are generally called as discrete values: homozygous versus heterozygous in the case of diploids, or an integer allele dosage in the case of polyploids. Software for linkage map construction and/or QTL analysis usually relies on such discrete genotypes. However, it may not always be possible, or desirable, to assign definite values to genotype observations in the presence of uncertainty in the genotype calling. Here, we present an approach that uses probabilistic marker dosages for linkage map construction in polyploids. We compare our method to an approach based on discrete dosages, using simulated SNP array and sequence reads data with varying levels of data quality. We validate our approach using experimental data from a potato (Solanum tuberosum L.) SNP array applied to an F1 mapping population. In comparison to the approach based on discrete dosages, we mapped an additional 562 markers. All but three of these were mapped to the expected chromosome and marker position. For the remaining three markers, no physical position was known. The use of dosage probabilities is of particular relevance for map construction in polyploids using sequencing data, as these often result in a higher level of uncertainty regarding allele dosage.
Collapse
Affiliation(s)
- Yanlin Liao
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Roeland E Voorrips
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Peter M Bourke
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Giorgio Tumino
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Paul Arens
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Richard G F Visser
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Marinus J M Smulders
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands
| | - Chris Maliepaard
- Wageningen University and Research Plant Breeding, P.O. Box 386, Wageningen, AJ, 6700, The Netherlands.
| |
Collapse
|
11
|
Popowski E, Thomson SJ, Knäbel M, Tahir J, Crowhurst RN, Davy M, Foster TM, Schaffer RJ, Tustin DS, Allan AC, McCallum J, Chagné D. Construction of a high density genetic map for hexaploid kiwifruit (Actinidia chinensis var. deliciosa) using genotyping by sequencing. G3-GENES GENOMES GENETICS 2021; 11:6261761. [PMID: 34009255 PMCID: PMC8495948 DOI: 10.1093/g3journal/jkab142] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 03/07/2021] [Indexed: 11/19/2022]
Abstract
Commercially grown kiwifruit (genus Actinidia) are generally of two sub-species which have a base haploid genome of 29 chromosomes. The yellow-fleshed Actinidia chinensis var. chinensis, is either diploid (2n = 2x = 58) or tetraploid (2n = 4x = 116) and the green-fleshed cultivar A. chinensis var. deliciosa “Hayward,” is hexaploid (2n = 6x = 174). Advances in breeding green kiwifruit could be greatly sped up by the use of molecular resources for more efficient and faster selection, for example using marker-assisted selection (MAS). The key genetic marker that has been implemented for MAS in hexaploid kiwifruit is for gender testing. The limited marker-trait association has been reported for other polyploid kiwifruit for fruit and production traits. We have constructed a high-density linkage map for hexaploid green kiwifruit using genotyping-by-sequence (GBS). The linkage map obtained consists of 3686 and 3940 markers organized in 183 and 176 linkage groups for the female and male parents, respectively. Both parental linkage maps are co-linear with the A. chinensis “Red5” reference genome of kiwifruit. The linkage map was then used for quantitative trait locus (QTL) mapping, and successfully identified QTLs for king flower number, fruit number and weight, dry matter accumulation, and storage firmness. These are the first QTLs to be reported and discovered for complex traits in hexaploid kiwifruit.
Collapse
Affiliation(s)
- Elizabeth Popowski
- The New Zealand Institute for Plant and Food Research Ltd (Plant & Food Research), Te Puke, New Zealand
| | | | | | | | | | - Marcus Davy
- The New Zealand Institute for Plant and Food Research Ltd (Plant & Food Research), Te Puke, New Zealand
| | | | - Robert J Schaffer
- Plant & Food Research, Motueka, New Zealand.,School of Biological Sciences, University of Auckland, Private Bag 92019, Auckland 1142, New Zealand
| | | | - Andrew C Allan
- Plant & Food Research, Auckland, New Zealand.,School of Biological Sciences, University of Auckland, Private Bag 92019, Auckland 1142, New Zealand
| | | | - David Chagné
- Plant & Food Research, Palmerston North, New Zealand
| |
Collapse
|
12
|
Pappas F, Palaiokostas C. Genotyping Strategies Using ddRAD Sequencing in Farmed Arctic Charr ( Salvelinus alpinus). Animals (Basel) 2021; 11:899. [PMID: 33801139 PMCID: PMC8004150 DOI: 10.3390/ani11030899] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Revised: 03/13/2021] [Accepted: 03/16/2021] [Indexed: 12/17/2022] Open
Abstract
Incorporation of genomic technologies into fish breeding programs is a modern reality, promising substantial advances regarding the accuracy of selection, monitoring the genetic diversity and pedigree record verification. Single nucleotide polymorphism (SNP) arrays are the most commonly used genomic tool, but the investments required make them unsustainable for emerging species, such as Arctic charr (Salvelinus alpinus), where production volume is low. The requirement to genotype a large number of animals for breeding practices necessitates cost effective genotyping approaches. In the current study, we used double digest restriction site-associated DNA (ddRAD) sequencing of either high or low coverage to genotype Arctic charr from the Swedish national breeding program and performed analytical procedures to assess their utility in a range of tasks. SNPs were identified and used for deciphering the genetic structure of the studied population, estimating genomic relationships and implementing an association study for growth-related traits. Missing information and underestimation of heterozygosity in the low coverage set were limiting factors in genetic diversity and genomic relationship analyses, where high coverage performed notably better. On the other hand, the high coverage dataset proved to be valuable when it comes to identifying loci that are associated with phenotypic traits of interest. In general, both genotyping strategies offer sustainable alternatives to hybridization-based genotyping platforms and show potential for applications in aquaculture selective breeding.
Collapse
Affiliation(s)
| | - Christos Palaiokostas
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, P.O. Box 7090, 750 07 Uppsala, Sweden;
| |
Collapse
|
13
|
Ben-David E, Boocock J, Guo L, Zdraljevic S, Bloom JS, Kruglyak L. Whole-organism eQTL mapping at cellular resolution with single-cell sequencing. eLife 2021; 10:e65857. [PMID: 33734084 PMCID: PMC8062134 DOI: 10.7554/elife.65857] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 03/17/2021] [Indexed: 12/11/2022] Open
Abstract
Genetic regulation of gene expression underlies variation in disease risk and other complex traits. The effect of expression quantitative trait loci (eQTLs) varies across cell types; however, the complexity of mammalian tissues makes studying cell-type eQTLs highly challenging. We developed a novel approach in the model nematode Caenorhabditis elegans that uses single-cell RNA sequencing to map eQTLs at cellular resolution in a single one-pot experiment. We mapped eQTLs across cell types in an extremely large population of genetically distinct C. elegans individuals. We found cell-type-specific trans eQTL hotspots that affect the expression of core pathways in the relevant cell types. Finally, we found single-cell-specific eQTL effects in the nervous system, including an eQTL with opposite effects in two individual neurons. Our results show that eQTL effects can be specific down to the level of single cells.
Collapse
Affiliation(s)
- Eyal Ben-David
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
- Department of Biochemistry and Molecular Biology, Institute for Medical Research Israel-Canada, The Hebrew University School of MedicineJerusalemIsrael
| | - James Boocock
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
| | - Longhua Guo
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
| | - Stefan Zdraljevic
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
| | - Joshua S Bloom
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
| | - Leonid Kruglyak
- Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los AngelesLos AngelesUnited States
| |
Collapse
|
14
|
Zhang J, Kadri NK, Mullaart E, Spelman R, Fritz S, Boichard D, Charlier C, Georges M, Druet T. Genetic architecture of individual variation in recombination rate on the X chromosome in cattle. Heredity (Edinb) 2020; 125:304-316. [PMID: 32651548 DOI: 10.1038/s41437-020-0341-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 07/01/2020] [Accepted: 07/01/2020] [Indexed: 12/26/2022] Open
Abstract
Meiotic recombination is an essential biological process that ensures proper chromosome segregation and creates genetic diversity. Individual variation in global recombination rates has been shown to be heritable in several species, and variants significantly associated with this trait have been identified. Recombination on the sex chromosome has often been ignored in these studies although this trait may be particularly interesting as it may correspond to a biological process distinct from that on autosomes. For instance, recombination in males is restricted to the pseudo-autosomal region (PAR). We herein used a large cattle pedigree with more than 100,000 genotyped animals to improve the genetic map of the X chromosome and to study the genetic architecture of individual variation in recombination rate on the sex chromosome (XRR). The length of the genetic map was 46.4 and 121.2 cM in males and females, respectively, but the recombination rate in the PAR was six times higher in males. The heritability of CO counts on the X chromosome was comparable to that of autosomes in males (0.011) but larger than that of autosomes in females (0.024). XRR was highly correlated (0.76) with global recombination rate (GRR) in females, suggesting that both traits might be governed by shared variants. In agreement, a set of eleven previously identified variants associated with GRR had correlated effects on female XRR (0.86). In males, XRR and GRR appeared to be distinct traits, although more accurate CO counts on the PAR would be valuable to confirm these results.
Collapse
Affiliation(s)
- Junjie Zhang
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Naveen Kumar Kadri
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium.,Animal Genomics, Institute of Agricultural Science, ETH Zürich, Zürich, Switzerland
| | | | | | - Sébastien Fritz
- Université Paris-Saclay, INRAE, AgroParisTech, GABI, Jouy-en-Josas, France.,Allice, Paris, France
| | | | - Carole Charlier
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Michel Georges
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, Liège, Belgium.
| |
Collapse
|
15
|
Palaiokostas C, Clarke SM, Jeuthe H, Brauning R, Bilton TP, Dodds KG, McEwan JC, De Koning DJ. Application of Low Coverage Genotyping by Sequencing in Selectively Bred Arctic Charr ( Salvelinus alpinus). G3 (BETHESDA, MD.) 2020; 10:2069-2078. [PMID: 32312839 PMCID: PMC7263669 DOI: 10.1534/g3.120.401295] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/16/2020] [Indexed: 12/12/2022]
Abstract
Arctic charr (Salvelinus alpinus) is a species of high economic value for the aquaculture industry, and of high ecological value due to its Holarctic distribution in both marine and freshwater environments. Novel genome sequencing approaches enable the study of population and quantitative genetic parameters even on species with limited or no prior genomic resources. Low coverage genotyping by sequencing (GBS) was applied in a selected strain of Arctic charr in Sweden originating from a landlocked freshwater population. For the needs of the current study, animals from year classes 2013 (171 animals, parental population) and 2017 (759 animals; 13 full sib families) were used as a template for identifying genome wide single nucleotide polymorphisms (SNPs). GBS libraries were constructed using the PstI and MspI restriction enzymes. Approximately 14.5K SNPs passed quality control and were used for estimating a genomic relationship matrix. Thereafter a wide range of analyses were conducted in order to gain insights regarding genetic diversity and investigate the efficiency of the genomic information for parentage assignment and breeding value estimation. Heterozygosity estimates for both year classes suggested a slight excess of heterozygotes. Furthermore, FST estimates among the families of year class 2017 ranged between 0.009 - 0.066. Principal components analysis (PCA) and discriminant analysis of principal components (DAPC) were applied aiming to identify the existence of genetic clusters among the studied population. Results obtained were in accordance with pedigree records allowing the identification of individual families. Additionally, DNA parentage verification was performed, with results in accordance with the pedigree records with the exception of a putative dam where full sib genotypes suggested a potential recording error. Breeding value estimation for juvenile growth through the usage of the estimated genomic relationship matrix clearly outperformed the pedigree equivalent in terms of prediction accuracy (0.51 opposed to 0.31). Overall, low coverage GBS has proven to be a cost-effective genotyping platform that is expected to boost the selection efficiency of the Arctic charr breeding program.
Collapse
Affiliation(s)
- Christos Palaiokostas
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7090, 750 07 Uppsala, Sweden,
| | - Shannon M Clarke
- Invermay Agricultural Centre, AgResearch, Private Bag 50034, Mosgiel 9053, New Zealand
| | - Henrik Jeuthe
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7090, 750 07 Uppsala, Sweden
- Aquaculture Center North, Åvägen 17, 844 61 Kälarne, Sweden, and
| | - Rudiger Brauning
- Invermay Agricultural Centre, AgResearch, Private Bag 50034, Mosgiel 9053, New Zealand
| | - Timothy P Bilton
- Invermay Agricultural Centre, AgResearch, Private Bag 50034, Mosgiel 9053, New Zealand
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| | - Ken G Dodds
- Invermay Agricultural Centre, AgResearch, Private Bag 50034, Mosgiel 9053, New Zealand
| | - John C McEwan
- Invermay Agricultural Centre, AgResearch, Private Bag 50034, Mosgiel 9053, New Zealand
| | - Dirk-Jan De Koning
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, Box 7090, 750 07 Uppsala, Sweden
| |
Collapse
|
16
|
Ostevik KL, Samuk K, Rieseberg LH. Ancestral Reconstruction of Karyotypes Reveals an Exceptional Rate of Nonrandom Chromosomal Evolution in Sunflower. Genetics 2020; 214:1031-1045. [PMID: 32033968 PMCID: PMC7153943 DOI: 10.1534/genetics.120.303026] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Accepted: 02/03/2020] [Indexed: 12/20/2022] Open
Abstract
Mapping the chromosomal rearrangements between species can inform our understanding of genome evolution, reproductive isolation, and speciation. Here, we present a novel algorithm for identifying regions of synteny in pairs of genetic maps, which is implemented in the accompanying R package syntR. The syntR algorithm performs as well as previous ad hoc methods while being systematic, repeatable, and applicable to mapping chromosomal rearrangements in any group of species. In addition, we present a systematic survey of chromosomal rearrangements in the annual sunflowers, which is a group known for extreme karyotypic diversity. We build high-density genetic maps for two subspecies of the prairie sunflower, Helianthus petiolaris ssp. petiolaris and H. petiolaris ssp. fallax Using syntR, we identify blocks of synteny between these two subspecies and previously published high-density genetic maps. We reconstruct ancestral karyotypes for annual sunflowers using those synteny blocks and conservatively estimate that there have been 7.9 chromosomal rearrangements per million years, a high rate of chromosomal evolution. Although the rate of inversion is even higher than the rate of translocation in this group, we further find that every extant karyotype is distinguished by between one and three translocations involving only 8 of the 17 chromosomes. This nonrandom exchange suggests that specific chromosomes are prone to translocation and may thus contribute disproportionately to widespread hybrid sterility in sunflowers. These data deepen our understanding of chromosome evolution and confirm that Helianthus has an exceptional rate of chromosomal rearrangement that may facilitate similarly rapid diversification.
Collapse
Affiliation(s)
- Kate L Ostevik
- Department of Biology, Duke University, Durham, North Carolina 27701
- Department of Botany, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
| | - Kieran Samuk
- Department of Biology, Duke University, Durham, North Carolina 27701
| | - Loren H Rieseberg
- Department of Botany, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada
| |
Collapse
|
17
|
Mollinari M, Olukolu BA, Pereira GDS, Khan A, Gemenet D, Yencho GC, Zeng ZB. Unraveling the Hexaploid Sweetpotato Inheritance Using Ultra-Dense Multilocus Mapping. G3 (BETHESDA, MD.) 2020; 10:281-292. [PMID: 31732504 PMCID: PMC6945028 DOI: 10.1534/g3.119.400620] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 11/12/2019] [Indexed: 12/18/2022]
Abstract
The hexaploid sweetpotato (Ipomoea batatas (L.) Lam., 2n = 6x = 90) is an important staple food crop worldwide and plays a vital role in alleviating famine in developing countries. Due to its high ploidy level, genetic studies in sweetpotato lag behind major diploid crops significantly. We built an ultra-dense multilocus integrated genetic map and characterized the inheritance system in a sweetpotato full-sib family using our newly developed software, MAPpoly. The resulting genetic map revealed 96.5% collinearity between I. batatas and its diploid relative I. trifida We computed the genotypic probabilities across the whole genome for all individuals in the mapping population and inferred their complete hexaploid haplotypes. We provide evidence that most of the meiotic configurations (73.3%) were resolved in bivalents, although a small portion of multivalent signatures (15.7%), among other inconclusive configurations (11.0%), were also observed. Except for low levels of preferential pairing in linkage group 2, we observed a hexasomic inheritance mechanism in all linkage groups. We propose that the hexasomic-bivalent inheritance promotes stability to the allelic transmission in sweetpotato.
Collapse
Affiliation(s)
- Marcelo Mollinari
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina,
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Bode A Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Tennessee
| | - Guilherme da S Pereira
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Awais Khan
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, New York, and
| | - Dorcus Gemenet
- International Potato Center, ILRI Campus, Nairobi, Kenya
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Zhao-Bang Zeng
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| |
Collapse
|
18
|
Mollinari M, Olukolu BA, Pereira GDS, Khan A, Gemenet D, Yencho GC, Zeng ZB. Unraveling the Hexaploid Sweetpotato Inheritance Using Ultra-Dense Multilocus Mapping. G3 (BETHESDA, MD.) 2020. [PMID: 31732504 DOI: 10.25387/g3.10255844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
The hexaploid sweetpotato (Ipomoea batatas (L.) Lam., 2n = 6x = 90) is an important staple food crop worldwide and plays a vital role in alleviating famine in developing countries. Due to its high ploidy level, genetic studies in sweetpotato lag behind major diploid crops significantly. We built an ultra-dense multilocus integrated genetic map and characterized the inheritance system in a sweetpotato full-sib family using our newly developed software, MAPpoly. The resulting genetic map revealed 96.5% collinearity between I. batatas and its diploid relative I. trifida We computed the genotypic probabilities across the whole genome for all individuals in the mapping population and inferred their complete hexaploid haplotypes. We provide evidence that most of the meiotic configurations (73.3%) were resolved in bivalents, although a small portion of multivalent signatures (15.7%), among other inconclusive configurations (11.0%), were also observed. Except for low levels of preferential pairing in linkage group 2, we observed a hexasomic inheritance mechanism in all linkage groups. We propose that the hexasomic-bivalent inheritance promotes stability to the allelic transmission in sweetpotato.
Collapse
Affiliation(s)
- Marcelo Mollinari
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina,
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Bode A Olukolu
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, Tennessee
| | - Guilherme da S Pereira
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Awais Khan
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, New York, and
| | - Dorcus Gemenet
- International Potato Center, ILRI Campus, Nairobi, Kenya
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| | - Zhao-Bang Zeng
- Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina
- Department of Horticultural Science, North Carolina State University, Raleigh, North Carolina
| |
Collapse
|
19
|
Matias FI, Xavier Meireles KG, Nagamatsu ST, Lima Barrios SC, Borges do Valle C, Carazzolle MF, Fritsche-Neto R, Endelman JB. Expected Genotype Quality and Diploidized Marker Data from Genotyping-by-Sequencing of Urochloa spp. Tetraploids. THE PLANT GENOME 2019; 12:1-9. [PMID: 33016594 DOI: 10.3835/plantgenome2019.01.0002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Accepted: 04/29/2019] [Indexed: 05/08/2023]
Abstract
Introduced concept of expected genotype quality (EGQ) and software to calculate it Provided read depth guidelines for GBS in tetraploids Developed software to generate diploidized genotype calls from VCF files Demonstrated value of aligning GBS reads to a mock reference genome for SNP discovery Recommend filtering based on GQ and read depth to prevent genotype bias Although genotyping-by-sequencing (GBS) is a well-established marker technology in diploids, the development of best practices for tetraploid species is a topic of current research. We determined the theoretical relationship between read depth and the phred-scaled probability of genotype misclassification conditioned on the true genotype, which we call expected genotype quality (EGQ). If the GBS method has 0.5% allelic error, then 17 reads are needed to classify simplex tetraploids as heterozygous with 95% accuracy (EGQ = 13) vs. 61 reads to determine allele dosage. We developed an R script to convert tetraploid GBS data in variant call format (VCF) into diploidized genotype calls and applied it to 267 interspecific hybrids of the tetraploid forage grass Urochloa. When reads were aligned to a mock reference genome created from GBS data of the Urochloa brizantha (Hochst. ex A. Rich.) R. D. Webster cultivar Marandu, 25,678 biallelic single nucleotide polymorphism (SNPs) were discovered, compared with ∼3000 SNPs when aligning to the closest true reference genomes, Setaria viridis (L.) P. Beauv. and S. italica (L.) P. Beauv. Cross-validation revealed that missing genotypes were imputed by the random forest method with a median accuracy of 0.85 regardless of heterozygote frequency. Using the Urochloa spp. hybrids, we illustrated how filtering samples based only on genotype quality (GQ) creates genotype bias; a depth threshold based on EGQ is also needed regardless of whether genotypes are called using a diploidized or allele dosage model.
Collapse
Affiliation(s)
- Filipe Inácio Matias
- Genetics Dep., Luiz de Queiroz College of Agriculture, Univ. of São Paulo, Av. Pádua Dias, 11, C. P. 9, 13.418-900, Piracicaba, São Paulo, Brazil
| | | | - Sheila Tiemi Nagamatsu
- Genetics and Evolution Dep., Univ. of Campinas, Cidade Univ. Zeferino Vaz, 13.083-970, Campinas, São Paulo, Brazil
| | | | - Cacilda Borges do Valle
- Embrapa Beef Cattle, Av. Rádio Maia, 830, Zona Rural, 79.106-550, Campo Grande, Mato Grosso do Sul, Brazil
| | | | - Roberto Fritsche-Neto
- Genetics Dep., Luiz de Queiroz College of Agriculture, Univ. of São Paulo, Av. Pádua Dias, 11, C. P. 9, 13.418-900, Piracicaba, São Paulo, Brazil
| | | |
Collapse
|
20
|
Exclusion and Genomic Relatedness Methods for Assignment of Parentage Using Genotyping-by-Sequencing Data. G3-GENES GENOMES GENETICS 2019; 9:3239-3247. [PMID: 31383721 PMCID: PMC6778805 DOI: 10.1534/g3.119.400501] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Genotypes are often used to assign parentage in agricultural and ecological settings. Sequencing can be used to obtain genotypes but does not provide unambiguous genotype calls, especially when sequencing depth is low in order to reduce costs. In that case, standard parentage analysis methods no longer apply. A strategy for using low-depth sequencing data for parentage assignment is developed here. It entails the use of relatedness estimates along with a metric termed excess mismatch rate which, for parent-offspring pairs or trios, is the difference between the observed mismatch rate and the rate expected under a model of inheritance and allele reads without error. When more than one putative parent has similar statistics, bootstrapping can provide a measure of the relatedness similarity. Putative parent-offspring trios can be further checked for consistency by comparing the offspring’s estimated inbreeding to half the parent relatedness. Suitable thresholds are required for each metric. These methods were applied to a deer breeding operation consisting of two herds of different breeds. Relatedness estimates were more in line with expectation when the herds were analyzed separately than when combined, although this did not alter which parents were the best matches with each offspring. Parentage results were largely consistent with those based on a microsatellite parentage panel with three discordant parent assignments out of 1561. Two models are investigated to allow the parentage metrics to be calculated with non-random selection of alleles. The tools and strategies given here allow parentage to be assigned from low-depth sequencing data.
Collapse
|
21
|
Kling D, Tillmar A. Forensic genealogy—A comparison of methods to infer distant relationships based on dense SNP data. Forensic Sci Int Genet 2019; 42:113-124. [DOI: 10.1016/j.fsigen.2019.06.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 05/15/2019] [Accepted: 06/24/2019] [Indexed: 12/16/2022]
|
22
|
Houston RD, Macqueen DJ. Atlantic salmon (Salmo salar L.) genetics in the 21st century: taking leaps forward in aquaculture and biological understanding. Anim Genet 2019; 50:3-14. [PMID: 30426521 PMCID: PMC6492011 DOI: 10.1111/age.12748] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/15/2018] [Indexed: 12/17/2022]
Abstract
Atlantic salmon (Salmo salar L.) is among the most iconic and economically important fish species and was the first member of Salmonidae to have a high-quality reference genome assembly published. Advances in genomics have become increasingly central to the genetic improvement of farmed Atlantic salmon as well as conservation of wild salmon stocks. The salmon genome has also been pivotal in shaping our understanding of the evolutionary and functional consequences arising from an ancestral whole-genome duplication event characterising all Salmonidae members. Here, we provide a review of the current status of Atlantic salmon genetics and genomics, focussed on progress made from genome-wide research aimed at improving aquaculture production and enhancing understanding of salmonid ecology, physiology and evolution. We present our views on the future direction of salmon genomics, including the role of emerging technologies (e.g. genome editing) in elucidating genetic features that underpin functional variation in traits of commercial and evolutionary importance.
Collapse
Affiliation(s)
- R. D. Houston
- The Roslin Institute and Royal (Dick) School of Veterinary StudiesThe University of EdinburghMidlothianEH25 9RGUK
| | - D. J. Macqueen
- School of Biological SciencesUniversity of AberdeenAberdeenAB24 2TZUK
| |
Collapse
|
23
|
Zenger KR, Khatkar MS, Jones DB, Khalilisamani N, Jerry DR, Raadsma HW. Genomic Selection in Aquaculture: Application, Limitations and Opportunities With Special Reference to Marine Shrimp and Pearl Oysters. Front Genet 2019; 9:693. [PMID: 30728827 PMCID: PMC6351666 DOI: 10.3389/fgene.2018.00693] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 12/11/2018] [Indexed: 11/20/2022] Open
Abstract
Within aquaculture industries, selection based on genomic information (genomic selection) has the profound potential to change genetic improvement programs and production systems. Genomic selection exploits the use of realized genomic relationships among individuals and information from genome-wide markers in close linkage disequilibrium with genes of biological and economic importance. We discuss the technical advances, practical requirements, and commercial applications that have made genomic selection feasible in a range of aquaculture industries, with a particular focus on molluscs (pearl oysters, Pinctada maxima) and marine shrimp (Litopenaeus vannamei and Penaeus monodon). The use of low-cost genome sequencing has enabled cost-effective genotyping on a large scale and is of particular value for species without a reference genome or access to commercial genotyping arrays. We highlight the pitfalls and offer the solutions to the genotyping by sequencing approach and the building of appropriate genetic resources to undertake genomic selection from first-hand experience. We describe the potential to capture large-scale commercial phenotypes based on image analysis and artificial intelligence through machine learning, as inputs for calculation of genomic breeding values. The application of genomic selection over traditional aquatic breeding programs offers significant advantages through being able to accurately predict complex polygenic traits including disease resistance; increasing rates of genetic gain; minimizing inbreeding; and negating potential limiting effects of genotype by environment interactions. Further practical advantages of genomic selection through the use of large-scale communal mating and rearing systems are highlighted, as well as presenting rate-limiting steps that impact on attaining maximum benefits from adopting genomic selection. Genomic selection is now at the tipping point where commercial applications can be readily adopted and offer significant short- and long-term solutions to sustainable and profitable aquaculture industries.
Collapse
Affiliation(s)
- Kyall R Zenger
- Centre for Sustainable Tropical Fisheries and Aquaculture, College of Science and Engineering, James Cook University, Townsville, QLD, Australia.,ARC Research Hub for Advanced Prawn Breeding, James Cook University, Townsville, QLD, Australia
| | - Mehar S Khatkar
- ARC Research Hub for Advanced Prawn Breeding, James Cook University, Townsville, QLD, Australia.,Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Camden, NSW, Australia
| | - David B Jones
- Centre for Sustainable Tropical Fisheries and Aquaculture, College of Science and Engineering, James Cook University, Townsville, QLD, Australia
| | - Nima Khalilisamani
- ARC Research Hub for Advanced Prawn Breeding, James Cook University, Townsville, QLD, Australia.,Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Camden, NSW, Australia
| | - Dean R Jerry
- Centre for Sustainable Tropical Fisheries and Aquaculture, College of Science and Engineering, James Cook University, Townsville, QLD, Australia.,ARC Research Hub for Advanced Prawn Breeding, James Cook University, Townsville, QLD, Australia.,Tropical Futures Institute, James Cook University Singapore, Singapore, Singapore
| | - Herman W Raadsma
- ARC Research Hub for Advanced Prawn Breeding, James Cook University, Townsville, QLD, Australia.,Sydney School of Veterinary Science, Faculty of Science, The University of Sydney, Camden, NSW, Australia
| |
Collapse
|
24
|
Linkage Disequilibrium Estimation in Low Coverage High-Throughput Sequencing Data. Genetics 2018; 209:389-400. [PMID: 29588288 PMCID: PMC5972415 DOI: 10.1534/genetics.118.300831] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 03/22/2018] [Indexed: 12/31/2022] Open
Abstract
High-throughput sequencing methods provide a cost-effective approach for genotyping and are commonly used in population genetics studies. A drawback of these methods, however, is that sequencing and genotyping errors can arise... High-throughput sequencing methods that multiplex a large number of individuals have provided a cost-effective approach for discovering genome-wide genetic variation in large populations. These sequencing methods are increasingly being utilized in population genetic studies across a diverse range of species. Two side-effects of these methods, however, are (1) sequencing errors and (2) heterozygous genotypes called as homozygous due to only one allele at a particular locus being sequenced, which occurs when the sequencing depth is insufficient. Both of these errors have a profound effect on the estimation of linkage disequilibrium (LD) and, if not taken into account, lead to inaccurate estimates. We developed a new likelihood method, GUS-LD, to estimate pairwise linkage disequilibrium using low coverage sequencing data that accounts for undercalled heterozygous genotypes and sequencing errors. Our findings show that accurate estimates were obtained using GUS-LD, whereas underestimation of LD results if no adjustment is made for the errors.
Collapse
|