1
|
Burridge AJ, Winfield M, Przewieslik‐Allen A, Edwards KJ, Siddique I, Barral‐Arca R, Griffiths S, Cheng S, Huang Z, Feng C, Dreisigacker S, Bentley AR, Brown‐Guedira G, Barker GL. Development of a next generation SNP genotyping array for wheat. PLANT BIOTECHNOLOGY JOURNAL 2024; 22:2235-2247. [PMID: 38520342 PMCID: PMC11258986 DOI: 10.1111/pbi.14341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 03/25/2024]
Abstract
High-throughput genotyping arrays have provided a cost-effective, reliable and interoperable system for genotyping hexaploid wheat and its relatives. Existing, highly cited arrays including our 35K Wheat Breeder's array and the Illumina 90K array were designed based on a limited amount of varietal sequence diversity and with imperfect knowledge of SNP positions. Recent progress in wheat sequencing has given us access to a vast pool of SNP diversity, whilst technological improvements have allowed us to fit significantly more probes onto a 384-well format Axiom array than previously possible. Here we describe a novel Axiom genotyping array, the 'Triticum aestivum Next Generation' array (TaNG), largely derived from whole genome skim sequencing of 204 elite wheat lines and 111 wheat landraces taken from the Watkins 'Core Collection'. We used a novel haplotype optimization approach to select SNPs with the highest combined varietal discrimination and a design iteration step to test and replace SNPs which failed to convert to reliable markers. The final design with 43 372 SNPs contains a combination of haplotype-optimized novel SNPs and legacy cross-platform markers. We show that this design has an improved distribution of SNPs compared to previous arrays and can be used to generate genetic maps with a significantly higher number of distinct bins than our previous array. We also demonstrate the improved performance of TaNGv1.1 for Genome-wide association studies (GWAS) and its utility for Copy Number Variation (CNV) analysis. The array is commercially available with supporting marker annotations and initial genotyping results freely available.
Collapse
Affiliation(s)
| | - Mark Winfield
- School of Biological SciencesUniversity of BristolBristolUK
| | | | | | - Imteaz Siddique
- Thermo Fisher Scientific3450 Central ExpresswaySanta ClaraCAUSA
| | | | | | - Shifeng Cheng
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | - Zejian Huang
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | - Cong Feng
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenChina
| | | | | | - Gina Brown‐Guedira
- Plant Science Research UnitUSDA Agricultural Research ServiceRaleighNCUSA
| | - Gary L. Barker
- School of Biological SciencesUniversity of BristolBristolUK
| |
Collapse
|
2
|
Tumas H, Ilska JJ, Gérardi S, Laroche J, A’Hara S, Boyle B, Janes M, McLean P, Lopez G, Lee SJ, Cottrell J, Gorjanc G, Bousquet J, Woolliams JA, MacKay JJ. High-density genetic linkage mapping in Sitka spruce advances the integration of genomic resources in conifers. G3 (BETHESDA, MD.) 2024; 14:jkae020. [PMID: 38366548 PMCID: PMC10989875 DOI: 10.1093/g3journal/jkae020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 01/03/2024] [Indexed: 02/18/2024]
Abstract
In species with large and complex genomes such as conifers, dense linkage maps are a useful resource for supporting genome assembly and laying the genomic groundwork at the structural, populational, and functional levels. However, most of the 600+ extant conifer species still lack extensive genotyping resources, which hampers the development of high-density linkage maps. In this study, we developed a linkage map relying on 21,570 single nucleotide polymorphism (SNP) markers in Sitka spruce (Picea sitchensis [Bong.] Carr.), a long-lived conifer from western North America that is widely planted for productive forestry in the British Isles. We used a single-step mapping approach to efficiently combine RAD-seq and genotyping array SNP data for 528 individuals from 2 full-sib families. As expected for spruce taxa, the saturated map contained 12 linkages groups with a total length of 2,142 cM. The positioning of 5,414 unique gene coding sequences allowed us to compare our map with that of other Pinaceae species, which provided evidence for high levels of synteny and gene order conservation in this family. We then developed an integrated map for P. sitchensis and Picea glauca based on 27,052 markers and 11,609 gene sequences. Altogether, these 2 linkage maps, the accompanying catalog of 286,159 SNPs and the genotyping chip developed, herein, open new perspectives for a variety of fundamental and more applied research objectives, such as for the improvement of spruce genome assemblies, or for marker-assisted sustainable management of genetic resources in Sitka spruce and related species.
Collapse
Affiliation(s)
- Hayley Tumas
- Department of Biology, University of Oxford, Oxford OX1 3RB, UK
| | - Joana J Ilska
- The Roslin Institute, Royal (Dick) School of Veterinary Science, University of Edinburgh, Midlothian EH25 9RG, UK
| | - Sebastien Gérardi
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC GIV 0A6, Canada
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC GIV 0A6, Canada
| | - Jerome Laroche
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC GIV 0A6, Canada
| | - Stuart A’Hara
- Forest Research, Northern Research Station, Midlothian EH25 9SY, UK
| | - Brian Boyle
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC GIV 0A6, Canada
| | - Mateja Janes
- The Roslin Institute, Royal (Dick) School of Veterinary Science, University of Edinburgh, Midlothian EH25 9RG, UK
| | - Paul McLean
- Forest Research, Northern Research Station, Midlothian EH25 9SY, UK
| | - Gustavo Lopez
- Forest Research, Northern Research Station, Midlothian EH25 9SY, UK
| | - Steve J Lee
- Forest Research, Northern Research Station, Midlothian EH25 9SY, UK
| | - Joan Cottrell
- Forest Research, Northern Research Station, Midlothian EH25 9SY, UK
| | - Gregor Gorjanc
- The Roslin Institute, Royal (Dick) School of Veterinary Science, University of Edinburgh, Midlothian EH25 9RG, UK
| | - Jean Bousquet
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, QC GIV 0A6, Canada
- Institute for Systems and Integrative Biology, Université Laval, Québec, QC GIV 0A6, Canada
| | - John A Woolliams
- The Roslin Institute, Royal (Dick) School of Veterinary Science, University of Edinburgh, Midlothian EH25 9RG, UK
| | - John J MacKay
- Department of Biology, University of Oxford, Oxford OX1 3RB, UK
| |
Collapse
|
3
|
Lo T, Coombe L, Gagalova KK, Marr A, Warren RL, Kirk H, Pandoh P, Zhao Y, Moore RA, Mungall AJ, Ritland C, Pavy N, Jones SJM, Bohlmann J, Bousquet J, Birol I, Thomson A. Assembly and annotation of the black spruce genome provide insights on spruce phylogeny and evolution of stress response. G3 (BETHESDA, MD.) 2023; 14:jkad247. [PMID: 37875130 PMCID: PMC10755193 DOI: 10.1093/g3journal/jkad247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 05/17/2023] [Accepted: 10/09/2023] [Indexed: 10/26/2023]
Abstract
Black spruce (Picea mariana [Mill.] B.S.P.) is a dominant conifer species in the North American boreal forest that plays important ecological and economic roles. Here, we present the first genome assembly of P. mariana with a reconstructed genome size of 18.3 Gbp and NG50 scaffold length of 36.0 kbp. A total of 66,332 protein-coding sequences were predicted in silico and annotated based on sequence homology. We analyzed the evolutionary relationships between P. mariana and 5 other spruces for which complete nuclear and organelle genome sequences were available. The phylogenetic tree estimated from mitochondrial genome sequences agrees with biogeography; specifically, P. mariana was strongly supported as a sister lineage to P. glauca and 3 other taxa found in western North America, followed by the European Picea abies. We obtained mixed topologies with weaker statistical support in phylogenetic trees estimated from nuclear and chloroplast genome sequences, indicative of ancient reticulate evolution affecting these 2 genomes. Clustering of protein-coding sequences from the 6 Picea taxa and 2 Pinus species resulted in 34,776 orthogroups, 560 of which appeared to be specific to P. mariana. Analysis of these specific orthogroups and dN/dS analysis of positive selection signatures for 497 single-copy orthogroups identified gene functions mostly related to plant development and stress response. The P. mariana genome assembly and annotation provides a valuable resource for forest genetics research and applications in this broadly distributed species, especially in relation to climate adaptation.
Collapse
Affiliation(s)
- Theodora Lo
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Lauren Coombe
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Kristina K Gagalova
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Alex Marr
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - René L Warren
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Heather Kirk
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Pawan Pandoh
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Yongjun Zhao
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Richard A Moore
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Andrew J Mungall
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Carol Ritland
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Nathalie Pavy
- Canada Research Chair in Forest Genomics, Laval University, Quebec City, QC G1V 0A6, Canada
| | - Steven J M Jones
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Joerg Bohlmann
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
- Department of Botany, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
| | - Jean Bousquet
- Canada Research Chair in Forest Genomics, Laval University, Quebec City, QC G1V 0A6, Canada
| | - Inanç Birol
- Canada’s Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, BC V5Z 4S6, Canada
| | - Ashley Thomson
- Faculty of Natural Resources Management, Lakehead University, Thunder Bay, ON P7B 5E1, Canada
| |
Collapse
|
4
|
Wood Formation under Changing Environment: Omics Approaches to Elucidate the Mechanisms Driving the Early-to-Latewood Transition in Conifers. FORESTS 2022. [DOI: 10.3390/f13040608] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The global change scenarios highlight the urgency of clarifying the mechanisms driving the determination of wood traits in forest trees. Coniferous xylem is characterized by the alternation between earlywood (EW) and latewood (LW), on which proportions the wood density depend, one of the most important mechanical xylem qualities. However, the molecular mechanisms triggering the transition between the production of cells with the typical features of EW to the LW are still far from being completely elucidated. The increasing availability of omics resources for conifers, e.g., genomes and transcriptomes, would lay the basis for the comprehension of wood formation dynamics, boosting both breeding and gene-editing approaches. This review is intended to introduce the importance of wood formation dynamics and xylem traits of conifers in a changing environment. Then, an up-to-date overview of the omics resources available for conifers was reported, focusing on both genomes and transcriptomes. Later, an analysis of wood formation studies using omics approaches was conducted, with the aim of elucidating the main metabolic pathways involved in EW and LW determination. Finally, the future perspectives and the urgent needs on this research topic were highlighted.
Collapse
|
5
|
Chen ZQ, Zan Y, Milesi P, Zhou L, Chen J, Li L, Cui B, Niu S, Westin J, Karlsson B, García-Gil MR, Lascoux M, Wu HX. Leveraging breeding programs and genomic data in Norway spruce (Picea abies L. Karst) for GWAS analysis. Genome Biol 2021; 22:179. [PMID: 34120648 PMCID: PMC8201819 DOI: 10.1186/s13059-021-02392-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Accepted: 05/26/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Genome-wide association studies (GWAS) identify loci underlying the variation of complex traits. One of the main limitations of GWAS is the availability of reliable phenotypic data, particularly for long-lived tree species. Although an extensive amount of phenotypic data already exists in breeding programs, accounting for its high heterogeneity is a great challenge. We combine spatial and factor-analytics analyses to standardize the heterogeneous data from 120 field experiments of 483,424 progenies of Norway spruce to implement the largest reported GWAS for trees using 134 605 SNPs from exome sequencing of 5056 parental trees. RESULTS We identify 55 novel quantitative trait loci (QTLs) that are associated with phenotypic variation. The largest number of QTLs is associated with the budburst stage, followed by diameter at breast height, wood quality, and frost damage. Two QTLs with the largest effect have a pleiotropic effect for budburst stage, frost damage, and diameter and are associated with MAP3K genes. Genotype data called from exome capture, recently developed SNP array and gene expression data indirectly support this discovery. CONCLUSION Several important QTLs associated with growth and frost damage have been verified in several southern and northern progeny plantations, indicating that these loci can be used in QTL-assisted genomic selection. Our study also demonstrates that existing heterogeneous phenotypic data from breeding programs, collected over several decades, is an important source for GWAS and that such integration into GWAS should be a major area of inquiry in the future.
Collapse
Affiliation(s)
- Zhi-Qiang Chen
- Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden
| | - Yanjun Zan
- Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden
| | - Pascal Milesi
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Linghua Zhou
- Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden
| | - Jun Chen
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and SciLifeLab, Uppsala University, Uppsala, Sweden
- College of Life Sciences, Zhejiang University, Zhejiang, 310058, Hangzhou, China
| | - Lili Li
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - BinBin Cui
- College of Biochemistry and Environmental Engineering, Baoding University, Baoding, 071000, Hebei, China
| | - Shihui Niu
- Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China
| | - Johan Westin
- Skogforsk, Box 3, SE-91821, Sävar, Sweden
- Unit for Field-Based Forest Research, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden
| | - Bo Karlsson
- Skogforsk, Ekebo, 2250, SE-26890, Svalöv, Sweden
| | - Maria Rosario García-Gil
- Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden
| | - Martin Lascoux
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre and SciLifeLab, Uppsala University, Uppsala, Sweden
| | - Harry X Wu
- Umeå Plant Science Centre, Department Forest Genetics and Plant Physiology, Swedish University of Agricultural Sciences, SE-90183, Umeå, Sweden.
- Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China.
- CSIRO National Collection Research Australia, Black Mountain Laboratory, Canberra, ACT, 2601, Australia.
| |
Collapse
|
6
|
Caballero M, Lauer E, Bennett J, Zaman S, McEvoy S, Acosta J, Jackson C, Townsend L, Eckert A, Whetten RW, Loopstra C, Holliday J, Mandal M, Wegrzyn JL, Isik F. Toward genomic selection in Pinus taeda: Integrating resources to support array design in a complex conifer genome. APPLICATIONS IN PLANT SCIENCES 2021; 9:e11439. [PMID: 34268018 PMCID: PMC8272584 DOI: 10.1002/aps3.11439] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/21/2021] [Indexed: 05/13/2023]
Abstract
PREMISE An informatics approach was used for the construction of an Axiom genotyping array from heterogeneous, high-throughput sequence data to assess the complex genome of loblolly pine (Pinus taeda). METHODS High-throughput sequence data, sourced from exome capture and whole genome reduced-representation approaches from 2698 trees across five sequence populations, were analyzed with the improved genome assembly and annotation for the loblolly pine. A variant detection, filtering, and probe design pipeline was developed to detect true variants across and within populations. From 8.27 million variants, a total of 642,275 were evaluated and 423,695 of those were screened across a range-wide population. RESULTS The final informatics and screening approach delivered an Axiom array representing 46,439 high-confidence variants to the forest tree breeding and genetics community. Based on the annotated reference genome, 34% were located in or directly upstream or downstream of genic regions. DISCUSSION The Pita50K array represents a genome-wide resource developed from sequence data for an economically important conifer, loblolly pine. It uniquely integrates independent projects that assessed trees sampled across the native range. The challenges associated with the large and repetitive genome are addressed in the development of this resource.
Collapse
Affiliation(s)
- Madison Caballero
- Department of Ecology and Evolutionary BiologyUniversity of ConnecticutStorrsConnecticut06269USA
| | - Edwin Lauer
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| | - Jeremy Bennett
- Department of Ecology and Evolutionary BiologyUniversity of ConnecticutStorrsConnecticut06269USA
| | - Sumaira Zaman
- Department of Ecology and Evolutionary BiologyUniversity of ConnecticutStorrsConnecticut06269USA
| | - Susan McEvoy
- Department of Ecology and Evolutionary BiologyUniversity of ConnecticutStorrsConnecticut06269USA
| | - Juan Acosta
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| | - Colin Jackson
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| | - Laura Townsend
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| | - Andrew Eckert
- Department of BiologyVirginia Commonwealth UniversityRichmondVirginia23284USA
| | - Ross W. Whetten
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| | - Carol Loopstra
- Department of Ecology and Conservation BiologyTexas A&M UniversityCollege StationTexas77843USA
| | - Jason Holliday
- Department of Forest Resources and Environmental ConservationVirginia Polytechnic Institute and State UniversityBlacksburgVirginia24061USA
| | - Mihir Mandal
- Department of BiologyClaflin UniversityOrangeburgSouth Carolina29115USA
| | - Jill L. Wegrzyn
- Department of Ecology and Evolutionary BiologyUniversity of ConnecticutStorrsConnecticut06269USA
| | - Fikret Isik
- Department of Forestry and Environmental ResourcesNorth Carolina State UniversityRaleighNorth Carolina27695USA
| |
Collapse
|
7
|
Bernhardsson C, Zan Y, Chen Z, Ingvarsson PK, Wu HX. Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species. Mol Ecol Resour 2020; 21:880-896. [PMID: 33179386 PMCID: PMC7984398 DOI: 10.1111/1755-0998.13292] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 10/23/2020] [Accepted: 11/04/2020] [Indexed: 12/30/2022]
Abstract
Norway spruce (Picea abies L. Karst) is one of the most important forest tree species with significant economic and ecological impact in Europe. For decades, genomic and genetic studies on Norway spruce have been challenging due to the large and repetitive genome (19.6 Gb with more than 70% being repetitive). To accelerate genomic studies, including population genetics, genome‐wide association studies (GWAS) and genomic selection (GS), in Norway spruce and related species, we here report on the design and performance of a 50K single nucleotide polymorphism (SNP) genotyping array for Norway spruce. The array is developed based on whole genome resequencing (WGS), making it the first WGS‐based SNP array in any conifer species so far. After identifying SNPs using genome resequencing data from 29 trees collected in northern Europe, we adopted a two‐step approach to design the array. First, we built a 450K screening array and used this to genotype a population of 480 trees sampled from both natural and breeding populations across the Norway spruce distribution range. These samples were then used to select high‐confidence probes that were put on the final 50K array. The SNPs selected are distributed over 45,552 scaffolds from the P. abies version 1.0 genome assembly and target 19,954 unique gene models with an even coverage of the 12 linkage groups in Norway spruce. We show that the array has a 99.5% probe specificity, >98% Mendelian allelic inheritance concordance, an average sample call rate of 96.30% and an SNP call rate of 98.90% in family trios and haploid tissues. We also observed that 23,797 probes (50%) could be identified with high confidence in three other spruce species (white spruce [Picea glauca], black spruce [P. mariana] and Sitka spruce [P. sitchensis]). The high‐quality genotyping array will be a valuable resource for genetic and genomic studies in Norway spruce as well as in other conifer species of the same genus.
Collapse
Affiliation(s)
- Carolina Bernhardsson
- Department of Ecology and Environmental Science, Umeå University, Umeå, Sweden.,Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Yanjun Zan
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden
| | - Zhiqiang Chen
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden
| | - Pär K Ingvarsson
- Linnean Centre for Plant Biology, Department of Plant Biology, Uppsala BioCenter, Swedish University of Agricultural Science, Uppsala, Sweden
| | - Harry X Wu
- Umeå Plant Science Centre, Department of Forest Genetics and Plant Physiology, Swedish University of Agricultural Science, Umeå, Sweden.,Beijing Advanced Innovation Centre for Tree Breeding by Molecular Design, Beijing Forestry University, Beijing, China.,Black Mountain Laboratory, CSIRO National Research Collection Australia, Canberra, ACT, Australia
| |
Collapse
|
8
|
Genomic prediction for hastening and improving efficiency of forward selection in conifer polycross mating designs: an example from white spruce. Heredity (Edinb) 2020; 124:562-578. [PMID: 31969718 PMCID: PMC7080810 DOI: 10.1038/s41437-019-0290-3] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Revised: 11/29/2019] [Accepted: 12/08/2019] [Indexed: 11/08/2022] Open
Abstract
Genomic selection (GS) has a large potential for improving the prediction accuracy of breeding values and significantly reducing the length of breeding cycles. In this context, the choice of mating designs becomes critical to improve the efficiency of breeding operations and to obtain the largest genetic gains per time unit. Polycross mating designs have been traditionally used in tree and plant breeding to perform backward selection of the female parents. The possibility to use genetic markers for paternity identification and for building genomic prediction models should allow for a broader use of polycross tests in forward selection schemes. We compared the accuracies of genomic predictions of offspring's breeding values from a polycross and a full-sib (partial diallel) mating design with similar genetic background in white spruce (Picea glauca). Trees were phenotyped for growth and wood quality traits, and genotyped for 4092 SNPs representing as many gene loci distributed across the 12 spruce chromosomes. For the polycross progeny test, heritability estimates were smaller, but more precise using the genomic BLUP (GBLUP) model as compared with pedigree-based models accounting for the maternal pedigree or for the reconstructed full pedigree. Cross-validations showed that GBLUP predictions were 22-52% more accurate than predictions based on the maternal pedigree, and 5-7% more accurate than predictions using the reconstructed full pedigree. The accuracies of GBLUP predictions were high and in the same range for most traits between the polycross (0.61-0.70) and full-sib progeny tests (0.61-0.74). However, higher genetic gains per time unit were expected from the polycross mating design given the shorter time needed to conduct crosses. Considering the operational advantages of the polycross design in terms of easier handling of crosses and lower associated costs for test establishment, we believe that this mating scheme offers great opportunities for the development and operational application of forward GS.
Collapse
|
9
|
Core set construction and association analysis of Pinus massoniana from Guangdong province in southern China using SLAF-seq. Sci Rep 2019; 9:13157. [PMID: 31511632 PMCID: PMC6739479 DOI: 10.1038/s41598-019-49737-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Accepted: 08/31/2019] [Indexed: 12/31/2022] Open
Abstract
Germplasm resource collection and utilization are important in forestry species breeding. High-through sequencing technologies have been playing increasing roles in forestry breeding. In this study, specific-locus amplified fragment sequencing (SLAF-seq) was employed to analyze 149 masson pine (Pinus massoniana) accessions collected from Guangdong in China. A large number of 471,660 SNPs in the total collection were identified from 599,164 polymorphic SLAF tags. Population structure analysis showed that 149 masson pines could not be obviously divided into subpopulations. Two core sets, containing 29 masson pine accessions for increasing resin and wood yield respectively, were obtained from the total collection. Phenotypic analyses of five traits showed abundant variations, 25 suggestive and 9 significant SNPs were associated with the resin-yielding capacity (RYC') and volume of wood (VW) using EMMAX and FaST-LMM; 22 suggestive and 11 significant SNPs were associated with RYC' and VW using mrMLM and FASTmrMLM. Moreover, a large number of associated SNPs were detected in trait HT, DBH, RW and RYC using mrMLM, FASTmrMLM, FASTmrEMMA and ISIS EM-BLASSO. The core germplasm sets would be a valuable resource for masson pine improvement and breeding. In addition, the associated SNP markers would be meaningful for masson pine resource selection.
Collapse
|
10
|
Liu J, Sun Z, Mao X, Gerken H, Wang X, Yang W. Multiomics analysis reveals a distinct mechanism of oleaginousness in the emerging model alga Chromochloris zofingiensis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 98:745-758. [PMID: 30828893 DOI: 10.1111/tpj.14270] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2018] [Revised: 12/24/2018] [Accepted: 01/28/2019] [Indexed: 05/03/2023]
Abstract
Chromochloris zofingiensis, featured due to its capability to simultaneously synthesize triacylglycerol (TAG) and astaxanthin, is emerging as a leading candidate alga for production uses. To better understand the oleaginous mechanism of this alga, we conducted a multiomics analysis by systematically integrating time-resolved transcriptomes, lipidomes and metabolomes in response to nitrogen deprivation. The data analysis unraveled the distinct mechanism of TAG accumulation, which involved coordinated stimulation of multiple biological processes including supply of energy and reductants, carbon reallocation from protein and starch, and 'pushing' and 'pulling' carbon to TAG synthesis. Unlike the model alga Chlamydomonas, de novo fatty acid synthesis in C. zofingiensis was promoted, together with enhanced turnover of both glycolipids and phospholipids, supporting the drastic need of acyls for TAG assembly. Moreover, genomewide analysis identified many key functional enzymes and transcription factors that had engineering potential for TAG modulation. Two genes encoding glycerol-3-phosphate acyltransferase (GPAT), the first committed enzyme for TAG assembly, were found in the C. zofingiensis genome; in vivo functional characterization revealed that extrachloroplastic GPAT instead of chloroplastic GPAT played a central role in TAG synthesis. These findings illuminate distinct oleaginousness mechanisms in C. zofingiensis and pave the way towards rational manipulation of this alga to becone an emerging model for trait improvements.
Collapse
Affiliation(s)
- Jin Liu
- Laboratory for Algae Biotechnology & Innovation, College of Engineering, Peking University, Beijing, 100871, China
| | - Zheng Sun
- International Research Center for Marine Biosciences, Ministry of Science and Technology, Shanghai Ocean University, Shanghai, 201306, China
| | - Xuemei Mao
- Laboratory for Algae Biotechnology & Innovation, College of Engineering, Peking University, Beijing, 100871, China
| | - Henri Gerken
- School of Sustainable Engineering and the Built Environment, Arizona State University Polytechnic campus, Mesa, AZ, 85212, USA
| | - Xiaofei Wang
- Laboratory for Algae Biotechnology & Innovation, College of Engineering, Peking University, Beijing, 100871, China
| | - Wenqiang Yang
- Photosynthesis Research Center, Key Laboratory of Photobiology, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| |
Collapse
|
11
|
Azaiez A, Pavy N, Gérardi S, Laroche J, Boyle B, Gagnon F, Mottet MJ, Beaulieu J, Bousquet J. A catalog of annotated high-confidence SNPs from exome capture and sequencing reveals highly polymorphic genes in Norway spruce (Picea abies). BMC Genomics 2018; 19:942. [PMID: 30558528 PMCID: PMC6296092 DOI: 10.1186/s12864-018-5247-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 11/14/2018] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Norway spruce [Picea abies (L.) Karst.] is ecologically and economically one of the most important conifer worldwide. Our main goal was to develop a large catalog of annotated high confidence gene SNPs that should sustain the development of genomic tools for the conservation of natural and domesticated genetic diversity resources, and hasten tree breeding efforts in this species. RESULTS Targeted sequencing was achieved by capturing P. abies exome with probes previously designed from the sequenced transcriptome of white spruce (Picea glauca (Moench) Voss). Capture efficiency was high (74.5%) given a high level of exome conservation between the two species. Using stringent criteria, we delimited a set of 61,771 high-confidence SNPs across 13,543 genes. To validate SNPs, a high-throughput genotyping array was developed for a subset of 5571 predicted SNPs representing as many different gene loci, and was used to genotype over 1000 trees. The estimated true positive rate of the resource was 84.2%, which was comparable with the genotyping success rate obtained for P. abies control SNPs recycled from previous genotyping efforts. We also analyzed SNP abundance across various gene functional categories. Several GO terms and gene families involved in stress response were found over-represented in highly polymorphic genes. CONCLUSION The annotated high-confidence SNP catalog developed herein represents a valuable genomic resource, being representative of over 13 K genes distributed across the P. abies genome. This resource should serve a variety of population genomics and breeding applications in Norway spruce.
Collapse
Affiliation(s)
- Aïda Azaiez
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Nathalie Pavy
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Sébastien Gérardi
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Jérôme Laroche
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Brian Boyle
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - France Gagnon
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Marie-Josée Mottet
- Direction de la recherche forestière, Ministère des Forêts, de la Faune et des Parcs du Québec, 2700 Einstein, Québec, Québec G1P 3W8 Canada
| | - Jean Beaulieu
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| | - Jean Bousquet
- Canada Research Chair in Forest Genomics, Forest Research Centre, Université Laval, Québec, Québec G1V 0A6 Canada
- Institute of Integrative Biology and Systems, Université Laval, Québec, Québec G1V 0A6 Canada
| |
Collapse
|
12
|
Heer K, Ullrich KK, Hiss M, Liepelt S, Schulze Brüning R, Zhou J, Opgenoorth L, Rensing SA. Detection of somatic epigenetic variation in Norway spruce via targeted bisulfite sequencing. Ecol Evol 2018; 8:9672-9682. [PMID: 30386566 PMCID: PMC6202725 DOI: 10.1002/ece3.4374] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 06/18/2018] [Accepted: 06/22/2018] [Indexed: 12/24/2022] Open
Abstract
Epigenetic mechanisms represent a possible mechanism for achieving a rapid response of long-lived trees to changing environmental conditions. However, our knowledge on plant epigenetics is largely limited to a few model species. With increasing availability of genomic resources for many tree species, it is now possible to adopt approaches from model species that permit to obtain single-base pair resolution data on methylation at a reasonable cost. Here, we used targeted bisulfite sequencing (TBS) to study methylation patterns in the conifer species Norway spruce (Picea abies). To circumvent the challenge of disentangling epigenetic and genetic differences, we focused on four clone pairs, where clone members were growing in different climatic conditions for 24 years. We targeted >26.000 genes using TBS and determined the performance and reproducibility of this approach. We characterized gene body methylation and compared methylation patterns between environments. We found highly comparable capture efficiency and coverage across libraries. Methylation levels were relatively constant across gene bodies, with 21.3 ± 0.3%, 11.0 ± 0.4% and 1.3 ± 0.2% in the CG, CHG, and CHH context, respectively. The variance in methylation profiles did not reveal consistent changes between environments, yet we could identify 334 differentially methylated positions (DMPs) between environments. This supports that changes in methylation patterns are a possible pathway for a plant to respond to environmental change. After this successful application of TBS in Norway spruce, we are confident that this approach can contribute to broaden our knowledge of methylation patterns in natural tree populations.
Collapse
Affiliation(s)
- Katrin Heer
- Conservation BiologyFaculty of BiologyPhilipps University MarburgMarburgGermany
- Department of EcologyFaculty of BiologyPhilipps University MarburgMarburgGermany
| | - Kristian K. Ullrich
- Plant Cell BiologyFaculty of BiologyPhilipps University MarburgMarburgGermany
- Department of Evolutionary GeneticsMax Planck Institute for Evolutionary BiologyPloenGermany
| | - Manuel Hiss
- Plant Cell BiologyFaculty of BiologyPhilipps University MarburgMarburgGermany
| | - Sascha Liepelt
- Conservation BiologyFaculty of BiologyPhilipps University MarburgMarburgGermany
| | | | - Jiabin Zhou
- College of Life SciencesShaanxi Normal UniversityXi'anChina
- State Key Laboratory of Grassland Agro‐EcosystemsSchool of Life SciencesLanzhou UniversityLanzhouChina
| | - Lars Opgenoorth
- Department of EcologyFaculty of BiologyPhilipps University MarburgMarburgGermany
| | - Stefan A. Rensing
- Plant Cell BiologyFaculty of BiologyPhilipps University MarburgMarburgGermany
- BIOSS Biological Signaling StudiesUniversity of FreiburgFreiburgGermany
| |
Collapse
|
13
|
Mukrimin M, Kovalchuk A, Neves LG, Jaber EHA, Haapanen M, Kirst M, Asiegbu FO. Genome-Wide Exon-Capture Approach Identifies Genetic Variants of Norway Spruce Genes Associated With Susceptibility to Heterobasidion parviporum Infection. FRONTIERS IN PLANT SCIENCE 2018; 9:793. [PMID: 29946332 PMCID: PMC6005875 DOI: 10.3389/fpls.2018.00793] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 05/24/2018] [Indexed: 05/03/2023]
Abstract
Root and butt rot caused by members of the Heterobasidion annosum species complex is the most economically important disease of conifer trees in boreal forests. Wood decay in the infected trees dramatically decreases their value and causes considerable losses to forest owners. Trees vary in their susceptibility to Heterobasidion infection, but the genetic determinants underlying the variation in the susceptibility are not well-understood. We performed the identification of Norway spruce genes associated with the resistance to Heterobasidion parviporum infection using genome-wide exon-capture approach. Sixty-four clonal Norway spruce lines were phenotyped, and their responses to H. parviporum inoculation were determined by lesion length measurements. Afterwards, the spruce lines were genotyped by targeted resequencing and identification of genetic variants (SNPs). Genome-wide association analysis identified 10 SNPs located within 8 genes as significantly associated with the larger necrotic lesions in response to H. parviporum inoculation. The genetic variants identified in our analysis are potential marker candidates for future screening programs aiming at the differentiation of disease-susceptible and resistant trees.
Collapse
Affiliation(s)
- Mukrimin Mukrimin
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Department of Forestry, Faculty of Forestry, Hasanuddin University, Makassar, Indonesia
| | - Andriy Kovalchuk
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| | | | - Emad H. A. Jaber
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| | - Matti Haapanen
- Natural Resources Institute Finland (LUKE), Helsinki, Finland
| | - Matias Kirst
- School of Forest Resources and Conservation, University of Florida, Gainesville, FL, United States
| | - Fred O. Asiegbu
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
14
|
Genotyping and Sequencing Technologies in Population Genetics and Genomics. POPULATION GENOMICS 2018. [DOI: 10.1007/13836_2017_5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
15
|
Morin PA, Foote AD, Hill CM, Simon-Bouhet B, Lang AR, Louis M. SNP Discovery from Single and Multiplex Genome Assemblies of Non-model Organisms. Methods Mol Biol 2018; 1712:113-144. [PMID: 29224072 DOI: 10.1007/978-1-4939-7514-3_9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Population genetic studies of non-model organisms often rely on initial ascertainment of genetic markers from a single individual or a small pool of individuals. This initial screening has been a significant barrier to beginning population studies on non-model organisms (Aitken et al., Mol Ecol 13:1423-1431, 2004; Morin et al., Trends Ecol Evol 19:208-216, 2004). As genomic data become increasingly available for non-model species, SNP ascertainment from across the genome can be performed directly from published genome contigs and short-read archive data. Alternatively, low to medium genome coverage from shotgun NGS library sequencing of single or pooled samples, or from reduced-representation libraries (e.g., capture enrichment; see Ref. "Hancock-Hanser et al., Mol Ecol Resour 13:254-268, 2013") can produce sufficient new data for SNP discovery with limited investment. We describe protocols for assembly of short read data to reference or related species genome contig sequences, followed by SNP discovery and filtering to obtain an optimal set of SNPs for population genotyping using a variety of downstream high-throughput genotyping methods.
Collapse
Affiliation(s)
- Phillip A Morin
- Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 8901 La Jolla Shores Drive, La Jolla, CA, 92037, USA.
| | - Andrew D Foote
- Molecular Ecology and Fisheries Genetics Laboratory, School of Biological Sciences, Bangor University, Bangor, Gwynedd, LL57 2UW, UK
| | - Christopher M Hill
- Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Benoit Simon-Bouhet
- Centre d'Etudes Biologiques de Chizé, UMR 7372 CNRS-Université de La Rochelle, 79360, Villiers-en-Bois, France
| | - Aimee R Lang
- Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, 8901 La Jolla Shores Drive, La Jolla, CA, 92037, USA
| | - Marie Louis
- Scottish Oceans Institute, University of St Andrews, East Sands, KY16 8LB, St Andrews, UK
| |
Collapse
|
16
|
Konar A, Choudhury O, Bullis R, Fiedler L, Kruser JM, Stephens MT, Gailing O, Schlarbaum S, Coggeshall MV, Staton ME, Carlson JE, Emrich S, Romero-Severson J. High-quality genetic mapping with ddRADseq in the non-model tree Quercus rubra. BMC Genomics 2017; 18:417. [PMID: 28558688 PMCID: PMC5450186 DOI: 10.1186/s12864-017-3765-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2016] [Accepted: 05/04/2017] [Indexed: 11/10/2022] Open
Abstract
Background Restriction site associated DNA sequencing (RADseq) has the potential to be a broadly applicable, low-cost approach for high-quality genetic linkage mapping in forest trees lacking a reference genome. The statistical inference of linear order must be as accurate as possible for the correct ordering of sequence scaffolds and contigs to chromosomal locations. Accurate maps also facilitate the discovery of chromosome segments containing allelic variants conferring resistance to the biotic and abiotic stresses that threaten forest trees worldwide. We used ddRADseq for genetic mapping in the tree Quercus rubra, with an approach optimized to produce a high-quality map. Our study design also enabled us to model the results we would have obtained with less depth of coverage. Results Our sequencing design produced a high sequencing depth in the parents (248×) and a moderate sequencing depth (15×) in the progeny. The digital normalization method of generating a de novo reference and the SAMtools SNP variant caller yielded the most SNP calls (78,725). The major drivers of map inflation were multiple SNPs located within the same sequence (77% of SNPs called). The highest quality map was generated with a low level of missing data (5%) and a genome-wide threshold of 0.025 for deviation from Mendelian expectation. The final map included 849 SNP markers (1.8% of the 78,725 SNPs called). Downsampling the individual FASTQ files to model lower depth of coverage revealed that sequencing the progeny using 96 samples per lane would have yielded too few SNP markers to generate a map, even if we had sequenced the parents at depth 248×. Conclusions The ddRADseq technology produced enough high-quality SNP markers to make a moderately dense, high-quality map. The success of this project was due to high depth of coverage of the parents, moderate depth of coverage of the progeny, a good framework map, an optimized bioinformatics pipeline, and rigorous premapping filters. The ddRADseq approach is useful for the construction of high-quality genetic maps in organisms lacking a reference genome if the parents and progeny are sequenced at sufficient depth. Technical improvements in reduced representation sequencing (RRS) approaches are needed to reduce the amount of missing data. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3765-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Arpita Konar
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Olivia Choudhury
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Rebecca Bullis
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Lauren Fiedler
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | | | - Melissa T Stephens
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Oliver Gailing
- School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
| | - Scott Schlarbaum
- Department of Forestry, Wildlife and Fisheries, University of Tennessee, Knoxville, TN, 37996, USA
| | - Mark V Coggeshall
- School of Natural Resources, University of Missouri-Columbia, Columbia, MO, 65211, USA.,Hardwood Tree Improvement and Regeneration Center, USDA Forest Service Northern Research Station, West Lafayette, IN, 47907, USA
| | - Margaret E Staton
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, 37996, USA
| | - John E Carlson
- Department of Ecosystem Science and Management, Penn State, University Park, State College, PA, 16802, USA
| | - Scott Emrich
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Jeanne Romero-Severson
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA.
| |
Collapse
|
17
|
Lenz PRN, Beaulieu J, Mansfield SD, Clément S, Desponts M, Bousquet J. Factors affecting the accuracy of genomic selection for growth and wood quality traits in an advanced-breeding population of black spruce (Picea mariana). BMC Genomics 2017; 18:335. [PMID: 28454519 PMCID: PMC5410046 DOI: 10.1186/s12864-017-3715-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Accepted: 04/21/2017] [Indexed: 11/11/2022] Open
Abstract
Background Genomic selection (GS) uses information from genomic signatures consisting of thousands of genetic markers to predict complex traits. As such, GS represents a promising approach to accelerate tree breeding, which is especially relevant for the genetic improvement of boreal conifers characterized by long breeding cycles. In the present study, we tested GS in an advanced-breeding population of the boreal black spruce (Picea mariana [Mill.] BSP) for growth and wood quality traits, and concurrently examined factors affecting GS model accuracy. Results The study relied on 734 25-year-old trees belonging to 34 full-sib families derived from 27 parents and that were established on two contrasting sites. Genomic profiles were obtained from 4993 Single Nucleotide Polymorphisms (SNPs) representative of as many gene loci distributed among the 12 linkage groups common to spruce. GS models were obtained for four growth and wood traits. Validation using independent sets of trees showed that GS model accuracy was high, related to trait heritability and equivalent to that of conventional pedigree-based models. In forward selection, gains per unit of time were three times higher with the GS approach than with conventional selection. In addition, models were also accurate across sites, indicating little genotype-by-environment interaction in the area investigated. Using information from half-sibs instead of full-sibs led to a significant reduction in model accuracy, indicating that the inclusion of relatedness in the model contributed to its higher accuracies. About 500 to 1000 markers were sufficient to obtain GS model accuracy almost equivalent to that obtained with all markers, whether they were well spread across the genome or from a single linkage group, further confirming the implication of relatedness and potential long-range linkage disequilibrium (LD) in the high accuracy estimates obtained. Only slightly higher model accuracy was obtained when using marker subsets that were identified to carry large effects, indicating a minor role for short-range LD in this population. Conclusions This study supports the integration of GS models in advanced-generation tree breeding programs, given that high genomic prediction accuracy was obtained with a relatively small number of markers due to high relatedness and family structure in the population. In boreal spruce breeding programs and similar ones with long breeding cycles, much larger gain per unit of time can be obtained from genomic selection at an early age than by the conventional approach. GS thus appears highly profitable, especially in the context of forward selection in species which are amenable to mass vegetative propagation of selected stock, such as spruces.
Collapse
Affiliation(s)
- Patrick R N Lenz
- Canadian Wood Fibre Centre, Canadian Forest Service, Natural Resources Canada, Government of Canada, 1055 du PEPS, P.O. Box 10380, Québec, Québec, G1V 4C7, Canada. .,Canada Research Chair in Forest Genomics, Institute of Systems and Integrative Biology and Centre for Forest Research, Université Laval, 1030, Avenue de la Médecine, Québec, Québec, G1V 0A6, Canada.
| | - Jean Beaulieu
- Canadian Wood Fibre Centre, Canadian Forest Service, Natural Resources Canada, Government of Canada, 1055 du PEPS, P.O. Box 10380, Québec, Québec, G1V 4C7, Canada.,Canada Research Chair in Forest Genomics, Institute of Systems and Integrative Biology and Centre for Forest Research, Université Laval, 1030, Avenue de la Médecine, Québec, Québec, G1V 0A6, Canada
| | - Shawn D Mansfield
- Department of Wood Science, Forest Sciences Centre, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
| | - Sébastien Clément
- Canadian Wood Fibre Centre, Canadian Forest Service, Natural Resources Canada, Government of Canada, 1055 du PEPS, P.O. Box 10380, Québec, Québec, G1V 4C7, Canada
| | - Mireille Desponts
- Ministère des Forêts, de la Faune et des Parcs, Gouvernement du Québec, Direction de la recherche forestière, 2700 rue Einstein, Québec, Québec, G1P 3W8, Canada
| | - Jean Bousquet
- Canada Research Chair in Forest Genomics, Institute of Systems and Integrative Biology and Centre for Forest Research, Université Laval, 1030, Avenue de la Médecine, Québec, Québec, G1V 0A6, Canada
| |
Collapse
|
18
|
Zhao P, Zhang J, Qian C, Zhou Q, Zhao X, Chen G, Ma XF. SNP Discovery and Genetic Variation of Candidate Genes Relevant to Heat Tolerance and Agronomic Traits in Natural Populations of Sand Rice ( Agriophyllum squarrosum). FRONTIERS IN PLANT SCIENCE 2017; 8:536. [PMID: 28439286 PMCID: PMC5383723 DOI: 10.3389/fpls.2017.00536] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 03/27/2017] [Indexed: 05/24/2023]
Abstract
The extreme stress tolerance and high nutritional value of sand rice (Agriophyllum squarrosum) make it attractive for use as an alternative crop in response to concerns about ongoing climate change and future food security. However, a lack of genetic information hinders understanding of the mechanisms underpinning the morphological and physiological adaptations of sand rice. In the present study, we sequenced and analyzed the transcriptomes of two individuals representing semi-arid [Naiman (NM)] and arid [Shapotou (SPT)] sand rice genotypes. A total of 105,868 pairwise single nucleotide polymorphisms (SNPs) distributed in 24,712 Unigenes were identified among SPT and NM samples; the average SNP frequency was 0.3% (one SNP per 333 base pair). Characterization of gene annotation demonstrated that variations in genes involved in DNA recombination were associated with the survival of the NM population in the semi-arid environment. A set of genes predicted to be relevant to heat stress response and agronomic traits was functionally annotated using the accumulated knowledge from Arabidopsis and several crop plants, including rice, barley, maize, and sorghum. Four candidate genes related to heat tolerance (heat-shock transcription factor, HsfA1d), seed size (DA1-Related, DAR1), and flowering (early flowering 3, ELF3 and late elongated hypocotyl, LHY) were subjected to analysis of the genetic diversity in 10 natural populations, representing the core germplasm resource across the area of sand rice distribution in China. Only one SNP was detected in each of HsfA1d and DAR1, among 60 genotypes, with two in ELF3 and four in LHY. Nucleotide diversity ranged from 0.00032 to 0.00118. Haplotype analysis indicated that the NM population carried a specific allele for all four genes, suggesting that divergence has occurred between NM and other populations. These four genes could be further analyzed to determine whether they are associated with phenotype variation and identify alleles favorable for sand rice breeding.
Collapse
Affiliation(s)
- Pengshan Zhao
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
- Shapotou Desert Research and Experiment Station, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Jiwei Zhang
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Chaoju Qian
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Qin Zhou
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Xin Zhao
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
- Shapotou Desert Research and Experiment Station, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Guoxiong Chen
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
- Shapotou Desert Research and Experiment Station, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| | - Xiao-Fei Ma
- Key Laboratory of Stress Physiology and Ecology in Cold and Arid Regions, Northwest Institute of Eco-Environment and Resources, Chinese Academy of SciencesLanzhou, China
| |
Collapse
|
19
|
Scheben A, Batley J, Edwards D. Genotyping-by-sequencing approaches to characterize crop genomes: choosing the right tool for the right application. PLANT BIOTECHNOLOGY JOURNAL 2017; 15:149-161. [PMID: 27696619 PMCID: PMC5258866 DOI: 10.1111/pbi.12645] [Citation(s) in RCA: 124] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Revised: 09/24/2016] [Accepted: 09/28/2016] [Indexed: 05/18/2023]
Abstract
In the last decade, the revolution in sequencing technologies has deeply impacted crop genotyping practice. New methods allowing rapid, high-throughput genotyping of entire crop populations have proliferated and opened the door to wider use of molecular tools in plant breeding. These new genotyping-by-sequencing (GBS) methods include over a dozen reduced-representation sequencing (RRS) approaches and at least four whole-genome resequencing (WGR) approaches. The diversity of methods available, each often producing different types of data at different cost, can make selection of the best-suited method seem a daunting task. We review the most common genotyping methods used today and compare their suitability for linkage mapping, genomewide association studies (GWAS), marker-assisted and genomic selection and genome assembly and improvement in crops with various genome sizes and complexity. Furthermore, we give an outline of bioinformatics tools for analysis of genotyping data. WGR is well suited to genotyping biparental cross populations with complex, small- to moderate-sized genomes and provides the lowest cost per marker data point. RRS approaches differ in their suitability for various tasks, but demonstrate similar costs per marker data point. These approaches are generally better suited for de novo applications and more cost-effective when genotyping populations with large genomes or high heterozygosity. We expect that although RRS approaches will remain the most cost-effective for some time, WGR will become more widespread for crop genotyping as sequencing costs continue to decrease.
Collapse
Affiliation(s)
- Armin Scheben
- School of Plant Biology and Institute of AgricultureUniversity of Western AustraliaPerthWAAustralia
| | - Jacqueline Batley
- School of Plant Biology and Institute of AgricultureUniversity of Western AustraliaPerthWAAustralia
| | - David Edwards
- School of Plant Biology and Institute of AgricultureUniversity of Western AustraliaPerthWAAustralia
| |
Collapse
|
20
|
Syring JV, Tennessen JA, Jennings TN, Wegrzyn J, Scelfo-Dalbey C, Cronn R. Targeted Capture Sequencing in Whitebark Pine Reveals Range-Wide Demographic and Adaptive Patterns Despite Challenges of a Large, Repetitive Genome. FRONTIERS IN PLANT SCIENCE 2016; 7:484. [PMID: 27148310 PMCID: PMC4838605 DOI: 10.3389/fpls.2016.00484] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 03/25/2016] [Indexed: 05/20/2023]
Abstract
Whitebark pine (Pinus albicaulis) inhabits an expansive range in western North America, and it is a keystone species of subalpine environments. Whitebark is susceptible to multiple threats - climate change, white pine blister rust, mountain pine beetle, and fire exclusion - and it is suffering significant mortality range-wide, prompting the tree to be listed as 'globally endangered' by the International Union for Conservation of Nature and 'endangered' by the Canadian government. Conservation collections (in situ and ex situ) are being initiated to preserve the genetic legacy of the species. Reliable, transferrable, and highly variable genetic markers are essential for quantifying the genetic profiles of seed collections relative to natural stands, and ensuring the completeness of conservation collections. We evaluated the use of hybridization-based target capture to enrich specific genomic regions from the 27 GB genome of whitebark pine, and to evaluate genetic variation across loci, trees, and geography. Probes were designed to capture 7,849 distinct genes, and screening was performed on 48 trees. Despite the inclusion of repetitive elements in the probe pool, the resulting dataset provided information on 4,452 genes and 32% of targeted positions (528,873 bp), and we were able to identify 12,390 segregating sites from 47 trees. Variations reveal strong geographic trends in heterozygosity and allelic richness, with trees from the southern Cascade and Sierra Range showing the greatest distinctiveness and differentiation. Our results show that even under non-optimal conditions (low enrichment efficiency; inclusion of repetitive elements in baits), targeted enrichment produces high quality, codominant genotypes from large genomes. The resulting data can be readily integrated into management and gene conservation activities for whitebark pine, and have the potential to be applied to other members of 5-needle pine group (Pinus subsect. Quinquefolia) due to their limited genetic divergence.
Collapse
Affiliation(s)
- John V. Syring
- Department of Biology, Linfield College, McMinnvilleOR, USA
- *Correspondence: John V. Syring,
| | - Jacob A. Tennessen
- Department of Integrative Biology, Oregon State University, CorvallisOR, USA
| | - Tara N. Jennings
- Department of Botany and Plant Pathology, Oregon State University, CorvallisOR, USA
| | - Jill Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, StorrsCT, USA
| | - Camille Scelfo-Dalbey
- Jack Baskin School of Engineering, University of California, Santa Cruz, Santa CruzCA, USA
| | - Richard Cronn
- Pacific Northwest Research Station, United States Department of Agriculture, Forest Service, CorvallisOR, USA
| |
Collapse
|