1
|
Weber SE, Chawla HS, Ehrig L, Hickey LT, Frisch M, Snowdon RJ. Accurate prediction of quantitative traits with failed SNP calls in canola and maize. FRONTIERS IN PLANT SCIENCE 2023; 14:1221750. [PMID: 37936929 PMCID: PMC10627008 DOI: 10.3389/fpls.2023.1221750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 10/05/2023] [Indexed: 11/09/2023]
Abstract
In modern plant breeding, genomic selection is becoming the gold standard to select superior genotypes in large breeding populations that are only partially phenotyped. Many breeding programs commonly rely on single-nucleotide polymorphism (SNP) markers to capture genome-wide data for selection candidates. For this purpose, SNP arrays with moderate to high marker density represent a robust and cost-effective tool to generate reproducible, easy-to-handle, high-throughput genotype data from large-scale breeding populations. However, SNP arrays are prone to technical errors that lead to failed allele calls. To overcome this problem, failed calls are often imputed, based on the assumption that failed SNP calls are purely technical. However, this ignores the biological causes for failed calls-for example: deletions-and there is increasing evidence that gene presence-absence and other kinds of genome structural variants can play a role in phenotypic expression. Because deletions are frequently not in linkage disequilibrium with their flanking SNPs, permutation of missing SNP calls can potentially obscure valuable marker-trait associations. In this study, we analyze published datasets for canola and maize using four parametric and two machine learning models and demonstrate that failed allele calls in genomic prediction are highly predictive for important agronomic traits. We present two statistical pipelines, based on population structure and linkage disequilibrium, that enable the filtering of failed SNP calls that are likely caused by biological reasons. For the population and trait examined, prediction accuracy based on these filtered failed allele calls was competitive to standard SNP-based prediction, underlying the potential value of missing data in genomic prediction approaches. The combination of SNPs with all failed allele calls or the filtered allele calls did not outperform predictions with only SNP-based prediction due to redundancy in genomic relationship estimates.
Collapse
Affiliation(s)
- Sven E. Weber
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | | | - Lennard Ehrig
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| | - Lee T. Hickey
- Centre for Crop Science, Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia, QLD, Australia
| | - Matthias Frisch
- Department of Biometry and Population Genetics, Justus Liebig University, Giessen, Germany
| | - Rod J. Snowdon
- Department of Plant Breeding, Justus Liebig University, Giessen, Germany
| |
Collapse
|
2
|
Dolatabadian A, Yuan Y, Bayer PE, Petereit J, Severn-Ellis A, Tirnaz S, Patel D, Edwards D, Batley J. Copy Number Variation among Resistance Genes Analogues in Brassica napus. Genes (Basel) 2022; 13:2037. [PMID: 36360273 PMCID: PMC9690292 DOI: 10.3390/genes13112037] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 10/26/2022] [Accepted: 10/31/2022] [Indexed: 10/31/2024] Open
Abstract
Copy number variations (CNVs) are defined as deletions, duplications and insertions among individuals of a species. There is growing evidence that CNV is a major factor underlining various autoimmune disorders and diseases in humans; however, in plants, especially oilseed crops, the role of CNVs in disease resistance is not well studied. Here, we investigate the genome-wide diversity and genetic properties of CNVs in resistance gene analogues (RGAs) across eight Brassica napus lines. A total of 1137 CNV events (704 deletions and 433 duplications) were detected across 563 RGAs. The results show CNVs are more likely to occur across clustered RGAs compared to singletons. In addition, 112 RGAs were linked to a blackleg resistance QTL, of which 25 were affected by CNV. Overall, we show that the presence and abundance of CNVs differ between lines, suggesting that in B. napus, the distribution of CNVs depends on genetic background. Our findings advance the understanding of CNV as an important type of genomic structural variation in B. napus and provide a resource to support breeding of advanced canola lines.
Collapse
Affiliation(s)
- Aria Dolatabadian
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Yuxuan Yuan
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong, China
| | - Philipp Emanuel Bayer
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Jakob Petereit
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Anita Severn-Ellis
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Soodeh Tirnaz
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Dhwani Patel
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - David Edwards
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| | - Jacqueline Batley
- School of Biological Sciences, University of Western Australia, Perth, WA 6009, Australia
| |
Collapse
|
3
|
Mo Y, Jiao Y. Advances and applications of single-cell omics technologies in plant research. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 110:1551-1563. [PMID: 35426954 DOI: 10.1111/tpj.15772] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 06/14/2023]
Abstract
Single-cell sequencing approaches reveal the intracellular dynamics of individual cells and answer biological questions with high-dimensional catalogs of millions of cells, including genomics, transcriptomics, chromatin accessibility, epigenomics, and proteomics data across species. These emerging yet thriving technologies have been fully embraced by the field of plant biology, with a constantly expanding portfolio of applications. Here, we introduce the current technical advances used for single-cell omics, especially single-cell genome and transcriptome sequencing. Firstly, we overview methods for protoplast and nucleus isolation and genome and transcriptome amplification. Subsequently, we use well-executed benchmarking studies to highlight advances made through the application of single-cell omics techniques. Looking forward, we offer a glimpse of additional hurdles and future opportunities that will introduce broad adoption of single-cell sequencing with revolutionary perspectives in plant biology.
Collapse
Affiliation(s)
- Yajin Mo
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Center for Quantitative Biology, School of Life Sciences, Peking University, Beijing, 100871, China
- School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yuling Jiao
- State Key Laboratory of Protein and Plant Gene Research, Peking-Tsinghua Center for Life Sciences, Center for Quantitative Biology, School of Life Sciences, Peking University, Beijing, 100871, China
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Beijing, 100101, China
| |
Collapse
|
4
|
Jobson E, Roberts R. Genomic structural variation in tomato and its role in plant immunity. MOLECULAR HORTICULTURE 2022; 2:7. [PMID: 37789472 PMCID: PMC10515242 DOI: 10.1186/s43897-022-00029-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 02/22/2022] [Indexed: 10/05/2023]
Abstract
It is well known that large genomic variations can greatly impact the phenotype of an organism. Structural Variants (SVs) encompass any genomic variation larger than 30 base pairs, and include changes caused by deletions, inversions, duplications, transversions, and other genome modifications. Due to their size and complex nature, until recently, it has been difficult to truly capture these variations. Recent advances in sequencing technology and computational analyses now permit more extensive studies of SVs in plant genomes. In tomato, advances in sequencing technology have allowed researchers to sequence hundreds of genomes from tomatoes, and tomato relatives. These studies have identified SVs related to fruit size and flavor, as well as plant disease response, resistance/susceptibility, and the ability of plants to detect pathogens (immunity). In this review, we discuss the implications for genomic structural variation in plants with a focus on its role in tomato immunity. We also discuss how advances in sequencing technology have led to new discoveries of SVs in more complex genomes, the current evidence for the role of SVs in biotic and abiotic stress responses, and the outlook for genetic modification of SVs to advance plant breeding objectives.
Collapse
Affiliation(s)
- Emma Jobson
- Montana State University Extension, Montana State University, Bozeman, MT, 59717, United States
| | - Robyn Roberts
- Agricultural Biology Department, College of Agricultural Sciences, Colorado State University, Fort Collins, CO, USA.
| |
Collapse
|
5
|
Huang Y, Huang W, Meng Z, Braz GT, Li Y, Wang K, Wang H, Lai J, Jiang J, Dong Z, Jin W. Megabase-scale presence-absence variation with Tripsacum origin was under selection during maize domestication and adaptation. Genome Biol 2021; 22:237. [PMID: 34416918 PMCID: PMC8377971 DOI: 10.1186/s13059-021-02448-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 08/02/2021] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Structural variants (SVs) significantly drive genome diversity and environmental adaptation for diverse species. Unlike the prevalent small SVs (< kilobase-scale) in higher eukaryotes, large-size SVs rarely exist in the genome, but they function as one of the key evolutionary forces for speciation and adaptation. RESULTS In this study, we discover and characterize several megabase-scale presence-absence variations (PAVs) in the maize genome. Surprisingly, we identify a 3.2 Mb PAV fragment that shows high integrity and is present as complete presence or absence in the natural diversity panel. This PAV is embedded within the nucleolus organizer region (NOR), where the suppressed recombination is found to maintain the PAV against the evolutionary variation. Interestingly, by analyzing the sequence of this PAV, we not only reveal the domestication trace from teosinte to modern maize, but also the footprints of its origin from Tripsacum, shedding light on a previously unknown contribution from Tripsacum to the speciation of Zea species. The functional consequence of the Tripsacum segment migration is also investigated, and environmental fitness conferred by the PAV may explain the whole segment as a selection target during maize domestication and improvement. CONCLUSIONS These findings provide a novel perspective that Tripsacum contributes to Zea speciation, and also instantiate a strategy for evolutionary and functional analysis of the "fossil" structure variations during genome evolution and speciation.
Collapse
Affiliation(s)
- Yumin Huang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Wei Huang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Zhuang Meng
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps (MOE), Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - Guilherme Tomaz Braz
- Department of Plant Biology, Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Yunfei Li
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Kai Wang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps (MOE), Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
| | - Hai Wang
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Jinsheng Lai
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China
| | - Jiming Jiang
- Department of Plant Biology, Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Zhaobin Dong
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China.
| | - Weiwei Jin
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Key Laboratory of Crop Heterosis and Utilization (MOE), Joint International Research Laboratory of Crop Molecular Breeding (MOE), China Agricultural University, Beijing, 100193, China.
| |
Collapse
|
6
|
Schwartz C, Lenderts B, Feigenbutz L, Barone P, Llaca V, Fengler K, Svitashev S. CRISPR-Cas9-mediated 75.5-Mb inversion in maize. NATURE PLANTS 2020; 6:1427-1431. [PMID: 33299151 DOI: 10.1038/s41477-020-00817-6] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 11/04/2020] [Indexed: 05/11/2023]
Abstract
CRISPR-Cas is a powerful double-strand-break technology with wide-ranging applications from gene discovery to commercial product development. Thus far, this tool has been almost exclusively used for gene knockouts and deletions, with a few examples of gene edits and targeted gene insertions. Here, we demonstrate the application of CRISPR-Cas9 technology to mediate targeted 75.5-Mb pericentric inversion in chromosome 2 in one of the elite maize inbred lines from Corteva Agriscience. This inversion unlocks a large chromosomal region containing substantial genetic variance for recombination, thus providing opportunities for the development of new maize varieties with improved phenotypes.
Collapse
|
7
|
McNinch C, Chen K, Dennison T, Lopez M, Yandeau-Nelson MD, Lauter N. A multigenotype maize silk expression atlas reveals how exposure-related stresses are mitigated following emergence from husk leaves. THE PLANT GENOME 2020; 13:e20040. [PMID: 33090730 DOI: 10.1002/tpg2.20040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 05/11/2020] [Accepted: 06/05/2020] [Indexed: 06/11/2023]
Abstract
The extraordinarily long stigmatic silks of corn (Zea mays L.) are critical for grain production but the biology of their growth and emergence from husk leaves has remained underexplored. Accordingly, gene expression was assayed for inbreds 'B73' and 'Mo17' across five contiguous silk sections. Half of the maize genes (∼20,000) are expressed in silks, mostly in spatiotemporally dynamic patterns. In particular, emergence triggers strong differential expression of ∼1,500 genes collectively enriched for gene ontology terms associated with abiotic and biotic stress responses, hormone signaling, cell-cell communication, and defense metabolism. Further, a meta-analysis of published maize transcriptomic studies on seedling stress showed that silk emergence elicits an upregulated transcriptomic response that overlaps strongly with both abiotic and biotic stress responses. Although the two inbreds revealed similar silk transcriptomic programs overall, genotypic expression differences were observed for 5,643 B73-Mo17 syntenic gene pairs and collectively account for >50% of genome-wide expression variance. Coexpression clusters, including many based on genotypic divergence, were identified and interrogated via ontology-term enrichment analyses to generate biological hypotheses for future research. Ultimately, dissecting how gene expression changes along the length of silks and between husk-encased and emerged states offers testable models for silk development and plant response to environmental stresses.
Collapse
Affiliation(s)
- Colton McNinch
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
| | - Keting Chen
- Bioinformatics & Computational Biology Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
| | - Tesia Dennison
- Genetics & Genomics Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
| | - Miriam Lopez
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State Univ., Ames, IA, 50011, USA
| | - Marna D Yandeau-Nelson
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
- Bioinformatics & Computational Biology Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
- Genetics & Genomics Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
- Department of Genetics, Development and Cell Biology, Iowa State Univ., Ames, IA, 50011, USA
| | - Nick Lauter
- Molecular, Cellular, and Developmental Biology Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
- Genetics & Genomics Graduate Program, Iowa State Univ., Ames, IA, 50011, USA
- USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State Univ., Ames, IA, 50011, USA
| |
Collapse
|
8
|
Zhou L, Wang C, Gao X, Ding Y, Cheng B, Zhang G, Cao N, Xu Y, Shao M, Zhang L. Genome-wide variations analysis of sorghum cultivar Hongyingzi for brewing Moutai liquor. Hereditas 2020; 157:19. [PMID: 32410666 PMCID: PMC7227080 DOI: 10.1186/s41065-020-00130-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 04/14/2020] [Indexed: 11/28/2022] Open
Abstract
Background Hongyingzi is a sorghum (Sorghum bicolor L. Moench) cultivar for brewing Moutai liquor. For an overall understanding of the whole genome of Hongyingzi, we performed whole-genome resequencing technology to reveal its comprehensive variations. Results Compared with the BTx623 reference genome, we uncovered 1,885,774 single nucleotide polymorphisms (SNPs), 309,381 small fragments insertions and deletions (Indels), 31,966 structural variations (SVs), and 217,273 copy number variations (CNVs). These alterations conferred 29,614 gene variations. It was also predicted that 35 gene variations were related to the multidrug and toxic efflux (MATE) transporter, chalcone synthase (CHS), ATPase isoform 10 (AHA10) transporter, dihydroflavonol-4-reductase (DFR), the laccase 15 (LAC15), flavonol 3′-hydroxylase (F3′H), flavanone 3-hydroxylase (F3H), O-methyltransferase (OMT), flavonoid 3′5′ hydroxylase (F3′5′H), UDP-glucose:sterol-glucosyltransferase (SGT), flavonol synthase (FLS), and chalcone isomerase (CHI) involved in the tannin synthesis. Conclusions These results would provide theoretical supports for the molecular markers developments and gene function studies related to the tannin synthesis, and the genetic improvement of liquor-making sorghum based on the genome editing technology.
Collapse
Affiliation(s)
- Lingbo Zhou
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Can Wang
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Xu Gao
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Yanqing Ding
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Bin Cheng
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Guobing Zhang
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Ning Cao
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Yan Xu
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Mingbo Shao
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China
| | - Liyi Zhang
- Institute of Upland Food Crops, Guizhou Academy of Agricultural Sciences, Guiyang, 550006, Guizhou, China.
| |
Collapse
|
9
|
Gabur I, Chawla HS, Lopisso DT, von Tiedemann A, Snowdon RJ, Obermeier C. Gene presence-absence variation associates with quantitative Verticillium longisporum disease resistance in Brassica napus. Sci Rep 2020; 10:4131. [PMID: 32139810 PMCID: PMC7057980 DOI: 10.1038/s41598-020-61228-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 02/07/2020] [Indexed: 12/31/2022] Open
Abstract
Although copy number variation (CNV) and presence-absence variation (PAV) have been discovered in selected gene families in most crop species, the global prevalence of these polymorphisms in most complex genomes is still unclear and their influence on quantitatively inherited agronomic traits is still largely unknown. Here we analyze the association of gene PAV with resistance of oilseed rape (Brassica napus) against the important fungal pathogen Verticillium longisporum, as an example for a complex, quantitative disease resistance in the strongly rearranged genome of a recent allopolyploid crop species. Using Single Nucleotide absence Polymorphism (SNaP) markers to efficiently trace PAV in breeding populations, we significantly increased the resolution of loci influencing V. longisporum resistance in biparental and multi-parental mapping populations. Gene PAV, assayed by resequencing mapping parents, was observed in 23-51% of the genes within confidence intervals of quantitative trait loci (QTL) for V. longisporum resistance, and high-priority candidate genes identified within QTL were all affected by PAV. The results demonstrate the prominent role of gene PAV in determining agronomic traits, suggesting that this important class of polymorphism should be exploited more systematically in future plant breeding.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Daniel Teshome Lopisso
- Section of General Plant Pathology and Crop Protection, Georg August University Göttingen, 37077, Göttingen, Germany
- College of Agriculture and Veterinary Medicine, Jimma University, Jimma, Ethiopia
| | - Andreas von Tiedemann
- Section of General Plant Pathology and Crop Protection, Georg August University Göttingen, 37077, Göttingen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Christian Obermeier
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany.
| |
Collapse
|
10
|
Kyriakidou M, Achakkagari SR, Gálvez López JH, Zhu X, Tang CY, Tai HH, Anglin NL, Ellis D, Strömvik MV. Structural genome analysis in cultivated potato taxa. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2020; 133:951-966. [PMID: 31893289 PMCID: PMC7021743 DOI: 10.1007/s00122-019-03519-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2019] [Accepted: 12/18/2019] [Indexed: 05/13/2023]
Abstract
KEY MESSAGE Twelve potato accessions were selected to represent two principal views on potato taxonomy. The genomes were sequenced and analyzed for structural variation (copy number variation) against three published potato genomes. The common potato (Solanum tuberosum L.) is an important staple crop with a highly heterozygous and complex tetraploid genome. The other taxa of cultivated potato contain varying ploidy levels (2X-5X), and structural variations are common in the genomes of these species, likely contributing to the diversification or agronomic traits during domestication. Increased understanding of the genomes and genomic variation will aid in the exploration of novel agronomic traits. Thus, sequencing data from twelve potato landraces, representing the four ploidy levels, were used to identify structural genomic variation compared to the two currently available reference genomes, a double monoploid potato genome and a diploid inbred clone of S. chacoense. The results of a copy number variation analysis showed that in the majority of the genomes, while the number of deletions is greater than the number of duplications, the number of duplicated genes is greater than the number of deleted ones. Specific regions in the twelve potato genomes have a high density of CNV events. Further, the auxin-induced SAUR genes (involved in abiotic stress), disease resistance genes and the 2-oxoglutarate/Fe(II)-dependent oxygenase superfamily proteins, among others, had increased copy numbers in these sequenced genomes relative to the references.
Collapse
Affiliation(s)
- Maria Kyriakidou
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada
| | - Sai Reddy Achakkagari
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada
| | - José Héctor Gálvez López
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada
| | - Xinyi Zhu
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada
| | - Chen Yu Tang
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada
| | - Helen H Tai
- Fredericton Research and Development Centre, Agriculture and Agri-Food Canada, Fredericton, Canada
| | | | | | - Martina V Strömvik
- Department of Plant Science, McGill University, Sainte-Anne-de-Bellevue, Montreal, QC, H9X 3V9, Canada.
| |
Collapse
|
11
|
Falque M, Jebreen K, Paux E, Knaak C, Mezmouk S, Martin OC. CNVmap: A Method and Software To Detect and Map Copy Number Variants from Segregation Data. Genetics 2020; 214:561-576. [PMID: 31882400 PMCID: PMC7054022 DOI: 10.1534/genetics.119.302881] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 12/23/2019] [Indexed: 01/22/2023] Open
Abstract
Single nucleotide polymorphisms (SNPs) are used widely for detecting quantitative trait loci, or for searching for causal variants of diseases. Nevertheless, structural variations such as copy-number variants (CNVs) represent a large part of natural genetic diversity, and contribute significantly to trait variation. Numerous methods and softwares based on different technologies (amplicons, CGH, tiling, or SNP arrays, or sequencing) have already been developed to detect CNVs, but they bypass a wealth of information such as genotyping data from segregating populations, produced, e.g., for QTL mapping. Here, we propose an original method to both detect and genetically map CNVs using mapping panels. Specifically, we exploit the apparent heterozygous state of duplicated loci: peaks in appropriately defined genome-wide allelic profiles provide highly specific signatures that identify the nature and position of the CNVs. Our original method and software can detect and map automatically up to 33 different predefined types of CNVs based on segregation data only. We validate this approach on simulated and experimental biparental mapping panels in two maize populations and one wheat population. Most of the events found correspond to having just one extra copy in one of the parental lines, but the corresponding allelic value can be that of either parent. We also find cases with two or more additional copies, especially in wheat, where these copies locate to homeologues. More generally, our computational tool can be used to give additional value, at no cost, to many datasets produced over the past decade from genetic mapping panels.
Collapse
Affiliation(s)
- Matthieu Falque
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190 Gif-sur-Yvette, France
| | - Kamel Jebreen
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190 Gif-sur-Yvette, France
- Department of Mathematics, An-Najah National University, Nablus, Palestine
| | - Etienne Paux
- Université Clermont Auvergne, INRAE, GDEC, 63000 Clermont-Ferrand, France
| | | | | | - Olivier C Martin
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190 Gif-sur-Yvette, France
| |
Collapse
|
12
|
Mabire C, Duarte J, Darracq A, Pirani A, Rimbert H, Madur D, Combes V, Vitte C, Praud S, Rivière N, Joets J, Pichon JP, Nicolas SD. High throughput genotyping of structural variations in a complex plant genome using an original Affymetrix® axiom® array. BMC Genomics 2019; 20:848. [PMID: 31722668 PMCID: PMC6854671 DOI: 10.1186/s12864-019-6136-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 09/23/2019] [Indexed: 12/19/2022] Open
Abstract
Background Insertions/deletions (InDels) and more specifically presence/absence variations (PAVs) are pervasive in several species and have strong functional and phenotypic effect by removing or drastically modifying genes. Genotyping of such variants on large panels remains poorly addressed, while necessary for approaches such as association mapping or genomic selection. Results We have developed, as a proof of concept, a new high-throughput and affordable approach to genotype InDels. We first identified 141,000 InDels by aligning reads from the B73 line against the genome of three temperate maize inbred lines (F2, PH207, and C103) and reciprocally. Next, we designed an Affymetrix® Axiom® array to target these InDels, with a combination of probes selected at breakpoint sites (13%) or within the InDel sequence, either at polymorphic (25%) or non-polymorphic sites (63%) sites. The final array design is composed of 662,772 probes and targets 105,927 InDels, including PAVs ranging from 35 bp to 129kbp. After Affymetrix® quality control, we successfully genotyped 86,648 polymorphic InDels (82% of all InDels interrogated by the array) on 445 maize DNA samples with 422,369 probes. Genotyping InDels using this approach produced a highly reliable dataset, with low genotyping error (~ 3%), high call rate (~ 98%), and high reproducibility (> 95%). This reliability can be further increased by combining genotyping of several probes calling the same InDels (< 0.1% error rate and > 99.9% of call rate for 5 probes). This “proof of concept” tool was used to estimate the kinship matrix between 362 maize lines with 57,824 polymorphic InDels. This InDels kinship matrix was highly correlated with kinship estimated using SNPs from Illumina 50 K SNP arrays. Conclusions We efficiently genotyped thousands of small to large InDels on a sizeable number of individuals using a new Affymetrix® Axiom® array. This powerful approach opens the way to studying the contribution of InDels to trait variation and heterosis in maize. The approach is easily extendable to other species and should contribute to decipher the biological impact of InDels at a larger scale.
Collapse
Affiliation(s)
- Clément Mabire
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Jorge Duarte
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Aude Darracq
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Ali Pirani
- Thermo Fisher Scientific, 3450 Central Expressway, Santa Clara, CA, 95051, USA
| | - Hélène Rimbert
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France.,Present address: GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Delphine Madur
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Valérie Combes
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Clémentine Vitte
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Sébastien Praud
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Nathalie Rivière
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Johann Joets
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | | | - Stéphane D Nicolas
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
13
|
Penning BW, Shiga TM, Klimek JF, SanMiguel PJ, Shreve J, Thimmapuram J, Sykes RW, Davis MF, McCann MC, Carpita NC. Expression profiles of cell-wall related genes vary broadly between two common maize inbreds during stem development. BMC Genomics 2019; 20:785. [PMID: 31664907 PMCID: PMC6819468 DOI: 10.1186/s12864-019-6117-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 09/20/2019] [Indexed: 11/23/2022] Open
Abstract
Background The cellular machinery for cell wall synthesis and metabolism is encoded by members of large multi-gene families. Maize is both a genetic model for grass species and a potential source of lignocellulosic biomass from crop residues. Genetic improvement of maize for its utility as a bioenergy feedstock depends on identification of the specific gene family members expressed during secondary wall development in stems. Results High-throughput sequencing of transcripts expressed in developing rind tissues of stem internodes provided a comprehensive inventory of cell wall-related genes in maize (Zea mays, cultivar B73). Of 1239 of these genes, 854 were expressed among the internodes at ≥95 reads per 20 M, and 693 of them at ≥500 reads per 20 M. Grasses have cell wall compositions distinct from non-commelinid species; only one-quarter of maize cell wall-related genes expressed in stems were putatively orthologous with those of the eudicot Arabidopsis. Using a slope-metric algorithm, five distinct patterns for sub-sets of co-expressed genes were defined across a time course of stem development. For the subset of genes associated with secondary wall formation, fifteen sequence motifs were found in promoter regions. The same members of gene families were often expressed in two maize inbreds, B73 and Mo17, but levels of gene expression between them varied, with 30% of all genes exhibiting at least a 5-fold difference at any stage. Although presence-absence and copy-number variation might account for much of these differences, fold-changes of expression of a CADa and a FLA11 gene were attributed to polymorphisms in promoter response elements. Conclusions Large genetic variation in maize as a species precludes the extrapolation of cell wall-related gene expression networks even from one common inbred line to another. Elucidation of genotype-specific expression patterns and their regulatory controls will be needed for association panels of inbreds and landraces to fully exploit genetic variation in maize and other bioenergy grass species.
Collapse
Affiliation(s)
- Bryan W Penning
- Department of Botany & Plant Pathology, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA.,Department of Biological Sciences, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA.,, Present Address: USDA-ARS, Wheat Quality Research Unit, 1680 Madison Avenue, Wooster, OH, 44691, USA
| | - Tânia M Shiga
- Department of Botany & Plant Pathology, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA.,Present Address: Departamento de Alimentos e Nutrição Experimental, FCF-USP F, 3091-3647 / 3091-3007, Av. Prof. Lineu Prestes, 580 - BL-14 CEP 05508-000, Butantã, Sâo Paulo, SP, Brazil
| | - John F Klimek
- Department of Botany & Plant Pathology, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA
| | - Philip J SanMiguel
- Genomics Core Facility, Purdue University, 170 South University Street, Purdue University, West Lafayette, IN, 47907, USA
| | - Jacob Shreve
- Bioinformatics Core Facility, Purdue University, 155 South Grant Street, West Lafayette, IN, 47907, USA.,, Present Address: Department of Internal Medicine, Cleveland Clinic, 9500 Euclid Ave, Cleveland, OH, 44195, USA
| | - Jyothi Thimmapuram
- Present Address: Departamento de Alimentos e Nutrição Experimental, FCF-USP F, 3091-3647 / 3091-3007, Av. Prof. Lineu Prestes, 580 - BL-14 CEP 05508-000, Butantã, Sâo Paulo, SP, Brazil.,Bioinformatics Core Facility, Purdue University, 155 South Grant Street, West Lafayette, IN, 47907, USA
| | - Robert W Sykes
- National Bioenergy Center, National Renewable Energy Laboratory, Golden, CO, 80401, USA.,, Present Address: Los Alamos National Laboratory, P.O. Box 1663, Los Alamos, NM, Los Alamos, NM, 87545, USA
| | - Mark F Davis
- National Bioenergy Center, National Renewable Energy Laboratory, Golden, CO, 80401, USA
| | - Maureen C McCann
- Department of Biological Sciences, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA.,Purdue Center for Plant Biology, West Lafayette, USA
| | - Nicholas C Carpita
- Department of Botany & Plant Pathology, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA. .,Department of Biological Sciences, Purdue University, 915 West State Street, West Lafayette, IN, 47907, USA. .,Purdue Center for Plant Biology, West Lafayette, USA.
| |
Collapse
|
14
|
Gabur I, Chawla HS, Snowdon RJ, Parkin IAP. Connecting genome structural variation with complex traits in crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:733-750. [PMID: 30448864 DOI: 10.1007/s00122-018-3233-0] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/07/2018] [Indexed: 05/05/2023]
Abstract
Structural genome variation is a major determinant of useful trait diversity. We describe how genome analysis methods are enabling discovery of trait-associated structural variants and their potential impact on breeding. As our understanding of complex crop genomes continues to grow, there is growing evidence that structural genome variation plays a major role in determining traits important for breeding and agriculture. Identifying the extent and impact of structural variants in crop genomes is becoming increasingly feasible with ongoing advances in the sophistication of genome sequencing technologies, particularly as it becomes easier to generate accurate long sequence reads on a genome-wide scale. In this article, we discuss the origins of structural genome variation in crops from ancient and recent genome duplication and polyploidization events and review high-throughput methods to assay such variants in crop populations in order to find associations with phenotypic traits. There is increasing evidence from such studies that gene presence-absence and copy number variation resulting from segmental chromosome exchanges may be at the heart of adaptive variation of crops to counter abiotic and biotic stress factors. We present examples from major crops that demonstrate the potential of pangenomic diversity as a key resource for future plant breeding for resilience and sustainability.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany.
| | - Isobel A P Parkin
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK, S7N OX2, Canada
| |
Collapse
|
15
|
Wang X, Lebarbier E, Aubert J, Robin S. Variational Inference for Coupled Hidden Markov Models Applied to the Joint Detection of Copy Number Variations. Int J Biostat 2019; 15:/j/ijb.ahead-of-print/ijb-2018-0023/ijb-2018-0023.xml. [PMID: 30779702 DOI: 10.1515/ijb-2018-0023] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 11/21/2018] [Indexed: 02/04/2023]
Abstract
Hidden Markov models provide a natural statistical framework for the detection of the copy number variations (CNV) in genomics. In this context, we define a hidden Markov process that underlies all individuals jointly in order to detect and to classify genomics regions in different states (typically, deletion, normal or amplification). Structural variations from different individuals may be dependent. It is the case in agronomy where varietal selection program exists and species share a common phylogenetic past. We propose to take into account these dependencies inthe HMM model. When dealing with a large number of series, maximum likelihood inference (performed classically using the EM algorithm) becomes intractable. We thus propose an approximate inference algorithm based on a variational approach (VEM), implemented in the CHMM R package. A simulation study is performed to assess the performance of the proposed method and an application to the detection of structural variations in plant genomes is presented.
Collapse
Affiliation(s)
- Xiaoqiang Wang
- School of Mathematics and Statistics, Shandong University (Weihai), Weihai,Shandong, China
| | - Emilie Lebarbier
- UMR MIA-Paris, AgroParisTech, INRA, Université Paris-Saclay, Paris, France
| | - Julie Aubert
- UMR MIA-Paris, AgroParisTech, INRA, Université Paris-Saclay, Paris, France
| | - Stéphane Robin
- UMR MIA-Paris, AgroParisTech, INRA, Université Paris-Saclay, Paris, France
| |
Collapse
|
16
|
Lyra DH, Galli G, Alves FC, Granato ÍSC, Vidotti MS, Bandeira E Sousa M, Morosini JS, Crossa J, Fritsche-Neto R. Modeling copy number variation in the genomic prediction of maize hybrids. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:273-288. [PMID: 30382311 DOI: 10.1007/s00122-018-3215-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 10/20/2018] [Indexed: 06/08/2023]
Abstract
Our study indicates that copy variants may play an essential role in the phenotypic variation of complex traits in maize hybrids. Moreover, predicting hybrid phenotypes by combining additive-dominance effects with copy variants has the potential to be a viable predictive model. Non-additive effects resulting from the actions of multiple loci may influence trait variation in single-cross hybrids. In addition, complementation of allelic variation could be a valuable contributor to hybrid genetic variation, especially when crossing inbred lines with higher contents of copy gains. With this in mind, we aimed (1) to study the association between copy number variation (CNV) and hybrid phenotype, and (2) to compare the predictive ability (PA) of additive and additive-dominance genomic best linear unbiased prediction model when combined with the effects of CNV in two datasets of maize hybrids (USP and HELIX). In the USP dataset, we observed a significant negative phenotypic correlation of low magnitude between copy number loss and plant height, revealing a tendency that more copy losses lead to lower plants. In the same set, when CNV was combined with the additive plus dominance effects, the PA significantly increased only for plant height under low nitrogen. In this case, CNV effects explicitly capture relatedness between individuals and add extra information to the model. In the HELIX dataset, we observed a pronounced difference in PA between additive (0.50) and additive-dominance (0.71) models for predicting grain yield, suggesting a significant contribution of dominance. We conclude that copy variants may play an essential role in the phenotypic variation of complex traits in maize hybrids, although the inclusion of CNVs into datasets does not return significant gains concerning PA.
Collapse
Affiliation(s)
- Danilo Hottis Lyra
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil.
- Department of Computational and Analytical Sciences, Rothamsted Research, West Common, Harpenden, AL52JQ, UK.
| | - Giovanni Galli
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Filipe Couto Alves
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Ítalo Stefanine Correia Granato
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Miriam Suzane Vidotti
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Massaine Bandeira E Sousa
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Júlia Silva Morosini
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), 06600, Texcoco, D.F, Mexico
| | - Roberto Fritsche-Neto
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| |
Collapse
|
17
|
Razali R, Bougouffa S, Morton MJL, Lightfoot DJ, Alam I, Essack M, Arold ST, Kamau AA, Schmöckel SM, Pailles Y, Shahid M, Michell CT, Al-Babili S, Ho YS, Tester M, Bajic VB, Negrão S. The Genome Sequence of the Wild Tomato Solanum pimpinellifolium Provides Insights Into Salinity Tolerance. FRONTIERS IN PLANT SCIENCE 2018; 9:1402. [PMID: 30349549 PMCID: PMC6186997 DOI: 10.3389/fpls.2018.01402] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2018] [Accepted: 09/04/2018] [Indexed: 05/19/2023]
Abstract
Solanum pimpinellifolium, a wild relative of cultivated tomato, offers a wealth of breeding potential for desirable traits such as tolerance to abiotic and biotic stresses. Here, we report the genome assembly and annotation of S. pimpinellifolium 'LA0480.' Moreover, we present phenotypic data from one field experiment that demonstrate a greater salinity tolerance for fruit- and yield-related traits in S. pimpinellifolium compared with cultivated tomato. The 'LA0480' genome assembly size (811 Mb) and the number of annotated genes (25,970) are within the range observed for other sequenced tomato species. We developed and utilized the Dragon Eukaryotic Analyses Platform (DEAP) to functionally annotate the 'LA0480' protein-coding genes. Additionally, we used DEAP to compare protein function between S. pimpinellifolium and cultivated tomato. Our data suggest enrichment in genes involved in biotic and abiotic stress responses. To understand the genomic basis for these differences in S. pimpinellifolium and S. lycopersicum, we analyzed 15 genes that have previously been shown to mediate salinity tolerance in plants. We show that S. pimpinellifolium has a higher copy number of the inositol-3-phosphate synthase and phosphatase genes, which are both key enzymes in the production of inositol and its derivatives. Moreover, our analysis indicates that changes occurring in the inositol phosphate pathway may contribute to the observed higher salinity tolerance in 'LA0480.' Altogether, our work provides essential resources to understand and unlock the genetic and breeding potential of S. pimpinellifolium, and to discover the genomic basis underlying its environmental robustness.
Collapse
Affiliation(s)
- Rozaimi Razali
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Salim Bougouffa
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Mitchell J. L. Morton
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Damien J. Lightfoot
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Intikhab Alam
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Division of Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Magbubah Essack
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Stefan T. Arold
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Allan A. Kamau
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Division of Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Sandra M. Schmöckel
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Yveline Pailles
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Mohammed Shahid
- International Center for Biosaline Agriculture, Dubai, United Arab Emirates
| | - Craig T. Michell
- Red Sea Research Center, Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Salim Al-Babili
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Yung Shwen Ho
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Mark Tester
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Vladimir B. Bajic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Division of Computer, Electrical and Mathematical Science and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Sónia Negrão
- Division of Biological and Environmental Sciences and Engineering, The Bioactives Lab, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| |
Collapse
|
18
|
Prunier J, Giguère I, Ryan N, Guy R, Soolanayakanahally R, Isabel N, MacKay J, Porth I. Gene copy number variations involved in balsam poplar (Populus balsamifera L.) adaptive variations. Mol Ecol 2018; 28:1476-1490. [PMID: 30270494 DOI: 10.1111/mec.14836] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 07/26/2018] [Accepted: 07/30/2018] [Indexed: 12/12/2022]
Abstract
Gene copy number variations (CNVs) involved in phenotypic variations have already been shown in plants, but genomewide testing of CNVs for adaptive variation was not doable until recent technological developments. Thus, reports of the genomic architecture of adaptation involving CNVs remain scarce to date. Here, we investigated F1 progenies of an intraprovenance cross (north-north cross, 58th parallel) and an interprovenances cross (north-south cross, 58th/49th parallels) for CNVs using comparative genomic hybridization on arrays of probes targeting gene sequences in balsam poplar (Populus balsamifera L.), a widespread North American forest tree. A total of 1,721 genes were found in varying copy numbers over the set of 19,823 tested genes. These gene CNVs presented an estimated average size of 8.3 kb and were distributed over poplar's 19 chromosomes including 22 hotspot regions. Gene CNVs number was higher for the interprovenance progeny in accordance with an expected higher genetic diversity related to the composite origin of this family. Regression analyses between gene CNVs and seven adaptive trait variations resulted in 23 significant links; among these adaptive gene CNVs, 30% were located in hotspots. One-to-five gene CNVs were found related to each of the measured adaptive traits and annotated for both biotic and abiotic stress responses. These annotations can be related to the occurrence of a higher pathogenic pressure in the southern parts of balsam poplar's distribution, and higher photosynthetic assimilation rates and water-use efficiency at high latitudes. Overall, our findings suggest that gene CNVs typically having higher mutation rates than SNPs may in fact represent efficient adaptive variations against fast-evolving pathogens.
Collapse
Affiliation(s)
- Julien Prunier
- Institute for System and Integrated Biology (IBIS), Université Laval, Québec, Québec, Canada.,Centre for Forest Research, Université Laval, Québec, Quebec, Canada
| | - Isabelle Giguère
- Institute for System and Integrated Biology (IBIS), Université Laval, Québec, Québec, Canada.,Centre for Forest Research, Université Laval, Québec, Quebec, Canada
| | - Natalie Ryan
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Robert Guy
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Raju Soolanayakanahally
- Indian Head Research Farm, Agriculture and Agri-Food Canada, Indian Head, Saskatchewan, Canada
| | - Nathalie Isabel
- Laurentian Forest Centre, Canadian Forest Service, Natural Resources Canada, Québec, Quebec, Canada
| | - John MacKay
- Department of Plant Sciences, University of Oxford, Oxford, UK
| | - Ilga Porth
- Institute for System and Integrated Biology (IBIS), Université Laval, Québec, Québec, Canada.,Centre for Forest Research, Université Laval, Québec, Quebec, Canada
| |
Collapse
|
19
|
Extensive intraspecific gene order and gene structural variations between Mo17 and other maize genomes. Nat Genet 2018; 50:1289-1295. [PMID: 30061735 DOI: 10.1038/s41588-018-0182-0] [Citation(s) in RCA: 233] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2017] [Accepted: 06/05/2018] [Indexed: 12/24/2022]
Abstract
Maize is an important crop with a high level of genome diversity and heterosis. The genome sequence of a typical female line, B73, was previously released. Here, we report a de novo genome assembly of a corresponding male representative line, Mo17. More than 96.4% of the 2,183 Mb assembled genome can be accounted for by 362 scaffolds in ten pseudochromosomes with 38,620 annotated protein-coding genes. Comparative analysis revealed large gene-order and gene structural variations: approximately 10% of the annotated genes were mutually nonsyntenic, and more than 20% of the predicted genes had either large-effect mutations or large structural variations, which might cause considerable protein divergence between the two inbred lines. Our study provides a high-quality reference-genome sequence of an important maize germplasm, and the intraspecific gene order and gene structural variations identified should have implications for heterosis and genome evolution.
Collapse
|
20
|
Hu Y, Ren J, Peng Z, Umana AA, Le H, Danilova T, Fu J, Wang H, Robertson A, Hulbert SH, White FF, Liu S. Analysis of Extreme Phenotype Bulk Copy Number Variation (XP-CNV) Identified the Association of rp1 with Resistance to Goss's Wilt of Maize. FRONTIERS IN PLANT SCIENCE 2018; 9:110. [PMID: 29479358 PMCID: PMC5812337 DOI: 10.3389/fpls.2018.00110] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Accepted: 01/19/2018] [Indexed: 05/19/2023]
Abstract
Goss's wilt (GW) of maize is caused by the Gram-positive bacterium Clavibacter michiganensis subsp. nebraskensis (Cmn) and has spread in recent years throughout the Great Plains, posing a threat to production. The genetic basis of plant resistance is unknown. Here, a simple method for quantifying disease symptoms was developed and used to select cohorts of highly resistant and highly susceptible lines known as extreme phenotypes (XP). Copy number variation (CNV) analyses using whole genome sequences of bulked XP revealed 141 genes containing CNV between the two XP groups. The CNV genes include the previously identified common rust resistant locus rp1. Multiple Rp1 accessions with distinct rp1 haplotypes in an otherwise susceptible accession exhibited hypersensitive responses upon inoculation. GW provides an excellent system for the genetic dissection of diseases caused by closely related subspecies of C. michiganesis. Further work will facilitate breeding strategies to control GW and provide needed insight into the resistance mechanism of important related diseases such as bacterial canker of tomato and bacterial ring rot of potato.
Collapse
Affiliation(s)
- Ying Hu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Jie Ren
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Zhao Peng
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
| | - Arnoldo A. Umana
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Ha Le
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Tatiana Danilova
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| | - Junjie Fu
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Haiyan Wang
- Department of Statistics, Kansas State University, Manhattan, KS, United States
| | - Alison Robertson
- Department of Plant Pathology and Microbiology, Iowa State University, Ames, IA, United States
| | - Scot H. Hulbert
- Department of Plant Pathology, Washington State University, Pullman, WA, United States
| | - Frank F. White
- Department of Plant Pathology, University of Florida, Gainesville, FL, United States
| | - Sanzhen Liu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, United States
| |
Collapse
|
21
|
Darracq A, Vitte C, Nicolas S, Duarte J, Pichon JP, Mary-Huard T, Chevalier C, Bérard A, Le Paslier MC, Rogowsky P, Charcosset A, Joets J. Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants. BMC Genomics 2018; 19:119. [PMID: 29402214 PMCID: PMC5800051 DOI: 10.1186/s12864-018-4490-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 01/22/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated. RESULTS De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups. CONCLUSIONS We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.
Collapse
Affiliation(s)
- Aude Darracq
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Clémentine Vitte
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Stéphane Nicolas
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | | | - Tristan Mary-Huard
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
- MIA, INRA, AgroParisTech, Université Paris-Saclay, Paris, France
| | - Céline Chevalier
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Aurélie Bérard
- EPGV US 1279, INRA, CEA, IG-CNG, Université Paris-Saclay, Evry, France
| | | | - Peter Rogowsky
- Laboratoire Reproduction et Développement des Plantes, Univ Lyon, ENS de Lyon, UCB Lyon 1, CNRS, INRA, Lyon, France
| | - Alain Charcosset
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Johann Joets
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
22
|
Chen L, Zhang P, Fan Y, Lu Q, Li Q, Yan J, Muehlbauer GJ, Schnable PS, Dai M, Li L. Circular RNAs mediated by transposons are associated with transcriptomic and phenotypic variation in maize. THE NEW PHYTOLOGIST 2018; 217:1292-1306. [PMID: 29155438 DOI: 10.1111/nph.14901] [Citation(s) in RCA: 69] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Accepted: 10/18/2017] [Indexed: 05/21/2023]
Abstract
Circular RNAs (circRNAs) are covalently closed RNA molecules. Recent studies have shown that circRNAs can arise from the transcripts of transposons. Given the prevalence of transposons in the maize genome and dramatic genomic variation driven by transposons, we hypothesize that transposons in maize may be involved in the formation of circRNAs and further modulate phenotypic variation. We performed circRNA-Seq on B73 seedling leaves and uncovered 2804 high-confidence maize circRNAs, which show distinct genomic features. Comprehensive analyses demonstrated that sequences related to LINE1-like elements (LLEs) and their Reverse Complementary Pairs (LLERCPs) are significantly enriched in the flanking regions of circRNAs. Interestingly, as the number of LLERCPs increase, the accumulation of circRNAs varies, whereas that of linear transcripts decreases. Furthermore, genes with LLERCP-mediated circRNAs are enriched among loci that are associated with phenotypic variation. These results suggest that circRNAs are likely to be involved in the modulation of phenotypic variation by LLERCPs. Further, we showed that the presence/absence variation of LLERCPs was associated with expression variation of circRNA-circ1690 and was related to ear height, potentially through the interplay between circRNAs and functional linear transcripts. Our first study of maize circRNAs uncovers a potential new way for transposons to modulate transcriptomic and phenotypic variations.
Collapse
Affiliation(s)
- Lu Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Pei Zhang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Yuan Fan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Qiong Lu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Qing Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Jianbing Yan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, St Paul, MN, 55108, USA
- Department of Plant and Microbial Biology, University of Minnesota, St Paul, MN, 55108, USA
| | | | - Mingqiu Dai
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| | - Lin Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, China
| |
Collapse
|
23
|
|
24
|
Dolatabadian A, Patel DA, Edwards D, Batley J. Copy number variation and disease resistance in plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:2479-2490. [PMID: 29043379 DOI: 10.1007/s00122-017-2993-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 09/27/2017] [Indexed: 05/06/2023]
Abstract
Plant genome diversity varies from single nucleotide polymorphisms to large-scale deletions, insertions, duplications, or re-arrangements. These re-arrangements of sequences resulting from duplication, gains or losses of DNA segments are termed copy number variations (CNVs). During the last decade, numerous studies have emphasized the importance of CNVs as a factor affecting human phenotype; in particular, CNVs have been associated with risks for several severe diseases. In plants, the exploration of the extent and role of CNVs in resistance against pathogens and pests is just beginning. Since CNVs are likely to be associated with disease resistance in plants, an understanding of the distribution of CNVs could assist in the identification of novel plant disease-resistance genes. In this paper, we review existing information about CNVs; their importance, role and function, as well as their association with disease resistance in plants.
Collapse
Affiliation(s)
- Aria Dolatabadian
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - Dhwani Apurva Patel
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia.
| |
Collapse
|
25
|
Prunier J, Caron S, Lamothe M, Blais S, Bousquet J, Isabel N, MacKay J. Gene copy number variations in adaptive evolution: The genomic distribution of gene copy number variations revealed by genetic mapping and their adaptive role in an undomesticated species, white spruce (Picea glauca). Mol Ecol 2017; 26:5989-6001. [PMID: 28833771 DOI: 10.1111/mec.14337] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Revised: 07/26/2017] [Accepted: 08/05/2017] [Indexed: 01/09/2023]
Abstract
Gene copy number variation (CNV) has been associated with phenotypic variability in animals and plants, but a genomewide understanding of their impacts on phenotypes is largely restricted to human and agricultural systems. As such, CNVs have rarely been considered in investigations of the genomic architecture of adaptation in wild species. Here, we report on the genetic mapping of gene CNVs in white spruce, which lacks a contiguous assembly of its large genome (~20 Gb), and their relationships with adaptive phenotypic variation. We detected 3,911 gene CNVs including de novo structural variations using comparative genome hybridization on arrays (aCGH) in a large progeny set. We inferred the heterozygosity at CNV loci within parents by comparing haploid and diploid tissues and genetically mapped 82 gene CNVs. Our analysis showed that CNVs were distributed over 10 linkage groups and identified four CNV hotspots that we predict to occur in other species of the Pinaceae. Significant relationships were found between 29 of the gene CNVs and adaptive traits based on regression analyses with timings of bud set and bud flush, and height growth, suggesting a role for CNVs in climate adaptation. The importance of CNVs in adaptive evolution of white spruce was also indicated by functional gene annotations and the clustering of 31% of the mapped adaptive gene CNVs in CNV hotspots. Taken together, these results illustrate the feasibility of studying CNVs in undomesticated species and represent a major step towards a better understanding of the roles of CNVs in adaptive evolution.
Collapse
Affiliation(s)
- Julien Prunier
- Institute for System and Integrative Biology (IBIS), Université Laval, Québec, QC, Canada.,Centre for Forest Research, Université Laval, Québec, QC, Canada
| | - Sébastien Caron
- Institute for System and Integrative Biology (IBIS), Université Laval, Québec, QC, Canada.,Centre for Forest Research, Université Laval, Québec, QC, Canada
| | - Manuel Lamothe
- Laurentian Forest Centre, Canadian Forest Service, Natural Resources Canada, Quebec, QC, Canada.,Canada Research Chair in Forest Genomics, Université Laval, Quebec, QC, Canada
| | - Sylvie Blais
- Institute for System and Integrative Biology (IBIS), Université Laval, Québec, QC, Canada.,Centre for Forest Research, Université Laval, Québec, QC, Canada.,Canada Research Chair in Forest Genomics, Université Laval, Quebec, QC, Canada
| | - Jean Bousquet
- Institute for System and Integrative Biology (IBIS), Université Laval, Québec, QC, Canada.,Centre for Forest Research, Université Laval, Québec, QC, Canada.,Canada Research Chair in Forest Genomics, Université Laval, Quebec, QC, Canada
| | - Nathalie Isabel
- Laurentian Forest Centre, Canadian Forest Service, Natural Resources Canada, Quebec, QC, Canada.,Canada Research Chair in Forest Genomics, Université Laval, Quebec, QC, Canada
| | - John MacKay
- Centre for Forest Research, Université Laval, Québec, QC, Canada.,Canada Research Chair in Forest Genomics, Université Laval, Quebec, QC, Canada.,Department of Plant Sciences, University of Oxford, Oxford, UK
| |
Collapse
|
26
|
Rasheed A, Hao Y, Xia X, Khan A, Xu Y, Varshney RK, He Z. Crop Breeding Chips and Genotyping Platforms: Progress, Challenges, and Perspectives. MOLECULAR PLANT 2017; 10:1047-1064. [PMID: 28669791 DOI: 10.1016/j.molp.2017.06.008] [Citation(s) in RCA: 237] [Impact Index Per Article: 33.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Revised: 05/29/2017] [Accepted: 06/19/2017] [Indexed: 05/18/2023]
Abstract
There is a rapidly rising trend in the development and application of molecular marker assays for gene mapping and discovery in field crops and trees. Thus far, more than 50 SNP arrays and 15 different types of genotyping-by-sequencing (GBS) platforms have been developed in over 25 crop species and perennial trees. However, much less effort has been made on developing ultra-high-throughput and cost-effective genotyping platforms for applied breeding programs. In this review, we discuss the scientific bottlenecks in existing SNP arrays and GBS technologies and the strategies to develop targeted platforms for crop molecular breeding. We propose that future practical breeding platforms should adopt automated genotyping technologies, either array or sequencing based, target functional polymorphisms underpinning economic traits, and provide desirable prediction accuracy for quantitative traits, with universal applications under wide genetic backgrounds in crops. The development of such platforms faces serious challenges at both the technological level due to cost ineffectiveness, and the knowledge level due to large genotype-phenotype gaps in crop plants. It is expected that such genotyping platforms will be achieved in the next ten years in major crops in consideration of (a) rapid development in gene discovery of important traits, (b) deepened understanding of quantitative traits through new analytical models and population designs, (c) integration of multi-layer -omics data leading to identification of genes and pathways responsible for important breeding traits, and (d) improvement in cost effectiveness of large-scale genotyping. Crop breeding chips and genotyping platforms will provide unprecedented opportunities to accelerate the development of cultivars with desired yield potential, quality, and enhanced adaptation to mitigate the effects of climate change.
Collapse
Affiliation(s)
- Awais Rasheed
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China
| | - Yuanfeng Hao
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China
| | - Xianchun Xia
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China
| | - Awais Khan
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Geneva, NY, USA
| | - Yunbi Xu
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru 502324, India
| | - Zhonghu He
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China.
| |
Collapse
|
27
|
Ruggieri V, Anzar I, Paytuvi A, Calafiore R, Cigliano RA, Sanseverino W, Barone A. Exploiting the great potential of Sequence Capture data by a new tool, SUPER-CAP. DNA Res 2017; 24:81-91. [PMID: 28011720 PMCID: PMC5381350 DOI: 10.1093/dnares/dsw050] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 10/26/2016] [Indexed: 01/08/2023] Open
Abstract
The recent development of Sequence Capture methodology represents a powerful strategy for enhancing data generation to assess genetic variation of targeted genomic regions. Here, we present SUPER-CAP, a bioinformatics web tool aimed at handling Sequence Capture data, fine calculating the allele frequency of variations and building genotype-specific sequence of captured genes. The dataset used to develop this in silico strategy consists of 378 loci and related regulative regions in a collection of 44 tomato landraces. About 14,000 high-quality variants were identified. The high depth (>40×) of coverage and adopting the correct filtering criteria allowed identification of about 4,000 rare variants and 10 genes with a different copy number variation. We also show that the tool is capable to reconstruct genotype-specific sequences for each genotype by using the detected variants. This allows evaluating the combined effect of multiple variants in the same protein. The architecture and functionality of SUPER-CAP makes the software appropriate for a broad set of analyses including SNP discovery and mining. Its functionality, together with the capability to process large data sets and efficient detection of sequence variation, makes SUPER-CAP a valuable bioinformatics tool for genomics and breeding purposes.
Collapse
Affiliation(s)
- Valentino Ruggieri
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055 Portici (NA), Italy.,Sequentia Biotech SL, Calle Compte d'Urgell, 240, 08035 Barcelona, Spain
| | - Irantzu Anzar
- Sequentia Biotech SL, Calle Compte d'Urgell, 240, 08035 Barcelona, Spain
| | - Andreu Paytuvi
- Sequentia Biotech SL, Calle Compte d'Urgell, 240, 08035 Barcelona, Spain
| | - Roberta Calafiore
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055 Portici (NA), Italy
| | | | - Walter Sanseverino
- Sequentia Biotech SL, Calle Compte d'Urgell, 240, 08035 Barcelona, Spain
| | - Amalia Barone
- Department of Agricultural Sciences, University of Naples Federico II, Via Università 100, 80055 Portici (NA), Italy
| |
Collapse
|
28
|
Samelak-Czajka A, Marszalek-Zenczak M, Marcinkowska-Swojak M, Kozlowski P, Figlerowicz M, Zmienko A. MLPA-Based Analysis of Copy Number Variation in Plant Populations. FRONTIERS IN PLANT SCIENCE 2017; 8:222. [PMID: 28270823 PMCID: PMC5318451 DOI: 10.3389/fpls.2017.00222] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2016] [Accepted: 02/06/2017] [Indexed: 05/18/2023]
Abstract
Copy number variants (CNVs) are intraspecies duplications/deletions of large DNA segments (>1 kb). A growing number of reports highlight the functional and evolutionary impact of CNV in plants, increasing the need for appropriate tools that enable locus-specific CNV genotyping on a population scale. Multiplex ligation-dependent probe amplification (MLPA) is considered a gold standard in genotyping CNV in humans. Consequently, numerous commercial MLPA assays for CNV-related human diseases have been created. We routinely genotype complex multiallelic CNVs in human and plant genomes using the modified MLPA procedure based on fully synthesized oligonucleotide probes (90-200 nt), which greatly simplifies the design process and allows for the development of custom assays. Here, we present a step-by-step protocol for gene-specific MLPA probe design, multiplexed assay setup and data analysis in a copy number genotyping experiment in plants. As a case study, we present the results of a custom assay designed to genotype the copy number status of 12 protein coding genes in a population of 80 Arabidopsis accessions. The genes were pre-selected based on whole genome sequencing data and are localized in the genomic regions that display different levels of population-scale variation (non-variable, biallelic, or multiallelic, as well as CNVs overlapping whole genes or their fragments). The presented approach is suitable for population-scale validation of the CNV regions inferred from whole genome sequencing data analysis and for focused analysis of selected genes of interest. It can also be very easily adopted for any plant species, following optimization of the template amount and design of the appropriate control probes, according to the general guidelines presented in this paper.
Collapse
Affiliation(s)
- Anna Samelak-Czajka
- Institute of Computing Science, Faculty of Computing, Poznan University of TechnologyPoznan, Poland
| | - Malgorzata Marszalek-Zenczak
- Department of Molecular and Systems Biology, Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznan, Poland
| | | | - Piotr Kozlowski
- Department of Molecular Genetics, Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznan, Poland
| | - Marek Figlerowicz
- Institute of Computing Science, Faculty of Computing, Poznan University of TechnologyPoznan, Poland
- Department of Molecular and Systems Biology, Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznan, Poland
| | - Agnieszka Zmienko
- Institute of Computing Science, Faculty of Computing, Poznan University of TechnologyPoznan, Poland
- Department of Molecular and Systems Biology, Institute of Bioorganic Chemistry, Polish Academy of SciencesPoznan, Poland
| |
Collapse
|
29
|
Liu S, Zheng J, Migeon P, Ren J, Hu Y, He C, Liu H, Fu J, White FF, Toomajian C, Wang G. Unbiased K-mer Analysis Reveals Changes in Copy Number of Highly Repetitive Sequences During Maize Domestication and Improvement. Sci Rep 2017; 7:42444. [PMID: 28186206 PMCID: PMC5301235 DOI: 10.1038/srep42444] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Accepted: 01/10/2017] [Indexed: 12/15/2022] Open
Abstract
The major component of complex genomes is repetitive elements, which remain recalcitrant to characterization. Using maize as a model system, we analyzed whole genome shotgun (WGS) sequences for the two maize inbred lines B73 and Mo17 using k-mer analysis to quantify the differences between the two genomes. Significant differences were identified in highly repetitive sequences, including centromere, 45S ribosomal DNA (rDNA), knob, and telomere repeats. Genotype specific 45S rDNA sequences were discovered. The B73 and Mo17 polymorphic k-mers were used to examine allele-specific expression of 45S rDNA in the hybrids. Although Mo17 contains higher copy number than B73, equivalent levels of overall 45S rDNA expression indicates that transcriptional or post-transcriptional regulation mechanisms operate for the 45S rDNA in the hybrids. Using WGS sequences of B73xMo17 doubled haploids, genomic locations showing differential repetitive contents were genetically mapped, which displayed different organization of highly repetitive sequences in the two genomes. In an analysis of WGS sequences of HapMap2 lines, including maize wild progenitor, landraces, and improved lines, decreases and increases in abundance of additional sets of k-mers associated with centromere, 45S rDNA, knob, and retrotransposons were found among groups, revealing global evolutionary trends of genomic repeats during maize domestication and improvement.
Collapse
Affiliation(s)
- Sanzhen Liu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Jun Zheng
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, P.R.China
| | - Pierre Migeon
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Jie Ren
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Ying Hu
- Department of Plant Pathology, Kansas State University, Manhattan, KS, 66506, USA
| | - Cheng He
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, P.R.China
| | - Hongjun Liu
- State Key Laboratory of Crop Biology, Shandong Key Laboratory of Crop Biology, Taian 271018, P.R. China.,College of Life Sciences, Shandong Agricultural University, Taian 271018, P.R. China
| | - Junjie Fu
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, P.R.China
| | - Frank F White
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | | | - Guoying Wang
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, Beijing 100081, P.R.China
| |
Collapse
|
30
|
Dell'Acqua M, Gatti DM, Pea G, Cattonaro F, Coppens F, Magris G, Hlaing AL, Aung HH, Nelissen H, Baute J, Frascaroli E, Churchill GA, Inzé D, Morgante M, Pè ME. Genetic properties of the MAGIC maize population: a new platform for high definition QTL mapping in Zea mays. Genome Biol 2015; 16:167. [PMID: 26357913 PMCID: PMC4566846 DOI: 10.1186/s13059-015-0716-z] [Citation(s) in RCA: 138] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Accepted: 07/03/2015] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Maize (Zea mays) is a globally produced crop with broad genetic and phenotypic variation. New tools that improve our understanding of the genetic basis of quantitative traits are needed to guide predictive crop breeding. We have produced the first balanced multi-parental population in maize, a tool that provides high diversity and dense recombination events to allow routine quantitative trait loci (QTL) mapping in maize. RESULTS We produced 1,636 MAGIC maize recombinant inbred lines derived from eight genetically diverse founder lines. The characterization of 529 MAGIC maize lines shows that the population is a balanced, evenly differentiated mosaic of the eight founders, with mapping power and resolution strengthened by high minor allele frequencies and a fast decay of linkage disequilibrium. We show how MAGIC maize may find strong candidate genes by incorporating genome sequencing and transcriptomics data. We discuss three QTL for grain yield and three for flowering time, reporting candidate genes. Power simulations show that subsets of MAGIC maize might achieve high-power and high-definition QTL mapping. CONCLUSIONS We demonstrate MAGIC maize's value in identifying the genetic bases of complex traits of agronomic relevance. The design of MAGIC maize allows the accumulation of sequencing and transcriptomics layers to guide the identification of candidate genes for a number of maize traits at different developmental stages. The characterization of the full MAGIC maize population will lead to higher power and definition in QTL mapping, and lay the basis for improved understanding of maize phenotypes, heterosis included. MAGIC maize is available to researchers.
Collapse
Affiliation(s)
- Matteo Dell'Acqua
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.
| | | | - Giorgio Pea
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.
- Current address: Thermo Fisher Scientific, Via G.B Tiepolo 18, 20900, Monza, MB, Italy.
| | | | - Frederik Coppens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium.
| | - Gabriele Magris
- Institute of Applied Genomics, Udine, Italy.
- Department of Agricultural and Environmental Sciences, University of Udine, Udine, Italy.
| | - Aye L Hlaing
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.
- Current address: Department of Agricultural Research, Nay Pyi Taw, Myanmar.
| | - Htay H Aung
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.
- Current address: Plant Biotechnology Center, Yangon, Myanmar.
| | - Hilde Nelissen
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium.
- Department of Plant Systems Biology, VIB, Gent, Belgium.
| | - Joke Baute
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium.
- Department of Plant Systems Biology, VIB, Gent, Belgium.
| | | | | | - Dirk Inzé
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Gent, Belgium.
- Department of Plant Systems Biology, VIB, Gent, Belgium.
| | - Michele Morgante
- Institute of Applied Genomics, Udine, Italy.
- Department of Agricultural and Environmental Sciences, University of Udine, Udine, Italy.
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa, Italy.
| |
Collapse
|
31
|
Muraya MM, Schmutzer T, Ulpinnis C, Scholz U, Altmann T. Targeted Sequencing Reveals Large-Scale Sequence Polymorphism in Maize Candidate Genes for Biomass Production and Composition. PLoS One 2015; 10:e0132120. [PMID: 26151830 PMCID: PMC4495061 DOI: 10.1371/journal.pone.0132120] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Accepted: 06/10/2015] [Indexed: 12/30/2022] Open
Abstract
A major goal of maize genomic research is to identify sequence polymorphisms responsible for phenotypic variation in traits of economic importance. Large-scale detection of sequence variation is critical for linking genes, or genomic regions, to phenotypes. However, due to its size and complexity, it remains expensive to generate whole genome sequences of sufficient coverage for divergent maize lines, even with access to next generation sequencing (NGS) technology. Because methods involving reduction of genome complexity, such as genotyping-by-sequencing (GBS), assess only a limited fraction of sequence variation, targeted sequencing of selected genomic loci offers an attractive alternative. We therefore designed a sequence capture assay to target 29 Mb genomic regions and surveyed a total of 4,648 genes possibly affecting biomass production in 21 diverse inbred maize lines (7 flints, 14 dents). Captured and enriched genomic DNA was sequenced using the 454 NGS platform to 19.6-fold average depth coverage, and a broad evaluation of read alignment and variant calling methods was performed to select optimal procedures for variant discovery. Sequence alignment with the B73 reference and de novo assembly identified 383,145 putative single nucleotide polymorphisms (SNPs), of which 42,685 were non-synonymous alterations and 7,139 caused frameshifts. Presence/absence variation (PAV) of genes was also detected. We found that substantial sequence variation exists among genomic regions targeted in this study, which was particularly evident within coding regions. This diversification has the potential to broaden functional diversity and generate phenotypic variation that may lead to new adaptations and the modification of important agronomic traits. Further, annotated SNPs identified here will serve as useful genetic tools and as candidates in searches for phenotype-altering DNA variation. In summary, we demonstrated that sequencing of captured DNA is a powerful approach for variant discovery in maize genes.
Collapse
Affiliation(s)
- Moses M. Muraya
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, D-06466, Stadt Seeland, Germany
- Department of Plant Science, Chuka University, P.O. Box, 109–60400, Chuka, Kenya
| | - Thomas Schmutzer
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, D-06466, Stadt Seeland, Germany
- * E-mail:
| | - Chris Ulpinnis
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, D-06466, Stadt Seeland, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, D-06466, Stadt Seeland, Germany
| | - Thomas Altmann
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Corrensstraße 3, D-06466, Stadt Seeland, Germany
| |
Collapse
|
32
|
|
33
|
Jiang L, Lv Y, Li T, Zhao H, Zhang T. Identification and characterization of presence/absence variation in maize genotype Mo17. Genes Genomics 2015. [DOI: 10.1007/s13258-015-0272-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
34
|
Abstract
Differences between plant genomes range from single nucleotide polymorphisms to large-scale duplications, deletions and rearrangements. The large polymorphisms are termed structural variants (SVs). SVs have received significant attention in human genetics and were found to be responsible for various chronic diseases. However, little effort has been directed towards understanding the role of SVs in plants. Many recent advances in plant genetics have resulted from improvements in high-resolution technologies for measuring SVs, including microarray-based techniques, and more recently, high-throughput DNA sequencing. In this review we describe recent reports of SV in plants and describe the genomic technologies currently used to measure these SVs.
Collapse
|
35
|
Wang H, Nettleton D, Ying K. Copy number variation detection using next generation sequencing read counts. BMC Bioinformatics 2014; 15:109. [PMID: 24731174 PMCID: PMC4021345 DOI: 10.1186/1471-2105-15-109] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Accepted: 04/02/2014] [Indexed: 12/11/2022] Open
Abstract
Background A copy number variation (CNV) is a difference between genotypes in the number of copies of a genomic region. Next generation sequencing (NGS) technologies provide sensitive and accurate tools for detecting genomic variations that include CNVs. However, statistical approaches for CNV identification using NGS are limited. We propose a new methodology for detecting CNVs using NGS data. This method (henceforth denoted by m-HMM) is based on a hidden Markov model with emission probabilities that are governed by mixture distributions. We use the Expectation-Maximization (EM) algorithm to estimate the parameters in the model. Results A simulation study demonstrates that our proposed m-HMM approach has greater power for detecting copy number gains and losses relative to existing methods. Furthermore, application of our m-HMM to DNA sequencing data from the two maize inbred lines B73 and Mo17 to identify CNVs that may play a role in creating phenotypic differences between these inbred lines provides results concordant with previous array-based efforts to identify CNVs. Conclusions The new m-HMM method is a powerful and practical approach for identifying CNVs from NGS data.
Collapse
Affiliation(s)
- Heng Wang
- Lyman Briggs College, Michigan State University, East Lansing, USA.
| | | | | |
Collapse
|
36
|
A high-density gene map of loblolly pine (Pinus taeda L.) based on exome sequence capture genotyping. G3-GENES GENOMES GENETICS 2014; 4:29-37. [PMID: 24192835 PMCID: PMC3887537 DOI: 10.1534/g3.113.008714] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Loblolly pine (Pinus taeda L.) is an economically and ecologically important conifer for which a suite of genomic resources is being generated. Despite recent attempts to sequence the large genome of conifers, their assembly and the positioning of genes remains largely incomplete. The interspecific synteny in pines suggests that a gene-based map would be useful to support genome assemblies and analysis of conifers. To establish a reference gene-based genetic map, we performed exome sequencing of 14729 genes on a mapping population of 72 haploid samples, generating a resource of 7434 sequence variants segregating for 3787 genes. Most markers are single-nucleotide polymorphisms, although short insertions/deletions and multiple nucleotide polymorphisms also were used. Marker segregation in the population was used to generate a high-density, gene-based genetic map. A total of 2841 genes were mapped to pine’s 12 linkage groups with an average of one marker every 0.58 cM. Capture data were used to detect gene presence/absence variations and position 65 genes on the map. We compared the marker order of genes previously mapped in loblolly pine and found high agreement. We estimated that 4123 genes had enough sequencing depth for reliable detection of markers, suggesting a high marker conversation rate of 92% (3787/4123). This is possible because a significant portion of the gene is captured and sequenced, increasing the chances of identifying a polymorphic site for characterization and mapping. This sub-centiMorgan genetic map provides a valuable resource for gene positioning on chromosomes and guide for the assembly of a reference pine genome.
Collapse
|
37
|
Żmieńko A, Samelak A, Kozłowski P, Figlerowicz M. Copy number polymorphism in plant genomes. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2014; 127:1-18. [PMID: 23989647 PMCID: PMC4544587 DOI: 10.1007/s00122-013-2177-7] [Citation(s) in RCA: 120] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Accepted: 08/12/2013] [Indexed: 05/18/2023]
Abstract
Copy number variants (CNVs) are genomic rearrangements resulting from gains or losses of DNA segments. Typically, the term refers to rearrangements of sequences larger than 1 kb. This type of polymorphism has recently been shown to be a key contributor to intra-species genetic variation, along with single-nucleotide polymorphisms and short insertion-deletion polymorphisms. Over the last decade, a growing number of studies have highlighted the importance of copy number variation (CNV) as a factor affecting human phenotype and individual CNVs have been linked to risks for severe diseases. In plants, the exploration of the extent and role of CNV is still just beginning. Initial genomic analyses indicate that CNVs are prevalent in plants and have greatly affected plant genome evolution. Many CNV events have been observed in outcrossing and autogamous species. CNVs are usually found on all chromosomes, with CNV hotspots interspersed with regions of very low genetic variation. Although CNV is mainly associated with intergenic regions, many CNVs encompass protein-coding genes. The collected data suggest that CNV mainly affects the members of large families of functionally redundant genes. Thus, the effects of individual CNV events on phenotype are usually modest. Nevertheless, there are many cases in which CNVs for specific genes have been linked to important traits such as flowering time, plant height and resistance to biotic and abiotic stress. Recent reports suggest that CNVs may form rapidly in response to stress.
Collapse
Affiliation(s)
- Agnieszka Żmieńko
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, Poznan, Poland
| | - Anna Samelak
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| | - Piotr Kozłowski
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, Poznan, Poland
| | - Marek Figlerowicz
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, Poznan, Poland
- Institute of Computing Science, Poznan University of Technology, Piotrowo 2, 60-965 Poznan, Poland
| |
Collapse
|
38
|
Zhang LM, Luo H, Liu ZQ, Zhao Y, Luo JC, Hao DY, Jing HC. Genome-wide patterns of large-size presence/absence variants in sorghum. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2014; 56:24-37. [PMID: 24428208 DOI: 10.1111/jipb.12121] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2013] [Accepted: 10/10/2013] [Indexed: 06/03/2023]
Abstract
The presence/absence variants (PAVs) are a major source of genome structural variation and have profound effects on phenotypic and genomic variation in animals and humans. However, little is understood about PAVs in plant genomes. Our previous resequencing effort on three sorghum (Sorghum bicolour L.) genomes, each 12× coverage, uncovered 5 364 PAVs. Here, we report a detailed characterization of 51 large-size (>30 kb) PAVs. These PAVs spanned a total size of 2.92 Mb of the sorghum genome containing 202 known and predicted genes, including 38 genes annotated to encode cell death and stress response genes. The PAVs varied considerably for repeat sequences and mobile elements with DNA transposons as the major components. The frequency and distribution of these PAVs differed substantially across 96 sorghum inbred lines, and the low- and high frequency PAVs differed in their gene categories. This report shed new light on the occurrence and diversity of PAVs in sorghum genomes. Our research exemplifies a new perspective to explore genome structural variation for genetic improvement in plant breeding.
Collapse
Affiliation(s)
- Li-Min Zhang
- The Key Laboratory of Plant Resources, Institute of Botany, The Chinese Academy of Sciences, Beijing, 100093, China; Key Laboratory for Molecular Enzymology and Engineering of the Ministry of Education, Jilin University, Changchun, 130012, China
| | | | | | | | | | | | | |
Collapse
|
39
|
Kanamori H, Fujisawa M, Katagiri S, Oono Y, Fujisawa H, Karasawa W, Kurita K, Sasaki H, Mori S, Hamada M, Mukai Y, Yazawa T, Mizuno H, Namiki N, Sasaki T, Katayose Y, Matsumoto T, Wu J. A BAC physical map of aus rice cultivar 'Kasalath', and the map-based genomic sequence of 'Kasalath' chromosome 1. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2013; 76:699-708. [PMID: 23980637 DOI: 10.1111/tpj.12317] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 07/11/2013] [Accepted: 08/20/2013] [Indexed: 05/22/2023]
Abstract
Comparative analysis using available genomic resources within closely related species is an effective way to investigate genomic sequence and structural diversity. Rice (Oryza sativa L.) has undergone significant physiological and morphological changes during its domestication and local adaptation. We present a complete bacterial artificial chromosome (BAC) physical map for the aus rice cultivar 'Kasalath', which covers 90% of the sequence of temperate japonica rice cultivar 'Nipponbare'. Examination of physical distances between computational and experimental measurements of 'Kasalath' BAC insert size revealed the presence of more than 500 genomic regions that appear to have significant chromosome structural changes between the two cultivars. In particular, a genomic region on the long arm of 'Kasalath' chromosome 11 carrying a disease-resistance gene cluster was greatly expanded relative to the 'Nipponbare' genome. We also decoded 41.37 Mb of high-quality genomic sequence from 'Kasalath' chromosome 1. Extensive comparisons of chromosome 1 between 'Kasalath' and 'Nipponbare' led to the discovery of 317,843 single-nucleotide polymorphisms (SNPs) and 66,331 insertion/deletion (indel) sites. Nearly two-thirds of the expressed genes on rice chromosome 1 carried natural variations involving SNPs and/or indels that resulted in substitutions, insertions or deletions of amino acids in one cultivar relative to the other. We also observed gain and loss of genes caused by large indels. This study provides an important framework and an invaluable dataset for further understanding of the molecular mechanisms underlying the evolution and functions of the rice genome.
Collapse
Affiliation(s)
- Hiroyuki Kanamori
- Agrogenomics Research Center, National Institute of Agrobiological Sciences, Tsukuba, Ibaraki , 305-8602, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
40
|
Yu P, Wang CH, Xu Q, Feng Y, Yuan XP, Yu HY, Wang YP, Tang SX, Wei XH. Genome-wide copy number variations in Oryza sativa L. BMC Genomics 2013; 14:649. [PMID: 24059626 PMCID: PMC3856455 DOI: 10.1186/1471-2164-14-649] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2013] [Accepted: 09/16/2013] [Indexed: 01/16/2023] Open
Abstract
Background Copy number variation (CNV) can lead to intra-specific genome variations. It is not only part of normal genetic variation, but also is the source of phenotypic differences. Rice (Oryza sativa L.) is a model organism with a well-annotated genome, but investigation of CNVs in rice lags behind its mammalian counterparts. Results We comprehensively assayed CNVs using high-density array comparative genomic hybridization in a panel of 20 Asian cultivated rice comprising six indica, three aus, two rayada, two aromatic, three tropical japonica, and four temperate japonica varieties. We used a stringent criterion to identify a total of 2886 high-confidence copy number variable regions (CNVRs), which span 10.28 Mb (or 2.69%) of the rice genome, overlapping 1321 genes. These genes were significantly enriched for specific biological functions involved in cell death, protein phosphorylation, and defense response. Transposable elements (TEs) and other repetitive sequences were identified in the majority of CNVRs. Chromosome 11 showed the greatest enrichment for CNVs. Of subspecies-specific CNVRs, 55.75% and 61.96% were observed in only one cultivar of ssp. indica and ssp. japonica, respectively. Some CNVs with high frequency differences among groups resided in genes underlying rice adaptation. Conclusions Higher recombination rates and the presence of homologous gene clusters are probably predispositions for generation of the higher number of CNVs on chromosome 11 by non-allelic homologous recombination events. The subspecies-specific variants are enriched for rare alleles, which suggests that CNVs are relatively recent events that have arisen within breeding populations. A number of the CNVs identified in this study are candidates for generation of group-specific phenotypes.
Collapse
Affiliation(s)
- Ping Yu
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Hangzhou, China.
| | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Muñoz-Amatriaín M, Eichten SR, Wicker T, Richmond TA, Mascher M, Steuernagel B, Scholz U, Ariyadasa R, Spannagl M, Nussbaumer T, Mayer KFX, Taudien S, Platzer M, Jeddeloh JA, Springer NM, Muehlbauer GJ, Stein N. Distribution, functional impact, and origin mechanisms of copy number variation in the barley genome. Genome Biol 2013; 14:R58. [PMID: 23758725 PMCID: PMC3706897 DOI: 10.1186/gb-2013-14-6-r58] [Citation(s) in RCA: 93] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Accepted: 06/12/2013] [Indexed: 12/20/2022] Open
Abstract
Background There is growing evidence for the prevalence of copy number variation (CNV) and its role in phenotypic variation in many eukaryotic species. Here we use array comparative genomic hybridization to explore the extent of this type of structural variation in domesticated barley cultivars and wild barleys. Results A collection of 14 barley genotypes including eight cultivars and six wild barleys were used for comparative genomic hybridization. CNV affects 14.9% of all the sequences that were assessed. Higher levels of CNV diversity are present in the wild accessions relative to cultivated barley. CNVs are enriched near the ends of all chromosomes except 4H, which exhibits the lowest frequency of CNVs. CNV affects 9.5% of the coding sequences represented on the array and the genes affected by CNV are enriched for sequences annotated as disease-resistance proteins and protein kinases. Sequence-based comparisons of CNV between cultivars Barke and Morex provided evidence that DNA repair mechanisms of double-strand breaks via single-stranded annealing and synthesis-dependent strand annealing play an important role in the origin of CNV in barley. Conclusions We present the first catalog of CNVs in a diploid Triticeae species, which opens the door for future genome diversity research in a tribe that comprises the economically important cereal species wheat, barley, and rye. Our findings constitute a valuable resource for the identification of CNV affecting genes of agronomic importance. We also identify potential mechanisms that can generate variation in copy number in plant genomes.
Collapse
|
42
|
Downs GS, Bi YM, Colasanti J, Wu W, Chen X, Zhu T, Rothstein SJ, Lukens LN. A developmental transcriptional network for maize defines coexpression modules. PLANT PHYSIOLOGY 2013; 161:1830-43. [PMID: 23388120 PMCID: PMC3613459 DOI: 10.1104/pp.112.213231] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Here, we present a genome-wide overview of transcriptional circuits in the agriculturally significant crop species maize (Zea mays). We examined transcript abundance data at 50 developmental stages, from embryogenesis to senescence, for 34,876 gene models and classified genes into 24 robust coexpression modules. Modules were strongly associated with tissue types and related biological processes. Sixteen of the 24 modules (67%) have preferential transcript abundance within specific tissues. One-third of modules had an absence of gene expression in specific tissues. Genes within a number of modules also correlated with the developmental age of tissues. Coexpression of genes is likely due to transcriptional control. For a number of modules, key genes involved in transcriptional control have expression profiles that mimic the expression profiles of module genes, although the expression of transcriptional control genes is not unusually representative of module gene expression. Known regulatory motifs are enriched in several modules. Finally, of the 13 network modules with more than 200 genes, three contain genes that are notably clustered (P < 0.05) within the genome. This work, based on a carefully selected set of major tissues representing diverse stages of maize development, demonstrates the remarkable power of transcript-level coexpression networks to identify underlying biological processes and their molecular components.
Collapse
|
43
|
Appels R, Barrero R, Bellgard M. Advances in biotechnology and informatics to link variation in the genome to phenotypes in plants and animals. Funct Integr Genomics 2013; 13:1-9. [PMID: 23494190 PMCID: PMC3605488 DOI: 10.1007/s10142-013-0319-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Revised: 03/02/2013] [Accepted: 03/03/2013] [Indexed: 11/27/2022]
Abstract
Advances in our understanding of genome structure provide consistent evidence for the existence of a core genome representing species classically defined by phenotype, as well as conditionally dispensable components of the genome that shows extensive variation between individuals of a given species. Generally, conservation of phenotypic features between species reflects conserved features of the genome; however, this is evidently not necessarily always the case as demonstrated by the analysis of the tunicate chordate Oikopleura dioica. In both plants and animals, the methylation activity of DNA and histones continues to present new variables for modifying (eventually) the phenotype of an organism and provides for structural variation that builds on the point mutations, rearrangements, indels, and amplification of retrotransposable elements traditionally considered. The translation of the advances in the structure/function analysis of the genome to industry is facilitated through the capture of research outputs in "toolboxes" that remain accessible in the public domain.
Collapse
Affiliation(s)
- R. Appels
- Centre for Comparative Genomics, Murdoch University, Perth, WA 6150 Australia
| | - R. Barrero
- Centre for Comparative Genomics, Murdoch University, Perth, WA 6150 Australia
| | - M. Bellgard
- Centre for Comparative Genomics, Murdoch University, Perth, WA 6150 Australia
| |
Collapse
|
44
|
Li L, Petsch K, Shimizu R, Liu S, Xu WW, Ying K, Yu J, Scanlon MJ, Schnable PS, Timmermans MCP, Springer NM, Muehlbauer GJ. Mendelian and non-Mendelian regulation of gene expression in maize. PLoS Genet 2013; 9:e1003202. [PMID: 23341782 PMCID: PMC3547793 DOI: 10.1371/journal.pgen.1003202] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2012] [Accepted: 11/14/2012] [Indexed: 11/25/2022] Open
Abstract
Transcriptome variation plays an important role in affecting the phenotype of an organism. However, an understanding of the underlying mechanisms regulating transcriptome variation in segregating populations is still largely unknown. We sought to assess and map variation in transcript abundance in maize shoot apices in the intermated B73×Mo17 recombinant inbred line population. RNA–based sequencing (RNA–seq) allowed for the detection and quantification of the transcript abundance derived from 28,603 genes. For a majority of these genes, the population mean, coefficient of variation, and segregation patterns could be predicted by the parental expression levels. Expression quantitative trait loci (eQTL) mapping identified 30,774 eQTL including 96 trans-eQTL “hotspots,” each of which regulates the expression of a large number of genes. Interestingly, genes regulated by a trans-eQTL hotspot tend to be enriched for a specific function or act in the same genetic pathway. Also, genomic structural variation appeared to contribute to cis-regulation of gene expression. Besides genes showing Mendelian inheritance in the RIL population, we also found genes whose expression level and variation in the progeny could not be predicted based on parental difference, indicating that non-Mendelian factors also contribute to expression variation. Specifically, we found 145 genes that show patterns of expression reminiscent of paramutation such that all the progeny had expression levels similar to one of the two parents. Furthermore, we identified another 210 genes that exhibited unexpected patterns of transcript presence/absence. Many of these genes are likely to be gene fragments resulting from transposition, and the presence/absence of their transcripts could influence expression levels of their ancestral syntenic genes. Overall, our results contribute to the identification of novel expression patterns and broaden the understanding of transcriptional variation in plants. Phenotypes are determined by the expression of genes, the environment, and the interaction of gene expression and the environment. However, a complete understanding of the inheritance of and genome-wide regulation of gene expression is lacking. One approach, called expression quantitative trait locus (eQTL) mapping provides the opportunity to examine the genome-wide inheritance and regulation of gene expression. In this paper, we conducted high-throughput sequencing of gene transcripts to examine gene expression in the shoot apex of a maize biparental mapping population. We quantified expression levels from 28,603 genes in the population and showed that the vast majority of genes exhibited the expected pattern of Mendelian inheritance. We genetically mapped the expression patterns and identified genomic regions associated with gene expression. Notably, we detected gene expression patterns that exhibited non-Mendelian inheritance. These included 145 genes that exhibited expression patterns in the progeny that were similar to only one of the parents and 210 genes with unexpected presence/absence expression patterns. The findings of non-Mendelian inheritance underscore the complexity of gene expression and provide a framework for understanding these complexities.
Collapse
Affiliation(s)
- Lin Li
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Katherine Petsch
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Rena Shimizu
- Department of Plant Biology, Cornell University, Ithaca, New York, United States of America
| | - Sanzhen Liu
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Wayne Wenzhong Xu
- Supercomputing Institute for Advanced Computational Research, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Kai Ying
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | - Jianming Yu
- Department of Agronomy, Kansas State University, Manhattan, Kansas, United States of America
| | - Michael J. Scanlon
- Department of Plant Biology, Cornell University, Ithaca, New York, United States of America
| | - Patrick S. Schnable
- Department of Genetics, Development, and Cell Biology, and Department of Agronomy, Iowa State University, Ames, Iowa, United States of America
| | | | - Nathan M. Springer
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Gary J. Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, Minnesota, United States of America
- Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail:
| |
Collapse
|
45
|
Olsen KM, Wendel JF. A bountiful harvest: genomic insights into crop domestication phenotypes. ANNUAL REVIEW OF PLANT BIOLOGY 2013; 64:47-70. [PMID: 23451788 DOI: 10.1146/annurev-arplant-050312-120048] [Citation(s) in RCA: 221] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Human selection during crop domestication has resulted in remarkable transformations of plant phenotypes, providing a window into the genetic basis of morphological evolution. Recent progress in our understanding of the genetic architecture of novel plant traits has emerged from combining advanced molecular technologies with improved experimental designs, including nested association mapping, genome-wide association studies, population genetic screens for signatures of selection, and candidate gene approaches. These studies reveal a diversity of underlying causative mutations affecting phenotypes important in plant domestication and crop improvement, including coding sequence substitutions, presence/absence and copy number variation, transposon activation leading to novel gene structures and expression patterns, diversification following gene duplication, and polyploidy leading to altered combinatorial capabilities. The genomic regions unknowingly targeted by human selection include both structural and regulatory genes, often with results that propagate through the transcriptome as well as to other levels in the biosynthetic and morphogenetic networks.
Collapse
Affiliation(s)
- Kenneth M Olsen
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA.
| | | |
Collapse
|
46
|
Liu S, Ying K, Yeh CT, Yang J, Swanson-Wagner R, Wu W, Richmond T, Gerhardt DJ, Lai J, Springer N, Nettleton D, Jeddeloh JA, Schnable PS. Changes in genome content generated via segregation of non-allelic homologs. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2012; 72:390-399. [PMID: 22731681 DOI: 10.1111/j.1365-313x.2012.05087.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
A careful analysis of two maize recombinant inbred lines (RILs) relative to their inbred parents revealed the presence of several hundred apparently de novo copy number variants (CNVs). These changes in genome content were validated via both PCR and whole exome-array capture-and-sequencing experiments. One hundred and eighty-five genomic regions, which overlap with 38 high-confidence genes, exhibited apparently de novo copy number variation (CNV) in these two RILs and in many instances the same apparently de novo CNV events were observed in multiple RILs. Further analyses revealed that these recurrent apparently de novo CNVs were caused by segregation of single-copy homologous sequences that are located in non-allelic positions in the two parental inbred lines. F(1) individuals derived from these inbred lines will be hemizygous for each of these non-allelic homologs but RIL genotypes will contain these sequences at zero, one or two genomic loci. Hence, the segregation of non-allelic homologs may contribute to transgressive segregation. Indeed, statistical associations between phenotypic quantitative trait loci and genomic losses were observed for two of 14 tested pairs of non-allelic homologs.
Collapse
Affiliation(s)
- Sanzhen Liu
- Department of Agronomy, 2035 Roy J. Carver Co-Lab, Iowa State University, Ames, IA 50011, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
McHale LK, Haun WJ, Xu WW, Bhaskar PB, Anderson JE, Hyten DL, Gerhardt DJ, Jeddeloh JA, Stupar RM. Structural variants in the soybean genome localize to clusters of biotic stress-response genes. PLANT PHYSIOLOGY 2012; 159:1295-308. [PMID: 22696021 PMCID: PMC3425179 DOI: 10.1104/pp.112.194605] [Citation(s) in RCA: 128] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2012] [Accepted: 06/12/2012] [Indexed: 05/19/2023]
Abstract
Genome-wide structural and gene content variations are hypothesized to drive important phenotypic variation within a species. Structural and gene content variations were assessed among four soybean (Glycine max) genotypes using array hybridization and targeted resequencing. Many chromosomes exhibited relatively low rates of structural variation (SV) among genotypes. However, several regions exhibited both copy number and presence-absence variation, the most prominent found on chromosomes 3, 6, 7, 16, and 18. Interestingly, the regions most enriched for SV were specifically localized to gene-rich regions that harbor clustered multigene families. The most abundant classes of gene families associated with these regions were the nucleotide-binding and receptor-like protein classes, both of which are important for plant biotic defense. The colocalization of SV with plant defense response signal transduction pathways provides insight into the mechanisms of soybean resistance gene evolution and may inform the development of new approaches to resistance gene cloning.
Collapse
Affiliation(s)
- Leah K. McHale
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Wayne W. Xu
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Justin E. Anderson
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | | - Daniel J. Gerhardt
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | - Jeffrey A. Jeddeloh
- Department of Horticulture and Crop Science, Ohio State University, Columbus, Ohio 43210 (L.K.M.)
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, Minnesota 55108 (W.J.H., P.B.B., J.E.A., R.M.S.)
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455 (W.W.X.)
- Soybean Genomics and Improvement Laboratory, Agricultural Research Service, Beltsville, Maryland 20705 (D.L.H.); and
- Roche NimbleGen, Research and Development, Madison, Wisconsin 53719 (D.J.G., J.A.J.)
| | | |
Collapse
|
48
|
Frascaroli E, Canè MA, Pè ME, Pea G, Landi P. Characterization of heterotic quantitative trait loci in maize by evaluation of near-isogenic lines and their crosses at two competition levels. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 124:35-47. [PMID: 21953207 DOI: 10.1007/s00122-011-1685-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2010] [Accepted: 08/12/2011] [Indexed: 05/02/2023]
Abstract
In a previous study on a maize (Zea mays L.) population of recombinant inbreds derived from B73 × H99, we identified several quantitative trait loci (QTL) for agronomic traits with high dominance-additive ratio. Then, for four of these QTL, we developed families of near-isogenic lines (NILs) homozygous either for the QTL allele from B73 (BB) or from H99 (HH); for two of these QTL, the NILs' families were produced in two different genetic backgrounds. The present study was conducted to: (1) characterize these QTL for agronomic traits and (2) verify whether their effects were influenced by the genetic background, inbreeding level and plant density (PD). The six NILs' families were tested across 3 years and in three experiments at different inbreeding levels as NILs per se and their reciprocal crosses (Experiment 1), NILs crossed to related inbreds B73 and H99 (Experiment 2) and NILs crossed to four unrelated inbreds (Experiment 3). Experiment 2 was conducted at two PDs (4.5 and 9.0 plants m(-2)). Results of Experiments 1 and 2 confirmed previous findings as to QTL effects, with dominance-additive ratio superior to 1 for several traits; as a tendency, dominance effects were more pronounced in Experiment 1. The QTL effects were also confirmed in Experiment 3. The interactions involving QTL effects, families and PD were generally negligible, suggesting a certain stability of the QTL. Results emphasize the importance of dominance effects for these QTL, suggesting that they might deserve further studies, using the NILs' families and their crosses as base materials.
Collapse
Affiliation(s)
- Elisabetta Frascaroli
- Department of Agroenvironmental Sciences and Technologies, University of Bologna, Viale Fanin 44, 40127, Bologna, Italy.
| | | | | | | | | |
Collapse
|
49
|
Zheng LY, Guo XS, He B, Sun LJ, Peng Y, Dong SS, Liu TF, Jiang S, Ramachandran S, Liu CM, Jing HC. Genome-wide patterns of genetic variation in sweet and grain sorghum (Sorghum bicolor). Genome Biol 2011; 12:R114. [PMID: 22104744 PMCID: PMC3334600 DOI: 10.1186/gb-2011-12-11-r114] [Citation(s) in RCA: 167] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2011] [Revised: 11/04/2011] [Accepted: 11/21/2011] [Indexed: 01/22/2023] Open
Abstract
Background Sorghum (Sorghum bicolor) is globally produced as a source of food, feed, fiber and fuel. Grain and sweet sorghums differ in a number of important traits, including stem sugar and juice accumulation, plant height as well as grain and biomass production. The first whole genome sequence of a grain sorghum is available, but additional genome sequences are required to study genome-wide and intraspecific variation for dissecting the genetic basis of these important traits and for tailor-designed breeding of this important C4 crop. Results We resequenced two sweet and one grain sorghum inbred lines, and identified a set of nearly 1,500 genes differentiating sweet and grain sorghum. These genes fall into ten major metabolic pathways involved in sugar and starch metabolisms, lignin and coumarin biosynthesis, nucleic acid metabolism, stress responses and DNA damage repair. In addition, we uncovered 1,057,018 SNPs, 99,948 indels of 1 to 10 bp in length and 16,487 presence/absence variations as well as 17,111 copy number variations. The majority of the large-effect SNPs, indels and presence/absence variations resided in the genes containing leucine rich repeats, PPR repeats and disease resistance R genes possessing diverse biological functions or under diversifying selection, but were absent in genes that are essential for life. Conclusions This is a first report of the identification of genome-wide patterns of genetic variation in sorghum. High-density SNP and indel markers reported here will be a valuable resource for future gene-phenotype studies and the molecular breeding of this important crop and related species.
Collapse
Affiliation(s)
- Lei-Ying Zheng
- Institute of Botany, Chinese Academy of Sciences, Beijing, China.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Eichten SR, Swanson-Wagner RA, Schnable JC, Waters AJ, Hermanson PJ, Liu S, Yeh CT, Jia Y, Gendler K, Freeling M, Schnable PS, Vaughn MW, Springer NM. Heritable epigenetic variation among maize inbreds. PLoS Genet 2011; 7:e1002372. [PMID: 22125494 PMCID: PMC3219600 DOI: 10.1371/journal.pgen.1002372] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 09/20/2011] [Indexed: 11/26/2022] Open
Abstract
Epigenetic variation describes heritable differences that are not attributable to changes in DNA sequence. There is the potential for pure epigenetic variation that occurs in the absence of any genetic change or for more complex situations that involve both genetic and epigenetic differences. Methylation of cytosine residues provides one mechanism for the inheritance of epigenetic information. A genome-wide profiling of DNA methylation in two different genotypes of Zea mays (ssp. mays), an organism with a complex genome of interspersed genes and repetitive elements, allowed the identification and characterization of examples of natural epigenetic variation. The distribution of DNA methylation was profiled using immunoprecipitation of methylated DNA followed by hybridization to a high-density tiling microarray. The comparison of the DNA methylation levels in the two genotypes, B73 and Mo17, allowed for the identification of approximately 700 differentially methylated regions (DMRs). Several of these DMRs occur in genomic regions that are apparently identical by descent in B73 and Mo17 suggesting that they may be examples of pure epigenetic variation. The methylation levels of the DMRs were further studied in a panel of near-isogenic lines to evaluate the stable inheritance of the methylation levels and to assess the contribution of cis- and trans- acting information to natural epigenetic variation. The majority of DMRs that occur in genomic regions without genetic variation are controlled by cis-acting differences and exhibit relatively stable inheritance. This study provides evidence for naturally occurring epigenetic variation in maize, including examples of pure epigenetic variation that is not conditioned by genetic differences. The epigenetic differences are variable within maize populations and exhibit relatively stable trans-generational inheritance. The detected examples of epigenetic variation, including some without tightly linked genetic variation, may contribute to complex trait variation. Heritable variation within a species provides the basis for natural and artificial selection. A substantial portion of heritable variation is based on alterations in DNA sequence among individuals and is termed genetic variation. There is also evidence for epigenetic variation, which refers to heritable differences that are not caused by DNA sequence changes. Methylation of cytosine residues provides one molecular mechanism for epigenetic variation in many eukaryotic species. The genome-wide distribution of DNA methylation was assessed in two different inbred genotypes of maize to identify differentially methylated regions that may contribute to epigenetic variation. There are hundreds of genomic regions that have differences in DNA methylation levels in these two different genotypes, including methylation differences in regions without genetic variation. By studying the inheritance of the differential methylation in near-isogenic progeny of the two inbred lines, it is possible to demonstrate relatively stable inheritance of epigenetic variation, even in the absence of DNA sequence changes. The epigenetic variation among individuals of the same species may provide important contributions to phenotypic variation within a species even in the absence of genetic differences.
Collapse
Affiliation(s)
- Steve R. Eichten
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Ruth A. Swanson-Wagner
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - James C. Schnable
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | - Amanda J. Waters
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Peter J. Hermanson
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - Sanzhen Liu
- Iowa State University, Ames, Iowa, United States of America
| | - Cheng-Ting Yeh
- Iowa State University, Ames, Iowa, United States of America
| | - Yi Jia
- Iowa State University, Ames, Iowa, United States of America
| | - Karla Gendler
- Texas Advanced Computing Center, University of Texas–Austin, Austin, Texas, United States of America
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, California, United States of America
| | | | - Matthew W. Vaughn
- Texas Advanced Computing Center, University of Texas–Austin, Austin, Texas, United States of America
- * E-mail: (MWV); (NMS)
| | - Nathan M. Springer
- Microbial and Plant Genomics Institute, Department of Plant Biology, University of Minnesota, Saint Paul, Minnesota, United States of America
- * E-mail: (MWV); (NMS)
| |
Collapse
|