1
|
Sun L, Lai M, Ghouri F, Nawaz MA, Ali F, Baloch FS, Nadeem MA, Aasim M, Shahid MQ. Modern Plant Breeding Techniques in Crop Improvement and Genetic Diversity: From Molecular Markers and Gene Editing to Artificial Intelligence-A Critical Review. PLANTS (BASEL, SWITZERLAND) 2024; 13:2676. [PMID: 39409546 PMCID: PMC11478383 DOI: 10.3390/plants13192676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 09/08/2024] [Accepted: 09/22/2024] [Indexed: 10/20/2024]
Abstract
With the development of new technologies in recent years, researchers have made significant progress in crop breeding. Modern breeding differs from traditional breeding because of great changes in technical means and breeding concepts. Whereas traditional breeding initially focused on high yields, modern breeding focuses on breeding orientations based on different crops' audiences or by-products. The process of modern breeding starts from the creation of material populations, which can be constructed by natural mutagenesis, chemical mutagenesis, physical mutagenesis transfer DNA (T-DNA), Tos17 (endogenous retrotransposon), etc. Then, gene function can be mined through QTL mapping, Bulked-segregant analysis (BSA), Genome-wide association studies (GWASs), RNA interference (RNAi), and gene editing. Then, at the transcriptional, post-transcriptional, and translational levels, the functions of genes are described in terms of post-translational aspects. This article mainly discusses the application of the above modern scientific and technological methods of breeding and the advantages and limitations of crop breeding and diversity. In particular, the development of gene editing technology has contributed to modern breeding research.
Collapse
Affiliation(s)
- Lixia Sun
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou 510642, China; (L.S.); (M.L.); (F.G.)
- Guangdong Provincial Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Mingyu Lai
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou 510642, China; (L.S.); (M.L.); (F.G.)
- Guangdong Provincial Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Fozia Ghouri
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou 510642, China; (L.S.); (M.L.); (F.G.)
- Guangdong Provincial Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| | - Muhammad Amjad Nawaz
- Education Scientific Center of Nanotechnology, Far Eastern Federal University, 690091 Vladivostok, Russia;
| | - Fawad Ali
- School of Tropical Agriculture and Forestry, Hainan University, Sanya 572025, China;
| | - Faheem Shehzad Baloch
- Dapartment of Biotechnology, Faculty of Science, Mersin University, Mersin 33343, Türkiye;
| | - Muhammad Azhar Nadeem
- Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas 58140, Türkiye; (M.A.N.); (M.A.)
| | - Muhammad Aasim
- Faculty of Agricultural Sciences and Technologies, Sivas University of Science and Technology, Sivas 58140, Türkiye; (M.A.N.); (M.A.)
| | - Muhammad Qasim Shahid
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Laboratory for Lingnan Modern Agriculture, South China Agricultural University, Guangzhou 510642, China; (L.S.); (M.L.); (F.G.)
- Guangdong Provincial Key Laboratory of Plant Molecular Breeding, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
2
|
Rai M, Tyagi W. Haplotype breeding for unlocking and utilizing plant genomics data. Front Genet 2022; 13:1006288. [DOI: 10.3389/fgene.2022.1006288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 10/28/2022] [Indexed: 11/17/2022] Open
|
3
|
Ling K, Yi-ning D, Majeed A, Zi-jiang Y, Jun-wen C, Li-lian H, Xian-hong W, Lu-feng L, Zhen-feng Q, Dan Z, Shu-jie G, Rong X, Lin-yan X, Fu X, Yang D, Fu-sheng L. Evaluation of genome size and phylogenetic relationships of the Saccharum complex species. 3 Biotech 2022; 12:327. [PMID: 36276474 PMCID: PMC9582063 DOI: 10.1007/s13205-022-03338-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 08/16/2022] [Indexed: 11/29/2022] Open
Abstract
"Saccharum complex" is a hypothetical group of species, which is supposed to be involved in the origin of modern sugarcane, and displays large genomes and complex chromosomal alterations. The utilization of restricted parents in breeding programs of modern cultivated sugarcane has resulted in a genetic blockage, which controlled its improvement because of the limited genetic diversity. The use of wild relatives is an effective way to broaden the genetic composition of cultivated sugarcane. Due to the infrequent characterization of genomes, the potential of wild relatives is diffused in improving the cultivated sugarcane. To characterize the genomes of the wild relatives, the genome size and phylogenetic relationships among eight species, including Saccharum spontaneum, Erianthus arundinaceus, E. fulvus, E. rockii, Narenga porphyrocoma, Miscanthus floridulus, Eulalia quadrinervis, and M. sinensis were evaluated based on flow cytometry, genome surveys, K-mer analysis, chloroplast genome sequencing, and whole-genome SNPs analysis. We observed highly heterozygous genomes of S. spontaneum, E. rockii, and E. arundinaceus and the highly repetitive genome of E. fulvus. The genomes of Eulalia quadrinervis, N. porphyrocoma, M. sinensis, and M. floridulus were highly complex. Phylogenetic results of the two approaches were dissimilar, however, both indicate E. fulvus displayed closer relationships to Miscanthus and Saccharum than other species of Saccharum complex. Eulalia quadrinervis was more closely related to M. floridulus than M. sinensis; E. arundinaceus differ significantly from Miscanthus, Narenga, and Saccharum, but was relatively close to Erianthus. We proved the point of E. rockii and E. fulvus should not be classified as one genus, and E. fulvus should be classified as the Saccharum genus. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-022-03338-5.
Collapse
Affiliation(s)
- Kui Ling
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
- Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen, 518067 China
| | - Di Yi-ning
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - Aasim Majeed
- School of Agricultural Biotechnology, Punjab Agriculture University, Ludhiana, 141004 India
| | - Yang Zi-jiang
- Applied Genomics Technology Laboratory, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Chen Jun-wen
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - He Li-lian
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Wang Xian-hong
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - Liu Lu-feng
- Sugarcane Research Institute, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Qian Zhen-feng
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - Zeng Dan
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Gu Shu-jie
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - Xu Rong
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Xie Lin-yan
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
| | - Xu Fu
- Sugarcane Research Institute, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Dong Yang
- Applied Genomics Technology Laboratory, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| | - Li Fu-sheng
- The Key Laboratory for Crop Production and Intelligent Agriculture of Yunnan Province, Kunming, 650201 Yunnan China
- Sugarcane Research Institute, Yunnan Agricultural University, Kunming, 650201 Yunnan China
- College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming, 650201 Yunnan China
| |
Collapse
|
4
|
Naqvi RZ, Siddiqui HA, Mahmood MA, Najeebullah S, Ehsan A, Azhar M, Farooq M, Amin I, Asad S, Mukhtar Z, Mansoor S, Asif M. Smart breeding approaches in post-genomics era for developing climate-resilient food crops. FRONTIERS IN PLANT SCIENCE 2022; 13:972164. [PMID: 36186056 PMCID: PMC9523482 DOI: 10.3389/fpls.2022.972164] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 08/15/2022] [Indexed: 06/16/2023]
Abstract
Improving the crop traits is highly required for the development of superior crop varieties to deal with climate change and the associated abiotic and biotic stress challenges. Climate change-driven global warming can trigger higher insect pest pressures and plant diseases thus affecting crop production sternly. The traits controlling genes for stress or disease tolerance are economically imperative in crop plants. In this scenario, the extensive exploration of available wild, resistant or susceptible germplasms and unraveling the genetic diversity remains vital for breeding programs. The dawn of next-generation sequencing technologies and omics approaches has accelerated plant breeding by providing the genome sequences and transcriptomes of several plants. The availability of decoded plant genomes offers an opportunity at a glance to identify candidate genes, quantitative trait loci (QTLs), molecular markers, and genome-wide association studies that can potentially aid in high throughput marker-assisted breeding. In recent years genomics is coupled with marker-assisted breeding to unravel the mechanisms to harness better better crop yield and quality. In this review, we discuss the aspects of marker-assisted breeding and recent perspectives of breeding approaches in the era of genomics, bioinformatics, high-tech phonemics, genome editing, and new plant breeding technologies for crop improvement. In nutshell, the smart breeding toolkit in the post-genomics era can steadily help in developing climate-smart future food crops.
Collapse
|
5
|
Tay Fernandez CG, Nestor BJ, Danilevicz MF, Gill M, Petereit J, Bayer PE, Finnegan PM, Batley J, Edwards D. Pangenomes as a Resource to Accelerate Breeding of Under-Utilised Crop Species. Int J Mol Sci 2022; 23:2671. [PMID: 35269811 PMCID: PMC8910360 DOI: 10.3390/ijms23052671] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 02/21/2022] [Accepted: 02/21/2022] [Indexed: 02/01/2023] Open
Abstract
Pangenomes are a rich resource to examine the genomic variation observed within a species or genera, supporting population genetics studies, with applications for the improvement of crop traits. Major crop species such as maize (Zea mays), rice (Oryza sativa), Brassica (Brassica spp.), and soybean (Glycine max) have had pangenomes constructed and released, and this has led to the discovery of valuable genes associated with disease resistance and yield components. However, pangenome data are not available for many less prominent crop species that are currently under-utilised. Despite many under-utilised species being important food sources in regional populations, the scarcity of genomic data for these species hinders their improvement. Here, we assess several under-utilised crops and review the pangenome approaches that could be used to build resources for their improvement. Many of these under-utilised crops are cultivated in arid or semi-arid environments, suggesting that novel genes related to drought tolerance may be identified and used for introgression into related major crop species. In addition, we discuss how previously collected data could be used to enrich pangenome functional analysis in genome-wide association studies (GWAS) based on studies in major crops. Considering the technological advances in genome sequencing, pangenome references for under-utilised species are becoming more obtainable, offering the opportunity to identify novel genes related to agro-morphological traits in these species.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | - David Edwards
- School of Biological Sciences, The University of Western Australia, Perth, WA 6009, Australia; (C.G.T.F.); (B.J.N.); (M.F.D.); (M.G.); (J.P.); (P.E.B.); (P.M.F.); (J.B.)
| |
Collapse
|
6
|
Tay Fernandez CG, Nestor BJ, Danilevicz MF, Marsh JI, Petereit J, Bayer PE, Batley J, Edwards D. Expanding Gene-Editing Potential in Crop Improvement with Pangenomes. Int J Mol Sci 2022; 23:ijms23042276. [PMID: 35216392 PMCID: PMC8879065 DOI: 10.3390/ijms23042276] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 02/14/2022] [Accepted: 02/15/2022] [Indexed: 02/01/2023] Open
Abstract
Pangenomes aim to represent the complete repertoire of the genome diversity present within a species or cohort of species, capturing the genomic structural variance between individuals. This genomic information coupled with phenotypic data can be applied to identify genes and alleles involved with abiotic stress tolerance, disease resistance, and other desirable traits. The characterisation of novel structural variants from pangenomes can support genome editing approaches such as Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR associated protein Cas (CRISPR-Cas), providing functional information on gene sequences and new target sites in variant-specific genes with increased efficiency. This review discusses the application of pangenomes in genome editing and crop improvement, focusing on the potential of pangenomes to accurately identify target genes for CRISPR-Cas editing of plant genomes while avoiding adverse off-target effects. We consider the limitations of applying CRISPR-Cas editing with pangenome references and potential solutions to overcome these limitations.
Collapse
|
7
|
Ashraf MF, Hou D, Hussain Q, Imran M, Pei J, Ali M, Shehzad A, Anwar M, Noman A, Waseem M, Lin X. Entailing the Next-Generation Sequencing and Metabolome for Sustainable Agriculture by Improving Plant Tolerance. Int J Mol Sci 2022; 23:651. [PMID: 35054836 PMCID: PMC8775971 DOI: 10.3390/ijms23020651] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 12/23/2021] [Accepted: 12/29/2021] [Indexed: 02/07/2023] Open
Abstract
Crop production is a serious challenge to provide food for the 10 billion individuals forecasted to live across the globe in 2050. The scientists' emphasize establishing an equilibrium among diversity and quality of crops by enhancing yield to fulfill the increasing demand for food supply sustainably. The exploitation of genetic resources using genomics and metabolomics strategies can help generate resilient plants against stressors in the future. The innovation of the next-generation sequencing (NGS) strategies laid the foundation to unveil various plants' genetic potential and help us to understand the domestication process to unmask the genetic potential among wild-type plants to utilize for crop improvement. Nowadays, NGS is generating massive genomic resources using wild-type and domesticated plants grown under normal and harsh environments to explore the stress regulatory factors and determine the key metabolites. Improved food nutritional value is also the key to eradicating malnutrition problems around the globe, which could be attained by employing the knowledge gained through NGS and metabolomics to achieve suitability in crop yield. Advanced technologies can further enhance our understanding in defining the strategy to obtain a specific phenotype of a crop. Integration among bioinformatic tools and molecular techniques, such as marker-assisted, QTLs mapping, creation of reference genome, de novo genome assembly, pan- and/or super-pan-genomes, etc., will boost breeding programs. The current article provides sequential progress in NGS technologies, a broad application of NGS, enhancement of genetic manipulation resources, and understanding the crop response to stress by producing plant metabolites. The NGS and metabolomics utilization in generating stress-tolerant plants/crops without deteriorating a natural ecosystem is considered a sustainable way to improve agriculture production. This highlighted knowledge also provides useful research that explores the suitable resources for agriculture sustainability.
Collapse
Affiliation(s)
- Muhammad Furqan Ashraf
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Lin’An, Hangzhou 311300, China; (M.F.A.); (D.H.); (Q.H.); (J.P.)
| | - Dan Hou
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Lin’An, Hangzhou 311300, China; (M.F.A.); (D.H.); (Q.H.); (J.P.)
| | - Quaid Hussain
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Lin’An, Hangzhou 311300, China; (M.F.A.); (D.H.); (Q.H.); (J.P.)
| | - Muhammad Imran
- Colleges of Agriculture and Horticulture, South China Agricultural University, Guangzhou 510642, China; (M.I.); (M.W.)
| | - Jialong Pei
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Lin’An, Hangzhou 311300, China; (M.F.A.); (D.H.); (Q.H.); (J.P.)
| | - Mohsin Ali
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing 100101, China;
| | - Aamar Shehzad
- Maize Research Station, AARI, Faisalabad 38000, Pakistan;
| | - Muhammad Anwar
- Guangdong Technology Research Center for Marine Algal Bioengineering, Guangdong Key Laboratory of Plant Epigenetics, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen 518055, China;
| | - Ali Noman
- Department of Botany, Government College University, Faisalabad 38000, Pakistan;
| | - Muhammad Waseem
- Colleges of Agriculture and Horticulture, South China Agricultural University, Guangzhou 510642, China; (M.I.); (M.W.)
| | - Xinchun Lin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, 666 Wusu Street, Lin’An, Hangzhou 311300, China; (M.F.A.); (D.H.); (Q.H.); (J.P.)
| |
Collapse
|
8
|
Razzaq A, Saleem F, Wani SH, Abdelmohsen SAM, Alyousef HA, Abdelbacki AMM, Alkallas FH, Tamam N, Elansary HO. De-novo Domestication for Improving Salt Tolerance in Crops. FRONTIERS IN PLANT SCIENCE 2021; 12:681367. [PMID: 34603347 PMCID: PMC8481614 DOI: 10.3389/fpls.2021.681367] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 07/12/2021] [Indexed: 05/21/2023]
Abstract
Global agriculture production is under serious threat from rapidly increasing population and adverse climate changes. Food security is currently a huge challenge to feed 10 billion people by 2050. Crop domestication through conventional approaches is not good enough to meet the food demands and unable to fast-track the crop yields. Also, intensive breeding and rigorous selection of superior traits causes genetic erosion and eliminates stress-responsive genes, which makes crops more prone to abiotic stresses. Salt stress is one of the most prevailing abiotic stresses that poses severe damages to crop yield around the globe. Recent innovations in state-of-the-art genomics and transcriptomics technologies have paved the way to develop salinity tolerant crops. De novo domestication is one of the promising strategies to produce superior new crop genotypes through exploiting the genetic diversity of crop wild relatives (CWRs). Next-generation sequencing (NGS) technologies open new avenues to identifying the unique salt-tolerant genes from the CWRs. It has also led to the assembly of highly annotated crop pan-genomes to snapshot the full landscape of genetic diversity and recapture the huge gene repertoire of a species. The identification of novel genes alongside the emergence of cutting-edge genome editing tools for targeted manipulation renders de novo domestication a way forward for developing salt-tolerance crops. However, some risk associated with gene-edited crops causes hurdles for its adoption worldwide. Halophytes-led breeding for salinity tolerance provides an alternative strategy to identify extremely salt tolerant varieties that can be used to develop new crops to mitigate salinity stress.
Collapse
Affiliation(s)
- Ali Razzaq
- Centre of Agricultural Biochemistry and Biotechnology, University of Agriculture, Faisalabad, Pakistan
| | - Fozia Saleem
- Centre of Agricultural Biochemistry and Biotechnology, University of Agriculture, Faisalabad, Pakistan
| | - Shabir Hussain Wani
- Division of Genetics and Plant Breeding, Sher-E-Kashmir University of Agricultural Sciences and Technology of Kashmir, Srinagar, India
| | - Shaimaa A. M. Abdelmohsen
- Physics Department, Faculty of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Haifa A. Alyousef
- Physics Department, Faculty of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | | | - Fatemah H. Alkallas
- Physics Department, Faculty of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Nissren Tamam
- Physics Department, Faculty of Science, Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Hosam O. Elansary
- Plant Production Department, College of Food and Agriculture Sciences, King Saud University, Riyadh, Saudi Arabia
| |
Collapse
|
9
|
Razzaq A, Kaur P, Akhter N, Wani SH, Saleem F. Next-Generation Breeding Strategies for Climate-Ready Crops. FRONTIERS IN PLANT SCIENCE 2021; 12:620420. [PMID: 34367194 PMCID: PMC8336580 DOI: 10.3389/fpls.2021.620420] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 06/14/2021] [Indexed: 05/17/2023]
Abstract
Climate change is a threat to global food security due to the reduction of crop productivity around the globe. Food security is a matter of concern for stakeholders and policymakers as the global population is predicted to bypass 10 billion in the coming years. Crop improvement via modern breeding techniques along with efficient agronomic practices innovations in microbiome applications, and exploiting the natural variations in underutilized crops is an excellent way forward to fulfill future food requirements. In this review, we describe the next-generation breeding tools that can be used to increase crop production by developing climate-resilient superior genotypes to cope with the future challenges of global food security. Recent innovations in genomic-assisted breeding (GAB) strategies allow the construction of highly annotated crop pan-genomes to give a snapshot of the full landscape of genetic diversity (GD) and recapture the lost gene repertoire of a species. Pan-genomes provide new platforms to exploit these unique genes or genetic variation for optimizing breeding programs. The advent of next-generation clustered regularly interspaced short palindromic repeat/CRISPR-associated (CRISPR/Cas) systems, such as prime editing, base editing, and de nova domestication, has institutionalized the idea that genome editing is revamped for crop improvement. Also, the availability of versatile Cas orthologs, including Cas9, Cas12, Cas13, and Cas14, improved the editing efficiency. Now, the CRISPR/Cas systems have numerous applications in crop research and successfully edit the major crop to develop resistance against abiotic and biotic stress. By adopting high-throughput phenotyping approaches and big data analytics tools like artificial intelligence (AI) and machine learning (ML), agriculture is heading toward automation or digitalization. The integration of speed breeding with genomic and phenomic tools can allow rapid gene identifications and ultimately accelerate crop improvement programs. In addition, the integration of next-generation multidisciplinary breeding platforms can open exciting avenues to develop climate-ready crops toward global food security.
Collapse
Affiliation(s)
- Ali Razzaq
- Centre of Agricultural Biochemistry and Biotechnology (CABB), University of Agriculture, Faisalabad, Pakistan
| | - Parwinder Kaur
- UWA School of Agriculture and Environment, The University of Western Australia, Perth, WA, Australia
| | - Naheed Akhter
- College of Allied Health Professional, Faculty of Medical Sciences, Government College University Faisalabad, Faisalabad, Pakistan
| | - Shabir Hussain Wani
- Mountain Research Center for Field Crops, Khudwani, Sher-e-Kashmir University of Agricultural Sciences and Technology of Kashmir, Srinagar, India
| | - Fozia Saleem
- Centre of Agricultural Biochemistry and Biotechnology (CABB), University of Agriculture, Faisalabad, Pakistan
| |
Collapse
|
10
|
Jayakodi M, Schreiber M, Stein N, Mascher M. Building pan-genome infrastructures for crop plants and their use in association genetics. DNA Res 2021; 28:6117190. [PMID: 33484244 PMCID: PMC7934568 DOI: 10.1093/dnares/dsaa030] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Indexed: 12/20/2022] Open
Abstract
Pan-genomic studies aim at representing the entire sequence diversity within a species to provide useful resources for evolutionary studies, functional genomics and breeding of cultivated plants. Cost reductions in high-throughput sequencing and advances in sequence assembly algorithms have made it possible to create multiple reference genomes along with a catalogue of all forms of genetic variations in plant species with large and complex or polyploid genomes. In this review, we summarize the current approaches to building pan-genomes as an in silico representation of plant sequence diversity and outline relevant methods for their effective utilization in linking structural with phenotypic variation. We propose as future research avenues (i) transcriptomic and epigenomic studies across multiple reference genomes and (ii) the development of user-friendly and feature-rich pan-genome browsers.
Collapse
Affiliation(s)
- Murukarthick Jayakodi
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Mona Schreiber
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Nils Stein
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.,Center for Integrated Breeding Research (CiBreed), Georg-August-University Göttingen, Göttingen, Germany
| | - Martin Mascher
- Department of Genebank, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Saxony, Germany
| |
Collapse
|
11
|
Zenda T, Liu S, Dong A, Duan H. Advances in Cereal Crop Genomics for Resilience under Climate Change. Life (Basel) 2021; 11:502. [PMID: 34072447 PMCID: PMC8228855 DOI: 10.3390/life11060502] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 05/21/2021] [Accepted: 05/25/2021] [Indexed: 12/12/2022] Open
Abstract
Adapting to climate change, providing sufficient human food and nutritional needs, and securing sufficient energy supplies will call for a radical transformation from the current conventional adaptation approaches to more broad-based and transformative alternatives. This entails diversifying the agricultural system and boosting productivity of major cereal crops through development of climate-resilient cultivars that can sustainably maintain higher yields under climate change conditions, expanding our focus to crop wild relatives, and better exploitation of underutilized crop species. This is facilitated by the recent developments in plant genomics, such as advances in genome sequencing, assembly, and annotation, as well as gene editing technologies, which have increased the availability of high-quality reference genomes for various model and non-model plant species. This has necessitated genomics-assisted breeding of crops, including underutilized species, consequently broadening genetic variation of the available germplasm; improving the discovery of novel alleles controlling important agronomic traits; and enhancing creation of new crop cultivars with improved tolerance to biotic and abiotic stresses and superior nutritive quality. Here, therefore, we summarize these recent developments in plant genomics and their application, with particular reference to cereal crops (including underutilized species). Particularly, we discuss genome sequencing approaches, quantitative trait loci (QTL) mapping and genome-wide association (GWAS) studies, directed mutagenesis, plant non-coding RNAs, precise gene editing technologies such as CRISPR-Cas9, and complementation of crop genotyping by crop phenotyping. We then conclude by providing an outlook that, as we step into the future, high-throughput phenotyping, pan-genomics, transposable elements analysis, and machine learning hold much promise for crop improvements related to climate resilience and nutritional superiority.
Collapse
Affiliation(s)
- Tinashe Zenda
- State Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding 071001, China; (S.L.); (A.D.)
- North China Key Laboratory for Crop Germplasm Resources of the Education Ministry, Hebei Agricultural University, Baoding 071001, China
- Department of Crop Genetics and Breeding, College of Agronomy, Hebei Agricultural University, Baoding 071001, China
- Department of Crop Science, Faculty of Agriculture and Environmental Science, Bindura University of Science Education, Bindura P. Bag 1020, Zimbabwe
| | - Songtao Liu
- State Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding 071001, China; (S.L.); (A.D.)
- North China Key Laboratory for Crop Germplasm Resources of the Education Ministry, Hebei Agricultural University, Baoding 071001, China
- Department of Crop Genetics and Breeding, College of Agronomy, Hebei Agricultural University, Baoding 071001, China
| | - Anyi Dong
- State Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding 071001, China; (S.L.); (A.D.)
- North China Key Laboratory for Crop Germplasm Resources of the Education Ministry, Hebei Agricultural University, Baoding 071001, China
- Department of Crop Genetics and Breeding, College of Agronomy, Hebei Agricultural University, Baoding 071001, China
| | - Huijun Duan
- State Key Laboratory of North China Crop Improvement and Regulation, Hebei Agricultural University, Baoding 071001, China; (S.L.); (A.D.)
- North China Key Laboratory for Crop Germplasm Resources of the Education Ministry, Hebei Agricultural University, Baoding 071001, China
- Department of Crop Genetics and Breeding, College of Agronomy, Hebei Agricultural University, Baoding 071001, China
| |
Collapse
|
12
|
Martina M, Tikunov Y, Portis E, Bovy AG. The Genetic Basis of Tomato Aroma. Genes (Basel) 2021; 12:genes12020226. [PMID: 33557308 PMCID: PMC7915847 DOI: 10.3390/genes12020226] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 01/29/2021] [Accepted: 02/01/2021] [Indexed: 02/06/2023] Open
Abstract
Tomato (Solanum lycopersicum L.) aroma is determined by the interaction of volatile compounds (VOCs) released by the tomato fruits with receptors in the nose, leading to a sensorial impression, such as "sweet", "smoky", or "fruity" aroma. Of the more than 400 VOCs released by tomato fruits, 21 have been reported as main contributors to the perceived tomato aroma. These VOCs can be grouped in five clusters, according to their biosynthetic origins. In the last decades, a vast array of scientific studies has investigated the genetic component of tomato aroma in modern tomato cultivars and their relatives. In this paper we aim to collect, compare, integrate and summarize the available literature on flavour-related QTLs in tomato. Three hundred and 5ifty nine (359) QTLs associated with tomato fruit VOCs were physically mapped on the genome and investigated for the presence of potential candidate genes. This review makes it possible to (i) pinpoint potential donors described in literature for specific traits, (ii) highlight important QTL regions by combining information from different populations, and (iii) pinpoint potential candidate genes. This overview aims to be a valuable resource for researchers aiming to elucidate the genetics underlying tomato flavour and for breeders who aim to improve tomato aroma.
Collapse
Affiliation(s)
- Matteo Martina
- DISAFA, Plant Genetics and Breeding, University of Turin, 10095 Grugliasco, Italy;
| | - Yury Tikunov
- Plant Breeding, Wageningen University & Research, P.O. Box 386, 6700 AJ Wageningen, The Netherlands;
| | - Ezio Portis
- DISAFA, Plant Genetics and Breeding, University of Turin, 10095 Grugliasco, Italy;
- Correspondence: (E.P.); (A.G.B.); Tel.: +39-011-6708807 (E.P.); +31-317-480762 (A.G.B.)
| | - Arnaud G. Bovy
- Plant Breeding, Wageningen University & Research, P.O. Box 386, 6700 AJ Wageningen, The Netherlands;
- Correspondence: (E.P.); (A.G.B.); Tel.: +39-011-6708807 (E.P.); +31-317-480762 (A.G.B.)
| |
Collapse
|
13
|
Della Coletta R, Qiu Y, Ou S, Hufford MB, Hirsch CN. How the pan-genome is changing crop genomics and improvement. Genome Biol 2021; 22:3. [PMID: 33397434 PMCID: PMC7780660 DOI: 10.1186/s13059-020-02224-8] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 12/07/2020] [Indexed: 01/13/2023] Open
Abstract
Crop genomics has seen dramatic advances in recent years due to improvements in sequencing technology, assembly methods, and computational resources. These advances have led to the development of new tools to facilitate crop improvement. The study of structural variation within species and the characterization of the pan-genome has revealed extensive genome content variation among individuals within a species that is paradigm shifting to crop genomics and improvement. Here, we review advances in crop genomics and how utilization of these tools is shifting in light of pan-genomes that are becoming available for many crop species.
Collapse
Affiliation(s)
- Rafael Della Coletta
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108 USA
| | - Yinjie Qiu
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108 USA
| | - Shujun Ou
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| | - Matthew B. Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011 USA
| | - Candice N. Hirsch
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108 USA
| |
Collapse
|
14
|
Yang Y, Saand MA, Huang L, Abdelaal WB, Zhang J, Wu Y, Li J, Sirohi MH, Wang F. Applications of Multi-Omics Technologies for Crop Improvement. FRONTIERS IN PLANT SCIENCE 2021; 12:563953. [PMID: 34539683 PMCID: PMC8446515 DOI: 10.3389/fpls.2021.563953] [Citation(s) in RCA: 69] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 08/06/2021] [Indexed: 05/19/2023]
Abstract
Multiple "omics" approaches have emerged as successful technologies for plant systems over the last few decades. Advances in next-generation sequencing (NGS) have paved a way for a new generation of different omics, such as genomics, transcriptomics, and proteomics. However, metabolomics, ionomics, and phenomics have also been well-documented in crop science. Multi-omics approaches with high throughput techniques have played an important role in elucidating growth, senescence, yield, and the responses to biotic and abiotic stress in numerous crops. These omics approaches have been implemented in some important crops including wheat (Triticum aestivum L.), soybean (Glycine max), tomato (Solanum lycopersicum), barley (Hordeum vulgare L.), maize (Zea mays L.), millet (Setaria italica L.), cotton (Gossypium hirsutum L.), Medicago truncatula, and rice (Oryza sativa L.). The integration of functional genomics with other omics highlights the relationships between crop genomes and phenotypes under specific physiological and environmental conditions. The purpose of this review is to dissect the role and integration of multi-omics technologies for crop breeding science. We highlight the applications of various omics approaches, such as genomics, transcriptomics, proteomics, metabolomics, phenomics, and ionomics, and the implementation of robust methods to improve crop genetics and breeding science. Potential challenges that confront the integration of multi-omics with regard to the functional analysis of genes and their networks as well as the development of potential traits for crop improvement are discussed. The panomics platform allows for the integration of complex omics to construct models that can be used to predict complex traits. Systems biology integration with multi-omics datasets can enhance our understanding of molecular regulator networks for crop improvement. In this context, we suggest the integration of entire omics by employing the "phenotype to genotype" and "genotype to phenotype" concept. Hence, top-down (phenotype to genotype) and bottom-up (genotype to phenotype) model through integration of multi-omics with systems biology may be beneficial for crop breeding improvement under conditions of environmental stresses.
Collapse
Affiliation(s)
- Yaodong Yang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
- *Correspondence: Yaodong Yang
| | - Mumtaz Ali Saand
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
- Department of Botany, Shah Abdul Latif University, Khairpur, Pakistan
| | - Liyun Huang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Walid Badawy Abdelaal
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Jun Zhang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Yi Wu
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | - Jing Li
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| | | | - Fuyou Wang
- Hainan Key Laboratory of Tropical Oil Crops Biology/Coconut Research Institute, Chinese Academy of Tropical Agricultural Sciences, Wenchang, China
| |
Collapse
|
15
|
Ruperao P, Thirunavukkarasu N, Gandham P, Selvanayagam S, Govindaraj M, Nebie B, Manyasa E, Gupta R, Das RR, Odeny DA, Gandhi H, Edwards D, Deshpande SP, Rathore A. Sorghum Pan-Genome Explores the Functional Utility for Genomic-Assisted Breeding to Accelerate the Genetic Gain. FRONTIERS IN PLANT SCIENCE 2021; 12:666342. [PMID: 34140962 PMCID: PMC8204017 DOI: 10.3389/fpls.2021.666342] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/28/2021] [Indexed: 05/05/2023]
Abstract
Sorghum (Sorghum bicolor L.) is a staple food crops in the arid and rainfed production ecologies. Sorghum plays a critical role in resilient farming and is projected as a smart crop to overcome the food and nutritional insecurity in the developing world. The development and characterisation of the sorghum pan-genome will provide insight into genome diversity and functionality, supporting sorghum improvement. We built a sorghum pan-genome using reference genomes as well as 354 genetically diverse sorghum accessions belonging to different races. We explored the structural and functional characteristics of the pan-genome and explain its utility in supporting genetic gain. The newly-developed pan-genome has a total of 35,719 genes, a core genome of 16,821 genes and an average of 32,795 genes in each cultivar. The variable genes are enriched with environment responsive genes and classify the sorghum accessions according to their race. We show that 53% of genes display presence-absence variation, and some of these variable genes are predicted to be functionally associated with drought adaptation traits. Using more than two million SNPs from the pan-genome, association analysis identified 398 SNPs significantly associated with important agronomic traits, of which, 92 were in genes. Drought gene expression analysis identified 1,788 genes that are functionally linked to different conditions, of which 79 were absent from the reference genome assembly. This study provides comprehensive genomic diversity resources in sorghum which can be used in genome assisted crop improvement.
Collapse
Affiliation(s)
- Pradeep Ruperao
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | | | - Prasad Gandham
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | | | | | - Baloua Nebie
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Bamako, Mali
| | - Eric Manyasa
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Nairobi, Kenya
| | - Rajeev Gupta
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - Roma Rani Das
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - Damaris A. Odeny
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Nairobi, Kenya
| | - Harish Gandhi
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Santosh P. Deshpande
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
- Santosh P. Deshpande
| | - Abhishek Rathore
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
- *Correspondence: Abhishek Rathore
| |
Collapse
|
16
|
Scott MF, Ladejobi O, Amer S, Bentley AR, Biernaskie J, Boden SA, Clark M, Dell'Acqua M, Dixon LE, Filippi CV, Fradgley N, Gardner KA, Mackay IJ, O'Sullivan D, Percival-Alwyn L, Roorkiwal M, Singh RK, Thudi M, Varshney RK, Venturini L, Whan A, Cockram J, Mott R. Multi-parent populations in crops: a toolbox integrating genomics and genetic mapping with breeding. Heredity (Edinb) 2020; 125:396-416. [PMID: 32616877 PMCID: PMC7784848 DOI: 10.1038/s41437-020-0336-6] [Citation(s) in RCA: 91] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 06/16/2020] [Accepted: 06/16/2020] [Indexed: 11/21/2022] Open
Abstract
Crop populations derived from experimental crosses enable the genetic dissection of complex traits and support modern plant breeding. Among these, multi-parent populations now play a central role. By mixing and recombining the genomes of multiple founders, multi-parent populations combine many commonly sought beneficial properties of genetic mapping populations. For example, they have high power and resolution for mapping quantitative trait loci, high genetic diversity and minimal population structure. Many multi-parent populations have been constructed in crop species, and their inbred germplasm and associated phenotypic and genotypic data serve as enduring resources. Their utility has grown from being a tool for mapping quantitative trait loci to a means of providing germplasm for breeding programmes. Genomics approaches, including de novo genome assemblies and gene annotations for the population founders, have allowed the imputation of rich sequence information into the descendent population, expanding the breadth of research and breeding applications of multi-parent populations. Here, we report recent successes from crop multi-parent populations in crops. We also propose an ideal genotypic, phenotypic and germplasm 'package' that multi-parent populations should feature to optimise their use as powerful community resources for crop research, development and breeding.
Collapse
Affiliation(s)
| | | | - Samer Amer
- University of Reading, Reading, RG6 6AH, UK
- Faculty of Agriculture, Alexandria University, Alexandria, 23714, Egypt
| | - Alison R Bentley
- The John Bingham Laboratory, NIAB, 93 Lawrence Weaver Road, Cambridge, CB3 0LE, UK
| | - Jay Biernaskie
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB, UK
| | - Scott A Boden
- School of Agriculture, Food and Wine, University of Adelaide, Glen Osmond, SA, 5064, Australia
| | | | | | - Laura E Dixon
- Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Carla V Filippi
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Nicolas Repetto y Los Reseros s/n, 1686, Hurlingham, Buenos Aires, Argentina
| | - Nick Fradgley
- The John Bingham Laboratory, NIAB, 93 Lawrence Weaver Road, Cambridge, CB3 0LE, UK
| | - Keith A Gardner
- The John Bingham Laboratory, NIAB, 93 Lawrence Weaver Road, Cambridge, CB3 0LE, UK
| | - Ian J Mackay
- SRUC, West Mains Road, Kings Buildings, Edinburgh, EH9 3JG, UK
| | | | | | - Manish Roorkiwal
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Rakesh Kumar Singh
- International Center for Biosaline Agriculture, Academic City, Dubai, United Arab Emirates
| | - Mahendar Thudi
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Rajeev Kumar Varshney
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | | | - Alex Whan
- CSIRO, GPO Box 1700, Canberra, ACT, 2601, Australia
| | - James Cockram
- The John Bingham Laboratory, NIAB, 93 Lawrence Weaver Road, Cambridge, CB3 0LE, UK
| | - Richard Mott
- UCL Genetics Institute, Gower Street, London, WC1E 6BT, UK
| |
Collapse
|
17
|
Abstract
The giant sequoia (Sequoiadendron giganteum) of California are massive, long-lived trees that grow along the U.S. Sierra Nevada mountains. Genomic data are limited in giant sequoia and producing a reference genome sequence has been an important goal to allow marker development for restoration and management. Using deep-coverage Illumina and Oxford Nanopore sequencing, combined with Dovetail chromosome conformation capture libraries, the genome was assembled into eleven chromosome-scale scaffolds containing 8.125 Gbp of sequence. Iso-Seq transcripts, assembled from three distinct tissues, was used as evidence to annotate a total of 41,632 protein-coding genes. The genome was found to contain, distributed unevenly across all 11 chromosomes and in 63 orthogroups, over 900 complete or partial predicted NLR genes, of which 375 are supported by annotation derived from protein evidence and gene modeling. This giant sequoia reference genome sequence represents the first genome sequenced in the Cupressaceae family, and lays a foundation for using genomic tools to aid in giant sequoia conservation and management.
Collapse
|
18
|
Kumar J, Sen Gupta D. Prospects of next generation sequencing in lentil breeding. Mol Biol Rep 2020; 47:9043-9053. [PMID: 33037962 DOI: 10.1007/s11033-020-05891-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 10/03/2020] [Indexed: 11/28/2022]
Abstract
Lentil is an important food legume crop that has large and complex genome. During past years, considerable attention has been given on the use of next generation sequencing for enriching the genomic resources including identification of SSR and SNP markers, development of unigenes, transcripts, and identification of candidate genes for biotic and abiotic stresses, analysis of genetic diversity and identification of genes/ QTLs for agronomically important traits. However, in other crops including pulses, next generation sequencing has revolutionized the genomic research and helped in genomic assisted breeding rapidly and cost effectively. The present review discuss current status and future prospects of the use NGS based breeding in lentil.
Collapse
Affiliation(s)
- Jitendra Kumar
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kalyanpur, Kanpur, 208024, India.
| | - Debjyoti Sen Gupta
- Division of Crop Improvement, ICAR-Indian Institute of Pulses Research, Kalyanpur, Kanpur, 208024, India
| |
Collapse
|
19
|
Krasnov GS, Pushkova EN, Novakovskiy RO, Kudryavtseva LP, Rozhmina TA, Dvorianinova EM, Povkhova LV, Kudryavtseva AV, Dmitriev AA, Melnikova NV. High-Quality Genome Assembly of Fusarium oxysporum f. sp. lini. Front Genet 2020; 11:959. [PMID: 33193577 PMCID: PMC7481384 DOI: 10.3389/fgene.2020.00959] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Accepted: 07/30/2020] [Indexed: 12/31/2022] Open
Abstract
In the present work, a highly pathogenic isolate of Fusarium oxysporum f. sp. lini, which is the most harmful pathogen affecting flax (Linum usitatissimum L.), was sequenced for the first time. To achieve a high-quality genome assembly, we used the combination of two sequencing platforms - Oxford Nanopore Technologies (MinION system) with long noisy reads and Illumina (HiSeq 2500 instrument) with short accurate reads. Given the quality of DNA is crucial for Nanopore sequencing, we developed the protocol for extraction of pure high-molecular-weight DNA from fungi. Sequencing of DNA extracted using this protocol allowed us to obtain about 85x genome coverage with long (N50 = 29 kb) MinION reads and 30x coverage with 2 × 250 bp HiSeq reads. Several tools were developed for genome assembly; however, they provide different results depending on genome complexity, sequencing data volume, read length and quality. We benchmarked the most requested assemblers (Canu, Flye, Shasta, wtdbg2, and MaSuRCA), Nanopore polishers (Medaka and Racon), and Illumina polishers (Pilon and POLCA) on our sequencing data. The assembly performed with Canu and polished with Medaka and POLCA was considered the most full and accurate. After further elimination of redundant contigs using Purge Haplotigs, we achieved a high-quality genome of F. oxysporum f. sp. lini with a total length of 59 Mb, N50 of 3.3 Mb, and 99.5% completeness according to BUSCO. We also obtained a complete circular mitochondrial genome with a length of 38.7 kb. The achieved assembly expands studies on F. oxysporum and plant-pathogen interaction in flax.
Collapse
Affiliation(s)
- George S. Krasnov
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Elena N. Pushkova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Roman O. Novakovskiy
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | | | - Tatiana A. Rozhmina
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Federal Research Center for Bast Fiber Crops, Torzhok, Russia
| | - Ekaterina M. Dvorianinova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Liubov V. Povkhova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| | - Anna V. Kudryavtseva
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Alexey A. Dmitriev
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Nataliya V. Melnikova
- Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
20
|
A chromosome-level genome assembly of the wild rice Oryza rufipogon facilitates tracing the origins of Asian cultivated rice. SCIENCE CHINA-LIFE SCIENCES 2020; 64:282-293. [PMID: 32737856 DOI: 10.1007/s11427-020-1738-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 05/19/2020] [Indexed: 01/15/2023]
Abstract
Oryza rufipogon Griff. is a wild progenitor of the Asian cultivated rice Oryza sativa. To better understand the genomic diversity of the wild rice, high-quality reference genomes of O. rufipogon populations are needed, which also facilitate utilization of the wild genetic resources in rice breeding. In this study, we generated a chromosome-level genome assembly of O. rufipogon using a combination of short-read sequencing, single-molecule sequencing, BioNano and Hi-C platforms. The genome sequence (399.8 Mb) was assembled into 46 scaffolds on the 12 chromosomes, with contig N50 and scaffold N50 of 13.2 Mb and 20.3 Mb, respectively. The genome contains 36,520 protein-coding genes, and 49.37% of the genome consists of repetitive elements. The genome has strong synteny with those of the O. sativa subspecies indica and japonica, but containing some large structural variations. Evolutionary analysis unveiled the polyphyletic origins of O. sativa, in which the japonica and indica genome formations involved different divergent O. rufipogon (including O. nivara) lineages, accompanied by introgression of genomic regions between japonica and indica. This high-quality reference genome provides insight on the genome evolution of the wild rice and the origins of the O. sativa subspecies, and valuable information for basic research and rice breeding.
Collapse
|
21
|
Campbell MT, Du Q, Liu K, Sharma S, Zhang C, Walia H. Characterization of the transcriptional divergence between the subspecies of cultivated rice (Oryza sativa). BMC Genomics 2020; 21:394. [PMID: 32513103 PMCID: PMC7278148 DOI: 10.1186/s12864-020-06786-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Accepted: 05/19/2020] [Indexed: 01/24/2023] Open
Abstract
Background Cultivated rice consists of two subspecies, Indica and Japonica, that exhibit well-characterized differences at the morphological and genetic levels. However, the differences between these subspecies at the transcriptome level remains largely unexamined. Here, we provide a comprehensive characterization of transcriptome divergence and cis-regulatory variation within rice using transcriptome data from 91 accessions from a rice diversity panel (RDP1). Results The transcriptomes of the two subspecies of rice are highly divergent. Japonica have significantly lower expression and genetic diversity relative to Indica, which is likely a consequence of a population bottleneck during Japonica domestication. We leveraged high-density genotypic data and transcript levels to identify cis-regulatory variants that may explain the genetic divergence between the subspecies. We identified significantly more eQTL that were specific to the Indica subspecies compared to Japonica, suggesting that the observed differences in expression and genetic variability also extends to cis-regulatory variation. Conclusions Using RNA sequencing data for 91diverse rice accessions and high-density genotypic data, we show that the two species are highly divergent with respect to gene expression levels, as well as the genetic regulation of expression. The data generated by this study provide, to date, the largest collection of genome-wide transcriptional levels for rice, and provides a community resource to accelerate functional genomic studies in rice.
Collapse
Affiliation(s)
- Malachy T Campbell
- Department of Agronomy and Horticulture, University of Nebraska Lincoln, 1825 N 38th St., Lincoln, 68583, NE, USA. .,Department of Animal and Poultry Sciences, Virginia Polytechnic Institute and State University, 175 West Campus Drive, Blacksburg, 24060, VA, USA.
| | - Qian Du
- School of Biological Sciences, University of Nebraska Lincoln, 1901 Vine St., Lincoln, 68503, NE, USA
| | - Kan Liu
- School of Biological Sciences, University of Nebraska Lincoln, 1901 Vine St., Lincoln, 68503, NE, USA
| | - Sandeep Sharma
- Department of Agronomy and Horticulture, University of Nebraska Lincoln, 1825 N 38th St., Lincoln, 68583, NE, USA.,Marine Biotechnology and Ecology Division, CSIR-CSMCRI, Bhavnagar, Gujarat, India
| | - Chi Zhang
- School of Biological Sciences, University of Nebraska Lincoln, 1901 Vine St., Lincoln, 68503, NE, USA
| | - Harkamal Walia
- Department of Agronomy and Horticulture, University of Nebraska Lincoln, 1825 N 38th St., Lincoln, 68583, NE, USA.
| |
Collapse
|
22
|
Seo J, Lee SM, Han JH, Shin NH, Lee YK, Kim B, Chin JH, Koh HJ. Characterization of the Common Japonica-Originated Genomic Regions in the High-Yielding Varieties Developed from Inter-Subspecific Crosses in Temperate Rice ( Oryza sativa L.). Genes (Basel) 2020; 11:genes11050562. [PMID: 32443496 PMCID: PMC7290844 DOI: 10.3390/genes11050562] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/11/2020] [Accepted: 05/14/2020] [Indexed: 01/18/2023] Open
Abstract
The inter-subspecific crossing between indica and japonica subspecies in rice have been utilized to improve the yield potential of temperate rice. In this study, a comparative study of the genomic regions in the eight high-yielding varieties (HYVs) was conducted with those of the four non-HYVs. The Next-Generation Sequencing (NGS) mapping on the Nipponbare reference genome identified a total of 14 common genomic regions of japonica-originated alleles. Interestingly, the HYVs shared japonica-originated genomic regions on nine chromosomes, although they were developed through different breeding programs. A panel of 94 varieties was classified into four varietal groups with 38 single nucleotide polymorphism (SNP) markers from 38 genes residing in the japonica-originated genomic regions and 16 additional trait-specific SNPs. As expected, the japonica-originated genomic regions were only present in the japonica (JAP) and HYV groups, except for Chr4-1 and Chr4-2. The Wx gene, located within Chr6-1, was present in the HYV and JAP variety groups, while the yield-related genes were conserved as indica alleles in HYVs. The japonica-originated genomic regions and alleles shared by HYVs can be employed in molecular breeding programs to further develop the HYVs in temperate rice.
Collapse
Affiliation(s)
- Jeonghwan Seo
- Department of Plant Science and Research Institute for Agriculture and Life Sciences, and Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Korea; (J.S.); (S.-M.L.); (Y.K.L.); (B.K.)
| | - So-Myeong Lee
- Department of Plant Science and Research Institute for Agriculture and Life Sciences, and Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Korea; (J.S.); (S.-M.L.); (Y.K.L.); (B.K.)
- Department of Southern Area Crop Science, National Institute of Crop Science, RDA, Miryang 50424, Korea
| | - Jae-Hyuk Han
- Department of Integrative Biological Sciences and Industry, Sejong University, 209, Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea; (J.-H.H.); (N.-H.S.)
| | - Na-Hyun Shin
- Department of Integrative Biological Sciences and Industry, Sejong University, 209, Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea; (J.-H.H.); (N.-H.S.)
| | - Yoon Kyung Lee
- Department of Plant Science and Research Institute for Agriculture and Life Sciences, and Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Korea; (J.S.); (S.-M.L.); (Y.K.L.); (B.K.)
| | - Backki Kim
- Department of Plant Science and Research Institute for Agriculture and Life Sciences, and Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Korea; (J.S.); (S.-M.L.); (Y.K.L.); (B.K.)
| | - Joong Hyoun Chin
- Department of Integrative Biological Sciences and Industry, Sejong University, 209, Neungdong-ro, Gwangjin-gu, Seoul 05006, Korea; (J.-H.H.); (N.-H.S.)
- Correspondence: (J.H.C.); (H.-J.K.)
| | - Hee-Jong Koh
- Department of Plant Science and Research Institute for Agriculture and Life Sciences, and Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Korea; (J.S.); (S.-M.L.); (Y.K.L.); (B.K.)
- Correspondence: (J.H.C.); (H.-J.K.)
| |
Collapse
|
23
|
De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing. G3-GENES GENOMES GENETICS 2020; 10:1495-1501. [PMID: 32184372 PMCID: PMC7202035 DOI: 10.1534/g3.119.400871] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
IR64 is a rice variety with high-yield that has been widely cultivated around the world. IR64 has been replaced by modern varieties in most growing areas. Given that modern varieties are mostly progenies or relatives of IR64, genetic analysis of IR64 is valuable for rice functional genomics. However, chromosome-level genome sequences of IR64 have not been available previously. Here, we sequenced the IR64 genome using synthetic long reads obtained by linked-read sequencing and ultra-long reads obtained by nanopore sequencing. We integrated these data and generated the de novo assembly of the IR64 genome of 367 Mb, equivalent to 99% of the estimated size. Continuity of the IR64 genome assembly was improved compared with that of a publicly available IR64 genome assembly generated by short reads only. We annotated 41,458 protein-coding genes, including 657 IR64-specific genes, that are missing in other high-quality rice genome assemblies IRGSP-1.0 of japonica cultivar Nipponbare or R498 of indica cultivar Shuhui498. The IR64 genome assembly will serve as a genome resource for rice functional genomics as well as genomics-driven and/or molecular breeding.
Collapse
|
24
|
Abstract
Since the early days of the genome era, the scientific community has relied on a single 'reference' genome for each species, which is used as the basis for a wide range of genetic analyses, including studies of variation within and across species. As sequencing costs have dropped, thousands of new genomes have been sequenced, and scientists have come to realize that a single reference genome is inadequate for many purposes. By sampling a diverse set of individuals, one can begin to assemble a pan-genome: a collection of all the DNA sequences that occur in a species. Here we review efforts to create pan-genomes for a range of species, from bacteria to humans, and we further consider the computational methods that have been proposed in order to capture, interpret and compare pan-genome data. As scientists continue to survey and catalogue the genomic variation across human populations and begin to assemble a human pan-genome, these efforts will increase our power to connect variation to human diversity, disease and beyond.
Collapse
Affiliation(s)
- Rachel M Sherman
- Department of Computer Science, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA.
- Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA.
| | - Steven L Salzberg
- Department of Computer Science, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Computational Biology, Whiting School of Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
- Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
25
|
Choi JY, Lye ZN, Groen SC, Dai X, Rughani P, Zaaijer S, Harrington ED, Juul S, Purugganan MD. Nanopore sequencing-based genome assembly and evolutionary genomics of circum-basmati rice. Genome Biol 2020; 21:21. [PMID: 32019604 PMCID: PMC7001208 DOI: 10.1186/s13059-020-1938-2] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Accepted: 01/17/2020] [Indexed: 01/23/2023] Open
Abstract
Background The circum-basmati group of cultivated Asian rice (Oryza sativa) contains many iconic varieties and is widespread in the Indian subcontinent. Despite its economic and cultural importance, a high-quality reference genome is currently lacking, and the group’s evolutionary history is not fully resolved. To address these gaps, we use long-read nanopore sequencing and assemble the genomes of two circum-basmati rice varieties. Results We generate two high-quality, chromosome-level reference genomes that represent the 12 chromosomes of Oryza. The assemblies show a contig N50 of 6.32 Mb and 10.53 Mb for Basmati 334 and Dom Sufid, respectively. Using our highly contiguous assemblies, we characterize structural variations segregating across circum-basmati genomes. We discover repeat expansions not observed in japonica—the rice group most closely related to circum-basmati—as well as the presence and absence variants of over 20 Mb, one of which is a circum-basmati-specific deletion of a gene regulating awn length. We further detect strong evidence of admixture between the circum-basmati and circum-aus groups. This gene flow has its greatest effect on chromosome 10, causing both structural variation and single-nucleotide polymorphism to deviate from genome-wide history. Lastly, population genomic analysis of 78 circum-basmati varieties shows three major geographically structured genetic groups: Bhutan/Nepal, India/Bangladesh/Myanmar, and Iran/Pakistan. Conclusion The availability of high-quality reference genomes allows functional and evolutionary genomic analyses providing genome-wide evidence for gene flow between circum-aus and circum-basmati, describes the nature of circum-basmati structural variation, and reveals the presence/absence variation in this important and iconic rice variety group.
Collapse
Affiliation(s)
- Jae Young Choi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA.
| | - Zoe N Lye
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Simon C Groen
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | | | | | | | | | - Sissel Juul
- Oxford Nanopore Technologies, New York, NY, USA
| | - Michael D Purugganan
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA. .,Center for Genomics and Systems Biology, NYU Abu Dhabi Research Institute, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
26
|
Read AC, Moscou MJ, Zimin AV, Pertea G, Meyer RS, Purugganan MD, Leach JE, Triplett LR, Salzberg SL, Bogdanove AJ. Genome assembly and characterization of a complex zfBED-NLR gene-containing disease resistance locus in Carolina Gold Select rice with Nanopore sequencing. PLoS Genet 2020; 16:e1008571. [PMID: 31986137 PMCID: PMC7004385 DOI: 10.1371/journal.pgen.1008571] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2019] [Revised: 02/06/2020] [Accepted: 12/16/2019] [Indexed: 12/26/2022] Open
Abstract
Long-read sequencing facilitates assembly of complex genomic regions. In plants, loci containing nucleotide-binding, leucine-rich repeat (NLR) disease resistance genes are an important example of such regions. NLR genes constitute one of the largest gene families in plants and are often clustered, evolving via duplication, contraction, and transposition. We recently mapped the Xo1 locus for resistance to bacterial blight and bacterial leaf streak, found in the American heirloom rice variety Carolina Gold Select, to a region that in the Nipponbare reference genome is NLR gene-rich. Here, toward identification of the Xo1 gene, we combined Nanopore and Illumina reads and generated a high-quality Carolina Gold Select genome assembly. We identified 529 complete or partial NLR genes and discovered, relative to Nipponbare, an expansion of NLR genes at the Xo1 locus. One of these has high sequence similarity to the cloned, functionally similar Xa1 gene. Both harbor an integrated zfBED domain, and the repeats within each protein are nearly perfect. Across diverse Oryzeae, we identified two sub-clades of NLR genes with these features, varying in the presence of the zfBED domain and the number of repeats. The Carolina Gold Select genome assembly also uncovered at the Xo1 locus a rice blast resistance gene and a gene encoding a polyphenol oxidase (PPO). PPO activity has been used as a marker for blast resistance at the locus in some varieties; however, the Carolina Gold Select sequence revealed a loss-of-function mutation in the PPO gene that breaks this association. Our results demonstrate that whole genome sequencing combining Nanopore and Illumina reads effectively resolves NLR gene loci. Our identification of an Xo1 candidate is an important step toward mechanistic characterization, including the role(s) of the zfBED domain. Finally, the Carolina Gold Select genome assembly will facilitate identification of other useful traits in this historically important variety. Plants lack adaptive immunity, and instead contain repeat-rich, disease resistance genes that evolve rapidly through duplication, recombination, and transposition. The number, variation, and often clustered arrangement of these genes make them challenging to sequence and catalog. The US heirloom rice variety Carolina Gold Select has resistance to two important bacterial diseases. Toward identifying the responsible gene(s), we combined long- and short-read sequencing technologies to assemble the whole genome and identify the resistance gene repertoire. We previously narrowed the location of the gene(s) to a region on chromosome four. The region in Carolina Gold Select is larger than in the rice reference genome (Nipponbare) and contains twice as many resistance genes. One shares unusual features with a known bacterial disease resistance gene, suggesting that it confers the resistance. Across diverse varieties and related species, we identified two widely-distributed groups of such genes. The results are an important step toward mechanistic characterization and deployment of the bacterial disease resistance. The genome assembly also identified a resistance gene for a fungal disease and predicted a marker phenotype used in breeding for resistance. Thus, the Carolina Gold Select genome assembly can be expected to aid in the identification and deployment of other valuable traits.
Collapse
Affiliation(s)
- Andrew C. Read
- Plant Pathology and Plant Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States of America
| | - Matthew J. Moscou
- The Sainsbury Laboratory, University of East Anglia, Norwich, United Kingdom
| | - Aleksey V. Zimin
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, United States of America
| | - Geo Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, United States of America
| | - Rachel S. Meyer
- Center for Genomics and Systems Biology, New York University, New York, NY, United States of America
| | - Michael D. Purugganan
- Center for Genomics and Systems Biology, New York University, New York, NY, United States of America
- Center for Genomics and Biology, New York University Abu Dhabi, Saadiyat Island, Abu Dhabi, United Arab Emirates
| | - Jan E. Leach
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO, United States of America
| | - Lindsay R. Triplett
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO, United States of America
| | - Steven L. Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, United States of America
- Departments of Biomedical Engineering, Computer Science, and Biostatistics, Johns Hopkins University, Baltimore, MD, United States of America
| | - Adam J. Bogdanove
- Plant Pathology and Plant Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States of America
- * E-mail:
| |
Collapse
|
27
|
Jain R, Jenkins J, Shu S, Chern M, Martin JA, Copetti D, Duong PQ, Pham NT, Kudrna DA, Talag J, Schackwitz WS, Lipzen AM, Dilworth D, Bauer D, Grimwood J, Nelson CR, Xing F, Xie W, Barry KW, Wing RA, Schmutz J, Li G, Ronald PC. Genome sequence of the model rice variety KitaakeX. BMC Genomics 2019; 20:905. [PMID: 31775618 PMCID: PMC6882167 DOI: 10.1186/s12864-019-6262-4] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Accepted: 11/05/2019] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species. RESULTS Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics. CONCLUSIONS The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.
Collapse
Affiliation(s)
- Rashmi Jain
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA.,Feedstocks Division, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Jerry Jenkins
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA.,HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Shengqiang Shu
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Mawsheng Chern
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA.,Feedstocks Division, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Joel A Martin
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Dario Copetti
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,Molecular Plant Breeding, Institute of Agricultural Sciences, ETH Zurich, Universitaetstrasse 2, 8092, Zurich, Switzerland.,Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Phat Q Duong
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA.,Feedstocks Division, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Nikki T Pham
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA
| | - David A Kudrna
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Jayson Talag
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Wendy S Schackwitz
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Anna M Lipzen
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - David Dilworth
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Diane Bauer
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jane Grimwood
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA.,HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Catherine R Nelson
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA
| | - Feng Xing
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
| | - Weibo Xie
- National Key Laboratory of Crop Genetic Improvement, National Center of Plant Gene Research (Wuhan), Huazhong Agricultural University, Wuhan, 430070, China
| | - Kerrie W Barry
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,BIO5 Institute, School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,International Rice Research Institute, Genetic Resource Center, Los Baños, Laguna, Philippines
| | - Jeremy Schmutz
- U.S. Department of Energy, Joint Genome Institute, Walnut Creek, CA, 94598, USA.,HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Guotian Li
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA. .,Feedstocks Division, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. .,The Provincial Key Lab of Plant Pathology of Hubei Province and College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei, China.
| | - Pamela C Ronald
- Department of Plant Pathology and the Genome Center, University of California, One Shields Avenue, Davis, CA, 95616, USA. .,Feedstocks Division, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
28
|
Kim B. Classifying Oryza sativa accessions into Indica and Japonica using logistic regression model with phenotypic data. PeerJ 2019; 7:e7259. [PMID: 31720092 PMCID: PMC6842562 DOI: 10.7717/peerj.7259] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Accepted: 06/05/2019] [Indexed: 11/26/2022] Open
Abstract
In Oryza sativa, indica and japonica are pivotal subpopulations, and other subpopulations such as aus and aromatic are considered to be derived from indica or japonica. In this regard, Oryza sativa accessions are frequently viewed from the indica/japonica perspective. This study introduces a computational method for indica/japonica classification by applying phenotypic variables to the logistic regression model (LRM). The population used in this study included 413 Oryza sativa accessions, of which 280 accessions were indica or japonica. Out of 24 phenotypic variables, a set of seven phenotypic variables was identified to collectively generate the fully accurate indica/japonica separation power of the LRM. The resulting parameters were used to define the customized LRM. Given the 280 indica/japonica accessions, the classification accuracy of the customized LRM along with the set of seven phenotypic variables was estimated by 100 iterations of ten-fold cross-validations. As a result, the classification accuracy of 100% was achieved. This suggests that the LRM can be an effective tool to analyze the indica/japonica classification with phenotypic variables in Oryza sativa.
Collapse
Affiliation(s)
- Bongsong Kim
- Noble Research Institute LLC, Ardmore, OK, Carter, United States of America
| |
Collapse
|
29
|
Verma H, Borah JL, Sarma RN. Variability Assessment for Root and Drought Tolerance Traits and Genetic Diversity Analysis of Rice Germplasm using SSR Markers. Sci Rep 2019; 9:16513. [PMID: 31712622 PMCID: PMC6848176 DOI: 10.1038/s41598-019-52884-1] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2019] [Accepted: 10/23/2019] [Indexed: 11/16/2022] Open
Abstract
The studies on genetic variation, diversity and population structure of rice germplasm of North East India could be an important step for improvements of abiotic and biotic stress tolerance in rice. Genetic diversity and genetic relatedness among 114 rice genotypes of North East India were assessed using genotypic data of 65 SSR markers and phenotypic data. The phenotypic diversity analysis showed the considerable variation across genotypes for root, shoot and drought tolerance traits. The principal component analysis (PCA) revealed the fresh shoot weight, root volume, dry shoot weight, fresh root weight and drought score as a major contributor to diversity. Genotyping of 114 rice genotypes using 65 SSR markers detected 147 alleles with the average polymorphic information content (PIC) value of 0.51. Population structure analysis using the Bayesian clustering model approach, distance-based neighbor-joining cluster and principal coordinate analysis using genotypic data grouped the accession into three sub-populations. Population structure analysis revealed that rice accession was moderately structured based on FST value estimates. Analysis of molecular variance (AMOVA) and pairwise FST values showed significant differentiation among all the pairs of sub-population ranging from 0.152 to 0.222 suggesting that all the three subpopulations were significantly different from each other. AMOVA revealed that most of the variation in rice accession mainly occurred among individuals. The present study suggests that diverse germplasm of NE India could be used for the improvement of root and drought tolerance in rice breeding programmes.
Collapse
Affiliation(s)
- H Verma
- Department of Plant Breeding & Genetics, Assam Agricultural University, Jorhat, 785013, Assam, India.
| | - J L Borah
- Department of Plant Breeding & Genetics, Assam Agricultural University, Jorhat, 785013, Assam, India
| | - R N Sarma
- Department of Plant Breeding & Genetics, Assam Agricultural University, Jorhat, 785013, Assam, India.
| |
Collapse
|
30
|
Oliva R, Ji C, Atienza-Grande G, Huguet-Tapia JC, Perez-Quintero A, Li T, Eom JS, Li C, Nguyen H, Liu B, Auguy F, Sciallano C, Luu VT, Dossa GS, Cunnac S, Schmidt SM, Slamet-Loedin IH, Vera Cruz C, Szurek B, Frommer WB, White FF, Yang B. Broad-spectrum resistance to bacterial blight in rice using genome editing. Nat Biotechnol 2019; 37:1344-1350. [PMID: 31659337 PMCID: PMC6831514 DOI: 10.1038/s41587-019-0267-z] [Citation(s) in RCA: 341] [Impact Index Per Article: 56.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 08/28/2019] [Indexed: 02/01/2023]
Abstract
Bacterial blight of rice is an important disease in Asia and Africa. The pathogen, Xanthomonas oryzae pv. oryzae (Xoo), secretes one or more of six known transcription-activator-like effectors (TALes) that bind specific promoter sequences and induce, at minimum, one of the three host sucrose transporter genes SWEET11, SWEET13 and SWEET14, the expression of which is required for disease susceptibility. We used CRISPR-Cas9-mediated genome editing to introduce mutations in all three SWEET gene promoters. Editing was further informed by sequence analyses of TALe genes in 63 Xoo strains, which revealed multiple TALe variants for SWEET13 alleles. Mutations were also created in SWEET14, which is also targeted by two TALes from an African Xoo lineage. A total of five promoter mutations were simultaneously introduced into the rice line Kitaake and the elite mega varieties IR64 and Ciherang-Sub1. Paddy trials showed that genome-edited SWEET promoters endow rice lines with robust, broad-spectrum resistance.
Collapse
Affiliation(s)
- Ricardo Oliva
- International Rice Research Institute, Metro Manila, Philippines.
| | - Chonghui Ji
- Division of Plant Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Genelou Atienza-Grande
- International Rice Research Institute, Metro Manila, Philippines
- College of Agriculture and Food Science, University of the Philippines Los Baños, Los Baños, Philippines
| | | | - Alvaro Perez-Quintero
- IRD, CIRAD, Université Montpellier, IPME, Montpellier, France
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO, USA
| | - Ting Li
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, USA
| | - Joon-Seob Eom
- Institute for Molecular Physiology and Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine Universität Düsseldorf and Max Planck Institute for Plant Breeding Research, Köln, Germany
| | - Chenhao Li
- Division of Plant Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Hanna Nguyen
- International Rice Research Institute, Metro Manila, Philippines
| | - Bo Liu
- Division of Plant Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Florence Auguy
- IRD, CIRAD, Université Montpellier, IPME, Montpellier, France
| | | | - Van T Luu
- Institute for Molecular Physiology and Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine Universität Düsseldorf and Max Planck Institute for Plant Breeding Research, Köln, Germany
| | | | | | - Sarah M Schmidt
- Institute for Molecular Physiology and Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine Universität Düsseldorf and Max Planck Institute for Plant Breeding Research, Köln, Germany
| | | | | | - Boris Szurek
- IRD, CIRAD, Université Montpellier, IPME, Montpellier, France
| | - Wolf B Frommer
- Institute for Molecular Physiology and Cluster of Excellence on Plant Sciences (CEPLAS), Heinrich Heine Universität Düsseldorf and Max Planck Institute for Plant Breeding Research, Köln, Germany.
- Institute of Transformative Bio-Molecules (WPI-ITbM), Nagoya University, Aichi, Japan.
| | - Frank F White
- Department of Plant Pathology, University of Florida, Gainesville, FL, USA
| | - Bing Yang
- Division of Plant Sciences, Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- Donald Danforth Plant Science Center, St. Louis, MO, USA.
| |
Collapse
|
31
|
Ou S, Chen J, Jiang N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res 2019; 46:e126. [PMID: 30107434 PMCID: PMC6265445 DOI: 10.1093/nar/gky730] [Citation(s) in RCA: 286] [Impact Index Per Article: 47.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2018] [Accepted: 07/31/2018] [Indexed: 12/15/2022] Open
Abstract
Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.
Collapse
Affiliation(s)
- Shujun Ou
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Jinfeng Chen
- Department of Plant Pathology and Microbiology, University of California, Riverside, CA 92507, USA
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, MI 48824, USA.,Program in Ecology, Evolutionary Biology and Behavior, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
32
|
Abstract
Motivation Whole-genome alignment (WGA) methods show insufficient scalability toward the generation of large-scale WGAs. Profile alignment-based approaches revolutionized the fields of multiple sequence alignment construction methods by significantly reducing computational complexity and runtime. However, WGAs need to consider genomic rearrangements between genomes, which make the profile-based extension of several whole-genomes challenging. Currently, none of the available methods offer the possibility to align or extend WGA profiles. Results Here, we present genome profile alignment, an approach that aligns the profiles of WGAs and that is capable of producing large-scale WGAs many times faster than conventional methods. Our concept relies on already available whole-genome aligners, which are used to compute several smaller sets of aligned genomes that are combined to a full WGA with a divide and conquer approach. To align or extend WGA profiles, we make use of the SuperGenome data structure, which features a bidirectional mapping between individual sequence and alignment coordinates. This data structure is used to efficiently transfer different coordinate systems into a common one based on the principles of profiles alignments. The approach allows the computation of a WGA where alignments are subsequently merged along a guide tree. The current implementation uses progressiveMauve and offers the possibility for parallel computation of independent genome alignments. Our results based on various bacterial datasets up to several hundred genomes show that we can reduce the runtime from months to hours with a quality that is negligibly worse than the WGA computed with the conventional progressiveMauve tool. Availability and implementation GPA is freely available at https://lambda.informatik.uni-tuebingen.de/gitlab/ahennig/GPA. GPA is implemented in Java, uses progressiveMauve and offers a parallel computation of WGAs. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- André Hennig
- Center for Bioinformatics (ZBIT), Integrative Transcriptomics, Eberhard Karls University of Tübingen, Tübingen, Germany
| | - Kay Nieselt
- Center for Bioinformatics (ZBIT), Integrative Transcriptomics, Eberhard Karls University of Tübingen, Tübingen, Germany
| |
Collapse
|
33
|
Cooper EA, Brenton ZW, Flinn BS, Jenkins J, Shu S, Flowers D, Luo F, Wang Y, Xia P, Barry K, Daum C, Lipzen A, Yoshinaga Y, Schmutz J, Saski C, Vermerris W, Kresovich S. A new reference genome for Sorghum bicolor reveals high levels of sequence similarity between sweet and grain genotypes: implications for the genetics of sugar metabolism. BMC Genomics 2019; 20:420. [PMID: 31133004 PMCID: PMC6537160 DOI: 10.1186/s12864-019-5734-x] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Accepted: 04/24/2019] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND The process of crop domestication often consists of two stages: initial domestication, where the wild species is first cultivated by humans, followed by diversification, when the domesticated species are subsequently adapted to more environments and specialized uses. Selective pressure to increase sugar accumulation in certain varieties of the cereal crop Sorghum bicolor is an excellent example of the latter; this has resulted in pronounced phenotypic divergence between sweet and grain-type sorghums, but the genetic mechanisms underlying these differences remain poorly understood. RESULTS Here we present a new reference genome based on an archetypal sweet sorghum line and compare it to the current grain sorghum reference, revealing a high rate of nonsynonymous and potential loss of function mutations, but few changes in gene content or overall genome structure. We also use comparative transcriptomics to highlight changes in gene expression correlated with high stalk sugar content and show that changes in the activity and possibly localization of transporters, along with the timing of sugar metabolism play a critical role in the sweet phenotype. CONCLUSIONS The high level of genomic similarity between sweet and grain sorghum reflects their historical relatedness, rather than their current phenotypic differences, but we find key changes in signaling molecules and transcriptional regulators that represent new candidates for understanding and improving sugar metabolism in this important crop.
Collapse
Affiliation(s)
- Elizabeth A. Cooper
- Advanced Plant Technology Program, Clemson University, Clemson, SC USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC USA
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC USA
| | - Zachary W. Brenton
- Advanced Plant Technology Program, Clemson University, Clemson, SC USA
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC USA
| | - Barry S. Flinn
- Advanced Plant Technology Program, Clemson University, Clemson, SC USA
| | - Jerry Jenkins
- HudsonAlpha Institute for Biotechnology, Huntsville, AL USA
| | - Shengqiang Shu
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Dave Flowers
- HudsonAlpha Institute for Biotechnology, Huntsville, AL USA
| | - Feng Luo
- School of Computing, Clemson University, Clemson, SC USA
| | - Yunsheng Wang
- School of Computing, Clemson University, Clemson, SC USA
- School of Plant Protection, Hunan Agricultural University, Changsha, 410128 China
| | - Penny Xia
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC USA
| | - Kerrie Barry
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Chris Daum
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Anna Lipzen
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Yuko Yoshinaga
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL USA
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598 USA
| | - Christopher Saski
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC USA
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC USA
| | - Wilfred Vermerris
- Department of Microbiology and Cell Science and UF Genetics Institute, University of Florida, Gainesville, FL USA
| | - Stephen Kresovich
- Advanced Plant Technology Program, Clemson University, Clemson, SC USA
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC USA
- Department of Plant and Environmental Sciences, Clemson University, Clemson, SC USA
| |
Collapse
|
34
|
Juanillas V, Dereeper A, Beaume N, Droc G, Dizon J, Mendoza JR, Perdon JP, Mansueto L, Triplett L, Lang J, Zhou G, Ratharanjan K, Plale B, Haga J, Leach JE, Ruiz M, Thomson M, Alexandrov N, Larmande P, Kretzschmar T, Mauleon RP. Rice Galaxy: an open resource for plant science. Gigascience 2019; 8:giz028. [PMID: 31107941 PMCID: PMC6527052 DOI: 10.1093/gigascience/giz028] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 08/29/2018] [Accepted: 02/12/2019] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Rice molecular genetics, breeding, genetic diversity, and allied research (such as rice-pathogen interaction) have adopted sequencing technologies and high-density genotyping platforms for genome variation analysis and gene discovery. Germplasm collections representing rice diversity, improved varieties, and elite breeding materials are accessible through rice gene banks for use in research and breeding, with many having genome sequences and high-density genotype data available. Combining phenotypic and genotypic information on these accessions enables genome-wide association analysis, which is driving quantitative trait loci discovery and molecular marker development. Comparative sequence analyses across quantitative trait loci regions facilitate the discovery of novel alleles. Analyses involving DNA sequences and large genotyping matrices for thousands of samples, however, pose a challenge to non-computer savvy rice researchers. FINDINGS The Rice Galaxy resource has shared datasets that include high-density genotypes from the 3,000 Rice Genomes project and sequences with corresponding annotations from 9 published rice genomes. The Rice Galaxy web server and deployment installer includes tools for designing single-nucleotide polymorphism assays, analyzing genome-wide association studies, population diversity, rice-bacterial pathogen diagnostics, and a suite of published genomic prediction methods. A prototype Rice Galaxy compliant to Open Access, Open Data, and Findable, Accessible, Interoperable, and Reproducible principles is also presented. CONCLUSIONS Rice Galaxy is a freely available resource that empowers the plant research community to perform state-of-the-art analyses and utilize publicly available big datasets for both fundamental and applied science.
Collapse
Affiliation(s)
- Venice Juanillas
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Alexis Dereeper
- Institut de recherche pour le développement (IRD), University of Montpellier, DIADE, IPME, Montpellier, France
| | - Nicolas Beaume
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Gaetan Droc
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Joshua Dizon
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - John Robert Mendoza
- Advanced Science and Technology Institute, Department of Science and Technology, Quezon City, Philippines
| | - Jon Peter Perdon
- Advanced Science and Technology Institute, Department of Science and Technology, Quezon City, Philippines
| | - Locedie Mansueto
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Lindsay Triplett
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Jillian Lang
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Gabriel Zhou
- Indiana University, 107 S Indiana Ave, Bloomington, IN 47405, USA
| | | | - Beth Plale
- Indiana University, 107 S Indiana Ave, Bloomington, IN 47405, USA
| | - Jason Haga
- National Institute of Advanced Industrial Science and Technology, AIST Tsukuba Central 1,1-1-1 Umezono, Tsukuba, Ibaraki 305-8560, Japan
| | - Jan E Leach
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO 80523-1177, USA
| | - Manuel Ruiz
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Michael Thomson
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Department of Soil and Crop Sciences, Texas A&M University, Houston, TX, USA
| | - Nickolai Alexandrov
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
| | - Pierre Larmande
- Institut de recherche pour le développement (IRD), University of Montpellier, DIADE, IPME, Montpellier, France
| | - Tobias Kretzschmar
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Southern Cross Plant Science, Southern Cross University, Lismore, Australia
| | - Ramil P Mauleon
- International Rice Research Institute, DAPO Box 7777, Metro Manila 1301, Philippines
- Southern Cross Plant Science, Southern Cross University, Lismore, Australia
| |
Collapse
|
35
|
Gang H, Liu G, Zhang M, Zhao Y, Jiang J, Chen S. Comprehensive characterization of T-DNA integration induced chromosomal rearrangement in a birch T-DNA mutant. BMC Genomics 2019; 20:311. [PMID: 31014254 PMCID: PMC6480916 DOI: 10.1186/s12864-019-5636-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 03/24/2019] [Indexed: 11/29/2022] Open
Abstract
Background Integration of T-DNA into plant genomes via Agrobacterium may interrupt gene structure and generate numerous mutants. The T-DNA caused mutants are valuable materials for understanding T-DNA integration model in plant research. T-DNA integration in plants is complex and still largely unknown. In this work, we reported that multiple T-DNA fragments caused chromosomal translocation and deletion in a birch (Betula platyphylla × B. pendula) T-DNA mutant yl. Results We performed PacBio genome resequencing for yl and the result revealed that two ends of a T-DNA can be integrated into plant genome independently because the two ends can be linked to different chromosomes and cause chromosomal translocation. We also found that these T-DNA were connected into tandem fragment regardless of direction before integrating into plant genome. In addition, the integration of T-DNA in yl genome also caused several chromosomal fragments deletion. We then summarized three cases for T-DNA integration model in the yl genome. (1) A T-DNA fragment is linked to the two ends of a double-stranded break (DSB); (2) Only one end of a T-DNA fragment is linked to a DSB; (3) A T-DNA fragment is linked to the ends of different DSBs. All the observations in the yl genome supported the DSB repair model. Conclusions In this study, we showed a comprehensive genome analysis of a T-DNA mutant and provide a new insight into T-DNA integration in plants. These findings would be helpful for the analysis of T-DNA mutants with special phenotypes. Electronic supplementary material The online version of this article (10.1186/s12864-019-5636-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Huixin Gang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China
| | - Guifeng Liu
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China
| | - Manman Zhang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China
| | - Yuming Zhao
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China
| | - Jing Jiang
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China.
| | - Su Chen
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, 26 Hexing Road, Harbin, 150040, China.
| |
Collapse
|
36
|
Fuentes RR, Chebotarov D, Duitama J, Smith S, De la Hoz JF, Mohiyuddin M, Wing RA, McNally KL, Tatarinova T, Grigoriev A, Mauleon R, Alexandrov N. Structural variants in 3000 rice genomes. Genome Res 2019; 29:870-880. [PMID: 30992303 PMCID: PMC6499320 DOI: 10.1101/gr.241240.118] [Citation(s) in RCA: 83] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 03/11/2019] [Indexed: 12/24/2022]
Abstract
Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5′ UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice.
Collapse
Affiliation(s)
- Roven Rommel Fuentes
- International Rice Research Institute, Laguna 4031, Philippines.,Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, the Netherlands
| | | | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de Los Andes, Bogotá 111711, Colombia.,Agrobiodiversity Research Area, International Center for Tropical Agriculture (CIAT), Cali 6713, Colombia
| | - Sean Smith
- Biology Department, Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey 08102, USA
| | - Juan Fernando De la Hoz
- Agrobiodiversity Research Area, International Center for Tropical Agriculture (CIAT), Cali 6713, Colombia
| | | | - Rod A Wing
- International Rice Research Institute, Laguna 4031, Philippines.,Arizona Genomics Institute, University of Arizona, Tucson, Arizona 85721, USA.,King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
| | | | - Tatiana Tatarinova
- Department of Biology, University of La Verne, La Verne, California 91750, USA.,Vavilov Institute of General Genetics, Moscow 119333, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russia.,Laboratory of Forest Genomics, Siberian Federal University, Krasnoyarsk 660041, Russia
| | - Andrey Grigoriev
- Biology Department, Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey 08102, USA
| | - Ramil Mauleon
- International Rice Research Institute, Laguna 4031, Philippines
| | | |
Collapse
|
37
|
Tracking the origin of two genetic components associated with transposable element bursts in domesticated rice. Nat Commun 2019; 10:641. [PMID: 30733435 PMCID: PMC6367367 DOI: 10.1038/s41467-019-08451-3] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Accepted: 01/09/2019] [Indexed: 11/08/2022] Open
Abstract
Transposable elements (TEs) shape genome evolution through periodic bursts of amplification. In this study prior knowledge of the mPing/Ping/Pong TE family is exploited to track their copy numbers and distribution in genome sequences from 3,000 accessions of domesticated Oryza sativa (rice) and the wild progenitor Oryza rufipogon. We find that mPing bursts are restricted to recent domestication and is likely due to the accumulation of two TE components, Ping16A and Ping16A_Stow, that appear to be critical for mPing hyperactivity. Ping16A is a variant of the autonomous element with reduced activity as shown in a yeast transposition assay. Transposition of Ping16A into a Stowaway element generated Ping16A_Stow, the only Ping locus shared by all bursting accessions, and shown here to correlate with high mPing copies. Finally, we show that sustained activity of the mPing/Ping family in domesticated rice produced the components necessary for mPing bursts, not the loss of epigenetic regulation.
Collapse
|
38
|
Carpentier MC, Manfroi E, Wei FJ, Wu HP, Lasserre E, Llauro C, Debladis E, Akakpo R, Hsing YI, Panaud O. Retrotranspositional landscape of Asian rice revealed by 3000 genomes. Nat Commun 2019; 10:24. [PMID: 30604755 PMCID: PMC6318337 DOI: 10.1038/s41467-018-07974-5] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 12/05/2018] [Indexed: 12/21/2022] Open
Abstract
The recent release of genomic sequences for 3000 rice varieties provides access to the genetic diversity at species level for this crop. We take advantage of this resource to unravel some features of the retrotranspositional landscape of rice. We develop software TRACKPOSON specifically for the detection of transposable elements insertion polymorphisms (TIPs) from large datasets. We apply this tool to 32 families of retrotransposons and identify more than 50,000 TIPs in the 3000 rice genomes. Most polymorphisms are found at very low frequency, suggesting that they may have occurred recently in agro. A genome-wide association study shows that these activations in rice may be triggered by external stimuli, rather than by the alteration of genetic factors involved in transposable element silencing pathways. Finally, the TIPs dataset is used to trace the origin of rice domestication. Our results suggest that rice originated from three distinct domestication events.
Collapse
Affiliation(s)
- Marie-Christine Carpentier
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France
| | - Ernandes Manfroi
- Faculdade de Agronomia, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, 90040-060, Brazil
| | - Fu-Jin Wei
- Institute of Plant and Microbial Biology, Academia Sinica, 128, Section 2, Yien-chu-yuan Road, Nankang, 115, Taipei, Taiwan
- Department of Forest Molecular Genetics and Biotechnology, Forestry and Forest Products Research Institute, 1 Matsunosato, Tsukuba, 305-8687, Ibaraki, Japan
| | - Hshin-Ping Wu
- Institute of Plant and Microbial Biology, Academia Sinica, 128, Section 2, Yien-chu-yuan Road, Nankang, 115, Taipei, Taiwan
| | - Eric Lasserre
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France
| | - Christel Llauro
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France
| | - Emilie Debladis
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France
| | - Roland Akakpo
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France
| | - Yue-Ie Hsing
- Institute of Plant and Microbial Biology, Academia Sinica, 128, Section 2, Yien-chu-yuan Road, Nankang, 115, Taipei, Taiwan
| | - Olivier Panaud
- Laboratoire Génome et Développement des Plantes, UMR CNRS/UPVD 5096, Université de Perpignan Via Domitia, 52 Avenue Paul Alduy., 66860, Perpignan Cedex, France.
- Institut Universitaire de France, 1 rue Descartes, 75231, Paris Cedex 05, France.
| |
Collapse
|
39
|
Grover CE, Arick MA, Thrash A, Conover JL, Sanders WS, Peterson DG, Frelichowski JE, Scheffler JA, Scheffler BE, Wendel JF. Insights into the Evolution of the New World Diploid Cottons (Gossypium, Subgenus Houzingenia) Based on Genome Sequencing. Genome Biol Evol 2019; 11:53-71. [PMID: 30476109 PMCID: PMC6320677 DOI: 10.1093/gbe/evy256] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/20/2018] [Indexed: 12/24/2022] Open
Abstract
We employed phylogenomic methods to study molecular evolutionary processes and phylogeny in the geographically widely dispersed New World diploid cottons (Gossypium, subg. Houzingenia). Whole genome resequencing data (average of 33× genomic coverage) were generated to reassess the phylogenetic history of the subgenus and provide a temporal framework for its diversification. Phylogenetic analyses indicate that the subgenus likely originated following transoceanic dispersal from Africa about 6.6 Ma, but that nearly all of the biodiversity evolved following rapid diversification in the mid-Pleistocene (0.5-2.0 Ma), with multiple long-distance dispersals required to account for range expansion to Arizona, the Galapagos Islands, and Peru. Comparative analyses of cpDNAversus nuclear data indicate that this history was accompanied by several clear cases of interspecific introgression. Repetitive DNAs contribute roughly half of the total 880 Mb genome, but most transposable element families are relatively old and stable among species. In the genic fraction, pairwise synonymous mutation rates average 1% per Myr, with nonsynonymous changes being about seven times less frequent. Over 1.1 million indels were detected and phylogenetically polarized, revealing a 2-fold bias toward deletions over small insertions. We suggest that this genome down-sizing bias counteracts genome size growth by TE amplification and insertions, and helps explain the relatively small genomes that are restricted to this subgenus. Compared with the rate of nucleotide substitution, the rate of indel occurrence is much lower averaging about 17 nucleotide substitutions per indel event.
Collapse
Affiliation(s)
- Corrinne E Grover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - Mark A Arick
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Adam Thrash
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | - Justin L Conover
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| | - William S Sanders
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
- Department of Computer Science & Engineering, Mississippi State University
- The Jackson Laboratory, Connecticut
| | - Daniel G Peterson
- Institute for Genomics, Biocomputing, and Biotechnology, Mississippi State University
| | | | | | - Brian E Scheffler
- USDA, Genomics and Bioinformatics Research Unit, Stoneville, Mississippi
| | - Jonathan F Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University
| |
Collapse
|
40
|
Song S, Tian D, Zhang Z, Hu S, Yu J. Rice Genomics: over the Past Two Decades and into the Future. GENOMICS, PROTEOMICS & BIOINFORMATICS 2018; 16:397-404. [PMID: 30771506 PMCID: PMC6411948 DOI: 10.1016/j.gpb.2019.01.001] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 01/14/2019] [Accepted: 01/23/2019] [Indexed: 01/08/2023]
Abstract
Domestic rice (Oryza sativa L.) is one of the most important cereal crops, feeding a large number of worldwide populations. Along with various high-throughput genome sequencing projects, rice genomics has been making great headway toward direct field applications of basic research advances in understanding the molecular mechanisms of agronomical traits and utilizing diverse germplasm resources. Here, we briefly review its achievements over the past two decades and present the potential for its bright future.
Collapse
Affiliation(s)
- Shuhui Song
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Dongmei Tian
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhang Zhang
- BIG Data Center, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Songnian Hu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jun Yu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
41
|
Assembling the genome of the African wild rice Oryza longistaminata by exploiting synteny in closely related Oryza species. Commun Biol 2018; 1:162. [PMID: 30320230 PMCID: PMC6173730 DOI: 10.1038/s42003-018-0171-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 09/13/2018] [Indexed: 01/30/2023] Open
Abstract
The African wild rice species Oryza longistaminata has several beneficial traits compared to cultivated rice species, such as resistance to biotic stresses, clonal propagation via rhizomes, and increased biomass production. To facilitate breeding efforts and functional genomics studies, we de-novo assembled a high-quality, haploid-phased genome. Here, we present our assembly, with a total length of 351 Mb, of which 92.2% was anchored onto 12 chromosomes. We detected 34,389 genes and 38.1% of the genome consisted of repetitive content. We validated our assembly by a comparative linkage analysis and by examining well-characterized gene families. This genome assembly will be a useful resource to exploit beneficial alleles found in O. longistaminata. Our results also show that it is possible to generate a high-quality, functionally complete rice genome assembly from moderate SMRT read coverage by exploiting synteny in a closely related Oryza species. Stefan Reuscher et al. assembled the genome of an African wild rice species to facilitate breeding efforts and functional genomic studies. They used SMRT sequencing, chromosomal synteny between rice species, and a linkage map to assemble the 351 Mb genome into 12 chromosomes.
Collapse
|
42
|
Darracq A, Vitte C, Nicolas S, Duarte J, Pichon JP, Mary-Huard T, Chevalier C, Bérard A, Le Paslier MC, Rogowsky P, Charcosset A, Joets J. Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants. BMC Genomics 2018; 19:119. [PMID: 29402214 PMCID: PMC5800051 DOI: 10.1186/s12864-018-4490-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 01/22/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated. RESULTS De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups. CONCLUSIONS We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.
Collapse
Affiliation(s)
- Aude Darracq
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Clémentine Vitte
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Stéphane Nicolas
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | | | - Tristan Mary-Huard
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
- MIA, INRA, AgroParisTech, Université Paris-Saclay, Paris, France
| | - Céline Chevalier
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Aurélie Bérard
- EPGV US 1279, INRA, CEA, IG-CNG, Université Paris-Saclay, Evry, France
| | | | - Peter Rogowsky
- Laboratoire Reproduction et Développement des Plantes, Univ Lyon, ENS de Lyon, UCB Lyon 1, CNRS, INRA, Lyon, France
| | - Alain Charcosset
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Johann Joets
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
43
|
Monat C, Pera B, Ndjiondjop MN, Sow M, Tranchant-Dubreuil C, Bastianelli L, Ghesquière A, Sabot F. De Novo Assemblies of Three Oryza glaberrima Accessions Provide First Insights about Pan-Genome of African Rices. Genome Biol Evol 2018; 9:1-6. [PMID: 28173009 PMCID: PMC5381527 DOI: 10.1093/gbe/evw253] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/12/2016] [Indexed: 11/12/2022] Open
Abstract
Oryza glaberrima is one of the two cultivated species of rice, and harbors various interesting agronomic traits, especially in biotic and abiotic resistance, compared with its Asian cousin O. sativa. A previous reference genome was published but newer studies highlighted some missing parts. Moreover, global species diversity is known nowadays to be represented by more than one single individual. For that purpose, we sequenced, assembled and annotated de novo three different cultivars from O. glaberrima. After validating our assemblies, we were able to better solve complex regions than the previous assembly and to provide a first insight in pan-genomic divergence between individuals. The three assemblies shown large common regions, but almost 25% of the genome present collinearity breakpoints or are even individual specific.
Collapse
Affiliation(s)
- Cécile Monat
- RICE Team, DIADE UMR 232 IRD/UM, IRD France Sud, Montpellier, France
| | - Bérengère Pera
- RICE Team, DIADE UMR 232 IRD/UM, IRD France Sud, Montpellier, France.,CEA//Genoscope, Evry, France
| | | | | | | | - Leila Bastianelli
- Montpellier GenomiX, c/o Institut de Génomique Fonctionnelle, Montpellier, France
| | - Alain Ghesquière
- RICE Team, DIADE UMR 232 IRD/UM, IRD France Sud, Montpellier, France
| | - Francois Sabot
- RICE Team, DIADE UMR 232 IRD/UM, IRD France Sud, Montpellier, France
| |
Collapse
|
44
|
Nie SJ, Liu YQ, Wang CC, Gao SW, Xu TT, Liu Q, Chang HL, Chen YB, Yan PC, Peng W, Zheng TQ, Xu JL, Li ZK. Assembly of an early-matured japonica (Geng) rice genome, Suijing18, based on PacBio and Illumina sequencing. Sci Data 2017; 4:170195. [PMID: 29257136 PMCID: PMC5735919 DOI: 10.1038/sdata.2017.195] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 11/16/2017] [Indexed: 11/24/2022] Open
Abstract
The early-matured japonica (Geng) rice variety, Suijing18 (SJ18), carries multiple elite traits including durable blast resistance, good grain quality, and high yield. Using PacBio SMRT technology, we produced over 25 Gb of long-read sequencing raw data from SJ18 with a coverage of 62×. Using Illumina paired-end whole-genome shotgun sequencing technology, we generated 59 Gb of short-read sequencing data from SJ18 (23.6 Gb from a 200 bp library with a coverage of 59× and 35.4 Gb from an 800 bp library with a coverage of 88×). With these data, we assembled a single SJ18 genome and then generated a set of annotation data. These data sets can be used to test new programs for variation deep mining, and will provide new insights into the genome structure, function, and evolution of SJ18, and will provide essential support for biological research in general.
Collapse
Affiliation(s)
- Shou-Jun Nie
- Suihua Branch Institute, Heilongjiang Academy of Agricultural Sciences, 420 Gong-Nong West Road, Suihua, Heilongjiang 152000, China
| | - Yu-Qiang Liu
- Suihua Branch Institute, Heilongjiang Academy of Agricultural Sciences, 420 Gong-Nong West Road, Suihua, Heilongjiang 152000, China
| | - Chun-Chao Wang
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, 12 South Zhong-Guan-Cun Street, Beijing 100081, China
| | - Shi-Wei Gao
- Suihua Branch Institute, Heilongjiang Academy of Agricultural Sciences, 420 Gong-Nong West Road, Suihua, Heilongjiang 152000, China
| | - Tian-Tian Xu
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, 12 South Zhong-Guan-Cun Street, Beijing 100081, China
| | - Qing Liu
- Suihua Branch Institute, Heilongjiang Academy of Agricultural Sciences, 420 Gong-Nong West Road, Suihua, Heilongjiang 152000, China
| | - Hui-Lin Chang
- Suihua Branch Institute, Heilongjiang Academy of Agricultural Sciences, 420 Gong-Nong West Road, Suihua, Heilongjiang 152000, China
| | - Yu-Bao Chen
- Beijing Computing Center, No. 7 Mid, Fengxian Rd. Yongfeng Industry Base, Beijing 100094, China
| | - Peng-Cheng Yan
- Beijing Computing Center, No. 7 Mid, Fengxian Rd. Yongfeng Industry Base, Beijing 100094, China
| | - Wei Peng
- Beijing Computing Center, No. 7 Mid, Fengxian Rd. Yongfeng Industry Base, Beijing 100094, China
| | - Tian-Qing Zheng
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, 12 South Zhong-Guan-Cun Street, Beijing 100081, China.,Shenzhen Institute of Breeding for Innovation, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Jian-Long Xu
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, 12 South Zhong-Guan-Cun Street, Beijing 100081, China.,Shenzhen Institute of Breeding for Innovation, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Zhi-Kang Li
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, 12 South Zhong-Guan-Cun Street, Beijing 100081, China.,Shenzhen Institute of Breeding for Innovation, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| |
Collapse
|
45
|
Misra G, Badoni S, Anacleto R, Graner A, Alexandrov N, Sreenivasulu N. Whole genome sequencing-based association study to unravel genetic architecture of cooked grain width and length traits in rice. Sci Rep 2017; 7:12478. [PMID: 28963534 PMCID: PMC5622062 DOI: 10.1038/s41598-017-12778-6] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 09/14/2017] [Indexed: 12/13/2022] Open
Abstract
In this study, we used 2.9 million single nucleotide polymorphisms (SNP) and 393,429 indels derived from whole genome sequences of 591 rice landraces to determine the genetic basis of cooked and raw grain length, width and shape using genome-wide association study (GWAS). We identified a unique fine-mapped genetic region GWi7.1 significantly associated with cooked and raw grain width. Additionally, GWi7.2 that harbors GL7/GW7 a cloned gene for grain dimension was found. Novel regions in chromosomes 10 and 11 were also found to be associated with cooked grain shape and raw grain width, respectively. The indel-based GWAS identified fine-mapped genetic regions GL3.1 and GWi5.1 that matched synteny breakpoints between indica and japonica. GL3.1 was positioned a few kilobases away from GS3, a cloned gene for cooked and raw grain lengths in indica. GWi5.1 found to be significantly associated with cooked and raw grain width. It anchors upstream of cloned gene GW5, which varied between indica and japonica accessions. GWi11.1 is present inside the 3'-UTR of a functional gene in indica that corresponds to a syntenic break in chromosome 11 of japonica. Our results identified novel allelic structural variants and haplotypes confirmed using single locus and multilocus SNP and indel-based GWAS.
Collapse
Affiliation(s)
- Gopal Misra
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Saurabh Badoni
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Roslen Anacleto
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Andreas Graner
- Leibniz institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 03, 06466, Gatersleben, Germany
| | - Nickolai Alexandrov
- Genetics and Biotechnology Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Nese Sreenivasulu
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines.
| |
Collapse
|
46
|
Misra G, Badoni S, Anacleto R, Graner A, Alexandrov N, Sreenivasulu N. Whole genome sequencing-based association study to unravel genetic architecture of cooked grain width and length traits in rice. Sci Rep 2017. [PMID: 28963534 DOI: 10.1038/s41598‐017‐12778‐6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
In this study, we used 2.9 million single nucleotide polymorphisms (SNP) and 393,429 indels derived from whole genome sequences of 591 rice landraces to determine the genetic basis of cooked and raw grain length, width and shape using genome-wide association study (GWAS). We identified a unique fine-mapped genetic region GWi7.1 significantly associated with cooked and raw grain width. Additionally, GWi7.2 that harbors GL7/GW7 a cloned gene for grain dimension was found. Novel regions in chromosomes 10 and 11 were also found to be associated with cooked grain shape and raw grain width, respectively. The indel-based GWAS identified fine-mapped genetic regions GL3.1 and GWi5.1 that matched synteny breakpoints between indica and japonica. GL3.1 was positioned a few kilobases away from GS3, a cloned gene for cooked and raw grain lengths in indica. GWi5.1 found to be significantly associated with cooked and raw grain width. It anchors upstream of cloned gene GW5, which varied between indica and japonica accessions. GWi11.1 is present inside the 3'-UTR of a functional gene in indica that corresponds to a syntenic break in chromosome 11 of japonica. Our results identified novel allelic structural variants and haplotypes confirmed using single locus and multilocus SNP and indel-based GWAS.
Collapse
Affiliation(s)
- Gopal Misra
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Saurabh Badoni
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Roslen Anacleto
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Andreas Graner
- Leibniz institute of Plant Genetics and Crop Plant Research (IPK), Corrensstrasse 03, 06466, Gatersleben, Germany
| | - Nickolai Alexandrov
- Genetics and Biotechnology Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines
| | - Nese Sreenivasulu
- Grain Quality and Nutrition Center, Plant Breeding Division, International Rice Research Institute, DAPO Box 7777, Metro Manila, 1301, Philippines.
| |
Collapse
|
47
|
Lv Q, Huang Z, Xu X, Tang L, Liu H, Wang C, Zhou Z, Xin Y, Xing J, Peng Z, Li X, Zheng T, Zhu L. Allelic variation of the rice blast resistance gene Pid3 in cultivated rice worldwide. Sci Rep 2017; 7:10362. [PMID: 28871108 PMCID: PMC5583387 DOI: 10.1038/s41598-017-10617-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 08/11/2017] [Indexed: 11/12/2022] Open
Abstract
In this study, the re-sequencing data from 3,000 rice genomes project (3 K RGP) was used to analyze the allelic variation at the rice blast resistance (R) Pid3 locus. A total of 40 haplotypes were identified based on 71 nucleotide polymorphic sites among 2621 Pid3 homozygous alleles in the 3k genomes. Pid3 alleles in most japonica rice accessions were pseudogenes due to premature stop mutations, while those in most indica rice accessions were identical to the functional haplotype Hap_6, which had a similar resistance spectrum as the previously reported Pid3 gene. By sequencing and CAPS marker analyzing the Pid3 alleles in widespread cultivars in China, we verified that Hap_6 had been widely deployed in indica rice breeding of China. Thus, we suggest that the priority for utilization of the Pid3 locus in rice breeding should be on introducing the functional Pid3 alleles into japonica rice cultivars and the functional alleles of non-Hap_6 haplotypes into indica rice cultivars for increasing genetic diversity.
Collapse
Affiliation(s)
- Qiming Lv
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China.,State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Zhiyuan Huang
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Xiao Xu
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Li Tang
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Hai Liu
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Chunchao Wang
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zhuangzhi Zhou
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Yeyun Xin
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Junjie Xing
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Zhirong Peng
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China
| | - Xiaobing Li
- State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Tianqing Zheng
- Institute of Crop Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.
| | - Lihuang Zhu
- State Key Laboratory of Hybrid Rice, Hunan Hybrid Rice Research Center, Changsha, 410125, China. .,State Key Laboratory of Plant Genomics and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.
| |
Collapse
|
48
|
Moll KM, Zhou P, Ramaraj T, Fajardo D, Devitt NP, Sadowsky MJ, Stupar RM, Tiffin P, Miller JR, Young ND, Silverstein KAT, Mudge J. Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula. BMC Genomics 2017; 18:578. [PMID: 28778149 PMCID: PMC5545040 DOI: 10.1186/s12864-017-3971-4] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2017] [Accepted: 07/31/2017] [Indexed: 12/16/2022] Open
Abstract
Background Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. Results Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. Conclusions Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3971-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Karen M Moll
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM, 87505, USA.,Montana State University, Center for Biofilm Engineering, Bozeman, MT, 59717, USA
| | - Peng Zhou
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | - Thiruvarangan Ramaraj
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM, 87505, USA
| | - Diego Fajardo
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM, 87505, USA
| | - Nicholas P Devitt
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM, 87505, USA
| | - Michael J Sadowsky
- Department of Soil, Water & Climate, Plant and Microbial Biology and BioTechnology Institute, University of Minnesota, St. Paul, MN, USA
| | - Robert M Stupar
- Department of Agronomy and Plant Genetics, University of Minnesota, Saint Paul, MN, USA
| | - Peter Tiffin
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN, USA
| | | | - Nevin D Young
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, MN, USA
| | | | - Joann Mudge
- National Center for Genome Resources, 2935 Rodeo Park Drive East, Santa Fe, NM, 87505, USA.
| |
Collapse
|
49
|
Miller JR, Zhou P, Mudge J, Gurtowski J, Lee H, Ramaraj T, Walenz BP, Liu J, Stupar RM, Denny R, Song L, Singh N, Maron LG, McCouch SR, McCombie WR, Schatz MC, Tiffin P, Young ND, Silverstein KAT. Hybrid assembly with long and short reads improves discovery of gene family expansions. BMC Genomics 2017; 18:541. [PMID: 28724409 PMCID: PMC5518131 DOI: 10.1186/s12864-017-3927-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Accepted: 07/06/2017] [Indexed: 11/25/2022] Open
Abstract
Background Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation. Methods We developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation. Results Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies. Conclusion Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3927-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jason R Miller
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Peng Zhou
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa Fe, NM, USA
| | | | - Hayan Lee
- Stanford School of Medicine, Stanford, CA, USA
| | | | - Brian P Walenz
- National Human Genome Research Institute, Bethesda, MD, USA
| | - Junqi Liu
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA
| | - Robert M Stupar
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA
| | - Roxanne Denny
- Department of Plant Pathology, University of Minnesota, St. Paul, MN, USA
| | - Li Song
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Namrata Singh
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | - Lyza G Maron
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | - Susan R McCouch
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | | | - Michael C Schatz
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Peter Tiffin
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | - Nevin D Young
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | | |
Collapse
|
50
|
Fragoso CA, Moreno M, Wang Z, Heffelfinger C, Arbelaez LJ, Aguirre JA, Franco N, Romero LE, Labadie K, Zhao H, Dellaporta SL, Lorieux M. Genetic Architecture of a Rice Nested Association Mapping Population. G3 (BETHESDA, MD.) 2017; 7:1913-1926. [PMID: 28450374 PMCID: PMC5473768 DOI: 10.1534/g3.117.041608] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 04/14/2017] [Indexed: 12/21/2022]
Abstract
Describing the genetic diversity in the gene pool of crops will provide breeders with novel resources for varietal improvement. Nested Association Mapping (NAM) populations are uniquely suited for characterizing parental diversity through the shuffling and fixation of parental haplotypes. Here, we describe a set of 1879 rice NAM lines created through the selfing and single-seed descent of F1 hybrids derived from elite IR64 indica crossed with 10 diverse tropical japonica lines. Genotyping data indicated tropical japonica alleles were captured at every queried locus despite the presence of segregation distortion factors. Several distortion loci were mapped, both shared and unique, among the 10 populations. Using two-point and multi-point genetic map calculations, our datasets achieved the ∼1500 cM expected map size in rice. Finally, we highlighted the utility of the NAM lines for QTL mapping, including joint analysis across the 10 populations, by confirming known QTL locations for the trait days to heading.
Collapse
Affiliation(s)
- Christopher A Fragoso
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06511
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06511
| | - Maria Moreno
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06511
| | - Zuoheng Wang
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06511
- Department of Biostatistics, Yale University, New Haven, Connecticut 06511
| | - Christopher Heffelfinger
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06511
| | - Lady J Arbelaez
- Rice Genetics and Genomics Laboratory, International Center for Tropical Agriculture, Cali 6713, Colombia
| | - John A Aguirre
- Rice Genetics and Genomics Laboratory, International Center for Tropical Agriculture, Cali 6713, Colombia
| | - Natalia Franco
- Rice Genetics and Genomics Laboratory, International Center for Tropical Agriculture, Cali 6713, Colombia
| | - Luz E Romero
- Rice Genetics and Genomics Laboratory, International Center for Tropical Agriculture, Cali 6713, Colombia
| | - Karine Labadie
- Commissariat à L'énergie Atomique et aux Énergies Alternatives, Institut de Génomique, Genoscope, 91000 Evry, France
| | - Hongyu Zhao
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06511
- Department of Biostatistics, Yale University, New Haven, Connecticut 06511
| | - Stephen L Dellaporta
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06511
| | - Mathias Lorieux
- Rice Genetics and Genomics Laboratory, International Center for Tropical Agriculture, Cali 6713, Colombia
- Diversité, Adaptation, Développement des Plantes Research Unit, Institut de Recherche pour le Développement, F-34394 Montpellier, France
| |
Collapse
|