51
|
Tian Z, Wang JW, Li J, Han B. Designing future crops: challenges and strategies for sustainable agriculture. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 105:1165-1178. [PMID: 33258137 DOI: 10.1111/tpj.15107] [Citation(s) in RCA: 73] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/22/2020] [Accepted: 11/26/2020] [Indexed: 05/26/2023]
Abstract
Crop production is facing unprecedented challenges. Despite the fact that the food supply has significantly increased over the past half-century, ~8.9 and 14.3% people are still suffering from hunger and malnutrition, respectively. Agricultural environments are continuously threatened by a booming world population, a shortage of arable land, and rapid changes in climate. To ensure food and ecosystem security, there is a need to design future crops for sustainable agriculture development by maximizing net production and minimalizing undesirable effects on the environment. The future crops design projects, recently launched by the National Natural Science Foundation of China and Chinese Academy of Sciences (CAS), aim to develop a roadmap for rapid design of customized future crops using cutting-edge technologies in the Breeding 4.0 era. In this perspective, we first introduce the background and missions of these projects. We then outline strategies to design future crops, such as improvement of current well-cultivated crops, de novo domestication of wild species and redomestication of current cultivated crops. We further discuss how these ambitious goals can be achieved by the recent development of new integrative omics tools, advanced genome-editing tools and synthetic biology approaches. Finally, we summarize related opportunities and challenges in these projects.
Collapse
Affiliation(s)
- Zhixi Tian
- State Key Laboratory of Plant Cell and Chromosome Engineering, Innovation Academy for Seed Design, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jia-Wei Wang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
- ShanghaiTech University, Shanghai, 200031, China
| | - Jiayang Li
- University of Chinese Academy of Sciences, Beijing, 100049, China
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research (Beijing), Innovation Academy for Seed Design, Institute of Genetics and Developmental Biology Chinese Academy of Sciences, Beijing, 100101, China
| | - Bin Han
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Chinese Academy of Sciences, Shanghai, 200032, China
- ShanghaiTech University, Shanghai, 200031, China
- National Center for Gene Research, Shanghai, 200233, China
| |
Collapse
|
52
|
Ruperao P, Thirunavukkarasu N, Gandham P, Selvanayagam S, Govindaraj M, Nebie B, Manyasa E, Gupta R, Das RR, Odeny DA, Gandhi H, Edwards D, Deshpande SP, Rathore A. Sorghum Pan-Genome Explores the Functional Utility for Genomic-Assisted Breeding to Accelerate the Genetic Gain. FRONTIERS IN PLANT SCIENCE 2021; 12:666342. [PMID: 34140962 PMCID: PMC8204017 DOI: 10.3389/fpls.2021.666342] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 04/28/2021] [Indexed: 05/05/2023]
Abstract
Sorghum (Sorghum bicolor L.) is a staple food crops in the arid and rainfed production ecologies. Sorghum plays a critical role in resilient farming and is projected as a smart crop to overcome the food and nutritional insecurity in the developing world. The development and characterisation of the sorghum pan-genome will provide insight into genome diversity and functionality, supporting sorghum improvement. We built a sorghum pan-genome using reference genomes as well as 354 genetically diverse sorghum accessions belonging to different races. We explored the structural and functional characteristics of the pan-genome and explain its utility in supporting genetic gain. The newly-developed pan-genome has a total of 35,719 genes, a core genome of 16,821 genes and an average of 32,795 genes in each cultivar. The variable genes are enriched with environment responsive genes and classify the sorghum accessions according to their race. We show that 53% of genes display presence-absence variation, and some of these variable genes are predicted to be functionally associated with drought adaptation traits. Using more than two million SNPs from the pan-genome, association analysis identified 398 SNPs significantly associated with important agronomic traits, of which, 92 were in genes. Drought gene expression analysis identified 1,788 genes that are functionally linked to different conditions, of which 79 were absent from the reference genome assembly. This study provides comprehensive genomic diversity resources in sorghum which can be used in genome assisted crop improvement.
Collapse
Affiliation(s)
- Pradeep Ruperao
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | | | - Prasad Gandham
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | | | | | - Baloua Nebie
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Bamako, Mali
| | - Eric Manyasa
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Nairobi, Kenya
| | - Rajeev Gupta
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - Roma Rani Das
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - Damaris A. Odeny
- Sorghum Breeding Program, International Crops Research Institute for the Semi-Arid Tropics, Nairobi, Kenya
| | - Harish Gandhi
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Perth, WA, Australia
| | - Santosh P. Deshpande
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
- Santosh P. Deshpande
| | - Abhishek Rathore
- International Crops Research Institute for the Semi-Arid Tropics, Patancheru, India
- *Correspondence: Abhishek Rathore
| |
Collapse
|
53
|
Lang D, Zhang S, Ren P, Liang F, Sun Z, Meng G, Tan Y, Li X, Lai Q, Han L, Wang D, Hu F, Wang W, Liu S. Comparison of the two up-to-date sequencing technologies for genome assembly: HiFi reads of Pacific Biosciences Sequel II system and ultralong reads of Oxford Nanopore. Gigascience 2020; 9:giaa123. [PMID: 33319909 PMCID: PMC7736813 DOI: 10.1093/gigascience/giaa123] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 07/02/2020] [Accepted: 10/08/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The availability of reference genomes has revolutionized the study of biology. Multiple competing technologies have been developed to improve the quality and robustness of genome assemblies during the past decade. The 2 widely used long-read sequencing providers-Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)-have recently updated their platforms: PacBio enables high-throughput HiFi reads with base-level resolution of >99%, and ONT generated reads as long as 2 Mb. We applied the 2 up-to-date platforms to a single rice individual and then compared the 2 assemblies to investigate the advantages and limitations of each. RESULTS The results showed that ONT ultralong reads delivered higher contiguity, producing a total of 18 contigs of which 10 were assembled into a single chromosome compared to 394 contigs and 3 chromosome-level contigs for the PacBio assembly. The ONT ultralong reads also prevented assembly errors caused by long repetitive regions, for which we observed a total of 44 genes of false redundancies and 10 genes of false losses in the PacBio assembly, leading to over- or underestimation of the gene families in those long repetitive regions. We also noted that the PacBio HiFi reads generated assemblies with considerably fewer errors at the level of single nucleotides and small insertions and deletions than those of the ONT assembly, which generated an average 1.06 errors per kb and finally engendered 1,475 incorrect gene annotations via altered or truncated protein predictions. CONCLUSIONS It shows that both PacBio HiFi reads and ONT ultralong reads had their own merits. Further genome reference constructions could leverage both techniques to lessen the impact of assembly errors and subsequent annotation mistakes rooted in each.
Collapse
Affiliation(s)
- Dandan Lang
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Shilai Zhang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Research Center for Perennial Rice Engineering and Technology of Yunnan, School of Agriculture, Yunnan University, No.2, North Cuihu Road, Kunming, Yunnan 650091, China
| | - Pingping Ren
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Fan Liang
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Zongyi Sun
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Guanliang Meng
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Yuntao Tan
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Xiaokang Li
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Qihua Lai
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Lingling Han
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Depeng Wang
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
| | - Fengyi Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Research Center for Perennial Rice Engineering and Technology of Yunnan, School of Agriculture, Yunnan University, No.2, North Cuihu Road, Kunming, Yunnan 650091, China
| | - Wen Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No.32, East Jiaochang Road, Kunming, Yunnan 650223, China
- Center for Ecological and Environmental Sciences, Key Laboratory for Space Bioscience & Biotechnology, Northwestern Polytechnical University, No.127, West Youyi Road, Xi'an, Shanxi 710072, China
| | - Shanlin Liu
- GrandOmics Biosciences, No.1, East Nengyuan Road, Beijing 102200, China
- Department of Entomology, College of Plant Protection, China Agricultural University, No.2, West Yuanmingyuan Road, Beijing 100193, China
| |
Collapse
|
54
|
Multiple wheat genomes reveal global variation in modern breeding. Nature 2020; 588:277-283. [PMID: 33239791 PMCID: PMC7759465 DOI: 10.1038/s41586-020-2961-x] [Citation(s) in RCA: 402] [Impact Index Per Article: 100.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 09/09/2020] [Indexed: 12/24/2022]
Abstract
Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome1, and the lack of genome-assembly data for multiple wheat lines2,3. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses4,5. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm16, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars. Comparison of multiple genome assemblies from wheat reveals extensive diversity that results from the complex breeding history of wheat and provides a basis for further potential improvements to this important food crop.
Collapse
|
55
|
Tahir Ul Qamar M, Zhu X, Khan MS, Xing F, Chen LL. Pan-genome: A promising resource for noncoding RNA discovery in plants. THE PLANT GENOME 2020; 13:e20046. [PMID: 33217199 DOI: 10.1002/tpg2.20046] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 06/08/2020] [Accepted: 06/22/2020] [Indexed: 05/05/2023]
Abstract
Plant genomes contain both protein-coding and noncoding sequences including transposable elements (TEs) and noncoding RNAs (ncRNAs). The ncRNAs are recognized as important elements that play fundamental roles in the structural organization and function of plant genomes. Despite various hypotheses, TEs are believed to be a major precursor of ncRNAs. Transposable elements are also prime factors that cause genomic variation among members of a species. Hence, TEs pose a major challenge in the discovery and analysis of ncRNAs. With the increase in the number of sequenced plant genomes, it is now accepted that a single reference genome is insufficient to represent the complete genomic diversity and contents of a species, and exploring the pan-genome of a species is critical. In this review, we summarize the recent progress in the field of plant pan-genomes. We also discuss TEs and their roles in ncRNA biogenesis and present our perspectives on the application of pan-genomes for the discovery of ncRNAs to fully explore and exploit their biological roles in plants.
Collapse
Affiliation(s)
- Muhammad Tahir Ul Qamar
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, 530004, P. R. China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Xitong Zhu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| | - Muhammad Sarwar Khan
- Center of Agricultural Biochemistry and Biotechnology, University of Agriculture, Faisalabad, 38000, Pakistan
| | - Feng Xing
- College of Life Science, Xinyang Normal University, Xinyang, 464000, P. R. China
| | - Ling-Ling Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Nanning, 530004, P. R. China
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, 430070, P. R. China
| |
Collapse
|
56
|
Nieto Feliner G, Casacuberta J, Wendel JF. Genomics of Evolutionary Novelty in Hybrids and Polyploids. Front Genet 2020; 11:792. [PMID: 32849797 PMCID: PMC7399645 DOI: 10.3389/fgene.2020.00792] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Accepted: 07/03/2020] [Indexed: 12/15/2022] Open
Abstract
It has long been recognized that hybridization and polyploidy are prominent processes in plant evolution. Although classically recognized as significant in speciation and adaptation, recognition of the importance of interspecific gene flow has dramatically increased during the genomics era, concomitant with an unending flood of empirical examples, with or without genome doubling. Interspecific gene flow is thus increasingly thought to lead to evolutionary innovation and diversification, via adaptive introgression, homoploid hybrid speciation and allopolyploid speciation. Less well understood, however, are the suite of genetic and genomic mechanisms set in motion by the merger of differentiated genomes, and the temporal scale over which recombinational complexity mediated by gene flow might be expressed and exposed to natural selection. We focus on these issues here, considering the types of molecular genetic and genomic processes that might be set in motion by the saltational event of genome merger between two diverged species, either with or without genome doubling, and how these various processes can contribute to novel phenotypes. Genetic mechanisms include the infusion of new alleles and the genesis of novel structural variation including translocations and inversions, homoeologous exchanges, transposable element mobilization and novel insertional effects, presence-absence variation and copy number variation. Polyploidy generates massive transcriptomic and regulatory alteration, presumably set in motion by disrupted stoichiometries of regulatory factors, small RNAs and other genome interactions that cascade from single-gene expression change up through entire networks of transformed regulatory modules. We highlight both these novel combinatorial possibilities and the range of temporal scales over which such complexity might be generated, and thus exposed to natural selection and drift.
Collapse
Affiliation(s)
- Gonzalo Nieto Feliner
- Department of Biodiversity and Conservation, Real Jardín Botánico, CSIC, Madrid, Spain
| | - Josep Casacuberta
- Center for Research in Agricultural Genomics, CRAG (CSIC-IRTA-UAB-UB), Barcelona, Spain
| | - Jonathan F. Wendel
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA, United States
| |
Collapse
|
57
|
Liu Y, Du H, Li P, Shen Y, Peng H, Liu S, Zhou GA, Zhang H, Liu Z, Shi M, Huang X, Li Y, Zhang M, Wang Z, Zhu B, Han B, Liang C, Tian Z. Pan-Genome of Wild and Cultivated Soybeans. Cell 2020; 182:162-176.e13. [PMID: 32553274 DOI: 10.1016/j.cell.2020.05.023] [Citation(s) in RCA: 403] [Impact Index Per Article: 100.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2019] [Revised: 04/07/2020] [Accepted: 05/12/2020] [Indexed: 12/21/2022]
Abstract
Soybean is one of the most important vegetable oil and protein feed crops. To capture the entire genomic diversity, it is needed to construct a complete high-quality pan-genome from diverse soybean accessions. In this study, we performed individual de novo genome assemblies for 26 representative soybeans that were selected from 2,898 deeply sequenced accessions. Using these assembled genomes together with three previously reported genomes, we constructed a graph-based genome and performed pan-genome analysis, which identified numerous genetic variations that cannot be detected by direct mapping of short sequence reads onto a single reference genome. The structural variations from the 2,898 accessions that were genotyped based on the graph-based genome and the RNA sequencing (RNA-seq) data from the representative 26 accessions helped to link genetic variations to candidate genes that are responsible for important traits. This pan-genome resource will promote evolutionary and functional genomics studies in soybean.
Collapse
Affiliation(s)
- Yucheng Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Huilong Du
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Pengcheng Li
- Berry Genomics Corporation, Beijing 100015, China
| | - Yanting Shen
- School of Pharmaceutical Sciences, Guangzhou University of Chinese Medicine, Guangzhou 510006, China
| | - Hua Peng
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Shulin Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Guo-An Zhou
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | | | - Zhi Liu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Miao Shi
- Berry Genomics Corporation, Beijing 100015, China
| | - Xuehui Huang
- College of Life Sciences, Shanghai Normal University, Shanghai 200234, China
| | - Yan Li
- National Center for Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China
| | - Min Zhang
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Zheng Wang
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Baoge Zhu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Bin Han
- National Center for Gene Research, CAS Center for Excellence in Molecular Plant Sciences, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200032, China
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Zhixi Tian
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; College of Advanced Agriculture Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
58
|
Wang J, Lu N, Yi F, Xiao Y. Identification of Transposable Elements in Conifer and Their Potential Application in Breeding. Evol Bioinform Online 2020; 16:1176934320930263. [PMID: 32595272 PMCID: PMC7297469 DOI: 10.1177/1176934320930263] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Accepted: 05/04/2020] [Indexed: 12/20/2022] Open
Abstract
Transposable elements (TEs) are known to play a role in genome evolution, gene regulation, and epigenetics, representing potential tools for genetics research in and breeding of conifers. Recently, thanks to the development of high-throughput sequencing, more conifer genomes have been reported. Using bioinformatics tools, the TEs of 3 important conifers (Picea abies, Picea glauce, and Pinus taeda) were identified in our previous study, which provided a foundation for accelerating the use of TEs in conifer breeding and genetic study. Here, we review recent studies on the functional biology of TEs and discuss the potential applications for TEs in conifers.
Collapse
Affiliation(s)
- Junhui Wang
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, China
| | - Nan Lu
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, China
| | - Fei Yi
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, China
| | - Yao Xiao
- State Key Laboratory of Tree Genetics and Breeding, Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, China
| |
Collapse
|
59
|
K. Srivastava R, Bollam S, Pujarula V, Pusuluri M, Singh RB, Potupureddi G, Gupta R. Exploitation of Heterosis in Pearl Millet: A Review. PLANTS (BASEL, SWITZERLAND) 2020; 9:E807. [PMID: 32605134 PMCID: PMC7412370 DOI: 10.3390/plants9070807] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Revised: 06/23/2020] [Accepted: 06/23/2020] [Indexed: 01/06/2023]
Abstract
The phenomenon of heterosis has fascinated plant breeders ever since it was first described by Charles Darwin in 1876 in the vegetable kingdom and later elaborated by George H Shull and Edward M East in maize during 1908. Heterosis is the phenotypic and functional superiority manifested in the F1 crosses over the parents. Various classical complementation mechanisms gave way to the study of the underlying potential cellular and molecular mechanisms responsible for heterosis. In cereals, such as maize, heterosis has been exploited very well, with the development of many single-cross hybrids that revolutionized the yield and productivity enhancements. Pearl millet (Pennisetum glaucum (L.) R. Br.) is one of the important cereal crops with nutritious grains and lower water and energy footprints in addition to the capability of growing in some of the harshest and most marginal environments of the world. In this highly cross-pollinating crop, heterosis was exploited by the development of a commercially viable cytoplasmic male-sterility (CMS) system involving a three-lines breeding system (A-, B- and R-lines). The first set of male-sterile lines, i.e., Tift 23A and Tift18A, were developed in the early 1960s in Tifton, Georgia, USA. These provided a breakthrough in the development of hybrids worldwide, e.g., Tift 23A was extensively used by Punjab Agricultural University (PAU), Ludhiana, India, for the development of the first single-cross pearl millet hybrid, named Hybrid Bajra 1 (HB 1), in 1965. Over the past five decades, the pearl millet community has shown tremendous improvement in terms of cytoplasmic and nuclear diversification of the hybrid parental lines, which led to a progressive increase in the yield and adaptability of the hybrids that were developed, resulting in significant genetic gains. Lately, the whole genome sequencing of Tift 23D2B1 and re-sequencing of circa 1000 genomes by a consortium led by the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) has been a significant milestone in the development of cutting-edge genetic and genomic resources in pearl millet. Recently, the application of genomics and molecular technologies has provided better insights into genetic architecture and patterns of heterotic gene pools. Development of whole-genome prediction models incorporating heterotic gene pool models, mapped traits and markers have the potential to take heterosis breeding to a new level in pearl millet. This review discusses advances and prospects in various fronts of heterosis for pearl millet.
Collapse
Affiliation(s)
- Rakesh K. Srivastava
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad TS 502324, India; (S.B.); (V.P.); (M.P.); (R.B.S.); (G.P.)
| | | | | | | | | | | | - Rajeev Gupta
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad TS 502324, India; (S.B.); (V.P.); (M.P.); (R.B.S.); (G.P.)
| |
Collapse
|
60
|
Heller D, Vingron M. SVIM: structural variant identification using mapped long reads. Bioinformatics 2020; 35:2907-2915. [PMID: 30668829 PMCID: PMC6735718 DOI: 10.1093/bioinformatics/btz041] [Citation(s) in RCA: 166] [Impact Index Per Article: 41.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 01/04/2019] [Accepted: 01/22/2019] [Indexed: 02/07/2023] Open
Abstract
Motivation Structural variants are defined as genomic variants larger than 50 bp. They have been shown to affect more bases in any given genome than single-nucleotide polymorphisms or small insertions and deletions. Additionally, they have great impact on human phenotype and diversity and have been linked to numerous diseases. Due to their size and association with repeats, they are difficult to detect by shotgun sequencing, especially when based on short reads. Long read, single-molecule sequencing technologies like those offered by Pacific Biosciences or Oxford Nanopore Technologies produce reads with a length of several thousand base pairs. Despite the higher error rate and sequencing cost, long-read sequencing offers many advantages for the detection of structural variants. Yet, available software tools still do not fully exploit the possibilities. Results We present SVIM, a tool for the sensitive detection and precise characterization of structural variants from long-read data. SVIM consists of three components for the collection, clustering and combination of structural variant signatures from read alignments. It discriminates five different variant classes including similar types, such as tandem and interspersed duplications and novel element insertions. SVIM is unique in its capability of extracting both the genomic origin and destination of duplications. It compares favorably with existing tools in evaluations on simulated data and real datasets from Pacific Biosciences and Nanopore sequencing machines. Availability and implementation The source code and executables of SVIM are available on Github: github.com/eldariont/svim. SVIM has been implemented in Python 3 and published on bioconda and the Python Package Index. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- David Heller
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Martin Vingron
- Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
| |
Collapse
|
61
|
Howlader J, Robin AHK, Natarajan S, Biswas MK, Sumi KR, Song CY, Park JI, Nou IS. Transcriptome Analysis by RNA-Seq Reveals Genes Related to Plant Height in Two Sets of Parent-hybrid Combinations in Easter lily (Lilium longiflorum). Sci Rep 2020; 10:9082. [PMID: 32494055 PMCID: PMC7270119 DOI: 10.1038/s41598-020-65909-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 05/12/2020] [Indexed: 11/23/2022] Open
Abstract
In this study, two different hybrids of Easter lily (Lilium longiflorum), obtained from two cross combinations, along with their four parents were sequenced by high–throughput RNA–sequencing (RNA–Seq) to find out differentially expressed gene in parent-hybrid combinations. The leaf mRNA profiles of two hybrids and their four parents were RNA–sequenced with a view to identify the potential candidate genes related to plant height heterosis. In both cross combinations, based to morphological traits mid–parent heterosis (MPH) was higher than high–parent heterosis (HPH) for plant height, leaf length, and number of flowers whereas HPH was higher than MPH for flowering time. A total of 4,327 differentially expressed genes (DEGs) were identified through RNA–Seq between the hybrids and their parents based on fold changes (FC) ≥ 2 for up– and ≤ –2 for down–regulation. Venn diagram analysis revealed that there were 703 common DEGs in two hybrid combinations, those were either up– or down–regulated. Most of the commonly expressed DEGs exhibited higher non–additive effects especially overdominance (75.9%) rather than additive (19.4%) and dominance (4.76%) effects. Among the 384 functionally annotated DEGs identified through Blast2GO tool, 12 DEGs were up–regulated and 16 of them were down–regulated in a similar fashion in both hybrids as revealed by heat map analysis. These 28 universally expressed DEGs were found to encode different types of proteins and enzymes those might regulate heterosis by modulating growth, development and stress–related functions in lily. In addition, gene ontology (GO) analysis of 260 annotated DEGs revealed that biological process might play dominant role in heterotic expression. In this first report of transcriptome sequencing in Easter lily, the notable universally up-regulated DEGs annotated ABC transporter A family member–like, B3 domain–containing, disease resistance RPP13/1, auxin–responsive SAUR68–like, and vicilin–like antimicrobial peptides 2–2 proteins those were perhaps associated with plant height heterosis. The genes expressed universally due to their overdominace function perhaps influenced MPH for greater plant height― largely by modulating biological processes involved therein. The genes identified in this study might be exploited in heterosis breeding for plant height of L. longiflorum.
Collapse
Affiliation(s)
- Jewel Howlader
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea.,Department of Horticulture, Patuakhali Science and Technology University, Dumki, Patuakhali, 8602, Bangladesh
| | - Arif Hasan Khan Robin
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea.,Department of Genetics and Plant Breeding, Bangladesh Agricultural University, Mymensingh, 2202, Bangladesh
| | - Sathishkumar Natarajan
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea
| | - Manosh Kumar Biswas
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea
| | - Kanij Rukshana Sumi
- Department of Fisheries Science, Chonnam National University, 50, Daehak-ro, Yeosu, Jeonnam, 59626, Republic of Korea.,Department of Aquaculture, Patuakhali Science and Technology University, Dumki, Patuakhali, 8602, Bangladesh
| | - Cheon Young Song
- Department of Floriculture, Korea National College of Agriculture and Fisheries, 1515, Kongjwipatjwi-ro, Wansan-gu, Jeonju-si, Jeollabuk-do, 54874, Republic of Korea
| | - Jong-In Park
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea
| | - Ill-Sup Nou
- Department of Horticulture, Sunchon National University, 255, Jungang-ro, Suncheon, Jeonnam, 57922, Republic of Korea.
| |
Collapse
|
62
|
Identifying genetic variants underlying phenotypic variation in plants without complete genomes. Nat Genet 2020; 52:534-540. [PMID: 32284578 PMCID: PMC7610390 DOI: 10.1038/s41588-020-0612-7] [Citation(s) in RCA: 72] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 03/10/2020] [Indexed: 12/11/2022]
Abstract
Structural variants and presence/absence polymorphisms are common in plant genomes, yet they are routinely overlooked in genome-wide association studies (GWAS). Here, we expand the type of genetic variants detected in GWAS to include major deletions, insertions and rearrangements. We first use raw sequencing data directly to derive short sequences, k-mers, that mark a broad range of polymorphisms independently of a reference genome. We then link k-mers associated with phenotypes to specific genomic regions. Using this approach, we reanalyzed 2,000 traits in Arabidopsis thaliana, tomato and maize populations. Associations identified with k-mers recapitulate those found with SNPs, but with stronger statistical support. Importantly, we discovered new associations with structural variants and with regions missing from reference genomes. Our results demonstrate the power of performing GWAS before linking sequence reads to specific genomic regions, which allows the detection of a wider range of genetic variants responsible for phenotypic variation.
Collapse
|
63
|
Dolatabadian A, Bayer PE, Tirnaz S, Hurgobin B, Edwards D, Batley J. Characterization of disease resistance genes in the Brassica napus pangenome reveals significant structural variation. PLANT BIOTECHNOLOGY JOURNAL 2020; 18:969-982. [PMID: 31553100 PMCID: PMC7061875 DOI: 10.1111/pbi.13262] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 08/30/2019] [Accepted: 09/13/2019] [Indexed: 05/18/2023]
Abstract
Methods based on single nucleotide polymorphism (SNP), copy number variation (CNV) and presence/absence variation (PAV) discovery provide a valuable resource to study gene structure and evolution. However, as a result of these structural variations, a single reference genome is unable to cover the entire gene content of a species. Therefore, pangenomics analysis is needed to ensure that the genomic diversity within a species is fully represented. Brassica napus is one of the most important oilseed crops in the world and exhibits variability in its resistance genes across different cultivars. Here, we characterized resistance gene distribution across 50 B. napus lines. We identified a total of 1749 resistance gene analogs (RGAs), of which 996 are core and 753 are variable, 368 of which are not present in the reference genome (cv. Darmor-bzh). In addition, a total of 15 318 SNPs were predicted within 1030 of the RGAs. The results showed that core R-genes harbour more SNPs than variable genes. More nucleotide binding site-leucine-rich repeat (NBS-LRR) genes were located in clusters than as singletons, with variable genes more likely to be found in clusters. We identified 106 RGA candidates linked to blackleg resistance quantitative trait locus (QTL). This study provides a better understanding of resistance genes to target for genomics-based improvement and improved disease resistance.
Collapse
Affiliation(s)
- Aria Dolatabadian
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| | - Philipp E. Bayer
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| | - Soodeh Tirnaz
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| | - Bhavna Hurgobin
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| | - David Edwards
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| | - Jacqueline Batley
- UWA School of Biological Sciences and the UWA Institute of AgricultureFaculty of ScienceThe University of Western AustraliaCrawleyWAAustralia
| |
Collapse
|
64
|
Maximising recombination across macadamia populations to generate linkage maps for genome anchoring. Sci Rep 2020; 10:5048. [PMID: 32193408 PMCID: PMC7081209 DOI: 10.1038/s41598-020-61708-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 02/24/2020] [Indexed: 01/02/2023] Open
Abstract
The Proteaceae genus Macadamia has a recent history of domestication as a commercial nut crop. We aimed to establish the first sequence-based haploid-correlated reference genetic linkage maps for this primarily outcrossing perennial tree crop, with marker density suitable for genome anchoring. Four first generation populations were used to maximise the segregation patterns available within full-sib, biparental and self-pollinated progeny. This allowed us to combine segregation data from overlapping subsets of >4,000 informative sequence-tagged markers to increase the effective coverage of the karyotype represented by the recombinant crossover events detected. All maps had 14 linkage groups, corresponding to the Macadamia haploid chromosome number, and enabled the anchoring and orientation of sequence scaffolds to construct a pseudo-chromosomal genome assembly for macadamia. Comparison of individual maps indicated a high level of congruence, with minor discrepancies satisfactorily resolved within the integrated maps. The combined set of maps significantly improved marker density and the proportion (70%) of the genome sequence assembly anchored. Overall, increasing our understanding of the genetic landscape and genome for this nut crop represents a substantial advance in macadamia genetics and genomics. The set of maps, large number of sequence-based markers and the reconstructed genome provide a toolkit to underpin future breeding that should help to extend the macadamia industry as well as provide resources for the long term conservation of natural populations in eastern Australia of this unique genus.
Collapse
|
65
|
Gabur I, Chawla HS, Lopisso DT, von Tiedemann A, Snowdon RJ, Obermeier C. Gene presence-absence variation associates with quantitative Verticillium longisporum disease resistance in Brassica napus. Sci Rep 2020; 10:4131. [PMID: 32139810 PMCID: PMC7057980 DOI: 10.1038/s41598-020-61228-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 02/07/2020] [Indexed: 12/31/2022] Open
Abstract
Although copy number variation (CNV) and presence-absence variation (PAV) have been discovered in selected gene families in most crop species, the global prevalence of these polymorphisms in most complex genomes is still unclear and their influence on quantitatively inherited agronomic traits is still largely unknown. Here we analyze the association of gene PAV with resistance of oilseed rape (Brassica napus) against the important fungal pathogen Verticillium longisporum, as an example for a complex, quantitative disease resistance in the strongly rearranged genome of a recent allopolyploid crop species. Using Single Nucleotide absence Polymorphism (SNaP) markers to efficiently trace PAV in breeding populations, we significantly increased the resolution of loci influencing V. longisporum resistance in biparental and multi-parental mapping populations. Gene PAV, assayed by resequencing mapping parents, was observed in 23-51% of the genes within confidence intervals of quantitative trait loci (QTL) for V. longisporum resistance, and high-priority candidate genes identified within QTL were all affected by PAV. The results demonstrate the prominent role of gene PAV in determining agronomic traits, suggesting that this important class of polymorphism should be exploited more systematically in future plant breeding.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Daniel Teshome Lopisso
- Section of General Plant Pathology and Crop Protection, Georg August University Göttingen, 37077, Göttingen, Germany
- College of Agriculture and Veterinary Medicine, Jimma University, Jimma, Ethiopia
| | - Andreas von Tiedemann
- Section of General Plant Pathology and Crop Protection, Georg August University Göttingen, 37077, Göttingen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany
| | - Christian Obermeier
- Department of Plant Breeding, IFZ Research Centre for Biosystems, Land Use and Nutrition, Justus Liebig University Giessen, 35392, Giessen, Germany.
| |
Collapse
|
66
|
Khan AW, Garg V, Roorkiwal M, Golicz AA, Edwards D, Varshney RK. Super-Pangenome by Integrating the Wild Side of a Species for Accelerated Crop Improvement. TRENDS IN PLANT SCIENCE 2020; 25:148-158. [PMID: 31787539 PMCID: PMC6988109 DOI: 10.1016/j.tplants.2019.10.012] [Citation(s) in RCA: 131] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 10/28/2019] [Accepted: 10/29/2019] [Indexed: 05/19/2023]
Abstract
The pangenome provides genomic variations in the cultivated gene pool for a given species. However, as the crop's gene pool comprises many species, especially wild relatives with diverse genetic stock, here we suggest using accessions from all available species of a given genus for the development of a more comprehensive and complete pangenome, which we refer to as a super-pangenome. The super-pangenome provides a complete genomic variation repertoire of a genus and offers unprecedented opportunities for crop improvement. This opinion article focuses on recent developments in crop pangenomics, the need for a super-pangenome that should include wild species, and its application for crop improvement.
Collapse
Affiliation(s)
- Aamir W Khan
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India; School of Biological Sciences, The University of Western Australia (UWA), Crawley, WA, Australia
| | - Vanika Garg
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Manish Roorkiwal
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India
| | - Agnieszka A Golicz
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Parkville, Melbourne, VIC, Australia
| | - David Edwards
- School of Biological Sciences, The University of Western Australia (UWA), Crawley, WA, Australia
| | - Rajeev K Varshney
- Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.
| |
Collapse
|
67
|
An Y, Mi X, Zhao S, Guo R, Xia X, Liu S, Wei C. Revealing Distinctions in Genetic Diversity and Adaptive Evolution Between Two Varieties of Camellia sinensis by Whole-Genome Resequencing. FRONTIERS IN PLANT SCIENCE 2020; 11:603819. [PMID: 33329675 PMCID: PMC7732639 DOI: 10.3389/fpls.2020.603819] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 11/03/2020] [Indexed: 05/07/2023]
Abstract
Camellia sinensis var. sinensis (CSS) and C. sinensis var. assamica (CSA) are the two most economically important tea varieties. They have different characteristics and geographical distribution. Their genetic diversity and differentiation are unclear. Here, we identified 18,903,625 single nucleotide polymorphisms (SNPs) and 7,314,133 insertion-deletion mutations (indels) by whole-genome resequencing of 30 cultivated and three wild related species. Population structure and phylogenetic tree analyses divided the cultivated accessions into CSS and CSA containing 6,440,419 and 6,176,510 unique variations, respectively. The CSS subgroup possessed higher genetic diversity and was enriched for rare alleles. The CSA subgroup had more non-synonymous mutations and might have experienced a greater degree of balancing selection. The evolution rate (dN/dS) and KEGG enrichment indicated that genes involved in the synthesis and metabolism of flavor substances were positively selected in both CSS and CSA subpopulations. However, there are extensive genome differentiation regions (2959 bins and approximately 148 M in size) between the two subgroups. Compared with CSA (141 selected regions containing 124 genes), the CSS subgroup (830 selected regions containing 687 genes) displayed more selection regions potentially related to environmental adaptability. Fifty-three pairs of polymorphic indel markers were developed. Some markers were located in hormone-related genes with distinct alleles in the two cultivated subgroups. These identified variations and selected regions provide clues for the differentiation and adaptive evolution of tea varieties. The newly developed indel markers will be valuable in further genetic research on tea plants.
Collapse
|
68
|
Golicz AA, Bayer PE, Bhalla PL, Batley J, Edwards D. Pangenomics Comes of Age: From Bacteria to Plant and Animal Applications. Trends Genet 2019; 36:132-145. [PMID: 31882191 DOI: 10.1016/j.tig.2019.11.006] [Citation(s) in RCA: 109] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 11/09/2019] [Accepted: 11/12/2019] [Indexed: 02/01/2023]
Abstract
The pangenome refers to a collection of genomic sequence found in the entire species or population rather than in a single individual; the sequence can be core, present in all individuals, or accessory (variable or dispensable), found in a subset of individuals only. While pangenomic studies were first undertaken in bacterial species, developments in genome sequencing and assembly approaches have allowed construction of pangenomes for eukaryotic organisms, fungi, plants, and animals, including two large-scale human pangenome projects. Analysis of the these pangenomes revealed key differences, most likely stemming from divergent evolutionary histories, but also surprising similarities.
Collapse
Affiliation(s)
- Agnieszka A Golicz
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia.
| | - Philipp E Bayer
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - Prem L Bhalla
- Plant Molecular Biology and Biotechnology Laboratory, Faculty of Veterinary and Agricultural Sciences, University of Melbourne, Melbourne, VIC, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, The University of Western Australia, Crawley, WA, Australia.
| |
Collapse
|
69
|
Mabire C, Duarte J, Darracq A, Pirani A, Rimbert H, Madur D, Combes V, Vitte C, Praud S, Rivière N, Joets J, Pichon JP, Nicolas SD. High throughput genotyping of structural variations in a complex plant genome using an original Affymetrix® axiom® array. BMC Genomics 2019; 20:848. [PMID: 31722668 PMCID: PMC6854671 DOI: 10.1186/s12864-019-6136-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 09/23/2019] [Indexed: 12/19/2022] Open
Abstract
Background Insertions/deletions (InDels) and more specifically presence/absence variations (PAVs) are pervasive in several species and have strong functional and phenotypic effect by removing or drastically modifying genes. Genotyping of such variants on large panels remains poorly addressed, while necessary for approaches such as association mapping or genomic selection. Results We have developed, as a proof of concept, a new high-throughput and affordable approach to genotype InDels. We first identified 141,000 InDels by aligning reads from the B73 line against the genome of three temperate maize inbred lines (F2, PH207, and C103) and reciprocally. Next, we designed an Affymetrix® Axiom® array to target these InDels, with a combination of probes selected at breakpoint sites (13%) or within the InDel sequence, either at polymorphic (25%) or non-polymorphic sites (63%) sites. The final array design is composed of 662,772 probes and targets 105,927 InDels, including PAVs ranging from 35 bp to 129kbp. After Affymetrix® quality control, we successfully genotyped 86,648 polymorphic InDels (82% of all InDels interrogated by the array) on 445 maize DNA samples with 422,369 probes. Genotyping InDels using this approach produced a highly reliable dataset, with low genotyping error (~ 3%), high call rate (~ 98%), and high reproducibility (> 95%). This reliability can be further increased by combining genotyping of several probes calling the same InDels (< 0.1% error rate and > 99.9% of call rate for 5 probes). This “proof of concept” tool was used to estimate the kinship matrix between 362 maize lines with 57,824 polymorphic InDels. This InDels kinship matrix was highly correlated with kinship estimated using SNPs from Illumina 50 K SNP arrays. Conclusions We efficiently genotyped thousands of small to large InDels on a sizeable number of individuals using a new Affymetrix® Axiom® array. This powerful approach opens the way to studying the contribution of InDels to trait variation and heterosis in maize. The approach is easily extendable to other species and should contribute to decipher the biological impact of InDels at a larger scale.
Collapse
Affiliation(s)
- Clément Mabire
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Jorge Duarte
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Aude Darracq
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Ali Pirani
- Thermo Fisher Scientific, 3450 Central Expressway, Santa Clara, CA, 95051, USA
| | - Hélène Rimbert
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France.,Present address: GDEC, INRA, Université Clermont Auvergne, 63000, Clermont-Ferrand, France
| | - Delphine Madur
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Valérie Combes
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Clémentine Vitte
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Sébastien Praud
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Nathalie Rivière
- Biogemma - Centre de Recherche de Chappes, CS 90126, 63720, Chappes, France
| | - Johann Joets
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | | | - Stéphane D Nicolas
- GQE - Le Moulon, INRA, Univ. Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, 91190, Gif-sur-Yvette, France.
| |
Collapse
|
70
|
A high-quality cucumber genome assembly enhances computational comparative genomics. Mol Genet Genomics 2019; 295:177-193. [PMID: 31620884 DOI: 10.1007/s00438-019-01614-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 09/30/2019] [Indexed: 01/12/2023]
Abstract
Genetic variation is expressed by the presence of polymorphisms in compared genomes of individuals that can be transferred to next generations. The aim of this work was to reveal genome dynamics by predicting polymorphisms among the genomes of three individuals of the highly inbred B10 cucumber (Cucumis sativus L.) line. In this study, bioinformatic comparative genomics was used to uncover cucumber genome dynamics (also called real-time evolution). We obtained a new genome draft assembly from long single molecule real-time (SMRT) sequencing reads and used short paired-end read data from three individuals to analyse the polymorphisms. Using this approach, we uncovered differentiation aspects in the genomes of the inbred B10 line. The newly assembled genome sequence (B10v3) has the highest contiguity and quality characteristics among the currently available cucumber genome draft sequences. Standard and newly designed approaches were used to predict single nucleotide and structural variants that were unique among the three individual genomes. Some of the variant predictions spanned protein-coding genes and their promoters, and some were in the neighbourhood of annotated interspersed repetitive elements, indicating that the highly inbred homozygous plants remained genetically dynamic. This is the first bioinformatic comparative genomics study of a single highly inbred plant line. For this project, we developed a polymorphism prediction method with optimized precision parameters, which allowed the effective detection of small nucleotide variants (SNVs). This methodology could significantly improve bioinformatic pipelines for comparative genomics and thus has great practical potential in genomic metadata handling.
Collapse
|
71
|
Wang Z, Miao H, Liu J, Xu B, Yao X, Xu C, Zhao S, Fang X, Jia C, Wang J, Zhang J, Li J, Xu Y, Wang J, Ma W, Wu Z, Yu L, Yang Y, Liu C, Guo Y, Sun S, Baurens FC, Martin G, Salmon F, Garsmeur O, Yahiaoui N, Hervouet C, Rouard M, Laboureau N, Habas R, Ricci S, Peng M, Guo A, Xie J, Li Y, Ding Z, Yan Y, Tie W, D'Hont A, Hu W, Jin Z. Musa balbisiana genome reveals subgenome evolution and functional divergence. NATURE PLANTS 2019; 5:810-821. [PMID: 31308504 PMCID: PMC6784884 DOI: 10.1038/s41477-019-0452-6] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 05/20/2019] [Indexed: 05/19/2023]
Abstract
Banana cultivars (Musa ssp.) are diploid, triploid and tetraploid hybrids derived from Musa acuminata and Musa balbisiana. We presented a high-quality draft genome assembly of M. balbisiana with 430 Mb (87%) assembled into 11 chromosomes. We identified that the recent divergence of M. acuminata (A-genome) and M. balbisiana (B-genome) occurred after lineage-specific whole-genome duplication, and that the B-genome may be more sensitive to the fractionation process compared to the A-genome. Homoeologous exchanges occurred frequently between A- and B-subgenomes in allopolyploids. Genomic variation within progenitors resulted in functional divergence of subgenomes. Global homoeologue expression dominance occurred between subgenomes of the allotriploid. Gene families related to ethylene biosynthesis and starch metabolism exhibited significant expansion at the pathway level and wide homoeologue expression dominance in the B-subgenome of the allotriploid. The independent origin of 1-aminocyclopropane-1-carboxylic acid oxidase (ACO) homoeologue gene pairs and tandem duplication-driven expansion of ACO genes in the B-subgenome contributed to rapid and major ethylene production post-harvest in allotriploid banana fruits. The findings of this study provide greater context for understanding fruit biology, and aid the development of tools for breeding optimal banana cultivars.
Collapse
Affiliation(s)
- Zhuo Wang
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Hongxia Miao
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Juhua Liu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China
| | - Biyu Xu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | | | - Chunyan Xu
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Shancen Zhao
- BGI Institute of Applied Agriculture, BGI-Shenzhen, Shenzhen, China
| | | | - Caihong Jia
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jingyi Wang
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jianbin Zhang
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jingyang Li
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China
| | - Yi Xu
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jiashui Wang
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China
| | - Weihong Ma
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China
| | | | - Lili Yu
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Yulan Yang
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Chun Liu
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Yu Guo
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Silong Sun
- BGI Genomics, BGI-Shenzhen, Shenzhen, China
| | - Franc-Christophe Baurens
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Guillaume Martin
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Frederic Salmon
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
- CIRAD, UMR AGAP, Guadeloupe, France
| | - Olivier Garsmeur
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Nabila Yahiaoui
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Catherine Hervouet
- CIRAD, UMR AGAP, Montpellier, France
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | | | - Nathalie Laboureau
- CIRAD, UMR BGPI, Montpellier, France
- BGPI, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Remy Habas
- CIRAD, UMR BGPI, Montpellier, France
- BGPI, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
| | - Sebastien Ricci
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France
- CIRAD, UMR AGAP, Guadeloupe, France
| | - Ming Peng
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Anping Guo
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Jianghui Xie
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Yin Li
- Waksman Institute of Microbiology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Zehong Ding
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Yan Yan
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Weiwei Tie
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Angélique D'Hont
- CIRAD, UMR AGAP, Montpellier, France.
- AGAP, Univ Montpellier, CIRAD, INRA, Montpellier SupAgro, Montpellier, France.
| | - Wei Hu
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China.
| | - Zhiqiang Jin
- Key Laboratory of Biology and Genetic Resources of Tropical Crops, Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou, China.
- Key Laboratory of Genetic Improvement of Bananas, Hainan province, Haikou Experimental Station, China Academy of Tropical Agricultural Sciences, Haikou, China.
| |
Collapse
|
72
|
Genome assembly of a tropical maize inbred line provides insights into structural variation and crop improvement. Nat Genet 2019; 51:1052-1059. [DOI: 10.1038/s41588-019-0427-6] [Citation(s) in RCA: 133] [Impact Index Per Article: 26.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 04/25/2019] [Indexed: 01/15/2023]
|
73
|
Balfourier F, Bouchet S, Robert S, De Oliveira R, Rimbert H, Kitt J, Choulet F, Paux E. Worldwide phylogeography and history of wheat genetic diversity. SCIENCE ADVANCES 2019; 5:eaav0536. [PMID: 31149630 PMCID: PMC6541461 DOI: 10.1126/sciadv.aav0536] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 04/22/2019] [Indexed: 05/05/2023]
Abstract
Since its domestication in the Fertile Crescent ~8000 to 10,000 years ago, wheat has undergone a complex history of spread, adaptation, and selection. To get better insights into the wheat phylogeography and genetic diversity, we describe allele distribution through time using a set of 4506 landraces and cultivars originating from 105 different countries genotyped with a high-density single-nucleotide polymorphism array. Although the genetic structure of landraces is collinear to ancient human migration roads, we observe a reshuffling through time, related to breeding programs, with the appearance of new alleles enriched with structural variations that may be the signature of introgressions from wild relatives after 1960.
Collapse
Affiliation(s)
- François Balfourier
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
- Corresponding author. (F.B.); (E.P.)
| | - Sophie Bouchet
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | - Sandra Robert
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | - Romain De Oliveira
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | - Hélène Rimbert
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | - Jonathan Kitt
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | - Frédéric Choulet
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
| | | | | | - Etienne Paux
- GDEC, INRA, Université Clermont Auvergne, 5 chemin de Beaulieu, 63000 Clermont-Ferrand, France
- Corresponding author. (F.B.); (E.P.)
| |
Collapse
|
74
|
Fuentes RR, Chebotarov D, Duitama J, Smith S, De la Hoz JF, Mohiyuddin M, Wing RA, McNally KL, Tatarinova T, Grigoriev A, Mauleon R, Alexandrov N. Structural variants in 3000 rice genomes. Genome Res 2019; 29:870-880. [PMID: 30992303 PMCID: PMC6499320 DOI: 10.1101/gr.241240.118] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 03/11/2019] [Indexed: 12/24/2022]
Abstract
Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5′ UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice.
Collapse
Affiliation(s)
- Roven Rommel Fuentes
- International Rice Research Institute, Laguna 4031, Philippines.,Bioinformatics Group, Wageningen University and Research, 6708 PB Wageningen, the Netherlands
| | | | - Jorge Duitama
- Systems and Computing Engineering Department, Universidad de Los Andes, Bogotá 111711, Colombia.,Agrobiodiversity Research Area, International Center for Tropical Agriculture (CIAT), Cali 6713, Colombia
| | - Sean Smith
- Biology Department, Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey 08102, USA
| | - Juan Fernando De la Hoz
- Agrobiodiversity Research Area, International Center for Tropical Agriculture (CIAT), Cali 6713, Colombia
| | | | - Rod A Wing
- International Rice Research Institute, Laguna 4031, Philippines.,Arizona Genomics Institute, University of Arizona, Tucson, Arizona 85721, USA.,King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia
| | | | - Tatiana Tatarinova
- Department of Biology, University of La Verne, La Verne, California 91750, USA.,Vavilov Institute of General Genetics, Moscow 119333, Russia.,A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127051, Russia.,Laboratory of Forest Genomics, Siberian Federal University, Krasnoyarsk 660041, Russia
| | - Andrey Grigoriev
- Biology Department, Center for Computational and Integrative Biology, Rutgers University, Camden, New Jersey 08102, USA
| | - Ramil Mauleon
- International Rice Research Institute, Laguna 4031, Philippines
| | | |
Collapse
|
75
|
Gabur I, Chawla HS, Snowdon RJ, Parkin IAP. Connecting genome structural variation with complex traits in crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:733-750. [PMID: 30448864 DOI: 10.1007/s00122-018-3233-0] [Citation(s) in RCA: 64] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/07/2018] [Indexed: 05/05/2023]
Abstract
Structural genome variation is a major determinant of useful trait diversity. We describe how genome analysis methods are enabling discovery of trait-associated structural variants and their potential impact on breeding. As our understanding of complex crop genomes continues to grow, there is growing evidence that structural genome variation plays a major role in determining traits important for breeding and agriculture. Identifying the extent and impact of structural variants in crop genomes is becoming increasingly feasible with ongoing advances in the sophistication of genome sequencing technologies, particularly as it becomes easier to generate accurate long sequence reads on a genome-wide scale. In this article, we discuss the origins of structural genome variation in crops from ancient and recent genome duplication and polyploidization events and review high-throughput methods to assay such variants in crop populations in order to find associations with phenotypic traits. There is increasing evidence from such studies that gene presence-absence and copy number variation resulting from segmental chromosome exchanges may be at the heart of adaptive variation of crops to counter abiotic and biotic stress factors. We present examples from major crops that demonstrate the potential of pangenomic diversity as a key resource for future plant breeding for resilience and sustainability.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Harmeet Singh Chawla
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany
| | - Rod J Snowdon
- Department of Plant Breeding, Justus Liebig University, Heinrich-Buff-Ring 26-32, 35392, Giessen, Germany.
| | - Isobel A P Parkin
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK, S7N OX2, Canada
| |
Collapse
|
76
|
Sánchez-Martín J, Keller B. Contribution of recent technological advances to future resistance breeding. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:713-732. [PMID: 30756126 DOI: 10.1007/s00122-019-03297-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Accepted: 02/02/2019] [Indexed: 05/23/2023]
Abstract
The development of durable host resistance strategies to control crop diseases is a primary need for sustainable agricultural production in the future. This article highlights the potential of recent progress in the understanding of host resistance for future cereal breeding. Much of the novel work is based on advancements in large-scale sequencing and genomics, rapid gene isolation techniques and high-throughput molecular marker technologies. Moreover, emerging applications on the pathogen side like effector identification or field pathogenomics are discussed. The combination of knowledge from both sides of cereal pathosystems will result in new approaches for resistance breeding. We describe future applications and innovative strategies to implement effective and durable strategies to combat diseases of major cereal crops while reducing pesticide dependency.
Collapse
Affiliation(s)
- Javier Sánchez-Martín
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, 8008, Zurich, Switzerland.
| | - Beat Keller
- Department of Plant and Microbial Biology, University of Zürich, Zollikerstrasse 107, 8008, Zurich, Switzerland
| |
Collapse
|
77
|
Lyra DH, Galli G, Alves FC, Granato ÍSC, Vidotti MS, Bandeira E Sousa M, Morosini JS, Crossa J, Fritsche-Neto R. Modeling copy number variation in the genomic prediction of maize hybrids. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2019; 132:273-288. [PMID: 30382311 DOI: 10.1007/s00122-018-3215-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 10/20/2018] [Indexed: 06/08/2023]
Abstract
Our study indicates that copy variants may play an essential role in the phenotypic variation of complex traits in maize hybrids. Moreover, predicting hybrid phenotypes by combining additive-dominance effects with copy variants has the potential to be a viable predictive model. Non-additive effects resulting from the actions of multiple loci may influence trait variation in single-cross hybrids. In addition, complementation of allelic variation could be a valuable contributor to hybrid genetic variation, especially when crossing inbred lines with higher contents of copy gains. With this in mind, we aimed (1) to study the association between copy number variation (CNV) and hybrid phenotype, and (2) to compare the predictive ability (PA) of additive and additive-dominance genomic best linear unbiased prediction model when combined with the effects of CNV in two datasets of maize hybrids (USP and HELIX). In the USP dataset, we observed a significant negative phenotypic correlation of low magnitude between copy number loss and plant height, revealing a tendency that more copy losses lead to lower plants. In the same set, when CNV was combined with the additive plus dominance effects, the PA significantly increased only for plant height under low nitrogen. In this case, CNV effects explicitly capture relatedness between individuals and add extra information to the model. In the HELIX dataset, we observed a pronounced difference in PA between additive (0.50) and additive-dominance (0.71) models for predicting grain yield, suggesting a significant contribution of dominance. We conclude that copy variants may play an essential role in the phenotypic variation of complex traits in maize hybrids, although the inclusion of CNVs into datasets does not return significant gains concerning PA.
Collapse
Affiliation(s)
- Danilo Hottis Lyra
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil.
- Department of Computational and Analytical Sciences, Rothamsted Research, West Common, Harpenden, AL52JQ, UK.
| | - Giovanni Galli
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Filipe Couto Alves
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Ítalo Stefanine Correia Granato
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Miriam Suzane Vidotti
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Massaine Bandeira E Sousa
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - Júlia Silva Morosini
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| | - José Crossa
- Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), 06600, Texcoco, D.F, Mexico
| | - Roberto Fritsche-Neto
- Department of Genetics, Luiz de Queiroz College of Agriculture, University of São Paulo (ESALQ/USP), Piracicaba, São Paulo, Brazil
| |
Collapse
|
78
|
Gabur I, Chawla HS, Liu X, Kumar V, Faure S, von Tiedemann A, Jestin C, Dryzska E, Volkmann S, Breuer F, Delourme R, Snowdon R, Obermeier C. Finding invisible quantitative trait loci with missing data. PLANT BIOTECHNOLOGY JOURNAL 2018; 16:2102-2112. [PMID: 29729219 PMCID: PMC6230954 DOI: 10.1111/pbi.12942] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Revised: 04/26/2018] [Accepted: 04/28/2018] [Indexed: 05/21/2023]
Abstract
Evolutionary processes during plant polyploidization and speciation have led to extensive presence-absence variation (PAV) in crop genomes, and there is increasing evidence that PAV associates with important traits. Today, high-resolution genetic analysis in major crops frequently implements simple, cost-effective, high-throughput genotyping from single nucleotide polymorphism (SNP) hybridization arrays; however, these are normally not designed to distinguish PAV from failed SNP calls caused by hybridization artefacts. Here, we describe a strategy to recover valuable information from single nucleotide absence polymorphisms (SNaPs) by population-based quality filtering of SNP hybridization data to distinguish patterns associated with genuine deletions from those caused by technical failures. We reveal that including SNaPs in genetic analyses elucidate segregation of small to large-scale structural variants in nested association mapping populations of oilseed rape (Brassica napus), a recent polyploid crop with widespread structural variation. Including SNaP markers in genomewide association studies identified numerous quantitative trait loci, invisible using SNP markers alone, for resistance to two major fungal diseases of oilseed rape, Sclerotinia stem rot and blackleg disease. Our results indicate that PAV has a strong influence on quantitative disease resistance in B. napus and that SNaP analysis using cost-effective SNP array data can provide extensive added value from 'missing data'. This strategy might also be applicable for improving the precision of genetic mapping in many important crop species.
Collapse
Affiliation(s)
- Iulian Gabur
- Department of Plant BreedingJustus Liebig UniversityGiessenGermany
| | | | - Xiwei Liu
- Department of Plant BreedingJustus Liebig UniversityGiessenGermany
| | - Vinod Kumar
- IGEPP, INRA, AGROCAMPUS OUESTUniv RennesLe RheuFrance
| | | | - Andreas von Tiedemann
- Section of General Plant Pathology and Crop ProtectionGeorg August UniversityGöttingenGermany
| | | | | | | | | | | | - Rod Snowdon
- Department of Plant BreedingJustus Liebig UniversityGiessenGermany
| | | |
Collapse
|
79
|
A Megabase-Scale Deletion is Associated with Phenotypic Variation of Multiple Traits in Maize. Genetics 2018; 211:305-316. [PMID: 30389804 PMCID: PMC6325712 DOI: 10.1534/genetics.118.301567] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Accepted: 10/26/2018] [Indexed: 11/18/2022] Open
Abstract
Genomic deletions are pervasive in the maize (Zea mays L.) genome, and play important roles in phenotypic variation and adaptive evolution. However, little is known about the biological functions of these genomic deletions. Here, we report the biological function of a megabase-scale deletion, which we identified by position-based cloning of the multi-trait weakened (muw) mutant, which is inherited as a single recessive locus. MUW was mapped to a 5.16-Mb region on chromosome 2. The 5.16-Mb deletion in the muw mutant led to the loss of 48 genes and was responsible for a set of phenotypic abnormities, including wilting leaves, poor yield performance, reduced plant height, increased stomatal density, and rapid water loss. While muw appears to have resulted from double-stranded break repair that was not dependent on intragenomic DNA homology, extensive duplication of maize genes may have mitigated its effects and facilitated its survival.
Collapse
|
80
|
Lee T, Kim MY, Ha J, Lee SH. Detection of large sequence insertions by a hybrid approach that combine de novo assembly and resequencing of medium-coverage genome sequences. Genome 2018; 61:745-754. [PMID: 30227080 DOI: 10.1139/gen-2018-0027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]
Abstract
Large sequence insertion (LSI) is one of the structural variations (SVs) that may cause phenotypic differences in plants. To identify the LSIs using medium-coverage sequencing data of four wild soybean (Glycine soja) genotypes, we designed a hybrid approach combining de novo assembly and read mapping. Total reads and reads with both ends unmapped were independently assembled into "ordinary contigs" and "orphan contigs", respectively, and subjected to pairwise alignment and stringent filtering. This approach predicted 24 LSIs averaging 2682 bp in size, with no overlap with SVs detected by Pindel, BreakDancer, or ScanIndel, and they were validated by PCR. Compared with the soybean (Glycine max) reference genome, 20 LSIs were located outside genic regions. One of the four LSIs within a genic region, LSI05, is located in the coding DNA sequence region of a protein kinase superfamily gene (Glyma.08G123500). It caused delayed translation initiation and loss of 24 amino acids in the wild soybean genotype CW12. LSI05 was more frequently observed in 29 G. soja accessions than in 34 G. max accessions. Identified LSIs would be genomic resources harboring novel gene contents for studying SVs and improving crops. Moreover, our cost-efficient approach may be applicable to other plant species.
Collapse
Affiliation(s)
- Taeyoung Lee
- a Department of Plant Science and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea
| | - Moon Young Kim
- a Department of Plant Science and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea
- b Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Republic of Korea
| | - Jungmin Ha
- a Department of Plant Science and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea
- b Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Republic of Korea
| | - Suk-Ha Lee
- a Department of Plant Science and Research Institute of Agriculture and Life Sciences, Seoul National University, Seoul 08826, Republic of Korea
- b Plant Genomics and Breeding Institute, Seoul National University, Seoul 08826, Republic of Korea
| |
Collapse
|
81
|
Ott A, Schnable JC, Yeh CT, Wu L, Liu C, Hu HC, Dalgard CL, Sarkar S, Schnable PS. Linked read technology for assembling large complex and polyploid genomes. BMC Genomics 2018; 19:651. [PMID: 30180802 PMCID: PMC6122573 DOI: 10.1186/s12864-018-5040-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 08/27/2018] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Short read DNA sequencing technologies have revolutionized genome assembly by providing high accuracy and throughput data at low cost. But it remains challenging to assemble short read data, particularly for large, complex and polyploid genomes. The linked read strategy has the potential to enhance the value of short reads for genome assembly because all reads originating from a single long molecule of DNA share a common barcode. However, the majority of studies to date that have employed linked reads were focused on human haplotype phasing and genome assembly. RESULTS Here we describe a de novo maize B73 genome assembly generated via linked read technology which contains ~ 172,000 scaffolds with an N50 of 89 kb that cover 50% of the genome. Based on comparisons to the B73 reference genome, 91% of linked read contigs are accurately assembled. Because it was possible to identify errors with > 76% accuracy using machine learning, it may be possible to identify and potentially correct systematic errors. Complex polyploids represent one of the last grand challenges in genome assembly. Linked read technology was able to successfully resolve the two subgenomes of the recent allopolyploid, proso millet (Panicum miliaceum). Our assembly covers ~ 83% of the 1 Gb genome and consists of 30,819 scaffolds with an N50 of 912 kb. CONCLUSIONS Our analysis provides a framework for future de novo genome assemblies using linked reads, and we suggest computational strategies that if implemented have the potential to further improve linked read assemblies, particularly for repetitive genomes.
Collapse
Affiliation(s)
- Alina Ott
- Department of Agronomy, Iowa State University, Ames, IA 50011 USA
- Present address: Roche Sequencing Solutions, 500 S Rosa Road, Madison, WI 53719 USA
| | - James C. Schnable
- Department of Agriculture and Horticulture, University of Nebraska-Lincoln, Lincoln, NE 68588 USA
- Data2Bio LLC, 2079 Roy J Carver Co-Laboratory, 1111 WOI Rd, Ames, IA 50011 USA
- Dryland Genetics LLC, 2073 Roy J Carver Co-Laboratory, 1111 WOI Rd, Ames, IA 50011 USA
| | - Cheng-Ting Yeh
- Department of Agronomy, Iowa State University, Ames, IA 50011 USA
- Data2Bio LLC, 2079 Roy J Carver Co-Laboratory, 1111 WOI Rd, Ames, IA 50011 USA
| | - Linjiang Wu
- Department of Mechanical Engineering, Iowa State University, Ames, IA 50011 USA
| | - Chao Liu
- Department of Mechanical Engineering, Iowa State University, Ames, IA 50011 USA
- Present address: Department of Thermal Engineering, Tsinghua University, Beijing, 100084 China
| | - Heng-Cheng Hu
- The American Genome Center, Uniformed Services University of the Health Sciences, Bethesda, MD 20814 USA
- Collaborative Health Initiative Research Program (CHIRP), Uniformed Services University School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814 USA
- Present address: Qiagen Sciences Inc, 6951 Executive Way, Frederick, MD 21703 USA
| | - Clifton L. Dalgard
- The American Genome Center, Uniformed Services University of the Health Sciences, Bethesda, MD 20814 USA
- Collaborative Health Initiative Research Program (CHIRP), Uniformed Services University School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814 USA
- Department of Anatomy, Physiology and Genetics, Uniformed Services University School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814 USA
| | - Soumik Sarkar
- Department of Mechanical Engineering, Iowa State University, Ames, IA 50011 USA
| | - Patrick S. Schnable
- Department of Agronomy, Iowa State University, Ames, IA 50011 USA
- Data2Bio LLC, 2079 Roy J Carver Co-Laboratory, 1111 WOI Rd, Ames, IA 50011 USA
- Dryland Genetics LLC, 2073 Roy J Carver Co-Laboratory, 1111 WOI Rd, Ames, IA 50011 USA
| |
Collapse
|
82
|
Mutator-Based Transposon Display: A Genetic Tool for Evolutionary and Crop-Improvement Studies in Maize. Mol Biotechnol 2018; 60:799-809. [DOI: 10.1007/s12033-018-0118-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
|
83
|
Thind AK, Wicker T, Müller T, Ackermann PM, Steuernagel B, Wulff BBH, Spannagl M, Twardziok SO, Felder M, Lux T, Mayer KFX, Keller B, Krattinger SG. Chromosome-scale comparative sequence analysis unravels molecular mechanisms of genome dynamics between two wheat cultivars. Genome Biol 2018; 19:104. [PMID: 30115097 PMCID: PMC6097286 DOI: 10.1186/s13059-018-1477-2] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 07/10/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Recent improvements in DNA sequencing and genome scaffolding have paved the way to generate high-quality de novo assemblies of pseudomolecules representing complete chromosomes of wheat and its wild relatives. These assemblies form the basis to compare the dynamics of wheat genomes on a megabase scale. RESULTS Here, we provide a comparative sequence analysis of the 700-megabase chromosome 2D between two bread wheat genotypes-the old landrace Chinese Spring and the elite Swiss spring wheat line 'CH Campala Lr22a'. Both chromosomes were assembled into megabase-sized scaffolds. There is a high degree of sequence conservation between the two chromosomes. Analysis of large structural variations reveals four large indels of more than 100 kb. Based on the molecular signatures at the breakpoints, unequal crossing over and double-strand break repair were identified as the molecular mechanisms that caused these indels. Three of the large indels affect copy number of NLRs, a gene family involved in plant immunity. Analysis of SNP density reveals four haploblocks of 4, 8, 9 and 48 Mb with a 35-fold increased SNP density compared to the rest of the chromosome. Gene content across the two chromosomes was highly conserved. Ninety-nine percent of the genic sequences were present in both genotypes and the fraction of unique genes ranged from 0.4 to 0.7%. CONCLUSIONS This comparative analysis of two high-quality chromosome assemblies enabled a comprehensive assessment of large structural variations and gene content. The insight obtained from this analysis will form the basis of future wheat pan-genome studies.
Collapse
Affiliation(s)
- Anupriya Kaur Thind
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | - Thomas Wicker
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | - Thomas Müller
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | - Patrick M Ackermann
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | | | | | | | | | | | - Thomas Lux
- Helmholtz Zentrum Munich, Munich, Germany
| | - Klaus F X Mayer
- Helmholtz Zentrum Munich, Munich, Germany
- School of Life Sciences, Technical University Munich, Munich, Germany
- College of Science, King Saud University, Riad, Kingdom of Saudi Arabia
| | - Beat Keller
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland
| | - Simon G Krattinger
- Department of Plant and Microbial Biology, University of Zurich, Zollikerstrasse 107, Zurich, Switzerland.
- King Abdullah University of Science and Technology, Thuwal, Kingdom of Saudi Arabia.
| |
Collapse
|
84
|
Torkamaneh D, Boyle B, Belzile F. Efficient genome-wide genotyping strategies and data integration in crop plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2018; 131:499-511. [PMID: 29352324 DOI: 10.1007/s00122-018-3056-z] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2017] [Accepted: 01/12/2018] [Indexed: 05/21/2023]
Abstract
Next-generation sequencing (NGS) has revolutionized plant and animal research by providing powerful genotyping methods. This review describes and discusses the advantages, challenges and, most importantly, solutions to facilitate data processing, the handling of missing data, and cross-platform data integration. Next-generation sequencing technologies provide powerful and flexible genotyping methods to plant breeders and researchers. These methods offer a wide range of applications from genome-wide analysis to routine screening with a high level of accuracy and reproducibility. Furthermore, they provide a straightforward workflow to identify, validate, and screen genetic variants in a short time with a low cost. NGS-based genotyping methods include whole-genome re-sequencing, SNP arrays, and reduced representation sequencing, which are widely applied in crops. The main challenges facing breeders and geneticists today is how to choose an appropriate genotyping method and how to integrate genotyping data sets obtained from various sources. Here, we review and discuss the advantages and challenges of several NGS methods for genome-wide genetic marker development and genotyping in crop plants. We also discuss how imputation methods can be used to both fill in missing data in genotypic data sets and to integrate data sets obtained using different genotyping tools. It is our hope that this synthetic view of genotyping methods will help geneticists and breeders to integrate these NGS-based methods in crop plant breeding and research.
Collapse
Affiliation(s)
- Davoud Torkamaneh
- Département de Phytologie, Université Laval, Québec City, QC, Canada
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| | - Brian Boyle
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada
| | - François Belzile
- Département de Phytologie, Université Laval, Québec City, QC, Canada.
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec City, QC, Canada.
| |
Collapse
|
85
|
Darracq A, Vitte C, Nicolas S, Duarte J, Pichon JP, Mary-Huard T, Chevalier C, Bérard A, Le Paslier MC, Rogowsky P, Charcosset A, Joets J. Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants. BMC Genomics 2018; 19:119. [PMID: 29402214 PMCID: PMC5800051 DOI: 10.1186/s12864-018-4490-7] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 01/22/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated. RESULTS De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups. CONCLUSIONS We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.
Collapse
Affiliation(s)
- Aude Darracq
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Clémentine Vitte
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Stéphane Nicolas
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | | | - Tristan Mary-Huard
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
- MIA, INRA, AgroParisTech, Université Paris-Saclay, Paris, France
| | - Céline Chevalier
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Aurélie Bérard
- EPGV US 1279, INRA, CEA, IG-CNG, Université Paris-Saclay, Evry, France
| | | | - Peter Rogowsky
- Laboratoire Reproduction et Développement des Plantes, Univ Lyon, ENS de Lyon, UCB Lyon 1, CNRS, INRA, Lyon, France
| | - Alain Charcosset
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Johann Joets
- Genetique Quantitative et Evolution – Le Moulon, INRA, Université Paris-Sud, CNRS, AgroParisTech, Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
86
|
High throughput SNP discovery and genotyping in hexaploid wheat. PLoS One 2018; 13:e0186329. [PMID: 29293495 PMCID: PMC5749704 DOI: 10.1371/journal.pone.0186329] [Citation(s) in RCA: 108] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2017] [Accepted: 09/13/2017] [Indexed: 12/03/2022] Open
Abstract
Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.
Collapse
|
87
|
Dolatabadian A, Patel DA, Edwards D, Batley J. Copy number variation and disease resistance in plants. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:2479-2490. [PMID: 29043379 DOI: 10.1007/s00122-017-2993-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2017] [Accepted: 09/27/2017] [Indexed: 05/06/2023]
Abstract
Plant genome diversity varies from single nucleotide polymorphisms to large-scale deletions, insertions, duplications, or re-arrangements. These re-arrangements of sequences resulting from duplication, gains or losses of DNA segments are termed copy number variations (CNVs). During the last decade, numerous studies have emphasized the importance of CNVs as a factor affecting human phenotype; in particular, CNVs have been associated with risks for several severe diseases. In plants, the exploration of the extent and role of CNVs in resistance against pathogens and pests is just beginning. Since CNVs are likely to be associated with disease resistance in plants, an understanding of the distribution of CNVs could assist in the identification of novel plant disease-resistance genes. In this paper, we review existing information about CNVs; their importance, role and function, as well as their association with disease resistance in plants.
Collapse
Affiliation(s)
- Aria Dolatabadian
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - Dhwani Apurva Patel
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - David Edwards
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - Jacqueline Batley
- School of Biological Sciences and Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia.
| |
Collapse
|
88
|
Bayer PE, Hurgobin B, Golicz AA, Chan CK, Yuan Y, Lee H, Renton M, Meng J, Li R, Long Y, Zou J, Bancroft I, Chalhoub B, King GJ, Batley J, Edwards D. Assembly and comparison of two closely related Brassica napus genomes. PLANT BIOTECHNOLOGY JOURNAL 2017; 15:1602-1610. [PMID: 28403535 PMCID: PMC5698052 DOI: 10.1111/pbi.12742] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/06/2017] [Accepted: 04/09/2017] [Indexed: 05/18/2023]
Abstract
As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B. napus cultivar Tapidor and comparison with an improved assembly of the Brassica napus cultivar Darmor-bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor-bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B. napus genomes and highlight the potential sources of error in future production of a B. napus pangenome.
Collapse
Affiliation(s)
- Philipp E. Bayer
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
| | - Bhavna Hurgobin
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
- School of Agriculture and Food SciencesUniversity of QueenslandSt. LuciaQldAustralia
| | - Agnieszka A. Golicz
- Plant Molecular Biology and Biotechnology LaboratoryFaculty of Veterinary and Agricultural SciencesUniversity of Melbourne, ParkvilleMelbourneVic.Australia
| | | | - Yuxuan Yuan
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
| | - HueyTyng Lee
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
- School of Agriculture and Food SciencesUniversity of QueenslandSt. LuciaQldAustralia
| | - Michael Renton
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
- School of Agriculture and EnvironmentThe University of Western AustraliaCrawleyWAAustralia
| | - Jinling Meng
- National Key Laboratory of Crop Genetic ImprovementKey Laboratory of Rapeseed Genetic ImprovementMinistry of Agriculture P. R. ChinaHuazhong Agricultural UniversityWuhanChina
| | - Ruiyuan Li
- National Key Laboratory of Crop Genetic ImprovementKey Laboratory of Rapeseed Genetic ImprovementMinistry of Agriculture P. R. ChinaHuazhong Agricultural UniversityWuhanChina
| | - Yan Long
- National Key Laboratory of Crop Genetic ImprovementKey Laboratory of Rapeseed Genetic ImprovementMinistry of Agriculture P. R. ChinaHuazhong Agricultural UniversityWuhanChina
| | - Jun Zou
- National Key Laboratory of Crop Genetic ImprovementKey Laboratory of Rapeseed Genetic ImprovementMinistry of Agriculture P. R. ChinaHuazhong Agricultural UniversityWuhanChina
| | | | - Boulos Chalhoub
- Organization and Evolution of Complex Genomes (OECG)Institut National de la Recherche agronomique (INRA)Université d'Evry Val d'Essonne (UEVE)EvryFrance
- Institute of System and Synthetic Biology, GenopoleCentre National de la Recherche ScientifiqueUniversité d'Evry Val d'EssonneUniversité Paris‐SaclayEvryFrance
| | - Graham J. King
- National Key Laboratory of Crop Genetic ImprovementKey Laboratory of Rapeseed Genetic ImprovementMinistry of Agriculture P. R. ChinaHuazhong Agricultural UniversityWuhanChina
- Southern Cross Plant ScienceSouthern Cross UniversityLismoreNSWAustralia
| | - Jacqueline Batley
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
| | - David Edwards
- School of Biological SciencesThe University of Western AustraliaCrawleyWAAustralia
| |
Collapse
|
89
|
Hu X, Wang H, Li K, Wu Y, Liu Z, Huang C. Genome-wide proteomic profiling reveals the role of dominance protein expression in heterosis in immature maize ears. Sci Rep 2017; 7:16130. [PMID: 29170427 PMCID: PMC5700959 DOI: 10.1038/s41598-017-15985-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Accepted: 11/06/2017] [Indexed: 01/02/2023] Open
Abstract
Heterosis refers to the phenomenon in which hybrid progeny show superior performance relative to their parents. Early maize ear development shows strong heterosis in ear architecture traits and greatly affects grain yield. To explore the underlying molecular mechanisms, genome-wide proteomics of immature ears of maize hybrid ZD909 and its parents were analyzed using tandem mass tag (TMT) technology. A total of 9,713 proteins were identified in all three genotypes. Among them, 3,752 (38.6%) proteins were differentially expressed between ZD909 and its parents. Multiple modes of protein action were discovered in the hybrid, while dominance expression patterns accounted for 63.6% of the total differentially expressed proteins (DEPs). Protein pathway enrichment analysis revealed that high parent dominance proteins mainly participated in carbon metabolism and nitrogen assimilation processes. Our results suggested that the dominant expression of favorable alleles related to C/N metabolism in the hybrid may be essential for ZD909 ear growth and heterosis formation. Integrated analysis of proteomic and quantitative trait locus (QTL) data further support our DEP identification and provide useful information for the discovery of genes associated with ear development. Our study provides comprehensive insight into the molecular mechanisms underlying heterosis in immature maize ears from a proteomic perspective.
Collapse
Affiliation(s)
- Xiaojiao Hu
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China
| | - Hongwu Wang
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China
| | - Kun Li
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China
| | - Yujin Wu
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China
| | - Zhifang Liu
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China.
| | - Changling Huang
- Institute of Crop Science, Chinese Academy of Agricultural Sciences, National Engineering Laboratory for Crop Molecular Breeding, Beijing, 100081, China.
| |
Collapse
|
90
|
Neik TX, Barbetti MJ, Batley J. Current Status and Challenges in Identifying Disease Resistance Genes in Brassica napus. FRONTIERS IN PLANT SCIENCE 2017; 8:1788. [PMID: 29163558 PMCID: PMC5681527 DOI: 10.3389/fpls.2017.01788] [Citation(s) in RCA: 44] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 10/02/2017] [Indexed: 05/18/2023]
Abstract
Brassica napus is an economically important crop across different continents including temperate and subtropical regions in Europe, Canada, South Asia, China and Australia. Its widespread cultivation also brings setbacks as it plays host to fungal, oomycete and chytrid pathogens that can lead to serious yield loss. For sustainable crop production, identification of resistance (R) genes in B. napus has become of critical importance. In this review, we discuss four key pathogens affecting Brassica crops: Clubroot (Plasmodiophora brassicae), Blackleg (Leptosphaeria maculans and L. biglobosa), Sclerotinia Stem Rot (Sclerotinia sclerotiorum), and Downy Mildew (Hyaloperonospora parasitica). We first review current studies covering prevalence of these pathogens on Brassica crops and highlight the R genes and QTL that have been identified from Brassica species against these pathogens. Insights into the relationships between the pathogen and its Brassica host, the unique host resistance mechanisms and how these affect resistance outcomes is also presented. We discuss challenges in identification and deployment of R genes in B. napus in relation to highly specific genetic interactions between host subpopulations and pathogen pathotypes and emphasize the need for common or shared techniques and research materials or tighter collaboration between researchers to reconcile the inconsistencies in the research outcomes. Using current genomics tools, we provide examples of how characterization and cloning of R genes in B. napus can be carried out more effectively. Lastly, we put forward strategies to breed resistant cultivars through introgressions supported by genomic approaches and suggest prospects that can be implemented in the future for a better, pathogen-resistant B. napus.
Collapse
Affiliation(s)
- Ting Xiang Neik
- School of Biological Sciences, University of Western Australia, Perth, WA, Australia
| | - Martin J. Barbetti
- School of Agriculture and Environment and Institute of Agriculture, University of Western Australia, Perth, WA, Australia
| | - Jacqueline Batley
- School of Biological Sciences, University of Western Australia, Perth, WA, Australia
| |
Collapse
|
91
|
Rasheed A, Hao Y, Xia X, Khan A, Xu Y, Varshney RK, He Z. Crop Breeding Chips and Genotyping Platforms: Progress, Challenges, and Perspectives. MOLECULAR PLANT 2017; 10:1047-1064. [PMID: 28669791 DOI: 10.1016/j.molp.2017.06.008] [Citation(s) in RCA: 219] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Revised: 05/29/2017] [Accepted: 06/19/2017] [Indexed: 05/18/2023]
Abstract
There is a rapidly rising trend in the development and application of molecular marker assays for gene mapping and discovery in field crops and trees. Thus far, more than 50 SNP arrays and 15 different types of genotyping-by-sequencing (GBS) platforms have been developed in over 25 crop species and perennial trees. However, much less effort has been made on developing ultra-high-throughput and cost-effective genotyping platforms for applied breeding programs. In this review, we discuss the scientific bottlenecks in existing SNP arrays and GBS technologies and the strategies to develop targeted platforms for crop molecular breeding. We propose that future practical breeding platforms should adopt automated genotyping technologies, either array or sequencing based, target functional polymorphisms underpinning economic traits, and provide desirable prediction accuracy for quantitative traits, with universal applications under wide genetic backgrounds in crops. The development of such platforms faces serious challenges at both the technological level due to cost ineffectiveness, and the knowledge level due to large genotype-phenotype gaps in crop plants. It is expected that such genotyping platforms will be achieved in the next ten years in major crops in consideration of (a) rapid development in gene discovery of important traits, (b) deepened understanding of quantitative traits through new analytical models and population designs, (c) integration of multi-layer -omics data leading to identification of genes and pathways responsible for important breeding traits, and (d) improvement in cost effectiveness of large-scale genotyping. Crop breeding chips and genotyping platforms will provide unprecedented opportunities to accelerate the development of cultivars with desired yield potential, quality, and enhanced adaptation to mitigate the effects of climate change.
Collapse
Affiliation(s)
- Awais Rasheed
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China
| | - Yuanfeng Hao
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China
| | - Xianchun Xia
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China
| | - Awais Khan
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Geneva, NY, USA
| | - Yunbi Xu
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China
| | - Rajeev K Varshney
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru 502324, India
| | - Zhonghu He
- Institute of Crop Sciences, National Wheat Improvement Center, Chinese Academy of Agricultural Sciences (CAAS), Beijing 100081, China; International Maize and Wheat Improvement Center (CIMMYT), c/o CAAS, Beijing 100081, China.
| |
Collapse
|
92
|
Miller JR, Zhou P, Mudge J, Gurtowski J, Lee H, Ramaraj T, Walenz BP, Liu J, Stupar RM, Denny R, Song L, Singh N, Maron LG, McCouch SR, McCombie WR, Schatz MC, Tiffin P, Young ND, Silverstein KAT. Hybrid assembly with long and short reads improves discovery of gene family expansions. BMC Genomics 2017; 18:541. [PMID: 28724409 PMCID: PMC5518131 DOI: 10.1186/s12864-017-3927-8] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2016] [Accepted: 07/06/2017] [Indexed: 11/25/2022] Open
Abstract
Background Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation. Methods We developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation. Results Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies. Conclusion Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3927-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jason R Miller
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA.
| | - Peng Zhou
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | - Joann Mudge
- National Center for Genome Resources, Santa Fe, NM, USA
| | | | - Hayan Lee
- Stanford School of Medicine, Stanford, CA, USA
| | | | - Brian P Walenz
- National Human Genome Research Institute, Bethesda, MD, USA
| | - Junqi Liu
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA
| | - Robert M Stupar
- Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN, USA
| | - Roxanne Denny
- Department of Plant Pathology, University of Minnesota, St. Paul, MN, USA
| | - Li Song
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Namrata Singh
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | - Lyza G Maron
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | - Susan R McCouch
- School of Integrative Plant Sciences, Plant Breeding and Genetics section, Cornell University, Ithaca, NY, 14850, USA
| | | | - Michael C Schatz
- Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Peter Tiffin
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | - Nevin D Young
- Department of Plant Biology, University of Minnesota, Saint Paul, MN, USA
| | | |
Collapse
|
93
|
Jiao Y, Peluso P, Shi J, Liang T, Stitzer MC, Wang B, Campbell MS, Stein JC, Wei X, Chin CS, Guill K, Regulski M, Kumari S, Olson A, Gent J, Schneider KL, Wolfgruber TK, May MR, Springer NM, Antoniou E, McCombie WR, Presting GG, McMullen M, Ross-Ibarra J, Dawe RK, Hastie A, Rank DR, Ware D. Improved maize reference genome with single-molecule technologies. Nature 2017; 546:524-527. [PMID: 28605751 DOI: 10.1101/079004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 05/14/2017] [Indexed: 05/21/2023]
Abstract
Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Collapse
Affiliation(s)
- Yinping Jiao
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Paul Peluso
- Pacific Biosciences, Menlo Park, California 94025, USA
| | - Jinghua Shi
- BioNano Genomics, San Diego, California 92121, USA
| | | | - Michelle C Stitzer
- Department of Plant Sciences and Center for Population Biology, University of California, Davis, Davis, California 95616, USA
| | - Bo Wang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | - Joshua C Stein
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Xuehong Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | - Katherine Guill
- USDA-ARS, Plant Genetics Research Unit, Columbia, Missouri 65211, USA
| | - Michael Regulski
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Sunita Kumari
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | - Andrew Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | - Kevin L Schneider
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, Hawaii 96822, USA
| | - Thomas K Wolfgruber
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, Hawaii 96822, USA
| | - Michael R May
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Nathan M Springer
- Department of Plant Biology, University of Minnesota, St Paul, Minnesota 55108, USA
| | - Eric Antoniou
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
| | | | - Gernot G Presting
- Department of Molecular Biosciences and Bioengineering, University of Hawaii, Honolulu, Hawaii 96822, USA
| | - Michael McMullen
- USDA-ARS, Plant Genetics Research Unit, Columbia, Missouri 65211, USA
| | - Jeffrey Ross-Ibarra
- Department of Plant Sciences, Center for Population Biology, and Genome Center, University of California, Davis, California 95616, USA
| | - R Kelly Dawe
- University of Georgia, Athens, Georgia 30602, USA
| | - Alex Hastie
- BioNano Genomics, San Diego, California 92121, USA
| | - David R Rank
- Pacific Biosciences, Menlo Park, California 94025, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA
- USDA-ARS, NEA Robert W. Holley Center for Agriculture and Health, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
94
|
Improved maize reference genome with single-molecule technologies. Nature 2017; 546:524-527. [PMID: 28605751 PMCID: PMC7052699 DOI: 10.1038/nature22971] [Citation(s) in RCA: 701] [Impact Index Per Article: 100.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Accepted: 05/14/2017] [Indexed: 01/01/2023]
Abstract
An improved reference genome for maize, using single-molecule sequencing and high-resolution optical mapping, enables characterization of structural variation and repetitive regions, and identifies lineage expansions of transposable elements that are unique to maize. The maize genome was initially reported in 2009 but with some accuracy limitations. Doreen Ware and colleagues report a new reference genome for maize using single-molecule sequencing and high-resolution optical mapping. The technique shows improvements in the gene space including resolution of gaps and misassemblies and correction of order and orientation of genes. The authors characterize structural variation and repetitive regions, and identify transposable element lineage expansions unique to maize. Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation1. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions2. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome3, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing4. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Collapse
|
95
|
Li G, Jain R, Chern M, Pham NT, Martin JA, Wei T, Schackwitz WS, Lipzen AM, Duong PQ, Jones KC, Jiang L, Ruan D, Bauer D, Peng Y, Barry KW, Schmutz J, Ronald PC. The Sequences of 1504 Mutants in the Model Rice Variety Kitaake Facilitate Rapid Functional Genomic Studies. THE PLANT CELL 2017; 29:1218-1231. [PMID: 28576844 PMCID: PMC5502455 DOI: 10.1105/tpc.17.00154] [Citation(s) in RCA: 92] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 05/16/2017] [Accepted: 06/01/2017] [Indexed: 05/19/2023]
Abstract
The availability of a whole-genome sequenced mutant population and the cataloging of mutations of each line at a single-nucleotide resolution facilitate functional genomic analysis. To this end, we generated and sequenced a fast-neutron-induced mutant population in the model rice cultivar Kitaake (Oryza sativa ssp japonica), which completes its life cycle in 9 weeks. We sequenced 1504 mutant lines at 45-fold coverage and identified 91,513 mutations affecting 32,307 genes, i.e., 58% of all rice genes. We detected an average of 61 mutations per line. Mutation types include single-base substitutions, deletions, insertions, inversions, translocations, and tandem duplications. We observed a high proportion of loss-of-function mutations. We identified an inversion affecting a single gene as the causative mutation for the short-grain phenotype in one mutant line. This result reveals the usefulness of the resource for efficient, cost-effective identification of genes conferring specific phenotypes. To facilitate public access to this genetic resource, we established an open access database called KitBase that provides access to sequence data and seed stocks. This population complements other available mutant collections and gene-editing technologies. This work demonstrates how inexpensive next-generation sequencing can be applied to generate a high-density catalog of mutations.
Collapse
Affiliation(s)
- Guotian Li
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Rashmi Jain
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Mawsheng Chern
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Nikki T Pham
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Joel A Martin
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Tong Wei
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Wendy S Schackwitz
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Anna M Lipzen
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Phat Q Duong
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
| | - Kyle C Jones
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Liangrong Jiang
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- School of Life Sciences, Xiamen University, Xiamen 361102, China
| | - Deling Ruan
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| | - Diane Bauer
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Yi Peng
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Kerrie W Barry
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
| | - Jeremy Schmutz
- U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806
| | - Pamela C Ronald
- Department of Plant Pathology and the Genome Center, University of California, Davis, California 95616
- Grass Genetics, Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, California 94720
| |
Collapse
|
96
|
Yuan Y, Bayer PE, Batley J, Edwards D. Improvements in Genomic Technologies: Application to Crop Genomics. Trends Biotechnol 2017; 35:547-558. [DOI: 10.1016/j.tibtech.2017.02.009] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/10/2017] [Accepted: 02/14/2017] [Indexed: 12/13/2022]
|
97
|
Montenegro JD, Golicz AA, Bayer PE, Hurgobin B, Lee H, Chan CKK, Visendi P, Lai K, Doležel J, Batley J, Edwards D. The pangenome of hexaploid bread wheat. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 90:1007-1013. [PMID: 28231383 DOI: 10.1111/tpj.13515] [Citation(s) in RCA: 215] [Impact Index Per Article: 30.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2016] [Accepted: 02/06/2017] [Indexed: 05/19/2023]
Abstract
There is an increasing understanding that variation in gene presence-absence plays an important role in the heritability of agronomic traits; however, there have been relatively few studies on variation in gene presence-absence in crop species. Hexaploid wheat is one of the most important food crops in the world and intensive breeding has reduced the genetic diversity of elite cultivars. Major efforts have produced draft genome assemblies for the cultivar Chinese Spring, but it is unknown how well this represents the genome diversity found in current modern elite cultivars. In this study we build an improved reference for Chinese Spring and explore gene diversity across 18 wheat cultivars. We predict a pangenome size of 140 500 ± 102 genes, a core genome of 81 070 ± 1631 genes and an average of 128 656 genes in each cultivar. Functional annotation of the variable gene set suggests that it is enriched for genes that may be associated with important agronomic traits. In addition to variation in gene presence, more than 36 million intervarietal single nucleotide polymorphisms were identified across the pangenome. This study of the wheat pangenome provides insight into genome diversity in elite wheat as a basis for genomics-based improvement of this important crop. A wheat pangenome, GBrowse, is available at http://appliedbioinformatics.com.au/cgi-bin/gb2/gbrowse/WheatPan/, and data are available to download from http://wheatgenome.info/wheat_genome_databases.php.
Collapse
Affiliation(s)
- Juan D Montenegro
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
| | - Agnieszka A Golicz
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
| | - Philipp E Bayer
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
| | - Bhavna Hurgobin
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
| | - HueyTyng Lee
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
| | - Chon-Kit Kenneth Chan
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
| | - Paul Visendi
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
| | | | - Jaroslav Doležel
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, CZ-783 71, Olomouc, Czech Republic
| | - Jacqueline Batley
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
- Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| | - David Edwards
- School of Agriculture and Food Sciences, University of Queensland, Brisbane, Australia
- School of Plant Biology, University of Western Australia, Crawley, WA, 6009, Australia
- Institute of Agriculture, University of Western Australia, Crawley, WA, 6009, Australia
| |
Collapse
|
98
|
Varshney RK, Saxena RK, Upadhyaya HD, Khan AW, Yu Y, Kim C, Rathore A, Kim D, Kim J, An S, Kumar V, Anuradha G, Yamini KN, Zhang W, Muniswamy S, Kim JS, Penmetsa RV, von Wettberg E, Datta SK. Whole-genome resequencing of 292 pigeonpea accessions identifies genomic regions associated with domestication and agronomic traits. Nat Genet 2017; 49:1082-1088. [DOI: 10.1038/ng.3872] [Citation(s) in RCA: 150] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2016] [Accepted: 04/25/2017] [Indexed: 12/19/2022]
|
99
|
Du H, Yu Y, Ma Y, Gao Q, Cao Y, Chen Z, Ma B, Qi M, Li Y, Zhao X, Wang J, Liu K, Qin P, Yang X, Zhu L, Li S, Liang C. Sequencing and de novo assembly of a near complete indica rice genome. Nat Commun 2017; 8:15324. [PMID: 28469237 PMCID: PMC5418594 DOI: 10.1038/ncomms15324] [Citation(s) in RCA: 173] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Accepted: 03/17/2017] [Indexed: 01/03/2023] Open
Abstract
A high-quality reference genome is critical for understanding genome structure, genetic variation and evolution of an organism. Here we report the de novo assembly of an indica rice genome Shuhui498 (R498) through the integration of single-molecule sequencing and mapping data, genetic map and fosmid sequence tags. The 390.3 Mb assembly is estimated to cover more than 99% of the R498 genome and is more continuous than the current reference genomes of japonica rice Nipponbare (MSU7) and Arabidopsis thaliana (TAIR10). We annotate high-quality protein-coding genes in R498 and identify genetic variations between R498 and Nipponbare and presence/absence variations by comparing them to 17 draft genomes in cultivated rice and its closest wild relatives. Our results demonstrate how to de novo assemble a highly contiguous and near-complete plant genome through an integrative strategy. The R498 genome will serve as a reference for the discovery of genes and structural variations in rice. High-quality reference genomes facilitate analysis of genome structure and variation. Here Du et al. create a near-complete assembly of the indica rice genome by combining single molecule sequencing with mapping data and fosmid sequences and identify genetic variants by comparison with other rice genomes.
Collapse
Affiliation(s)
- Huilong Du
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Ying Yu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Yanfei Ma
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Qiang Gao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Yinghao Cao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Zhuo Chen
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Bin Ma
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Ming Qi
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Yan Li
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Xianfeng Zhao
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Jing Wang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Kunfan Liu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Peng Qin
- Rice Research Institute, Sichuan Agricultural University, Chengdu 611130, China
| | - Xin Yang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Lihuang Zhu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China
| | - Shigui Li
- Rice Research Institute, Sichuan Agricultural University, Chengdu 611130, China
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, 1 Beichen West Road No. 2, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
100
|
SNP Discovery Using a Pangenome: Has the Single Reference Approach Become Obsolete? BIOLOGY 2017; 6:biology6010021. [PMID: 28287462 PMCID: PMC5372014 DOI: 10.3390/biology6010021] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Revised: 03/07/2017] [Accepted: 03/08/2017] [Indexed: 12/22/2022]
Abstract
Increasing evidence suggests that a single individual is insufficient to capture the genetic diversity within a species due to gene presence absence variation. In order to understand the extent to which genomic variation occurs in a species, the construction of its pangenome is necessary. The pangenome represents the complete set of genes of a species; it is composed of core genes, which are present in all individuals, and variable genes, which are present only in some individuals. Aside from variations at the gene level, single nucleotide polymorphisms (SNPs) are also an important form of genetic variation. The advent of next-generation sequencing (NGS) coupled with the heritability of SNPs make them ideal markers for genetic analysis of human, animal, and microbial data. SNPs have also been extensively used in crop genetics for association mapping, quantitative trait loci (QTL) analysis, analysis of genetic diversity, and phylogenetic analysis. This review focuses on the use of pangenomes for SNP discovery. It highlights the advantages of using a pangenome rather than a single reference for this purpose. This review also demonstrates how extra information not captured in a single reference alone can be used to provide additional support for linking genotypic data to phenotypic data.
Collapse
|