2
|
Beier S, Himmelbach A, Colmsee C, Zhang XQ, Barrero RA, Zhang Q, Li L, Bayer M, Bolser D, Taudien S, Groth M, Felder M, Hastie A, Šimková H, Staňková H, Vrána J, Chan S, Muñoz-Amatriaín M, Ounit R, Wanamaker S, Schmutzer T, Aliyeva-Schnorr L, Grasso S, Tanskanen J, Sampath D, Heavens D, Cao S, Chapman B, Dai F, Han Y, Li H, Li X, Lin C, McCooke JK, Tan C, Wang S, Yin S, Zhou G, Poland JA, Bellgard MI, Houben A, Doležel J, Ayling S, Lonardi S, Langridge P, Muehlbauer GJ, Kersey P, Clark MD, Caccamo M, Schulman AH, Platzer M, Close TJ, Hansson M, Zhang G, Braumann I, Li C, Waugh R, Scholz U, Stein N, Mascher M. Construction of a map-based reference genome sequence for barley, Hordeum vulgare L. Sci Data 2017; 4:170044. [PMID: 28448065 PMCID: PMC5407242 DOI: 10.1038/sdata.2017.44] [Citation(s) in RCA: 105] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 02/09/2017] [Indexed: 12/30/2022] Open
Abstract
Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. ‘Morex’ was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX).
Collapse
Affiliation(s)
- Sebastian Beier
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Christian Colmsee
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Xiao-Qi Zhang
- School of Veterinary and Life Sciences, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Roberto A Barrero
- Centre for Comparative Genomics, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Qisen Zhang
- Australian Export Grains Innovation Centre, South Perth, Western Australia 6151, Australia
| | - Lin Li
- Department of Agronomy and Plant Genetics, University of Minnesota, St Paul, Minnesota 55108, USA
| | - Micha Bayer
- The James Hutton Institute, Dundee DD2 5DA, UK
| | - Daniel Bolser
- European Molecular Biology Laboratory-The European Bioinformatics Institute, Hinxton CB10 1SD, UK
| | - Stefan Taudien
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), 07745 Jena, Germany
| | - Marco Groth
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), 07745 Jena, Germany
| | - Marius Felder
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), 07745 Jena, Germany
| | - Alex Hastie
- BioNano Genomics Inc., San Diego, California 92121, USA
| | - Hana Šimková
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, 78371 Olomouc, Czech Republic
| | - Helena Staňková
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, 78371 Olomouc, Czech Republic
| | - Jan Vrána
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, 78371 Olomouc, Czech Republic
| | - Saki Chan
- BioNano Genomics Inc., San Diego, California 92121, USA
| | - María Muñoz-Amatriaín
- Department of Botany &Plant Sciences, University of California, Riverside, Riverside, California 92521, USA
| | - Rachid Ounit
- Department of Computer Science and Engineering, University of California, Riverside, Riverside, California 92521, USA
| | - Steve Wanamaker
- Department of Botany &Plant Sciences, University of California, Riverside, Riverside, California 92521, USA
| | - Thomas Schmutzer
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Lala Aliyeva-Schnorr
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Stefano Grasso
- Department of Agricultural and Environmental Sciences, University of Udine, 33100 Udine, Italy
| | - Jaakko Tanskanen
- Green Technology, Natural Resources Institute (Luke), Viikki Plant Science Centre, and Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland
| | | | | | - Sujie Cao
- BGI-Shenzhen, Shenzhen 518083, China
| | - Brett Chapman
- Centre for Comparative Genomics, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Fei Dai
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Yong Han
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Hua Li
- BGI-Shenzhen, Shenzhen 518083, China
| | - Xuan Li
- BGI-Shenzhen, Shenzhen 518083, China
| | | | - John K McCooke
- Centre for Comparative Genomics, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Cong Tan
- Centre for Comparative Genomics, Murdoch University, Murdoch, Western Australia 6150, Australia
| | | | - Shuya Yin
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Gaofeng Zhou
- School of Veterinary and Life Sciences, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Jesse A Poland
- Kansas State University, Wheat Genetics Resource Center, Department of Plant Pathology and Department of Agronomy, Manhattan, Kansas 66506, USA
| | - Matthew I Bellgard
- Centre for Comparative Genomics, Murdoch University, Murdoch, Western Australia 6150, Australia
| | - Andreas Houben
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Jaroslav Doležel
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, 78371 Olomouc, Czech Republic
| | | | - Stefano Lonardi
- Department of Computer Science and Engineering, University of California, Riverside, Riverside, California 92521, USA
| | - Peter Langridge
- School of Agriculture, University of Adelaide, Urrbrae, South Australia 5064, Australia
| | - Gary J Muehlbauer
- Department of Agronomy and Plant Genetics, University of Minnesota, St Paul, Minnesota 55108, USA.,Department of Plant and Microbial Biology, University of Minnesota, St Paul, Minnesota 55108, USA
| | - Paul Kersey
- European Molecular Biology Laboratory-The European Bioinformatics Institute, Hinxton CB10 1SD, UK
| | - Matthew D Clark
- Earlham Institute, Norwich NR4 7UH, UK.,School of Environmental Sciences, University of East Anglia, Norwich NR4 7UH, UK
| | - Mario Caccamo
- Earlham Institute, Norwich NR4 7UH, UK.,National Institute of Agricultural Botany, Cambridge CB3 0LE, UK
| | - Alan H Schulman
- Green Technology, Natural Resources Institute (Luke), Viikki Plant Science Centre, and Institute of Biotechnology, University of Helsinki, 00014 Helsinki, Finland
| | - Matthias Platzer
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), 07745 Jena, Germany
| | - Timothy J Close
- Department of Botany &Plant Sciences, University of California, Riverside, Riverside, California 92521, USA
| | - Mats Hansson
- Department of Biology, Lund University, 22362 Lund, Sweden
| | - Guoping Zhang
- College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310058, China
| | - Ilka Braumann
- Carlsberg Research Laboratory, 1799 Copenhagen, Denmark
| | - Chengdao Li
- School of Veterinary and Life Sciences, Murdoch University, Murdoch, Western Australia 6150, Australia.,Department of Agriculture and Food, Government of Western Australia, South Perth, Western Australia 6150, Australia.,Hubei Collaborative Innovation Centre for Grain Industry, Yangtze University, Jingzhou, Hubei 434025, China
| | - Robbie Waugh
- The James Hutton Institute, Dundee DD2 5DA, UK.,School of Life Sciences, University of Dundee, Dundee DD2 5DA, UK
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany.,School of Plant Biology, University of Western Australia, Crawley 6009, Australia
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466 Seeland, Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, 04103 Leipzig, Germany
| |
Collapse
|
3
|
Mascher M, Gundlach H, Himmelbach A, Beier S, Twardziok SO, Wicker T, Radchuk V, Dockter C, Hedley PE, Russell J, Bayer M, Ramsay L, Liu H, Haberer G, Zhang XQ, Zhang Q, Barrero RA, Li L, Taudien S, Groth M, Felder M, Hastie A, Šimková H, Staňková H, Vrána J, Chan S, Muñoz-Amatriaín M, Ounit R, Wanamaker S, Bolser D, Colmsee C, Schmutzer T, Aliyeva-Schnorr L, Grasso S, Tanskanen J, Chailyan A, Sampath D, Heavens D, Clissold L, Cao S, Chapman B, Dai F, Han Y, Li H, Li X, Lin C, McCooke JK, Tan C, Wang P, Wang S, Yin S, Zhou G, Poland JA, Bellgard MI, Borisjuk L, Houben A, Doležel J, Ayling S, Lonardi S, Kersey P, Langridge P, Muehlbauer GJ, Clark MD, Caccamo M, Schulman AH, Mayer KFX, Platzer M, Close TJ, Scholz U, Hansson M, Zhang G, Braumann I, Spannagl M, Li C, Waugh R, Stein N. A chromosome conformation capture ordered sequence of the barley genome. Nature 2017; 544:427-433. [DOI: 10.1038/nature22043] [Citation(s) in RCA: 966] [Impact Index Per Article: 138.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 03/03/2017] [Indexed: 02/06/2023]
|
4
|
Beier S, Himmelbach A, Schmutzer T, Felder M, Taudien S, Mayer KFX, Platzer M, Stein N, Scholz U, Mascher M. Multiplex sequencing of bacterial artificial chromosomes for assembling complex plant genomes. PLANT BIOTECHNOLOGY JOURNAL 2016; 14:1511-22. [PMID: 26801048 PMCID: PMC5066668 DOI: 10.1111/pbi.12511] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Revised: 11/11/2015] [Accepted: 11/13/2015] [Indexed: 05/02/2023]
Abstract
Hierarchical shotgun sequencing remains the method of choice for assembling high-quality reference sequences of complex plant genomes. The efficient exploitation of current high-throughput technologies and powerful computational facilities for large-insert clone sequencing necessitates the sequencing and assembly of a large number of clones in parallel. We developed a multiplexed pipeline for shotgun sequencing and assembling individual bacterial artificial chromosomes (BACs) using the Illumina sequencing platform. We illustrate our approach by sequencing 668 barley BACs (Hordeum vulgare L.) in a single Illumina HiSeq 2000 lane. Using a newly designed parallelized computational pipeline, we obtained sequence assemblies of individual BACs that consist, on average, of eight sequence scaffolds and represent >98% of the genomic inserts. Our BAC assemblies are clearly superior to a whole-genome shotgun assembly regarding contiguity, completeness and the representation of the gene space. Our methods may be employed to rapidly obtain high-quality assemblies of a large number of clones to assemble map-based reference sequences of plant and animal species with complex genomes by sequencing along a minimum tiling path.
Collapse
Affiliation(s)
- Sebastian Beier
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| | - Axel Himmelbach
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| | - Thomas Schmutzer
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| | - Marius Felder
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), Jena, Germany
| | - Stefan Taudien
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), Jena, Germany
| | - Klaus F X Mayer
- Plant Genome and System Biology (PGSB), Helmholtz Center Munich, German Research Center for Environmental Health (GmbH), Neuherberg, Germany
| | - Matthias Platzer
- Leibniz Institute on Aging-Fritz Lipmann Institute (FLI), Jena, Germany
| | - Nils Stein
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| | - Martin Mascher
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Stadt Seeland, Germany
| |
Collapse
|
5
|
Visendi P, Berkman PJ, Hayashi S, Golicz AA, Bayer PE, Ruperao P, Hurgobin B, Montenegro J, Chan CKK, Staňková H, Batley J, Šimková H, Doležel J, Edwards D. An efficient approach to BAC based assembly of complex genomes. PLANT METHODS 2016; 12:2. [PMID: 26793268 PMCID: PMC4719536 DOI: 10.1186/s13007-016-0107-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 01/08/2016] [Indexed: 05/27/2023]
Abstract
BACKGROUND There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. RESULTS We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. CONCLUSIONS We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
Collapse
Affiliation(s)
- Paul Visendi
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />Centre for Biotechnology and Bioinformatics, College of Biological and Physical Sciences, University of Nairobi, P. O. Box 30197, Nairobi, 00100 Kenya
| | | | - Satomi Hayashi
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
| | - Agnieszka A. Golicz
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
| | - Philipp E. Bayer
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| | - Pradeep Ruperao
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| | - Bhavna Hurgobin
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| | - Juan Montenegro
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
| | - Chon-Kit Kenneth Chan
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| | - Helena Staňková
- />Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, 78371 Olomouc, Czech Republic
| | - Jacqueline Batley
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| | - Hana Šimková
- />Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, 78371 Olomouc, Czech Republic
| | - Jaroslav Doležel
- />Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Šlechtitelů 31, 78371 Olomouc, Czech Republic
| | - David Edwards
- />School of Agriculture and Food Science, University of Queensland, Brisbane, QLD 4072 Australia
- />School of Plant Biology, University of Western Australia, Perth, WA 6009 Australia
| |
Collapse
|