1
|
Deblieck M, Fatiukha A, Grundman N, Merchuk-Ovnat L, Saranga Y, Krugman T, Pillen K, Serfling A, Makalowski W, Ordon F, Perovic D. GenoTypeMapper: graphical genotyping on genetic and sequence-based maps. PLANT METHODS 2020; 16:123. [PMID: 32944061 PMCID: PMC7488165 DOI: 10.1186/s13007-020-00665-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Accepted: 08/26/2020] [Indexed: 06/11/2023]
Abstract
BACKGROUND The rising availability of assemblies of large genomes (e.g. bread and durum wheat, barley) and their annotations deliver the basis to graphically present genome organization of parents and progenies on a physical scale. Genetic maps are a very important tool for breeders but often represent distorted models of the actual chromosomes, e.g., in centromeric and telomeric regions. This biased picture might lead to imprecise assumptions and estimations about the size and complexity of genetic regions and the selection of suitable molecular markers for the incorporation of traits in breeding populations or near-isogenic lines (NILs). Some software packages allow the graphical illustration of genotypic data, but to the best of our knowledge, suitable software packages that allow the comparison of genotypic data on the physical and genetic scale are currently unavailable. RESULTS We developed a simple Java-based-software called GenoTypeMapper (GTM) for comparing genotypic data on genetic and physical maps and tested it for effectiveness on data of two NILs that carry QTL-regions for drought stress tolerance from wild emmer on chromosome 2BS and 7AS. Both NILs were more tolerant to drought stress than their recurrent parents but exhibited additional undesirable traits such as delayed heading time. CONCLUSIONS In this article, we illustrate that the software easily allows users to display and identify additional chromosomal introgressions in both NILs originating from the wild emmer parent. The ability to detect and diminish linkage drag can be of particular interest for pre-breeding purposes and the developed software is a well-suited tool in this respect. The software is based on a simple allele-matching algorithm between the offspring and parents of a crossing scheme. Despite this simple approach, GTM seems to be the only software that allows us to analyse, illustrate and compare genotypic data of offspring of different crossing schemes with up to four parents in two different maps. So far, up to 500 individuals with a maximum number of 50,000 markers can be examined with the software. The main limitation that hampers the performance of the software is the number of markers that are examined in parallel. Since each individual must be analysed separately, a maximum of ten individuals can currently be displayed in a single run. On a computer with an Intel five processor of the 8th generation, GTM can reliably either analyse a single individual with up to 12,000 markers or ten individuals with up to 3,600 markers in less than five seconds. Future work aims to improve the performance of the software so that more complex crossing schemes with more parents and more markers can be analysed.
Collapse
Affiliation(s)
- Mathieu Deblieck
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany
| | - Andrii Fatiukha
- Institute of Evolution and Department of Environmental and Evolutionary Biology, University of Haifa, Abba Khoushy Ave 199, 3498838 Haifa, Israel
| | - Norbert Grundman
- Faculty of Medicine, Institute of Bioinformatics, Westfälische Wilhelms-Universität Münster, Niels-Stensen Strasse 14, 48149 Münster, Germany
| | - Lianne Merchuk-Ovnat
- Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, POB 12, 76100 Rehovot, Israel
| | - Yehoshua Saranga
- Robert H. Smith Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, POB 12, 76100 Rehovot, Israel
| | - Tamar Krugman
- Institute of Evolution and Department of Environmental and Evolutionary Biology, University of Haifa, Abba Khoushy Ave 199, 3498838 Haifa, Israel
| | - Klaus Pillen
- Institute of Agricultural and Nutritional Sciences, Department of Plant Breeding, Martin Luther University Halle-Wittenberg, Betty-Heimann-Str. 3, 06120 Halle, Germany
| | - Albrecht Serfling
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany
| | - Wojciech Makalowski
- Faculty of Medicine, Institute of Bioinformatics, Westfälische Wilhelms-Universität Münster, Niels-Stensen Strasse 14, 48149 Münster, Germany
| | - Frank Ordon
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany
| | - Dragan Perovic
- Institute for Resistance Research and Stress Tolerance, Julius Kühn-Institute, Erwin-Baur-Str. 27, 06484 Quedlinburg, Germany
| |
Collapse
|
2
|
Alonge M, Wang X, Benoit M, Soyk S, Pereira L, Zhang L, Suresh H, Ramakrishnan S, Maumus F, Ciren D, Levy Y, Harel TH, Shalev-Schlosser G, Amsellem Z, Razifard H, Caicedo AL, Tieman DM, Klee H, Kirsche M, Aganezov S, Ranallo-Benavidez TR, Lemmon ZH, Kim J, Robitaille G, Kramer M, Goodwin S, McCombie WR, Hutton S, Van Eck J, Gillis J, Eshed Y, Sedlazeck FJ, van der Knaap E, Schatz MC, Lippman ZB. Major Impacts of Widespread Structural Variation on Gene Expression and Crop Improvement in Tomato. Cell 2020; 182:145-161.e23. [PMID: 32553272 PMCID: PMC7354227 DOI: 10.1016/j.cell.2020.05.021] [Citation(s) in RCA: 364] [Impact Index Per Article: 91.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 04/10/2020] [Accepted: 05/12/2020] [Indexed: 12/22/2022]
Abstract
Structural variants (SVs) underlie important crop improvement and domestication traits. However, resolving the extent, diversity, and quantitative impact of SVs has been challenging. We used long-read nanopore sequencing to capture 238,490 SVs in 100 diverse tomato lines. This panSV genome, along with 14 new reference assemblies, revealed large-scale intermixing of diverse genotypes, as well as thousands of SVs intersecting genes and cis-regulatory regions. Hundreds of SV-gene pairs exhibit subtle and significant expression changes, which could broadly influence quantitative trait variation. By combining quantitative genetics with genome editing, we show how multiple SVs that changed gene dosage and expression levels modified fruit flavor, size, and production. In the last example, higher order epistasis among four SVs affecting three related transcription factors allowed introduction of an important harvesting trait in modern tomato. Our findings highlight the underexplored role of SVs in genotype-to-phenotype relationships and their widespread importance and utility in crop improvement.
Collapse
Affiliation(s)
- Michael Alonge
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Xingang Wang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Matthias Benoit
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Sebastian Soyk
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Lara Pereira
- Center for Applied Genetic Technologies, Genetics & Genomics, University of Georgia, Athens, GA 30602, USA
| | - Lei Zhang
- Center for Applied Genetic Technologies, Genetics & Genomics, University of Georgia, Athens, GA 30602, USA
| | - Hamsini Suresh
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | | | - Florian Maumus
- URGI, INRA, Université Paris-Saclay, 78026 Versailles, France
| | - Danielle Ciren
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Yuval Levy
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Tom Hai Harel
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Gili Shalev-Schlosser
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Ziva Amsellem
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Hamid Razifard
- Institute for Applied Life Sciences, University of Massachusetts Amherst, Amherst, MA 01003, USA; Department of Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA
| | - Ana L Caicedo
- Institute for Applied Life Sciences, University of Massachusetts Amherst, Amherst, MA 01003, USA; Department of Biology, University of Massachusetts Amherst, Amherst, MA 01003, USA
| | - Denise M Tieman
- Horticultural Sciences, Plant Innovation Center, University of Florida, Gainesville, FL 32611, USA
| | - Harry Klee
- Horticultural Sciences, Plant Innovation Center, University of Florida, Gainesville, FL 32611, USA
| | - Melanie Kirsche
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Sergey Aganezov
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | | | - Zachary H Lemmon
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Jennifer Kim
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Gina Robitaille
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Melissa Kramer
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Sara Goodwin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - W Richard McCombie
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Samuel Hutton
- Gulf Coast Research and Education Center, University of Florida, Wimauma, FL 33598, USA
| | - Joyce Van Eck
- Boyce Thompson Institute, Ithaca, NY 14853, USA; Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853, USA
| | - Jesse Gillis
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Yuval Eshed
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Esther van der Knaap
- Center for Applied Genetic Technologies, Genetics & Genomics, University of Georgia, Athens, GA 30602, USA; Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Athens, GA 30602, USA; Department of Horticulture, University of Georgia, Athens, GA 30602, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, USA; Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Department of Biology, Johns Hopkins University, Baltimore, MD 21218, USA.
| | - Zachary B Lippman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA.
| |
Collapse
|
3
|
Rommel Fuentes R, Hesselink T, Nieuwenhuis R, Bakker L, Schijlen E, van Dooijeweert W, Diaz Trivino S, de Haan JR, Sanchez Perez G, Zhang X, Fransz P, de Jong H, van Dijk ADJ, de Ridder D, Peters SA. Meiotic recombination profiling of interspecific hybrid F1 tomato pollen by linked read sequencing. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 102:480-492. [PMID: 31820490 DOI: 10.1111/tpj.14640] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 11/25/2019] [Accepted: 12/04/2019] [Indexed: 06/10/2023]
Abstract
Genome wide screening of pooled pollen samples from a single interspecific F1 hybrid obtained from a cross between tomato, Solanum lycopersicum and its wild relative, Solanum pimpinellifolium using linked read sequencing of the haploid nuclei, allowed profiling of the crossover (CO) and gene conversion (GC) landscape. We observed a striking overlap between cold regions of CO in the male gametes and our previously established F6 recombinant inbred lines (RILs) population. COs were overrepresented in non-coding regions in the gene promoter and 5'UTR regions of genes. Poly-A/T and AT rich motifs were found enriched in 1 kb promoter regions flanking the CO sites. Non-crossover associated allelic and ectopic GCs were detected in most chromosomes, confirming that besides CO, GC represents also a source for genetic diversity and genome plasticity in tomato. Furthermore, we identified processed break junctions pointing at the involvement of both homology directed and non-homology directed repair pathways, suggesting a recombination machinery in tomato that is more complex than currently anticipated.
Collapse
Affiliation(s)
- Roven Rommel Fuentes
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Thamara Hesselink
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Ronald Nieuwenhuis
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Linda Bakker
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Elio Schijlen
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Willem van Dooijeweert
- Centre for Genetic Resources, Wageningen University and Research, Wageningen, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Sara Diaz Trivino
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Jorn R de Haan
- Genetwister Technologies B.V., Nieuwe Kanaal 7b, 6709 PA, Wageningen, The Netherlands
| | - Gabino Sanchez Perez
- Genetwister Technologies B.V., Nieuwe Kanaal 7b, 6709 PA, Wageningen, The Netherlands
| | - Xinyue Zhang
- Swammerdam Institute for Life Sciences, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Paul Fransz
- Swammerdam Institute for Life Sciences, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands
| | - Hans de Jong
- Laboratory of Genetics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Aalt D J van Dijk
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
- Biometris, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Sander A Peters
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| |
Collapse
|
4
|
Soyk S, Lemmon ZH, Sedlazeck FJ, Jiménez-Gómez JM, Alonge M, Hutton SF, Van Eck J, Schatz MC, Lippman ZB. Duplication of a domestication locus neutralized a cryptic variant that caused a breeding barrier in tomato. NATURE PLANTS 2019; 5:471-479. [PMID: 31061537 DOI: 10.1038/s41477-019-0422-z] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 04/02/2019] [Indexed: 06/09/2023]
Abstract
Genome editing technologies are being widely adopted in plant breeding1. However, a looming challenge of engineering desirable genetic variation in diverse genotypes is poor predictability of phenotypic outcomes due to unforeseen interactions with pre-existing cryptic mutations2-4. In tomato, breeding with a classical MADS-box gene mutation that improves harvesting by eliminating fruit stem abscission frequently results in excessive inflorescence branching, flowering and reduced fertility due to interaction with a cryptic variant that causes partial mis-splicing in a homologous gene5-8. Here, we show that a recently evolved tandem duplication carrying the second-site variant achieves a threshold of functional transcripts to suppress branching, enabling breeders to neutralize negative epistasis on yield. By dissecting the dosage mechanisms by which this structural variant restored normal flowering and fertility, we devised strategies that use CRISPR-Cas9 genome editing to predictably improve harvesting. Our findings highlight the under-appreciated impact of epistasis in targeted trait breeding and underscore the need for a deeper characterization of cryptic variation to enable the full potential of genome editing in agriculture.
Collapse
Affiliation(s)
- Sebastian Soyk
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| | | | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - José M Jiménez-Gómez
- Institut Jean-Pierre Bourgin, INRA, AgroParisTech, CNRS, Université Paris-Saclay, Versailles, France
| | - Michael Alonge
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Samuel F Hutton
- Horticultural Sciences Department, University of Florida, Wimauma, FL, USA
| | - Joyce Van Eck
- The Boyce Thompson Institute, Ithaca, NY, USA
- Plant Breeding and Genetics Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, USA
| | - Michael C Schatz
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
- Department of Oncologye, Johns Hopkins Medicine, Baltimore, MD, USA
| | - Zachary B Lippman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
5
|
Demirci S, Peters SA, de Ridder D, van Dijk ADJ. DNA sequence and shape are predictive for meiotic crossovers throughout the plant kingdom. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2018; 95:686-699. [PMID: 29808512 DOI: 10.1111/tpj.13979] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2017] [Revised: 05/11/2018] [Accepted: 05/14/2018] [Indexed: 06/08/2023]
Abstract
A better understanding of genomic features influencing the location of meiotic crossovers (COs) in plant species is both of fundamental importance and of practical relevance for plant breeding. Using CO positions with sufficiently high resolution from four plant species [Arabidopsis thaliana, Solanum lycopersicum (tomato), Zea mays (maize) and Oryza sativa (rice)] we have trained machine-learning models to predict the susceptibility to CO formation. Our results show that CO occurrence within various plant genomes can be predicted by DNA sequence and shape features. Several features related to genome content and to genomic accessibility were consistently either positively or negatively related to COs in all four species. Other features were found as predictive only in specific species. Gene annotation-related features were especially predictive for maize, whereas in tomato and Arabidopsis propeller twist and helical twist (DNA shape features) and AT/TA dinucleotides were found to be the most important. In rice, high roll (another DNA shape feature) and low CA dinucleotide frequency in particular were found to be associated with CO occurrence. The accuracy of our models was sufficient for Arabidopsis and rice (area under receiver operating characteristic curve, AUROC > 0.5), and was high for tomato and maize (AUROC ≫ 0.5), demonstrating that DNA sequence and shape are predictive for meiotic COs throughout the plant kingdom.
Collapse
Affiliation(s)
- Sevgin Demirci
- Business Unit Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
| | - Sander A Peters
- Business Unit Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
| | - Aalt D J van Dijk
- Business Unit Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, the Netherlands
- Bioinformatics Group, Wageningen University and Research, Wageningen, the Netherlands
- Biometris, Wageningen University and Research, Wageningen, the Netherlands
| |
Collapse
|
6
|
Silva Ferreira D, Kevei Z, Kurowski T, de Noronha Fonseca ME, Mohareb F, Boiteux LS, Thompson AJ. BIFURCATE FLOWER TRUSS: a novel locus controlling inflorescence branching in tomato contains a defective MAP kinase gene. JOURNAL OF EXPERIMENTAL BOTANY 2018; 69:2581-2593. [PMID: 29509915 PMCID: PMC5920302 DOI: 10.1093/jxb/ery076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/29/2017] [Accepted: 02/21/2018] [Indexed: 06/08/2023]
Abstract
A mutant line, bifurcate flower truss (bif), was recovered from a tomato genetics programme. Plants from the control line produced a mean of 0.16 branches per truss, whereas the value for bif plants was 4.1. This increase in branching was accompanied by a 3.3-fold increase in flower number and showed a significant interaction with exposure to low temperature during truss development. The control line and bif genomes were resequenced and the bif gene was mapped to a 2.01 Mbp interval on chromosome 12; all coding region polymorphisms in the interval were surveyed, and five candidate genes displaying altered protein sequences were detected. One of these genes, SlMAPK1, encoding a mitogen-activated protein (MAP) kinase, contained a leucine to stop codon mutation predicted to disrupt kinase function. SlMAPK1 is an excellent candidate for bif because knock-out mutations of an Arabidopsis orthologue MPK6 were reported to have increased flower number. An introgression browser was used to demonstrate that the origin of the bif genomic DNA at the BIF locus was Solanum galapagense and that the SlMAPK1 null mutant is a naturally occurring allele widespread only on the Galápagos Islands. This work strongly implicates SlMAPK1 as part of the network of genes controlling inflorescence branching in tomato.
Collapse
Affiliation(s)
| | - Zoltan Kevei
- Cranfield Soil and Agrifood Institute, Cranfield University, Cranfield, UK
| | - Tomasz Kurowski
- Cranfield Soil and Agrifood Institute, Cranfield University, Cranfield, UK
| | | | - Fady Mohareb
- Cranfield Soil and Agrifood Institute, Cranfield University, Cranfield, UK
| | - Leonardo S Boiteux
- National Center for Vegetable Crops Research, CNPH—Embrapa Hortaliças, Brasília-DF, Brazil
| | - Andrew J Thompson
- Cranfield Soil and Agrifood Institute, Cranfield University, Cranfield, UK
| |
Collapse
|
7
|
Shea DJ, Shimizu M, Nishida N, Fukai E, Abe T, Fujimoto R, Okazaki K. IntroMap: a signal analysis based method for the detection of genomic introgressions. BMC Genet 2017; 18:101. [PMID: 29202713 PMCID: PMC5716257 DOI: 10.1186/s12863-017-0568-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Accepted: 11/14/2017] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Breeding programs often rely on marker-assisted tests or variant calling of next generation sequence (NGS) data to identify regions of genomic introgression arising from the hybridization of two plant species. In this paper we present IntroMap, a bioinformatics pipeline for the screening of candidate plants through the application of signal processing techniques to NGS data, using alignment to a reference genome sequence (annotation is not required) that shares homology with the recurrent parental cultivar, and without the need for de novo assembly of the read data or variant calling. RESULTS We show the accurate identification of introgressed genomic regions using both in silico simulated genomes, and a hybridized cultivar data set using our pipeline. Additionally we show, through targeted marker-based assays, validation of the IntroMap predicted regions for the hybrid cultivar. CONCLUSIONS This approach can be used to automate the screening of large populations, reducing the time and labor required, and can improve the accuracy of the detection of introgressed regions in comparison to a marker-based approach. In contrast to other approaches that generally rely upon a variant calling step, our method achieves accurate identification of introgressed regions without variant calling, relying solely upon alignment.
Collapse
Affiliation(s)
- Daniel J Shea
- Laboratory of Plant Breeding, Graduate School of Science and Technology, Niigata University, Ikarashi-ninocho, Niigata, 950-2181, Japan
| | - Motoki Shimizu
- Iwate Biotechnology Research Center, Narita, Kitakami, 024-0003, Japan
| | - Namiko Nishida
- Graduate School of Agricultural Science, Kobe University, Rokkodai, Nada-ku, Kobe, 657-8501, Japan
| | - Eigo Fukai
- Laboratory of Plant Breeding, Graduate School of Science and Technology, Niigata University, Ikarashi-ninocho, Niigata, 950-2181, Japan
| | - Takashi Abe
- Department of Computer Science, Graduate School of Science and Technology, Niigata University, Ikarashi-ninocho, Niigata, 950-2181, Japan
| | - Ryo Fujimoto
- Graduate School of Agricultural Science, Kobe University, Rokkodai, Nada-ku, Kobe, 657-8501, Japan
| | - Keiichi Okazaki
- Laboratory of Plant Breeding, Graduate School of Science and Technology, Niigata University, Ikarashi-ninocho, Niigata, 950-2181, Japan.
| |
Collapse
|
8
|
Demirci S, van Dijk ADJ, Sanchez Perez G, Aflitos SA, de Ridder D, Peters SA. Distribution, position and genomic characteristics of crossovers in tomato recombinant inbred lines derived from an interspecific cross between Solanum lycopersicum and Solanum pimpinellifolium. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:554-564. [PMID: 27797425 DOI: 10.1111/tpj.13406] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Revised: 10/21/2016] [Accepted: 10/21/2016] [Indexed: 05/20/2023]
Abstract
We determined the crossover (CO) distribution, frequency and genomic sequences involved in interspecies meiotic recombination by using parent-assigned variants of 52 F6 recombinant inbred lines obtained from a cross between tomato, Solanum lycopersicum, and its wild relative, Solanum pimpinellifolium. The interspecific CO frequency was 80% lower than reported for intraspecific tomato crosses. We detected regions showing a relatively high and low CO frequency, so-called hot and cold regions. Cold regions coincide to a large extent with the heterochromatin, although we found a limited number of smaller cold regions in the euchromatin. The CO frequency was higher at the distal ends of chromosomes than in pericentromeric regions and higher in short arm euchromatin. Hot regions of CO were detected in euchromatin, and COs were more often located in non-coding regions near the 5' untranslated region of genes than expected by chance. Besides overrepresented CCN repeats, we detected poly-A/T and AT-rich motifs enriched in 1-kb promoter regions flanking the CO sites. The most abundant sequence motifs at CO sites share weak similarity to transcription factor-binding sites, such as for the C2H2 zinc finger factors class and MADS box factors, while InterPro scans detected enrichment for genes possibly involved in the repair of DNA breaks.
Collapse
Affiliation(s)
- Sevgin Demirci
- Bioinformatics Group, Wageningen University and Research, Wageningen, The Netherlands
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Aalt D J van Dijk
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
- Biometris, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Gabino Sanchez Perez
- Bioinformatics Group, Wageningen University and Research, Wageningen, The Netherlands
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Saulo A Aflitos
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Wageningen University and Research, Wageningen, The Netherlands
| | - Sander A Peters
- Business Unit of Bioscience, Cluster Applied Bioinformatics, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| |
Collapse
|
9
|
Abrouk M, Balcárková B, Šimková H, Komínkova E, Martis MM, Jakobson I, Timofejeva L, Rey E, Vrána J, Kilian A, Järve K, Doležel J, Valárik M. The in silico identification and characterization of a bread wheat/Triticum militinae introgression line. PLANT BIOTECHNOLOGY JOURNAL 2017; 15:249-256. [PMID: 27510270 PMCID: PMC5259550 DOI: 10.1111/pbi.12610] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2015] [Revised: 07/21/2016] [Accepted: 08/08/2016] [Indexed: 05/23/2023]
Abstract
The capacity of the bread wheat (Triticum aestivum) genome to tolerate introgression from related genomes can be exploited for wheat improvement. A resistance to powdery mildew expressed by a derivative of the cross-bread wheat cv. Tähti × T. militinae (Tm) is known to be due to the incorporation of a Tm segment into the long arm of chromosome 4A. Here, a newly developed in silico method termed rearrangement identification and characterization (RICh) has been applied to characterize the introgression. A virtual gene order, assembled using the GenomeZipper approach, was obtained for the native copy of chromosome 4A; it incorporated 570 4A DArTseq markers to produce a zipper comprising 2132 loci. A comparison between the native and introgressed forms of the 4AL chromosome arm showed that the introgressed region is located at the distal part of the arm. The Tm segment, derived from chromosome 7G, harbours 131 homoeologs of the 357 genes present on the corresponding region of Chinese Spring 4AL. The estimated number of Tm genes transferred along with the disease resistance gene was 169. Characterizing the introgression's position, gene content and internal gene order should not only facilitate gene isolation, but may also be informative with respect to chromatin structure and behaviour studies.
Collapse
Affiliation(s)
- Michael Abrouk
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Barbora Balcárková
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Hana Šimková
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Eva Komínkova
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Mihaela M. Martis
- Munich Information Center for Protein Sequences/Institute of Bioinformatics and Systems BiologyInstitute for Bioinformatics and Systems BiologyHelmholtz Center MunichNeuherbergGermany
- Division of Cell BiologyDepartment of Clinical and Experimental Medicine, Bioinformatics Infrastructure for Life SciencesLinköping UniversityLinköpingSweden
| | - Irena Jakobson
- Department of Gene TechnologyTallinn University of TechnologyTallinnEstonia
| | | | - Elodie Rey
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Jan Vrána
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | | | - Kadri Järve
- Department of Gene TechnologyTallinn University of TechnologyTallinnEstonia
| | - Jaroslav Doležel
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| | - Miroslav Valárik
- Institute of Experimental BotanyCentre of the Region Haná for Biotechnological and Agricultural ResearchOlomoucCzech Republic
| |
Collapse
|
10
|
Gaiero P, van de Belt J, Vilaró F, Schranz ME, Speranza P, de Jong H. Collinearity between potato (Solanum tuberosum L.) and wild relatives assessed by comparative cytogenetic mapping. Genome 2016; 60:228-240. [PMID: 28169563 DOI: 10.1139/gen-2016-0150] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A major bottleneck to introgressive hybridization is the lack of genome collinearity between the donor (alien) genome and the recipient crop genome. Structural differences between the homeologs may create unbalanced segregation of chromosomes or cause linkage drag. To assess large-scale collinearity between potato and two of its wild relatives (Solanum commersonii and Solanum chacoense), we used BAC-FISH mapping of sequences with known positions on the RH potato map. BAC probes could successfully be hybridized to the S. commersonii and S. chachoense pachytene chromosomes, confirming their correspondence with linkage groups in RH potato. Our study shows that the order of BAC signals is conserved. Distances between BAC signals were quantified and compared; some differences found suggest either small-scale rearrangements or reduction/amplification of repeats. We conclude that S. commersonii and S. chacoense are collinear with cultivated Solanum tuberosum on the whole chromosome scale, making these amenable species for efficient introgressive hybridization breeding.
Collapse
Affiliation(s)
- Paola Gaiero
- a Department of Plant Biology, Facultad de Agronomía, Universidad de la República, Garzón 780, PC 12900, Montevideo, Uruguay.,b Laboratory of Genetics, Wageningen University, Droevendaalsesteeg 1, P.O. Box 16, 6708 PB, Wageningen, the Netherlands
| | - José van de Belt
- b Laboratory of Genetics, Wageningen University, Droevendaalsesteeg 1, P.O. Box 16, 6708 PB, Wageningen, the Netherlands
| | - Francisco Vilaró
- c Horticulture Unit, National Institute for Agricultural Research, Ruta 48 km 10, Las Brujas, Uruguay
| | - M Eric Schranz
- d Biosystematics Group, Wageningen University, Wageningen, the Netherlands
| | - Pablo Speranza
- a Department of Plant Biology, Facultad de Agronomía, Universidad de la República, Garzón 780, PC 12900, Montevideo, Uruguay
| | - Hans de Jong
- b Laboratory of Genetics, Wageningen University, Droevendaalsesteeg 1, P.O. Box 16, 6708 PB, Wageningen, the Netherlands
| |
Collapse
|
11
|
Gupta P, Naithani S, Tello-Ruiz MK, Chougule K, D’Eustachio P, Fabregat A, Jiao Y, Keays M, Lee YK, Kumari S, Mulvaney J, Olson A, Preece J, Stein J, Wei S, Weiser J, Huerta L, Petryszak R, Kersey P, Stein LD, Ware D, Jaiswal P. Gramene Database: Navigating Plant Comparative Genomics Resources. CURRENT PLANT BIOLOGY 2016; 7-8:10-15. [PMID: 28713666 PMCID: PMC5509230 DOI: 10.1016/j.cpb.2016.12.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Gramene (http://www.gramene.org) is an online, open source, curated resource for plant comparative genomics and pathway analysis designed to support researchers working in plant genomics, breeding, evolutionary biology, system biology, and metabolic engineering. It exploits phylogenetic relationships to enrich the annotation of genomic data and provides tools to perform powerful comparative analyses across a wide spectrum of plant species. It consists of an integrated portal for querying, visualizing and analyzing data for 44 plant reference genomes, genetic variation data sets for 12 species, expression data for 16 species, curated rice pathways and orthology-based pathway projections for 66 plant species including various crops. Here we briefly describe the functions and uses of the Gramene database.
Collapse
Affiliation(s)
- Parul Gupta
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Sushma Naithani
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA
| | | | | | | | - Antonio Fabregat
- European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, UK
| | - Yinping Jiao
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Maria Keays
- European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, UK
| | | | - Sunita Kumari
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | - Andrew Olson
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Justin Preece
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA
| | - Joshua Stein
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Sharon Wei
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Joel Weiser
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Laura Huerta
- European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, UK
| | - Robert Petryszak
- European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, UK
| | - Paul Kersey
- European Molecular Biology Laboratory - European Bioinformatics Institute, Hinxton, UK
| | | | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- USDA ARS NEA Plant, Soil & Nutrition Laboratory Research Unit, Ithaca, NY, USA
| | - Pankaj Jaiswal
- Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA
- To whom correspondence should be addressed Address of the corresponding author: Department of Botany & Plant Pathology, Oregon State University, Corvallis, OR, USA, Phone: +1-541-737-8471, Fax: +1-541-737-3573,
| |
Collapse
|
12
|
van Andel TR, Meyer RS, Aflitos SA, Carney JA, Veltman MA, Copetti D, Flowers JM, Havinga RM, Maat H, Purugganan MD, Wing RA, Schranz ME. Tracing ancestor rice of Suriname Maroons back to its African origin. NATURE PLANTS 2016; 2:16149. [PMID: 27694825 DOI: 10.1038/nplants.2016.149] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 08/30/2016] [Indexed: 05/26/2023]
Abstract
African rice (Oryza glaberrima) and African cultivation practices are said to have influenced emerging colonial plantation economies in the Americas1,2. However, the level of impact of African rice practices is difficult to establish because of limited written or botanical records2,3. Recent findings of O. glaberrima in rice fields of Suriname Maroons bear evidence of the high level of knowledge about rice among African slaves and their descendants, who consecrate it in ancestor rituals4,5. Here we establish the strong similarity, and hence likely origin, of the first extant New World landrace of O. glaberrima to landraces from the Upper Guinean forests in West Africa. We collected African rice from a Maroon market in Paramaribo, Suriname, propagated it, sequenced its genome6 and compared it with genomes of 109 accessions representing O. glaberrima diversity across West Africa. By analysing 1,649,769 single nucleotide polymorphisms (SNPs) in clustering analyses, the Suriname sample appears sister to an Ivory Coast landrace, and shows no evidence of introgression from Asian rice. Whereas the Dutch took most slaves from Ghana, Benin and Central Africa7, the diaries of slave ship captains record the purchase of food for provisions when sailing along the West African Coast8, offering one possible explanation for the patterns of genetic similarity. This study demonstrates the utility of genomics in understanding the largely unwritten histories of crop cultures of diaspora communities.
Collapse
Affiliation(s)
- Tinde R van Andel
- Biosystematics group, Wageningen University, PO Box 16, 6700 AP, Wageningen, The Netherlands
- Naturalis Biodiversity Center, PO Box 9517, 2300 RA, Leiden, The Netherlands
| | - Rachel S Meyer
- Department of Biology, Center for Genomics and Systems Biology, New York University, 12 Waverly Place, New York, New York 10003, USA
| | - Saulo A Aflitos
- Biosystematics group, Wageningen University, PO Box 16, 6700 AP, Wageningen, The Netherlands
| | - Judith A Carney
- Departmemt of Geography, University of California, Box 951524, Los Angeles, California 90095, USA
| | - Margaretha A Veltman
- Biosystematics group, Wageningen University, PO Box 16, 6700 AP, Wageningen, The Netherlands
| | - Dario Copetti
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, 1657 E. Helen Street, Tucson, Arizona 85721, USA
| | - Jonathan M Flowers
- Department of Biology, Center for Genomics and Systems Biology, New York University, 12 Waverly Place, New York, New York 10003, USA
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Saadiyat Island, PO Box 129188, Abu Dhabi, United Arab Emirates
| | - Reinout M Havinga
- Hortus Botanicus Amsterdam, Plantage Middenlaan 2, 1018 DD, Amsterdam, The Netherlands
| | - Harro Maat
- Knowledge Technology and Innovation group, Social Sciences Department, Wageningen University, PO Box 8130, 6700 EW, Wageningen, The Netherlands
| | - Michael D Purugganan
- Department of Biology, Center for Genomics and Systems Biology, New York University, 12 Waverly Place, New York, New York 10003, USA
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Saadiyat Island, PO Box 129188, Abu Dhabi, United Arab Emirates
| | - Rod A Wing
- Arizona Genomics Institute, School of Plant Sciences, University of Arizona, 1657 E. Helen Street, Tucson, Arizona 85721, USA
- International Rice Research Institute, Genetic Resource Center, Los Baños, Laguna, Philippines
| | - M Eric Schranz
- Biosystematics group, Wageningen University, PO Box 16, 6700 AP, Wageningen, The Netherlands
| |
Collapse
|
13
|
Fransz P, Linc G, Lee C, Aflitos SA, Lasky JR, Toomajian C, Ali H, Peters J, van Dam P, Ji X, Kuzak M, Gerats T, Schubert I, Schneeberger K, Colot V, Martienssen R, Koornneef M, Nordborg M, Juenger TE, de Jong H, Schranz ME. Molecular, genetic and evolutionary analysis of a paracentric inversion in Arabidopsis thaliana. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2016; 88:159-178. [PMID: 27436134 PMCID: PMC5113708 DOI: 10.1111/tpj.13262] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Revised: 06/29/2016] [Accepted: 07/01/2016] [Indexed: 05/02/2023]
Abstract
Chromosomal inversions can provide windows onto the cytogenetic, molecular, evolutionary and demographic histories of a species. Here we investigate a paracentric 1.17-Mb inversion on chromosome 4 of Arabidopsis thaliana with nucleotide precision of its borders. The inversion is created by Vandal transposon activity, splitting an F-box and relocating a pericentric heterochromatin segment in juxtaposition with euchromatin without affecting the epigenetic landscape. Examination of the RegMap panel and the 1001 Arabidopsis genomes revealed more than 170 inversion accessions in Europe and North America. The SNP patterns revealed historical recombinations from which we infer diverse haplotype patterns, ancient introgression events and phylogenetic relationships. We find a robust association between the inversion and fecundity under drought. We also find linkage disequilibrium between the inverted region and the early flowering Col-FRIGIDA allele. Finally, SNP analysis elucidates the origin of the inversion to South-Eastern Europe approximately 5000 years ago and the FRI-Col allele to North-West Europe, and reveals the spreading of a single haplotype to North America during the 17th to 19th century. The 'American haplotype' was identified from several European localities, potentially due to return migration.
Collapse
Affiliation(s)
- Paul Fransz
- Department of Plant Development and (Epi)GeneticsSwammerdam Institute for Life SciencesUniversity of AmsterdamAmsterdamthe Netherlands
| | - Gabriella Linc
- Department of Plant Development and (Epi)GeneticsSwammerdam Institute for Life SciencesUniversity of AmsterdamAmsterdamthe Netherlands
- Present address: Centre for Agricultural ResearchHungarian Academy of SciencesAgricultural InstituteMartonvásárHungary
| | - Cheng‐Ruei Lee
- Gregor Mendel Institute (GMI)Austrian Academy of SciencesVienna Biocenter (VBC)Dr Bohr‐Gasse 3Vienna1030Austria
| | | | - Jesse R. Lasky
- Department of BiologyPennsylvania State UniversityUniversity ParkPAUSA
| | | | - Hoda Ali
- Department of Cytogenetics and Genome AnalysisThe Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
- Present address: Department of Genetics and CytologyNational Research CenterCairoEgypt
| | - Janny Peters
- Section Plant GeneticsInstitute for Wetland and Water Research Faculty of ScienceRadboud UniversityNijmegenthe Netherlands
| | - Peter van Dam
- Section Plant GeneticsInstitute for Wetland and Water Research Faculty of ScienceRadboud UniversityNijmegenthe Netherlands
- Present address: Department of Molecular Plant PathologyUniversity of AmsterdamAmsterdamThe Netherlands
| | - Xianwen Ji
- Laboratory of GeneticsWageningen UniversityWageningenthe Netherlands
| | - Mateusz Kuzak
- MAD, Dutch Genomics Service & Support ProviderSwammerdam Institute for Life SciencesUniversity of AmsterdamAmsterdamthe Netherlands
- Present address: Netherlands eScience CenterUniversity of AmsterdamAmsterdamThe Netherlands
| | - Tom Gerats
- Section Plant GeneticsInstitute for Wetland and Water Research Faculty of ScienceRadboud UniversityNijmegenthe Netherlands
| | - Ingo Schubert
- Department of Cytogenetics and Genome AnalysisThe Leibniz Institute of Plant Genetics and Crop Plant Research (IPK)GaterslebenGermany
| | | | - Vincent Colot
- Unité de Recherche en Génomique Végétale (URGV)INRA/CNRS/UEVE 2 Rue Gaston CrémieuxEvry Cedex91057France
- Present address: Institut de Biologie de l'Ecole Normale Supérieure (IBENS)ParisFrance
| | - Rob Martienssen
- Cold Spring Harbor LaboratoryCold Spring HarborNew YorkNY11724USA
| | - Maarten Koornneef
- Laboratory of GeneticsWageningen UniversityWageningenthe Netherlands
- Max Planck Institute for Plant Breeding ResearchKöln50829Germany
| | - Magnus Nordborg
- Gregor Mendel Institute (GMI)Austrian Academy of SciencesVienna Biocenter (VBC)Dr Bohr‐Gasse 3Vienna1030Austria
| | | | - Hans de Jong
- Laboratory of GeneticsWageningen UniversityWageningenthe Netherlands
| | | |
Collapse
|
14
|
Aflitos SA, Severing E, Sanchez-Perez G, Peters S, de Jong H, de Ridder D. Cnidaria: fast, reference-free clustering of raw and assembled genome and transcriptome NGS data. BMC Bioinformatics 2015; 16:352. [PMID: 26525298 PMCID: PMC4630969 DOI: 10.1186/s12859-015-0806-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2015] [Accepted: 10/29/2015] [Indexed: 12/05/2022] Open
Abstract
Background Identification of biological specimens is a requirement for a range of applications. Reference-free methods analyse unprocessed sequencing data without relying on prior knowledge, but generally do not scale to arbitrarily large genomes and arbitrarily large phylogenetic distances. Results We present Cnidaria, a practical tool for clustering genomic and transcriptomic data with no limitation on genome size or phylogenetic distances. We successfully simultaneously clustered 169 genomic and transcriptomic datasets from 4 kingdoms, achieving 100 % identification accuracy at supra-species level and 78 % accuracy at the species level. Conclusion CNIDARIA allows for fast, resource-efficient comparison and identification of both raw and assembled genome and transcriptome data. This can help answer both fundamental (e.g. in phylogeny, ecological diversity analysis) and practical questions (e.g. sequencing quality control, primer design). Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0806-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Saulo Alves Aflitos
- Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands. .,Bioinformatics Group, Department of Plant Sciences, Wageningen University, Wageningen, The Netherlands.
| | - Edouard Severing
- Laboratory of Genetics, Wageningen University, Wageningen, The Netherlands.
| | - Gabino Sanchez-Perez
- Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands. .,Bioinformatics Group, Department of Plant Sciences, Wageningen University, Wageningen, The Netherlands.
| | - Sander Peters
- Applied Bioinformatics, Plant Research International, Wageningen, The Netherlands.
| | - Hans de Jong
- Laboratory of Genetics, Wageningen University, Wageningen, The Netherlands.
| | - Dick de Ridder
- Bioinformatics Group, Department of Plant Sciences, Wageningen University, Wageningen, The Netherlands.
| |
Collapse
|