1
|
Naithani S, Deng CH, Sahu SK, Jaiswal P. Exploring Pan-Genomes: An Overview of Resources and Tools for Unraveling Structure, Function, and Evolution of Crop Genes and Genomes. Biomolecules 2023; 13:1403. [PMID: 37759803 PMCID: PMC10527062 DOI: 10.3390/biom13091403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/29/2023] [Accepted: 09/12/2023] [Indexed: 09/29/2023] Open
Abstract
The availability of multiple sequenced genomes from a single species made it possible to explore intra- and inter-specific genomic comparisons at higher resolution and build clade-specific pan-genomes of several crops. The pan-genomes of crops constructed from various cultivars, accessions, landraces, and wild ancestral species represent a compendium of genes and structural variations and allow researchers to search for the novel genes and alleles that were inadvertently lost in domesticated crops during the historical process of crop domestication or in the process of extensive plant breeding. Fortunately, many valuable genes and alleles associated with desirable traits like disease resistance, abiotic stress tolerance, plant architecture, and nutrition qualities exist in landraces, ancestral species, and crop wild relatives. The novel genes from the wild ancestors and landraces can be introduced back to high-yielding varieties of modern crops by implementing classical plant breeding, genomic selection, and transgenic/gene editing approaches. Thus, pan-genomic represents a great leap in plant research and offers new avenues for targeted breeding to mitigate the impact of global climate change. Here, we summarize the tools used for pan-genome assembly and annotations, web-portals hosting plant pan-genomes, etc. Furthermore, we highlight a few discoveries made in crops using the pan-genomic approach and future potential of this emerging field of study.
Collapse
Affiliation(s)
- Sushma Naithani
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| | - Cecilia H. Deng
- Molecular & Digital Breeing Group, New Cultivar Innovation, The New Zealand Institute for Plant and Food Research Limited, Private Bag 92169, Auckland 1142, New Zealand;
| | - Sunil Kumar Sahu
- State Key Laboratory of Agricultural Genomics, Key Laboratory of Genomics, Ministry of Agriculture, BGI Research, Shenzhen 518083, China;
| | - Pankaj Jaiswal
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA;
| |
Collapse
|
2
|
Soriano JM, Alvaro F. Discovering consensus genomic regions in wheat for root-related traits by QTL meta-analysis. Sci Rep 2019; 9:10537. [PMID: 31332216 PMCID: PMC6646344 DOI: 10.1038/s41598-019-47038-2] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 07/09/2019] [Indexed: 11/25/2022] Open
Abstract
Root system architecture is crucial for wheat adaptation to drought stress, but phenotyping for root traits in breeding programmes is difficult and time-consuming owing to the belowground characteristics of the system. Identifying quantitative trait loci (QTLs) and linked molecular markers and using marker-assisted selection is an efficient way to increase selection efficiency and boost genetic gains in breeding programmes. Hundreds of QTLs have been identified for different root traits in the last few years. In the current study, consensus QTL regions were identified through QTL meta-analysis. First, a consensus map comprising 7352 markers was constructed. For the meta-analysis, 754 QTLs were retrieved from the literature and 634 of them were projected onto the consensus map. Meta-analysis grouped 557 QTLs in 94 consensus QTL regions, or meta-QTLs (MQTLs), and 18 QTLs remained as singletons. The recently published genome sequence of wheat was used to search for gene models within the MQTL peaks. As a result, gene models for 68 of the 94 Root_MQTLs were found, 35 of them related to root architecture and/or drought stress response. This work will facilitate QTL cloning and pyramiding to develop new cultivars with specific root architecture for coping with environmental constraints.
Collapse
Affiliation(s)
- Jose Miguel Soriano
- Sustainable Field Crops Programme, IRTA (Institute for Food and Agricultural Research and Technology), Lleida, Spain.
| | - Fanny Alvaro
- Sustainable Field Crops Programme, IRTA (Institute for Food and Agricultural Research and Technology), Lleida, Spain
| |
Collapse
|
3
|
Ashraf GM, Ganash M, Athanasios A. Computational analysis of non-coding RNAs in Alzheimer's disease. Bioinformation 2019; 15:351-357. [PMID: 31249438 PMCID: PMC6589468 DOI: 10.6026/97320630015351] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Accepted: 04/01/2019] [Indexed: 01/09/2023] Open
Abstract
Latest studies have shown that Long Noncoding RNAs corresponds to a crucial factor in neurodegenerative diseases and next-generation therapeutic targets. A wide range of advanced computational methods for the analysis of Noncoding RNAs mainly includes the prediction of RNA and miRNA structures. The problems that concern representations of specific biological structures such as secondary structures are either characterized as NP-complete or with high complexity. Numerous algorithms and techniques related to the enumeration of sequential terms of biological structures and mainly with exponential complexity have been constructed until now. While BACE1-AS, NATRad18, 17A, and hnRNP Q lnRNAs have been found to be associated with Alzheimer's disease, in this research study the significance of the most known β-turn-forming residues between these proteins is computationally identified and discussed, as a potentially crucial factor on the regulation of folding, aggregation and other intermolecular interactions.
Collapse
Affiliation(s)
- Ghulam Md Ashraf
- King Fahd Medical Research Center, King Abdulaziz University, P.O. Box 80216, Jeddah 21589, Saudi Arabia
| | - Magdah Ganash
- Department of Biology, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Alexiou Athanasios
- Novel Global Community Educational Foundation, 7 Peterlee Place, Hebersham, NSW 2770, Australia
- AFNP Med, Austria
| |
Collapse
|
4
|
Ates D, Aldemir S, Alsaleh A, Erdogmus S, Nemli S, Kahriman A, Ozkan H, Vandenberg A, Tanyolac B. A consensus linkage map of lentil based on DArT markers from three RIL mapping populations. PLoS One 2018; 13:e0191375. [PMID: 29351563 PMCID: PMC5774769 DOI: 10.1371/journal.pone.0191375] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 01/03/2018] [Indexed: 12/21/2022] Open
Abstract
Background Lentil (Lens culinaris ssp. culinaris Medikus) is a diploid (2n = 2x = 14), self-pollinating grain legume with a haploid genome size of about 4 Gbp and is grown throughout the world with current annual production of 4.9 million tonnes. Materials and methods A consensus map of lentil (Lens culinaris ssp. culinaris Medikus) was constructed using three different lentils recombinant inbred line (RIL) populations, including “CDC Redberry” x “ILL7502” (LR8), “ILL8006” x “CDC Milestone” (LR11) and “PI320937” x “Eston” (LR39). Results The lentil consensus map was composed of 9,793 DArT markers, covered a total of 977.47 cM with an average distance of 0.10 cM between adjacent markers and constructed 7 linkage groups representing 7 chromosomes of the lentil genome. The consensus map had no gap larger than 12.67 cM and only 5 gaps were found to be between 12.67 cM and 6.0 cM (on LG3 and LG4). The localization of the SNP markers on the lentil consensus map were in general consistent with their localization on the three individual genetic linkage maps and the lentil consensus map has longer map length, higher marker density and shorter average distance between the adjacent markers compared to the component linkage maps. Conclusion This high-density consensus map could provide insight into the lentil genome. The consensus map could also help to construct a physical map using a Bacterial Artificial Chromosome library and map based cloning studies. Sequence information of DArT may help localization of orientation scaffolds from Next Generation Sequencing data.
Collapse
Affiliation(s)
- Duygu Ates
- Department of Bioengineering, Faculty of Engineering, Ege University, Bornova, Izmir, Turkey
| | - Secil Aldemir
- Department of Bioengineering, Faculty of Engineering, Ege University, Bornova, Izmir, Turkey
| | - Ahmad Alsaleh
- Department of Field Crops, Faculty of Agriculture, Cukurova University, Adana, Turkey
| | - Semih Erdogmus
- Department of Bioengineering, Faculty of Engineering, Ege University, Bornova, Izmir, Turkey
| | - Seda Nemli
- Department of Bieoengineering and Genetics, Gumushane University, Gumushane, Turkey
| | - Abdullah Kahriman
- Department of Field Crops, Faculty of Agriculture, Harran University, Sanlı Urfa, Turkey
| | - Hakan Ozkan
- Department of Field Crops, Faculty of Agriculture, Cukurova University, Adana, Turkey
| | - Albert Vandenberg
- Crop Development Centre, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Bahattin Tanyolac
- Department of Bioengineering, Faculty of Engineering, Ege University, Bornova, Izmir, Turkey
- * E-mail:
| |
Collapse
|
5
|
Wang H, van Eeuwijk FA, Jansen J. The potential of probabilistic graphical models in linkage map construction. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2017; 130:433-444. [PMID: 27921120 PMCID: PMC5263214 DOI: 10.1007/s00122-016-2824-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Accepted: 11/03/2016] [Indexed: 06/06/2023]
Abstract
Probabilistic graphical models show great potential for robust and reliable construction of linkage maps. We show how to use probabilistic graphical models to construct high-quality linkage maps in the face of data perturbations caused by genotyping errors and reciprocal translocations. It has been shown that linkage map construction can be hampered by the presence of genotyping errors and chromosomal rearrangements such as inversions and translocations. Here, we report a novel method for linkage map construction using probabilistic graphical models. The method is proven, both theoretically and practically, to be effective in filtering out markers that contain genotyping errors. In particular, it carries out marker filtering and ordering simultaneously, and is therefore superior to the standard post hoc filtering using nearest-neighbour stress. Furthermore, we demonstrate empirically that the proposed method offers a promising solution to linkage map construction in the case of a reciprocal translocation.
Collapse
Affiliation(s)
- Huange Wang
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands.
| | - Fred A van Eeuwijk
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands
| | - Johannes Jansen
- Biometris, Wageningen University and Research Centre, P.O. Box 16, 6700 AA, Wageningen, The Netherlands
| |
Collapse
|
6
|
Lenz RR, Dai W. Mapping X-Disease Phytoplasma Resistance in Prunus virginiana. FRONTIERS IN PLANT SCIENCE 2017; 8:2057. [PMID: 29238359 PMCID: PMC5712551 DOI: 10.3389/fpls.2017.02057] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Accepted: 11/17/2017] [Indexed: 05/10/2023]
Abstract
Phytoplasmas such as "Candidatus Phytoplasma pruni," the causal agent of X-disease of stone fruits, lack detailed biological analysis. This has limited the understanding of plant resistance mechanisms. Chokecherry (Prunus virginiana L.) is a promising model to be used for the plant-phytoplasma interaction due to its documented ability to resist X-disease infection. A consensus chokecherry genetic map "Cho" was developed with JoinMap 4.0 by joining two parental maps. The new map contains a complete set of 16 linkage groups, spanning a genetic distance of 2,172 cM with an average marker density of 3.97 cM. Three significant quantitative trait loci (QTL) associated with X-disease resistance were identified contributing to a total of 45.9% of the phenotypic variation. This updated genetic linkage map and the identified QTL will provide the framework needed to facilitate molecular genetics, genomics, breeding, and biotechnology research concerning X-disease in chokecherry and other Prunus species.
Collapse
|
7
|
Wen W, He Z, Gao F, Liu J, Jin H, Zhai S, Qu Y, Xia X. A High-Density Consensus Map of Common Wheat Integrating Four Mapping Populations Scanned by the 90K SNP Array. FRONTIERS IN PLANT SCIENCE 2017; 8:1389. [PMID: 28848588 PMCID: PMC5552701 DOI: 10.3389/fpls.2017.01389] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Accepted: 07/25/2017] [Indexed: 05/04/2023]
Abstract
A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding.
Collapse
Affiliation(s)
- Weie Wen
- College of Agronomy, Xinjiang Agricultural UniversityUrumqi, China
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
| | - Zhonghu He
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
- International Maize and Wheat Improvement Center (CIMMYT)Beijing, China
| | - Fengmei Gao
- Crop Breeding Institute, Heilongjiang Academy of Agricultural SciencesHarbin, China
| | - Jindong Liu
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
| | - Hui Jin
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
| | - Shengnan Zhai
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
| | - Yanying Qu
- College of Agronomy, Xinjiang Agricultural UniversityUrumqi, China
- *Correspondence: Yanying Qu, Xianchun Xia,
| | - Xianchun Xia
- National Wheat Improvement Center, Institute of Crop Science, Chinese Academy of Agricultural SciencesBeijing, China
- *Correspondence: Yanying Qu, Xianchun Xia,
| |
Collapse
|
8
|
Fowler DB, N'Diaye A, Laudencia-Chingcuanco D, Pozniak CJ. Quantitative Trait Loci Associated with Phenological Development, Low-Temperature Tolerance, Grain Quality, and Agronomic Characters in Wheat (Triticum aestivum L.). PLoS One 2016; 11:e0152185. [PMID: 27019468 PMCID: PMC4809511 DOI: 10.1371/journal.pone.0152185] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Accepted: 03/10/2016] [Indexed: 12/14/2022] Open
Abstract
Plants must respond to environmental cues and schedule their development in order to react to periods of abiotic stress and commit fully to growth and reproduction under favorable conditions. This study was initiated to identify SNP markers for characters expressed from the seedling stage to plant maturity in spring and winter wheat (Triticum aestivum L.) genotypes adapted to western Canada. Three doubled haploid populations with the winter cultivar ‘Norstar’ as a common parent were developed and genotyped with a 90K Illumina iSelect SNP assay and a 2,998.9 cM consensus map with 17,541 markers constructed. High heritability’s reflected large differences among the parents and relatively low genotype by environment interactions for all characters considered. Significant QTL were detected for the 15 traits examined. However, different QTL for days to heading in controlled environments and the field provided a strong reminder that growth and development are being orchestrated by environmental cues and caution should be exercised when extrapolating conclusions from different experiments. A QTL on chromosome 6A for minimum final leaf number, which determines the rate of phenological development in the seedling stage, was closely linked to QTL for low-temperature tolerance, grain quality, and agronomic characters expressed up to the time of maturity. This suggests phenological development plays a critical role in programming subsequent outcomes for many traits. Transgressive segregation was observed for the lines in each population and QTL with additive effects were identified suggesting that genes for desirable traits could be stacked using Marker Assisted Selection. QTL were identified for characters that could be transferred between the largely isolated western Canadian spring and winter wheat gene pools demonstrating the opportunities offered by Marker Assisted Selection to act as bridges in the identification and transfer of useful genes among related genetic islands while minimizing the drag created by less desirable genes.
Collapse
Affiliation(s)
- D B Fowler
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, SK, Canada, S7N 5A8
| | - A N'Diaye
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, SK, Canada, S7N 5A8
| | - D Laudencia-Chingcuanco
- Crop Improvement and Genetics Research Unit, USDA-ARS WRRC, 800 Buchanan St. Albany, CA, United States of America, 94710
| | - C J Pozniak
- Department of Plant Sciences and Crop Development Centre, University of Saskatchewan, Saskatoon, SK, Canada, S7N 5A8
| |
Collapse
|
9
|
Fast and accurate construction of ultra-dense consensus genetic maps using evolution strategy optimization. PLoS One 2015; 10:e0122485. [PMID: 25867943 PMCID: PMC4395089 DOI: 10.1371/journal.pone.0122485] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Accepted: 02/22/2015] [Indexed: 11/19/2022] Open
Abstract
Our aim was to develop a fast and accurate algorithm for constructing consensus genetic maps for chip-based SNP genotyping data with a high proportion of shared markers between mapping populations. Chip-based genotyping of SNP markers allows producing high-density genetic maps with a relatively standardized set of marker loci for different mapping populations. The availability of a standard high-throughput mapping platform simplifies consensus analysis by ignoring unique markers at the stage of consensus mapping thereby reducing mathematical complicity of the problem and in turn analyzing bigger size mapping data using global optimization criteria instead of local ones. Our three-phase analytical scheme includes automatic selection of ~100-300 of the most informative (resolvable by recombination) markers per linkage group, building a stable skeletal marker order for each data set and its verification using jackknife re-sampling, and consensus mapping analysis based on global optimization criterion. A novel Evolution Strategy optimization algorithm with a global optimization criterion presented in this paper is able to generate high quality, ultra-dense consensus maps, with many thousands of markers per genome. This algorithm utilizes "potentially good orders" in the initial solution and in the new mutation procedures that generate trial solutions, enabling to obtain a consensus order in reasonable time. The developed algorithm, tested on a wide range of simulated data and real world data (Arabidopsis), outperformed two tested state-of-the-art algorithms by mapping accuracy and computation time.
Collapse
|
10
|
Plomion C, Chancerel E, Endelman J, Lamy JB, Mandrou E, Lesur I, Ehrenmann F, Isik F, Bink MCAM, van Heerwaarden J, Bouffier L. Genome-wide distribution of genetic diversity and linkage disequilibrium in a mass-selected population of maritime pine. BMC Genomics 2014; 15:171. [PMID: 24581176 PMCID: PMC4029062 DOI: 10.1186/1471-2164-15-171] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2013] [Accepted: 02/21/2014] [Indexed: 12/14/2022] Open
Abstract
Background The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently been developed for some economically important species such as conifers. The potential for using genomic technologies in association mapping and breeding depends largely on the genome wide patterns of diversity and linkage disequilibrium in current breeding populations. This study aims to deepen our knowledge regarding these issues in maritime pine, the first species used for reforestation in south western Europe. Results Using a new map merging algorithm, we first established a 1,712 cM composite linkage map (comprising 1,838 SNP markers in 12 linkage groups) by bringing together three already available genetic maps. Using rigorous statistical testing based on kernel density estimation and resampling we identified cold and hot spots of recombination. In parallel, 186 unrelated trees of a mass-selected population were genotyped using a 12k-SNP array. A total of 2,600 informative SNPs allowed to describe historical recombination, genetic diversity and genetic structure of this recently domesticated breeding pool that forms the basis of much of the current and future breeding of this species. We observe very low levels of population genetic structure and find no evidence that artificial selection has caused a reduction in genetic diversity. By combining these two pieces of information, we provided the map position of 1,671 SNPs corresponding to 1,192 different loci. This made it possible to analyze the spatial pattern of genetic diversity (He) and long distance linkage disequilibrium (LD) along the chromosomes. We found no particular pattern in the empirical variogram of He across the 12 linkage groups and, as expected for an outcrossing species with large effective population size, we observed an almost complete lack of long distance LD. Conclusions These results are a stepping stone for the development of strategies for studies in population genomics, association mapping and genomic prediction in this economical and ecologically important forest tree species.
Collapse
|
11
|
Satovic Z, Avila CM, Cruz-Izquierdo S, Díaz-Ruíz R, García-Ruíz GM, Palomino C, Gutiérrez N, Vitale S, Ocaña-Moral S, Gutiérrez MV, Cubero JI, Torres AM. A reference consensus genetic map for molecular markers and economically important traits in faba bean (Vicia faba L.). BMC Genomics 2013; 14:932. [PMID: 24377374 PMCID: PMC3880837 DOI: 10.1186/1471-2164-14-932] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2013] [Accepted: 12/12/2013] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Faba bean (Vicia faba L.) is among the earliest domesticated crops from the Near East. Today this legume is a key protein feed and food worldwide and continues to serve an important role in culinary traditions throughout Middle East, Mediterranean region, China and Ethiopia. Adapted to a wide range of soil types, the main faba bean breeding objectives are to improve yield, resistance to biotic and abiotic stresses, seed quality and other agronomic traits. Genomic approaches aimed at enhancing faba bean breeding programs require high-quality genetic linkage maps to facilitate quantitative trait locus analysis and gene tagging for use in a marker-assisted selection. The objective of this study was to construct a reference consensus map in faba bean by joining the information from the most relevant maps reported so far in this crop. RESULTS A combination of two approaches, increasing the number of anchor loci in diverse mapping populations and joining the corresponding genetic maps, was used to develop a reference consensus map in faba bean. The map was constructed from three main recombinant inbreed populations derived from four parental lines, incorporates 729 markers and is based on 69 common loci. It spans 4,602 cM with a range from 323 to 1041 loci in six main linkage groups or chromosomes, and an average marker density of one locus every 6 cM. Locus order is generally well maintained between the consensus map and the individual maps. CONCLUSION We have constructed a reliable and fairly dense consensus genetic linkage map that will serve as a basis for genomic approaches in faba bean research and breeding. The core map contains a larger number of markers than any previous individual map, covers existing gaps and achieves a wider coverage of the large faba bean genome as a whole. This tool can be used as a reference resource for studies in different genetic backgrounds, and provides a framework for transferring genetic information when using different marker technologies. Combined with syntenic approaches, the consensus map will increase marker density in selected genomic regions and will be useful for future faba bean molecular breeding applications.
Collapse
Affiliation(s)
- Zlatko Satovic
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
- Present addresses: Department of Seed Science and Technology, Faculty of Agriculture, University of Zagreb, Zagreb, Croatia
| | - Carmen M Avila
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - Serafin Cruz-Izquierdo
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
- Colegio de Postgraduados, Recursos Genéticos y Productividad – Genética, Campus Montecillo, Km 36.5 Carretera México-Texcoco, C.P., Texcoco, Edo. de México 56230, México
| | - Ramón Díaz-Ruíz
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
- Colegio de Postgraduados, Campus Puebla, Km 125.5 Carretera México-Puebla, C.P., Puebla, Pue 72760, México
| | - Gloria M García-Ruíz
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - Carmen Palomino
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - Natalia Gutiérrez
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - Stefania Vitale
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - Sara Ocaña-Moral
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - María Victoria Gutiérrez
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| | - José I Cubero
- Departamento de Mejora Genética, IAS-CSIC, Apdo. 4084, Córdoba 14080, Spain
| | - Ana M Torres
- IFAPA, Centro Alameda del Obispo, Área de Mejora y Biotecnología, Avda. Menéndez Pidal s/n, Apdo. 3092, Córdoba 14080, Spain
| |
Collapse
|
12
|
A Retrospective on Genomic Preprocessing for Comparative Genomics. MODELS AND ALGORITHMS FOR GENOME EVOLUTION 2013. [DOI: 10.1007/978-1-4471-5298-9_9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
|
13
|
Khan MA, Han Y, Zhao YF, Troggio M, Korban SS. A multi-population consensus genetic map reveals inconsistent marker order among maps likely attributed to structural variations in the apple genome. PLoS One 2012; 7:e47864. [PMID: 23144832 PMCID: PMC3489900 DOI: 10.1371/journal.pone.0047864] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 09/19/2012] [Indexed: 12/26/2022] Open
Abstract
Genetic maps serve as frameworks for determining the genetic architecture of quantitative traits, assessing structure of a genome, as well as aid in pursuing association mapping and comparative genetic studies. In this study, a dense genetic map was constructed using a high-throughput 1,536 EST-derived SNP GoldenGate genotyping platform and a global consensus map established by combining the new genetic map with four existing reliable genetic maps of apple. The consensus map identified markers with both major and minor conflicts in positioning across all five maps. These major inconsistencies among marker positions were attributed either to structural variations within the apple genome, or among mapping populations, or genotyping technical errors. These also highlighted problems in assembly and anchorage of the reference draft apple genome sequence in regions with known segmental duplications. Markers common across all five apple genetic maps resulted in successful positioning of 2875 markers, consisting of 2033 SNPs and 843 SSRs as well as other specific markers, on the global consensus map. These markers were distributed across all 17 linkage groups, with an average of 169±33 marker per linkage group and with an average distance of 0.70±0.14 cM between markers. The total length of the consensus map was 1991.38 cM with an average length of 117.14±24.43 cM per linkage group. A total of 569 SNPs were mapped onto the genetic map, consisting of 140 recombinant individuals, from our recently developed apple Oligonucleotide pool assays (OPA). The new functional SNPs, along with the dense consensus genetic map, will be useful for high resolution QTL mapping of important traits in apple and for pursuing comparative genetic studies in Rosaceae.
Collapse
Affiliation(s)
- Muhammad Awais Khan
- Department of Natural Resources & Environmental Sciences, University of Illinois, Urbana, Illinois, United States of America
| | - Yuepeng Han
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Moshan, Wuhan, People's Republic of China
| | - Youfu Frank Zhao
- Department of Crop Sciences, University of Illinois, Urbana, Illinois, United States of America
| | - Michela Troggio
- Istituto Agrario San Michele all'Adige Research and Innovation Centre, Foundation Edmund Mach, Trento, Italy
| | - Schuyler S. Korban
- Department of Natural Resources & Environmental Sciences, University of Illinois, Urbana, Illinois, United States of America
| |
Collapse
|
14
|
Two-phase analysis in consensus genetic mapping. G3-GENES GENOMES GENETICS 2012; 2:537-49. [PMID: 22670224 PMCID: PMC3362937 DOI: 10.1534/g3.112.002428] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/19/2012] [Accepted: 03/01/2012] [Indexed: 01/25/2023]
Abstract
Numerous mapping projects conducted on different species have generated an abundance of mapping data. Consequently, many multilocus maps have been constructed using diverse mapping populations and marker sets for the same organism. The quality of maps varies broadly among populations, marker sets, and software used, necessitating efforts to integrate the mapping information and generate consensus maps. The problem of consensus genetic mapping (MCGM) is by far more challenging compared with genetic mapping based on a single dataset, which by itself is also cumbersome. The additional complications introduced by consensus analysis include inter-population differences in recombination rate and exchange distribution along chromosomes; variations in dominance of the employed markers; and use of different subsets of markers in different labs. Hence, it is necessary to handle arbitrary patterns of shared sets of markers and different level of mapping data quality. In this article, we introduce a two-phase approach for solving MCGM. In phase 1, for each dataset, multilocus ordering is performed combined with iterative jackknife resampling to evaluate the stability of marker orders. In this phase, the ordering problem is reduced to the well-known traveling salesperson problem (TSP). Namely, for each dataset, we look for order that gives minimum sum of recombination distances between adjacent markers. In phase 2, the optimal consensus order of shared markers is selected from the set of allowed orders and gives the minimal sum of total lengths of nonconflicting maps of the chromosome. This criterion may be used in different modifications to take into account the variation in quality of the original data (population size, marker quality, etc.). In the foregoing formulation, consensus mapping is considered as a specific version of TSP that can be referred to as “synchronized TSP.” The conflicts detected after phase 1 are resolved using either a heuristic algorithm over the entire chromosome or an exact/heuristic algorithm applied subsequently to the revealed small non-overlapping regions with conflicts separated by non-conflicting regions. The proposed approach was tested on a wide range of simulated data and real datasets from maize.
Collapse
|
15
|
Galeano CH, Fernandez AC, Franco-Herrera N, Cichy KA, McClean PE, Vanderleyden J, Blair MW. Saturation of an intra-gene pool linkage map: towards a unified consensus linkage map for fine mapping and synteny analysis in common bean. PLoS One 2011; 6:e28135. [PMID: 22174773 PMCID: PMC3234260 DOI: 10.1371/journal.pone.0028135] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Accepted: 11/01/2011] [Indexed: 11/19/2022] Open
Abstract
Map-based cloning and fine mapping to find genes of interest and marker assisted selection (MAS) requires good genetic maps with reproducible markers. In this study, we saturated the linkage map of the intra-gene pool population of common bean DOR364 × BAT477 (DB) by evaluating 2,706 molecular markers including SSR, SNP, and gene-based markers. On average the polymorphism rate was 7.7% due to the narrow genetic base between the parents. The DB linkage map consisted of 291 markers with a total map length of 1,788 cM. A consensus map was built using the core mapping populations derived from inter-gene pool crosses: DOR364 × G19833 (DG) and BAT93 × JALO EEP558 (BJ). The consensus map consisted of a total of 1,010 markers mapped, with a total map length of 2,041 cM across 11 linkage groups. On average, each linkage group on the consensus map contained 91 markers of which 83% were single copy markers. Finally, a synteny analysis was carried out using our highly saturated consensus maps compared with the soybean pseudo-chromosome assembly. A total of 772 marker sequences were compared with the soybean genome. A total of 44 syntenic blocks were identified. The linkage group Pv6 presented the most diverse pattern of synteny with seven syntenic blocks, and Pv9 showed the most consistent relations with soybean with just two syntenic blocks. Additionally, a co-linear analysis using common bean transcript map information against soybean coding sequences (CDS) revealed the relationship with 787 soybean genes. The common bean consensus map has allowed us to map a larger number of markers, to obtain a more complete coverage of the common bean genome. Our results, combined with synteny relationships provide tools to increase marker density in selected genomic regions to identify closely linked polymorphic markers for indirect selection, fine mapping or for positional cloning.
Collapse
Affiliation(s)
- Carlos H. Galeano
- Department of Microbial and Molecular Systems, Centre of Microbial and Plant Genetics, K.U. Leuven, Heverlee, Belgium
| | - Andrea C. Fernandez
- Sugarbeet and Bean Research Unit, Agricultural Research Service, United States Department of Agriculture, East Lansing, Michigan, United States of America
| | | | - Karen A. Cichy
- Sugarbeet and Bean Research Unit, Agricultural Research Service, United States Department of Agriculture, East Lansing, Michigan, United States of America
| | - Phillip E. McClean
- Genomics and Bioinformatics Program, North Dakota State University, Fargo, North Dakota, United States of America
- Department of Plant Sciences, North Dakota State University, Fargo, North Dakota, United States of America
| | - Jos Vanderleyden
- Department of Microbial and Molecular Systems, Centre of Microbial and Plant Genetics, K.U. Leuven, Heverlee, Belgium
| | - Matthew W. Blair
- International Center for Tropical Agriculture (CIAT) Bean Project, Cali, Colombia
| |
Collapse
|
16
|
Endelman JB. New algorithm improves fine structure of the barley consensus SNP map. BMC Genomics 2011; 12:407. [PMID: 21831315 PMCID: PMC3179964 DOI: 10.1186/1471-2164-12-407] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2011] [Accepted: 08/10/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The need to integrate information from multiple linkage maps is a long-standing problem in genetics. One way to visualize the complex ordinal relationships is with a directed graph, where each vertex in the graph is a bin of markers. When there are no ordering conflicts between the linkage maps, the result is a directed acyclic graph, or DAG, which can then be linearized to produce a consensus map. RESULTS New algorithms for the simplification and linearization of consensus graphs have been implemented as a package for the R computing environment called DAGGER. The simplified consensus graphs produced by DAGGER exactly capture the ordinal relationships present in a series of linkage maps. Using either linear or quadratic programming, DAGGER generates a consensus map with minimum error relative to the linkage maps while remaining ordinally consistent with them. Both linearization methods produce consensus maps that are compressed relative to the mean of the linkage maps. After rescaling, however, the consensus maps had higher accuracy (and higher marker density) than the individual linkage maps in genetic simulations. When applied to four barley linkage maps genotyped at nearly 3000 SNP markers, DAGGER produced a consensus map with improved fine structure compared to the existing barley consensus SNP map. The root-mean-squared error between the linkage maps and the DAGGER map was 0.82 cM per marker interval compared to 2.28 cM for the existing consensus map. Examination of the barley hardness locus at the 5HS telomere, for which there is a physical map, confirmed that the DAGGER output was more accurate for fine structure analysis. CONCLUSIONS The R package DAGGER is an effective, freely available resource for integrating the information from a set of consistent linkage maps.
Collapse
Affiliation(s)
- Jeffrey B Endelman
- Department of Crop and Soil Sciences, Washington State University, 16650 State Route 536, Mount Vernon, WA 98273, USA.
| |
Collapse
|
17
|
Wu Y, Close TJ, Lonardi S. Accurate construction of consensus genetic maps via integer linear programming. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 8:381-394. [PMID: 20479505 DOI: 10.1109/tcbb.2010.35] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
We study the problem of merging genetic maps, when the individual genetic maps are given as directed acyclic graphs. The computational problem is to build a consensus map, which is a directed graph that includes and is consistent with all (or, the vast majority of) the markers in the input maps. However, when markers in the individual maps have ordering conflicts, the resulting consensus map will contain cycles. Here, we formulate the problem of resolving cycles in the context of a parsimonious paradigm that takes into account two types of errors that may be present in the input maps, namely, local reshuffles and global displacements. The resulting combinatorial optimization problem is, in turn, expressed as an integer linear program. A fast approximation algorithm is proposed, and an additional speedup heuristic is developed. Our algorithms were implemented in a software tool named MERGEMAP which is freely available for academic use. An extensive set of experiments shows that MERGEMAP consistently outperforms JOINMAP, which is the most popular tool currently available for this task, both in terms of accuracy and running time. MERGEMAP is available for download at http://www.cs.ucr.edu/~yonghui/mgmap.html.
Collapse
Affiliation(s)
- Yonghui Wu
- Google, Inc., 1600 Amphitheatre Parkway, Mountain View, CA 94043, USA.
| | | | | |
Collapse
|
18
|
Wang J, Lydiate DJ, Parkin IAP, Falentin C, Delourme R, Carion PWC, King GJ. Integration of linkage maps for the Amphidiploid Brassica napus and comparative mapping with Arabidopsis and Brassica rapa. BMC Genomics 2011; 12:101. [PMID: 21306613 PMCID: PMC3042011 DOI: 10.1186/1471-2164-12-101] [Citation(s) in RCA: 99] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2010] [Accepted: 02/09/2011] [Indexed: 12/25/2022] Open
Abstract
BACKGROUND The large number of genetic linkage maps representing Brassica chromosomes constitute a potential platform for studying crop traits and genome evolution within Brassicaceae. However, the alignment of existing maps remains a major challenge. The integration of these genetic maps will enhance genetic resolution, and provide a means to navigate between sequence-tagged loci, and with contiguous genome sequences as these become available. RESULTS We report the first genome-wide integration of Brassica maps based on an automated pipeline which involved collation of genome-wide genotype data for sequence-tagged markers scored on three extensively used amphidiploid Brassica napus (2n = 38) populations. Representative markers were selected from consolidated maps for each population, and skeleton bin maps were generated. The skeleton maps for the three populations were then combined to generate an integrated map for each LG, comparing two different approaches, one encapsulated in JoinMap and the other in MergeMap. The BnaWAIT_01_2010a integrated genetic map was generated using JoinMap, and includes 5,162 genetic markers mapped onto 2,196 loci, with a total genetic length of 1,792 cM. The map density of one locus every 0.82 cM, corresponding to 515 Kbp, increases by at least three-fold the locus and marker density within the original maps. Within the B. napus integrated map we identified 103 conserved collinearity blocks relative to Arabidopsis, including five previously unreported blocks. The BnaWAIT_01_2010a map was used to investigate the integrity and conservation of order proposed for genome sequence scaffolds generated from the constituent A genome of Brassica rapa. CONCLUSIONS Our results provide a comprehensive genetic integration of the B. napus genome from a range of sources, which we anticipate will provide valuable information for rapeseed and Canola research.
Collapse
Affiliation(s)
- Jun Wang
- Department of Plant Sciences, Rothamsted Research, Harpenden, AL5 2JQ, UK
| | | | | | | | | | | | | |
Collapse
|
19
|
Chen X, Yang JY. Constructing consensus genetic maps in comparative analysis. J Comput Biol 2010; 17:1561-73. [PMID: 20958247 DOI: 10.1089/cmb.2009.0268] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The construction of consensus genetic maps is a very challenging problem in computational biology. Many computational approaches have been proposed on the basis of only the marker order relations provided by a given set of individual genetic maps. In this article, we propose a comparative approach to constructing consensus genetic maps for a genome, which further takes into account the order relations from a closely related genome when resolving ordering conflicts among individual genetic maps. It aims to retain as many order relations as possible from individual genetic maps while achieving the minimum rearrangement distance to the reference genome. We implement the proposed approach as an integer linear program and test it on both simulated and real biological datasets. The experimental results show that it is capable of constructing more accurate consensus genetic maps than the most recent approach called MergeMap.
Collapse
Affiliation(s)
- Xin Chen
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore.
| | | |
Collapse
|
20
|
Construction of a high-density composite map and comparative mapping of segregation distortion regions in barley. Mol Genet Genomics 2010; 284:319-31. [DOI: 10.1007/s00438-010-0570-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2010] [Accepted: 08/13/2010] [Indexed: 11/26/2022]
|
21
|
Jow H, Bhattacharjee M, Boys R, Wilkinson D. The integration of genetic maps using Bayesian inference. J Comput Biol 2010; 17:825-40. [PMID: 20583928 DOI: 10.1089/cmb.2008.0243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the absence of a comprehensive sequence-based map of a species' genome, genetic maps constitute the next best source of genetic information. Information derived from such maps can be used, for example, in identifying the genes that form quantitative trait loci (QTLs) and for performing comparative genomics between species. Integrating information from a collection of maps will provide more accurate inferences on, for example, marker locations. We describe a method for integrating (possibly conflicting) experimentally derived genetic maps. It assumes a fully probabilistic model that describes the relationship between experimentally derived genetic maps and the integrated map. The model views experimentally derived maps for a given species' chromosome as noisy realisations of a single "true" map, where the noise consists of possible linear distortions and measurement error on the marker locations. Bayesian statistical inference methodology is then used to infer the integrated map (the "true" map) and its attendant uncertainties in the marker locations by using data from a number of experimentally determined genetic maps. The method is shown to work well on simulated data and is used to integrate linkage maps of Pig chromosome 6 and also linkage and radiation hybrid maps of Cow chromosome 1.
Collapse
Affiliation(s)
- Howsun Jow
- School of Mathematics and Statistics, Newcastle University, Newcastle upon Tyne, United Kingdom
| | | | | | | |
Collapse
|
22
|
Yu J, Kohel RJ, Smith CW. The construction of a tetraploid cotton genome wide comprehensive reference map. Genomics 2010; 95:230-40. [PMID: 20171271 DOI: 10.1016/j.ygeno.2010.02.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2009] [Revised: 01/17/2010] [Accepted: 02/03/2010] [Indexed: 10/19/2022]
Abstract
Integration of multiple genomic maps provides a higher density of markers and greater genome coverage, which not only facilitates the identification and positioning of QTLs and candidate genes, but it also provides a basic structure for the genome sequence assembly. However, the diversity in markers and populations used in individual mapping studies limits the ability to fully integrate the available data. By concentrating on marker orders rather than marker distances, published map data could be used to produce a comprehensive reference map (CRM) that includes a majority of known markers with optimally estimated order of those markers across the genome. In this study, a tetraploid cotton genome-wide CRM was constructed from 28 public cotton genetic maps. The initial CRM contained 7,424 markers and represented over 93% of the combined mapping information from the 28 individual maps. The current output is stored and displayed through CottonDB (http://www.cottondb.org), the public cotton genome database.
Collapse
Affiliation(s)
- Jing Yu
- The Department of Soil and Crop Sciences, Texas A&M University, TX, USA.
| | | | | |
Collapse
|
23
|
Close TJ, Bhat PR, Lonardi S, Wu Y, Rostoks N, Ramsay L, Druka A, Stein N, Svensson JT, Wanamaker S, Bozdag S, Roose ML, Moscou MJ, Chao S, Varshney RK, Szucs P, Sato K, Hayes PM, Matthews DE, Kleinhofs A, Muehlbauer GJ, DeYoung J, Marshall DF, Madishetty K, Fenton RD, Condamine P, Graner A, Waugh R. Development and implementation of high-throughput SNP genotyping in barley. BMC Genomics 2009. [PMID: 19961604 DOI: 10.1186/1471‐2164‐10‐582] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND High density genetic maps of plants have, nearly without exception, made use of marker datasets containing missing or questionable genotype calls derived from a variety of genic and non-genic or anonymous markers, and been presented as a single linear order of genetic loci for each linkage group. The consequences of missing or erroneous data include falsely separated markers, expansion of cM distances and incorrect marker order. These imperfections are amplified in consensus maps and problematic when fine resolution is critical including comparative genome analyses and map-based cloning. Here we provide a new paradigm, a high-density consensus genetic map of barley based only on complete and error-free datasets and genic markers, represented accurately by graphs and approximately by a best-fit linear order, and supported by a readily available SNP genotyping resource. RESULTS Approximately 22,000 SNPs were identified from barley ESTs and sequenced amplicons; 4,596 of them were tested for performance in three pilot phase Illumina GoldenGate assays. Data from three barley doubled haploid mapping populations supported the production of an initial consensus map. Over 200 germplasm selections, principally European and US breeding material, were used to estimate minor allele frequency (MAF) for each SNP. We selected 3,072 of these tested SNPs based on technical performance, map location, MAF and biological interest to fill two 1536-SNP "production" assays (BOPA1 and BOPA2), which were made available to the barley genetics community. Data were added using BOPA1 from a fourth mapping population to yield a consensus map containing 2,943 SNP loci in 975 marker bins covering a genetic distance of 1099 cM. CONCLUSION The unprecedented density of genic markers and marker bins enabled a high resolution comparison of the genomes of barley and rice. Low recombination in pericentric regions is evident from bins containing many more than the average number of markers, meaning that a large number of genes are recombinationally locked into the genetic centromeric regions of several barley chromosomes. Examination of US breeding germplasm illustrated the usefulness of BOPA1 and BOPA2 in that they provide excellent marker density and sensitivity for detection of minor alleles in this genetically narrow material.
Collapse
Affiliation(s)
- Timothy J Close
- Department of Botany & Plant Sciences, University of California (UCR), Riverside, CA, 92521, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Bertrand D, Blanchette M, El-Mabrouk N. Genetic map refinement using a comparative genomic approach. J Comput Biol 2009; 16:1475-86. [PMID: 19754272 DOI: 10.1089/cmb.2009.0094] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Following various genetic mapping techniques conducted on different segregating populations, one or more genetic maps are obtained for a given species. However, recombination analyzes and other methods for gene mapping often fail to resolve the ordering of some pairs of neighboring markers, thereby leading to sets of markers ambiguously mapped to the same position. Each individual map is thus a partial order defined on the set of markers, and can be represented as a Directed Acyclic Graph (DAG). In this article, given a phylogenetic tree with a set of DAGs labeling each leaf (species), the goal is to infer, at each leaf, a single combined DAG that is as resolved as possible, considering the complementary information provided by individual maps, and the phylogenetic information provided by the species tree. After combining the individual maps of a leaf into a single DAG, we order incomparable markers by using two successive heuristics for minimizing two distances on the species tree: the breakpoint distance, and the Kemeny distance. We apply our algorithms to the plant species represented in the Gramene database, and we evaluate the simplified maps we obtained.
Collapse
|
25
|
Muchero W, Diop NN, Bhat PR, Fenton RD, Wanamaker S, Pottorff M, Hearne S, Cisse N, Fatokun C, Ehlers JD, Roberts PA, Close TJ. A consensus genetic map of cowpea [Vigna unguiculata (L) Walp.] and synteny based on EST-derived SNPs. Proc Natl Acad Sci U S A 2009; 106:18159-64. [PMID: 19826088 PMCID: PMC2761239 DOI: 10.1073/pnas.0905886106] [Citation(s) in RCA: 123] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2009] [Indexed: 11/18/2022] Open
Abstract
Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Collapse
Affiliation(s)
| | - Ndeye N. Diop
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Prasanna R. Bhat
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Raymond D. Fenton
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Steve Wanamaker
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Marti Pottorff
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Sarah Hearne
- The International Institute of Tropical Agriculture, Oyo Road, PMB 5320, Ibadan, Nigeria; and
| | - Ndiaga Cisse
- Senegalese Institute of Agricultural Research, Bambey, Senegal
| | - Christian Fatokun
- The International Institute of Tropical Agriculture, Oyo Road, PMB 5320, Ibadan, Nigeria; and
| | - Jeffrey D. Ehlers
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| | | | - Timothy J. Close
- Botany and Plant Sciences, University of California, Riverside, CA 92521
| |
Collapse
|
26
|
Chen X, Cui Y. An approximation algorithm for the minimum breakpoint linearization problem. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2009; 6:401-409. [PMID: 19644168 DOI: 10.1109/tcbb.2009.3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
In the recent years, there has been a growing interest in inferring the total order of genes or markers on a chromosome, since current genetic mapping efforts might only suffice to produce a partial order. Many interesting optimization problems were thus formulated in the framework of genome rearrangement. As an important one among them, the minimum breakpoint linearization (MBL) problem is to find the total order of a partially ordered genome that minimizes its breakpoint distance to a reference genome whose genes are already totally ordered. It was previously shown to be NP-hard, and the algorithms proposed so far are all heuristic. In this paper, we present an {m(2)+m/2}-approximation algorithm for the MBL problem, where m is the number of gene maps that are combined together to form a partial order of the genome under investigation.
Collapse
Affiliation(s)
- Xin Chen
- Division of Mathematical Sciences, School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore.
| | | |
Collapse
|
27
|
Isobe S, Kölliker R, Hisano H, Sasamoto S, Wada T, Klimenko I, Okumura K, Tabata S. Construction of a consensus linkage map for red clover (Trifolium pratense L.). BMC PLANT BIOLOGY 2009; 9:57. [PMID: 19442273 PMCID: PMC2695442 DOI: 10.1186/1471-2229-9-57] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2008] [Accepted: 05/14/2009] [Indexed: 05/20/2023]
Abstract
BACKGROUND Red clover (Trifolium pratense L.) is a major forage legume that has a strong self-incompatibility system and exhibits high genetic diversity within populations. For several crop species, integrated consensus linkage maps that combine information from multiple mapping populations have been developed. For red clover, three genetic linkage maps have been published, but the information in these existing maps has not been integrated. RESULTS A consensus linkage map was constructed using six mapping populations originating from eight parental accessions. Three of the six mapping populations were established for this study. The integrated red clover map was composed of 1804 loci, including 1414 microsatellite loci, 181 amplified fragment length polymorphism (AFLP) loci and 204 restriction fragment length polymorphism (RFLP) loci, in seven linkage groups. The average distance between loci and the total length of the consensus map were 0.46 cM and 836.6 cM, respectively. The locus order on the consensus map correlated highly with that of accession-specific maps. Segregation distortion was observed across linkage groups. We investigated genome-wide allele frequency in 1144 red clover individuals using 462 microsatellite loci randomly chosen from the consensus map. The average number of alleles and polymorphism information content (PIC) were 9.17 and 0.69, respectively. CONCLUSION A consensus genetic linkage map for red clover was constructed for the first time based on six mapping populations. The locus order on the consensus map was highly conserved among linkage maps and was sufficiently reliable for use as a reference for genetic analysis of random red clover germplasms.
Collapse
Affiliation(s)
- Sachiko Isobe
- Kazusa DNA Research Institute, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba, 292-0818, Japan
| | - Roland Kölliker
- Agroscope Reckenholz-Tänikon Research Station ART, Reckenholzstr. 191, 8046 Zurich, Switzerland
| | - Hiroshi Hisano
- Kazusa DNA Research Institute, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba, 292-0818, Japan
- Samuel Robert Noble Foundation. 2510 Sam Noble Pky. Ardmore, OK, 73401, USA
| | - Shigemi Sasamoto
- Kazusa DNA Research Institute, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba, 292-0818, Japan
| | - Tshyuko Wada
- Kazusa DNA Research Institute, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba, 292-0818, Japan
| | - Irina Klimenko
- All-Russian Williams Fodder Crop Research Institute, 141055 Lugovaya, Moscow Region, Russia
| | - Kenji Okumura
- National Agricultural Research Institute for Hokkaido Region, Hitsujigaoka 1, Toyohira, Sapporo, 062-8555, Japan
| | - Satoshi Tabata
- Kazusa DNA Research Institute, Kazusa-Kamatari 2-6-7, Kisarazu, Chiba, 292-0818, Japan
| |
Collapse
|
28
|
Jackson BN, Schnable PS, Aluru S. Consensus genetic maps as median orders from inconsistent sources. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2008; 5:161-171. [PMID: 18451426 DOI: 10.1109/tcbb.2007.70221] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
A genetic map is an ordering of genetic markers calculated from a population of known lineage. While traditionally a map has been generated from a single population for each species, recently researchers have created maps from multiple populations. In the face of these new data, we address the need to find a consensus map--a map that combines the information from multiple partial and possibly inconsistent input maps. We model each input map as a partial order and formulate the consensus problem as finding a median partial order. Finding the median of multiple total orders (preferences or rankings)is a well studied problem in social choice. We choose to find the median using the weighted symmetric difference distance, a more general version of both the symmetric difference distance and the Kemeny distance. Finding a median order using this distance is NP-hard. We show that for our chosen weight assignment, a median order satisfies the positive responsiveness, extended Condorcet,and unanimity criteria. Our solution involves finding the maximum acyclic subgraph of a weighted directed graph. We present a method that dynamically switches between an exact branch and bound algorithm and a heuristic algorithm, and show that for real data from closely related organisms, an exact median can often be found. We present experimental results using seven populations of the crop plant Zea mays.
Collapse
Affiliation(s)
- Benjamin N Jackson
- Department of Electrical and Computer Engineering, Iowa State University, Ames, IA 50014, USA.
| | | | | |
Collapse
|
29
|
Liao W, Collins A, Hobbs M, Khatkar MS, Luo J, Nicholas FW. A comparative location database (CompLDB): map integration within and between species. Mamm Genome 2007; 18:287-99. [PMID: 17588189 DOI: 10.1007/s00335-007-9020-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2006] [Revised: 04/02/2007] [Accepted: 04/03/2007] [Indexed: 11/28/2022]
Abstract
We have adapted the Location Database (LDB) map-integration strategy of Morton et al. [Ann Hum Genet 56:223-232] (1992) as above to create an integrated map for each of several species for which fully annotated genome sequences are not yet available (sheep, cattle, pig, wallaby), using all types of partial maps for that species, including cytogenetic, linkage, somatic-cell hybrid, and radiation hybrid maps. An integrated map provides not only predictions of the kilobase location of every locus, but also predicts locations (in cM) and cytogenetic band locations for every locus. In this way a comprehensive linkage map and a comprehensive cytogenetic map are created, including all loci, irrespective of whether they have ever been linkage mapped or physically mapped, respectively. High-resolution physical maps from annotated sequenced species have also been placed alongside the integrated maps. This has created a powerful tool for comparative genomics. The LDB map-integration strategy has been extended to make use of zoo-FISH comparative information. It has also been extended to enable the creation of a "virtual" map for each species not yet sequenced by using mapping data from fully sequenced species. All of the partial maps, together with the integrated map, for each species have been placed in a database called Comparative Location Database (CompLDB), which is available for querying, browsing, or download in tabular form at http://medvet.angis.org.au/ldb/.
Collapse
Affiliation(s)
- Webber Liao
- Reprogen, Faculty of Veterinary Science, University of Sydney, B19, New South Wales 2006, Australia
| | | | | | | | | | | |
Collapse
|
30
|
Blin G, Blais E, Hermelin D, Guillon P, Blanchette M, El-Mabrouk N. Gene Maps Linearization Using Genomic Rearrangement Distances. J Comput Biol 2007; 14:394-407. [PMID: 17572019 DOI: 10.1089/cmb.2007.a002] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A preliminary step to most comparative genomics studies is the annotation of chromosomes as ordered sequences of genes. Different genetic mapping techniques often give rise to different maps with unequal gene content and sets of unordered neighboring genes. Only partial orders can thus be obtained from combining such maps. However, once a total order O is known for a given genome, it can be used as a reference to order genes of a closely related species characterized by a partial order P. Our goal is to find a linearization of P that is as close as possible to O, in term of a given genomic distance. We first prove NP-completeness complexity results considering the breakpoint and the common interval distances. We then focus on the breakpoint distance and give a dynamic programming algorithm whose running time is exponential for general partial orders, but polynomial when the partial order is derived from a bounded number of genetic maps. A time-efficient greedy heuristic is then given for the general case and is empirically shown to produce solutions within 10% of the optimal solution, on simulated data. Applications to the analysis of grass genomes are presented.
Collapse
Affiliation(s)
- Guillaume Blin
- IGM-LabInfo, UMR CNRS 8049, Université Paris-Est, Marne-la-Vallée, France.
| | | | | | | | | | | |
Collapse
|
31
|
Veyrieras JB, Goffinet B, Charcosset A. MetaQTL: a package of new computational methods for the meta-analysis of QTL mapping experiments. BMC Bioinformatics 2007; 8:49. [PMID: 17288608 PMCID: PMC1808479 DOI: 10.1186/1471-2105-8-49] [Citation(s) in RCA: 138] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2006] [Accepted: 02/08/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Integration of multiple results from Quantitative Trait Loci (QTL) studies is a key point to understand the genetic determinism of complex traits. Up to now many efforts have been made by public database developers to facilitate the storage, compilation and visualization of multiple QTL mapping experiment results. However, studying the congruency between these results still remains a complex task. Presently, the few computational and statistical frameworks to do so are mainly based on empirical methods (e.g. consensus genetic maps are generally built by iterative projection). RESULTS In this article, we present a new computational and statistical package, called MetaQTL, for carrying out whole-genome meta-analysis of QTL mapping experiments. Contrary to existing methods, MetaQTL offers a complete statistical process to establish a consensus model for both the marker and the QTL positions on the whole genome. First, MetaQTL implements a new statistical approach to merge multiple distinct genetic maps into a single consensus map which is optimal in terms of weighted least squares and can be used to investigate recombination rate heterogeneity between studies. Secondly, assuming that QTL can be projected on the consensus map, MetaQTL offers a new clustering approach based on a Gaussian mixture model to decide how many QTL underly the distribution of the observed QTL. CONCLUSION We demonstrate using simulations that the usual model choice criteria from mixture model literature perform relatively well in this context. As expected, simulations also show that this new clustering algorithm leads to a reduction in the length of the confidence interval of QTL location provided that across studies there are enough observed QTL for each underlying true QTL location. The usefulness of our approach is illustrated on published QTL detection results of flowering time in maize. Finally, MetaQTL is freely available at http://bioinformatics.org/mqtl.
Collapse
Affiliation(s)
- Jean-Baptiste Veyrieras
- UMR, INRA UPS-XI INAPG CNRS Génétique Végétale, Ferme du Moulon, 91190 Gif-sur-Yvette, France
| | - Bruno Goffinet
- BIA, Chemin de Borde Rouge BP27 31326, Castanet Tolosan Cedex, France
| | - Alain Charcosset
- UMR, INRA UPS-XI INAPG CNRS Génétique Végétale, Ferme du Moulon, 91190 Gif-sur-Yvette, France
| |
Collapse
|
32
|
Rossini L, Vecchietti A, Nicoloso L, Stein N, Franzago S, Salamini F, Pozzi C. Candidate genes for barley mutants involved in plant architecture: an in silico approach. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2006; 112:1073-85. [PMID: 16501940 DOI: 10.1007/s00122-006-0209-2] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2005] [Accepted: 12/28/2005] [Indexed: 05/06/2023]
Abstract
To individuate candidate genes (CGs) for a set of barley developmental mutants, a synteny approach comparing the genomes of barley and rice has been introduced. Based on map positions of mutants, sequenced RFLP markers linked to the target loci were selected. The markers were mapped in silico by BLAST searches against the rice genome sequence and chromosomal regions syntenous to barley target intervals were identified. Rice syntenous regions were defined for 15 barley chromosomal intervals hosting 23 mutant loci affecting plant height (brh1; brh2; sld4), shoot and inflorescence branching (als; brc1; cul-2, -3, -5, -15, -16; dub1; mnd6; vrs1), development of leaves (lig) and leaf-like organs (cal-b19, -C15, -d4; lks5; suKD-25; suKE-74; suKF-76; trd; trp). Annotation of 110 Mb of rice genomic sequence made it possible to screen for putative CGs which are listed together with the reasons supporting mutant-gene associations. For two loci, CGs were identified with a clear probability to represent the locus considered. These include FRIZZY PANICLE, a candidate for the brc1 barley mutant, and the rice ortholog of maize Liguleless1 (Lg1), a candidate for the barley lig locus on chromosome 2H. For this locus, the validity of the approach was supported by the PCR-amplification of a genomic fragment of the orthologous barley sequence. SNP mapping located this fragment on chromosome 2H in the region hosting the lig genetic locus.
Collapse
Affiliation(s)
- Laura Rossini
- Dipartimento di Produzione Vegetale, Università degli Studi di Milano, Via Celoria 2, 20133 Milan, Italy
| | | | | | | | | | | | | |
Collapse
|
33
|
Jackson BN, Aluru S, Schnable PS. Consensus genetic maps: a graph theoretic approach. PROCEEDINGS. IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE 2006:35-43. [PMID: 16447960 DOI: 10.1109/csb.2005.26] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
A genetic map is an ordering of genetic markers constructed from genetic linkage data for use in linkage studies and experimental design. While traditional methods have focused on constructing maps from a single population study, increasingly maps are generated for multiple lines and populations of the same organism. For example, in crop plants, where the genetic variability is high, researchers have created maps for many populations. In the face of these new data, we address the increasingly important problem of generating a consensus map - an ordering of all markers in the various population studies. In our method, each input map is treated as a partial order on a set of markers. To find the most consistent order shared between maps, we model the partial orders as directed graphs. We create an aggregate by merginging the transitive closure of the input graphs and taking the transitive reduction of the result. In this process, cycles may need to be broken to resolve inconsistencies between the inputs. The cycle breaking problem is NP-hard, but the problem size depends upon the scope of the inconsistency between the input graphs, which will be local if the input graphs are from closely related organisms. We present results of running the resulting software on maps generated from seven populations of the crop plant Zea Mays.
Collapse
Affiliation(s)
- Benjamin N Jackson
- Dept. of Electrical and Computer Engineering, Iowa State University, Ames, IA 50010, USA.
| | | | | |
Collapse
|