Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hahn MW, Zhang SV, Moyle LC. Sequencing, assembling, and correcting draft genomes using recombinant populations. G3 (Bethesda) 2014;4:669-79. [PMID: 24531727 DOI: 10.1534/g3.114.010264] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

For:	Hahn MW, Zhang SV, Moyle LC. Sequencing, assembling, and correcting draft genomes using recombinant populations. G3 (Bethesda) 2014;4:669-79. [PMID: 24531727 DOI: 10.1534/g3.114.010264] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Number

Cited by Other Article(s)

Hung TH, Wu ETY, Zeltiņš P, Jansons Ā, Ullah A, Erbilgin N, Bohlmann J, Bousquet J, Birol I, Clegg SM, MacKay JJ. Long-insert sequence capture detects high copy numbers in a defence-related beta-glucosidase gene βglu-1 with large variations in white spruce but not Norway spruce. BMC Genomics 2024;25:118. [PMID: 38281030 PMCID: PMC10821269 DOI: 10.1186/s12864-024-09978-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 01/05/2024] [Indexed: 01/29/2024] Open

Li J, Van de Peer Y, Li Z. Inference of Ancient Polyploidy Using Transcriptome Data. Methods Mol Biol 2023;2545:47-76. [PMID: 36720807 DOI: 10.1007/978-1-0716-2561-3_3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Galla SJ, Brown L, Couch-Lewis Ngāi Tahu Te Hapū O Ngāti Wheke Ngāti Waewae Y, Cubrinovska I, Eason D, Gooley RM, Hamilton JA, Heath JA, Hauser SS, Latch EK, Matocq MD, Richardson A, Wold JR, Hogg CJ, Santure AW, Steeves TE. The relevance of pedigrees in the conservation genomics era. Mol Ecol 2021;31:41-54. [PMID: 34553796 PMCID: PMC9298073 DOI: 10.1111/mec.16192] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/12/2021] [Accepted: 09/17/2021] [Indexed: 01/21/2023]

Affiliation(s)

Stephanie J Galla Department of Biological Sciences, Boise State University, Boise, Idaho, USA.,School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
Liz Brown New Zealand Department of Conservation, Twizel, Canterbury, New Zealand
Yvette Couch-Lewis Ngāi Tahu Te Hapū O Ngāti Wheke Ngāti Waewae Te Rūnanga o Ngāi Tahu, Te Whare o Te Waipounamu, Christchurch, Canterbury, New Zealand
Ilina Cubrinovska School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
Daryl Eason New Zealand Department of Conservation, Invercargill, Southland, New Zealand
Rebecca M Gooley Smithsonian-Mason School of Conservation, Front Royal, Maryland, USA.,Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, District of Columbia, USA
Jill A Hamilton Department of Biological Sciences, North Dakota State University, Fargo, North Dakota, USA
Julie A Heath Department of Biological Sciences, Boise State University, Boise, Idaho, USA
Samantha S Hauser Department of Biological Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Emily K Latch Department of Biological Sciences, University of Wisconsin-Milwaukee, Milwaukee, Wisconsin, USA
Marjorie D Matocq Department of Natural Resources and Environmental Science, Program in Ecology, Evolution and Conservation Biology, University of Nevada Reno, Reno, Nevada, USA
Anne Richardson The Isaac Conservation and Wildlife Trust, Christchurch, Canterbury, New Zealand
Jana R Wold School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand
Carolyn J Hogg School of Life and Environmental Sciences, University of Sydney, Sydney, NSW, Australia
Anna W Santure School of Biological Sciences, University of Auckland, Auckland, Auckland, New Zealand
Tammy E Steeves School of Biological Sciences, University of Canterbury, Christchurch, Canterbury, New Zealand

Collapse

Williams AM, Itgen MW, Broz AK, Carter OG, Sloan DB. Long-read transcriptome and other genomic resources for the angiosperm Silene noctiflora. G3 (BETHESDA, MD.) 2021;11:jkab189. [PMID: 34849814 PMCID: PMC8496259 DOI: 10.1093/g3journal/jkab189] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 01/04/2023]

Grassa CJ, Weiblen GD, Wenger JP, Dabney C, Poplawski SG, Timothy Motley S, Michael TP, Schwartz CJ. A new Cannabis genome assembly associates elevated cannabidiol (CBD) with hemp introgressed into marijuana. THE NEW PHYTOLOGIST 2021;230:1665-1679. [PMID: 33521943 PMCID: PMC8248131 DOI: 10.1111/nph.17243] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 01/18/2021] [Indexed: 05/20/2023]

Madritsch S, Burg A, Sehr EM. Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species. BMC Bioinformatics 2021;22:146. [PMID: 33752598 PMCID: PMC7986043 DOI: 10.1186/s12859-021-04078-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 03/15/2021] [Indexed: 01/15/2023] Open

Abstract

Background

Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing.

Results

We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs).

Conclusion

All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-021-04078-8.

Collapse

Zhao Z, Zhou Y, Wang S, Zhang X, Wang C, Li S. LDscaff: LD-based scaffolding of de novo genome assemblies. BMC Bioinformatics 2020;21:570. [PMID: 33371875 PMCID: PMC7768660 DOI: 10.1186/s12859-020-03895-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 11/18/2020] [Indexed: 12/11/2022] Open

Zhou C, Olukolu B, Gemenet DC, Wu S, Gruneberg W, Cao MD, Fei Z, Zeng ZB, George AW, Khan A, Yencho GC, Coin LJM. Assembly of whole-chromosome pseudomolecules for polyploid plant genomes using outbred mapping populations. Nat Genet 2020;52:1256-1264. [PMID: 33128049 DOI: 10.1038/s41588-020-00717-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 09/15/2020] [Indexed: 12/31/2022]

Fuller ZL, Mocellin VJL, Morris LA, Cantin N, Shepherd J, Sarre L, Peng J, Liao Y, Pickrell J, Andolfatto P, Matz M, Bay LK, Przeworski M. Population genetics of the coral Acropora millepora: Toward genomic prediction of bleaching. Science 2020;369:369/6501/eaba4674. [PMID: 32675347 DOI: 10.1126/science.aba4674] [Citation(s) in RCA: 97] [Impact Index Per Article: 24.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Accepted: 06/01/2020] [Indexed: 12/11/2022]

Waterhouse RM, Aganezov S, Anselmetti Y, Lee J, Ruzzante L, Reijnders MJMF, Feron R, Bérard S, George P, Hahn MW, Howell PI, Kamali M, Koren S, Lawson D, Maslen G, Peery A, Phillippy AM, Sharakhova MV, Tannier E, Unger MF, Zhang SV, Alekseyev MA, Besansky NJ, Chauve C, Emrich SJ, Sharakhov IV. Evolutionary superscaffolding and chromosome anchoring to improve Anopheles genome assemblies. BMC Biol 2020;18:1. [PMID: 31898513 PMCID: PMC6939337 DOI: 10.1186/s12915-019-0728-3] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 11/26/2019] [Indexed: 11/18/2022] Open

Abstract

Background

New sequencing technologies have lowered financial barriers to whole genome sequencing, but resulting assemblies are often fragmented and far from ‘finished’. Updating multi-scaffold drafts to chromosome-level status can be achieved through experimental mapping or re-sequencing efforts. Avoiding the costs associated with such approaches, comparative genomic analysis of gene order conservation (synteny) to predict scaffold neighbours (adjacencies) offers a potentially useful complementary method for improving draft assemblies.

Results

We evaluated and employed 3 gene synteny-based methods applied to 21 Anopheles mosquito assemblies to produce consensus sets of scaffold adjacencies. For subsets of the assemblies, we integrated these with additional supporting data to confirm and complement the synteny-based adjacencies: 6 with physical mapping data that anchor scaffolds to chromosome locations, 13 with paired-end RNA sequencing (RNAseq) data, and 3 with new assemblies based on re-scaffolding or long-read data. Our combined analyses produced 20 new superscaffolded assemblies with improved contiguities: 7 for which assignments of non-anchored scaffolds to chromosome arms span more than 75% of the assemblies, and a further 7 with chromosome anchoring including an 88% anchored Anopheles arabiensis assembly and, respectively, 73% and 84% anchored assemblies with comprehensively updated cytogenetic photomaps for Anopheles funestus and Anopheles stephensi.

Conclusions

Experimental data from probe mapping, RNAseq, or long-read technologies, where available, all contribute to successful upgrading of draft assemblies. Our evaluations show that gene synteny-based computational methods represent a valuable alternative or complementary approach. Our improved Anopheles reference assemblies highlight the utility of applying comparative genomics approaches to improve community genomic resources.

Collapse

Affiliation(s)

Robert M Waterhouse Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.
Sergey Aganezov Department of Computer Science, Princeton University, Princeton, NJ, 08450, USA.,Department of Computer Science, Johns Hopkins University, Baltimore, MD, 21218, USA
Yoann Anselmetti ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France
Jiyoung Lee The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Livio Ruzzante Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
Maarten J M F Reijnders Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
Romain Feron Department of Ecology and Evolution, University of Lausanne, and Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
Sèverine Bérard ISEM, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France
Phillip George Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Matthew W Hahn Departments of Biology and Computer Science, Indiana University, Bloomington, IN, 47405, USA
Paul I Howell Centers for Disease Control and Prevention, Atlanta, GA, 30329, USA
Maryam Kamali Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA.,Department of Medical Entomology and Parasitology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
Sergey Koren Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Daniel Lawson European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SD, UK
Gareth Maslen European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, CB10 1SD, UK
Ashley Peery Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Adam M Phillippy Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Maria V Sharakhova Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA.,Laboratory of Ecology, Genetics and Environmental Protection, Tomsk State University, Tomsk, Russia, 634050
Eric Tannier Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, Unité Mixte de Recherche 5558 Centre National de la Recherche Scientifique, 69622, Villeurbanne, France.,Institut national de recherche en informatique et en automatique, Montbonnot, 38334, Grenoble, Rhône-Alpes, France
Maria F Unger Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, Galvin Life Sciences Building, Notre Dame, IN, 46556, USA
Simo V Zhang Departments of Biology and Computer Science, Indiana University, Bloomington, IN, 47405, USA
Max A Alekseyev Department of Mathematics and Computational Biology Institute, George Washington University, Ashburn, VA, 20147, USA
Nora J Besansky Eck Institute for Global Health and Department of Biological Sciences, University of Notre Dame, Galvin Life Sciences Building, Notre Dame, IN, 46556, USA
Cedric Chauve Department of Mathematics, Simon Fraser University, Burnaby, British Columbia, V5A 1S6, Canada
Scott J Emrich Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, TN, 37996, USA
Igor V Sharakhov The Interdisciplinary PhD Program in Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA. .,Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA. .,Laboratory of Ecology, Genetics and Environmental Protection, Tomsk State University, Tomsk, Russia, 634050.

Collapse

Flagel LE, Blackman BK, Fishman L, Monnahan PJ, Sweigart A, Kelly JK. GOOGA: A platform to synthesize mapping experiments and identify genomic structural diversity. PLoS Comput Biol 2019;15:e1006949. [PMID: 30986215 PMCID: PMC6483263 DOI: 10.1371/journal.pcbi.1006949] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 04/25/2019] [Accepted: 03/15/2019] [Indexed: 11/18/2022] Open

Pengelly RJ, Collins A. Linkage disequilibrium maps to guide contig ordering for genome assembly. Bioinformatics 2018;35:541-545. [DOI: 10.1093/bioinformatics/bty687] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2018] [Revised: 07/13/2018] [Accepted: 08/03/2018] [Indexed: 11/12/2022] Open

Sefick SA, Castronova MA, Stevison LS. genotypeR : An integrated r package for single nucleotide polymorphism genotype marker design and data analysis. Methods Ecol Evol 2018. [DOI: 10.1111/2041-210x.12965] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Clonorchis sinensis and Clonorchiasis: The Relevance of Exploring Genetic Variation. ADVANCES IN PARASITOLOGY 2018;100:155-208. [PMID: 29753338 DOI: 10.1016/bs.apar.2018.03.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Pfeifer SP. Direct estimate of the spontaneous germ line mutation rate in African green monkeys. Evolution 2017;71:2858-2870. [PMID: 29068052 DOI: 10.1111/evo.13383] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2017] [Revised: 10/03/2017] [Accepted: 10/09/2017] [Indexed: 12/30/2022]

Genetic Mapping of Millions of SNPs in Safflower (Carthamus tinctorius L.) via Whole-Genome Resequencing. G3-GENES GENOMES GENETICS 2016;6:2203-11. [PMID: 27226165 PMCID: PMC4938673 DOI: 10.1534/g3.115.026690] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Stevison LS, Woerner AE, Kidd JM, Kelley JL, Veeramah KR, McManus KF, Bustamante CD, Hammer MF, Wall JD. The Time Scale of Recombination Rate Evolution in Great Apes. Mol Biol Evol 2016;33:928-45. [PMID: 26671457 PMCID: PMC5870646 DOI: 10.1093/molbev/msv331] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

We present three linkage-disequilibrium (LD)-based recombination maps generated using whole-genome sequence data from 10 Nigerian chimpanzees, 13 bonobos, and 15 western gorillas, collected as part of the Great Ape Genome Project (Prado-Martinez J, et al. 2013. Great ape genetic diversity and population history. Nature 499:471-475). We also identified species-specific recombination hotspots in each group using a modified LDhot framework, which greatly improves statistical power to detect hotspots at varying strengths. We show that fewer hotspots are shared among chimpanzee subspecies than within human populations, further narrowing the time scale of complete hotspot turnover. Further, using species-specific PRDM9 sequences to predict potential binding sites (PBS), we show higher predicted PRDM9 binding in recombination hotspots as compared to matched cold spot regions in multiple great ape species, including at least one chimpanzee subspecies. We found that correlations between broad-scale recombination rates decline more rapidly than nucleotide divergence between species. We also compared the skew of recombination rates at centromeres and telomeres between species and show a skew from chromosome means extending as far as 10-15 Mb from chromosome ends. Further, we examined broad-scale recombination rate changes near a translocation in gorillas and found minimal differences as compared to other great ape species perhaps because the coordinates relative to the chromosome ends were unaffected. Finally, on the basis of multiple linear regression analysis, we found that various correlates of recombination rate persist throughout the African great apes including repeats, diversity, and divergence. Our study is the first to analyze within- and between-species genome-wide recombination rate variation in several close relatives.

Collapse

Martin G, Baurens FC, Droc G, Rouard M, Cenci A, Kilian A, Hastie A, Doležel J, Aury JM, Alberti A, Carreel F, D'Hont A. Improvement of the banana "Musa acuminata" reference sequence using NGS data and semi-automated bioinformatics methods. BMC Genomics 2016;17:243. [PMID: 26984673 PMCID: PMC4793746 DOI: 10.1186/s12864-016-2579-4] [Citation(s) in RCA: 79] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Accepted: 03/08/2016] [Indexed: 12/04/2022] Open

Abstract

Background

Recent advances in genomics indicate functional significance of a majority of genome sequences and their long range interactions. As a detailed examination of genome organization and function requires very high quality genome sequence, the objective of this study was to improve reference genome assembly of banana (Musa acuminata).

Results

We have developed a modular bioinformatics pipeline to improve genome sequence assemblies, which can handle various types of data. The pipeline comprises several semi-automated tools. However, unlike classical automated tools that are based on global parameters, the semi-automated tools proposed an expert mode for a user who can decide on suggested improvements through local compromises. The pipeline was used to improve the draft genome sequence of Musa acuminata. Genotyping by sequencing (GBS) of a segregating population and paired-end sequencing were used to detect and correct scaffold misassemblies. Long insert size paired-end reads identified scaffold junctions and fusions missed by automated assembly methods. GBS markers were used to anchor scaffolds to pseudo-molecules with a new bioinformatics approach that avoids the tedious step of marker ordering during genetic map construction. Furthermore, a genome map was constructed and used to assemble scaffolds into super scaffolds. Finally, a consensus gene annotation was projected on the new assembly from two pre-existing annotations. This approach reduced the total Musa scaffold number from 7513 to 1532 (i.e. by 80 %), with an N50 that increased from 1.3 Mb (65 scaffolds) to 3.0 Mb (26 scaffolds). 89.5 % of the assembly was anchored to the 11 Musa chromosomes compared to the previous 70 %. Unknown sites (N) were reduced from 17.3 to 10.0 %.

Conclusion

The release of the Musa acuminata reference genome version 2 provides a platform for detailed analysis of banana genome variation, function and evolution. Bioinformatics tools developed in this work can be used to improve genome sequence assemblies in other species.

Electronic supplementary material

The online version of this article (doi:10.1186/s12864-016-2579-4) contains supplementary material, which is available to authorized users.

Collapse

Application of Population Sequencing (POPSEQ) for Ordering and Imputing Genotyping-by-Sequencing Markers in Hexaploid Wheat. G3-GENES GENOMES GENETICS 2015;5:2547-53. [PMID: 26530417 PMCID: PMC4683627 DOI: 10.1534/g3.115.020362] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

A Male-Specific Genetic Map of the Microcrustacean Daphnia pulex Based on Single-Sperm Whole-Genome Sequencing. Genetics 2015;201:31-8. [PMID: 26116153 DOI: 10.1534/genetics.115.179028] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 06/24/2015] [Indexed: 12/12/2022] Open

Fierst JL. Using linkage maps to correct and scaffold de novo genome assemblies: methods, challenges, and computational tools. Front Genet 2015;6:220. [PMID: 26150829 PMCID: PMC4473057 DOI: 10.3389/fgene.2015.00220] [Citation(s) in RCA: 98] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Accepted: 06/08/2015] [Indexed: 01/05/2023] Open

Chapman JA, Mascher M, Buluç A, Barry K, Georganas E, Session A, Strnadova V, Jenkins J, Sehgal S, Oliker L, Schmutz J, Yelick KA, Scholz U, Waugh R, Poland JA, Muehlbauer GJ, Stein N, Rokhsar DS. A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome. Genome Biol 2015. [PMID: 25637298 DOI: 10.1186/s13059‐015‐0582‐8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Affiliation(s)

Jarrod A Chapman Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.
Martin Mascher Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Aydın Buluç Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
Kerrie Barry Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.
Evangelos Georganas Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. .,Department of Electrical Engineering and Computer Science, Computer Science Division, University of California, Berkeley, CA, 94720, USA.
Adam Session Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
Veronika Strnadova Department of Computer Science, University of California, Santa Barbara, CA, 93106, USA.
Jerry Jenkins Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,HudsonAlpha Institute of Biotechnology, Huntsville, AL, 35806, USA.
Sunish Sehgal Department of Plant Pathology, Kansas State University, Manhattan, KS, 65506, USA. .,Present address: Department of Plant Science, South Dakota State University, Brookings, SD, 57007, USA.
Leonid Oliker Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
Jeremy Schmutz Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,HudsonAlpha Institute of Biotechnology, Huntsville, AL, 35806, USA.
Katherine A Yelick Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. .,Department of Electrical Engineering and Computer Science, Computer Science Division, University of California, Berkeley, CA, 94720, USA.
Uwe Scholz Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Robbie Waugh Division of Plant Sciences, University of Dundee & The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
Jesse A Poland Department of Plant Pathology, Kansas State University, Manhattan, KS, 65506, USA.
Gary J Muehlbauer Departments of Agronomy and Plant Genetics, and Plant Biology, University of Minnesota, St Paul, MN, 55108, USA.
Nils Stein Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Daniel S Rokhsar Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.

Collapse

Chapman JA, Mascher M, Buluç A, Barry K, Georganas E, Session A, Strnadova V, Jenkins J, Sehgal S, Oliker L, Schmutz J, Yelick KA, Scholz U, Waugh R, Poland JA, Muehlbauer GJ, Stein N, Rokhsar DS. A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome. Genome Biol 2015;16:26. [PMID: 25637298 PMCID: PMC4373400 DOI: 10.1186/s13059-015-0582-8] [Citation(s) in RCA: 162] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 01/06/2015] [Indexed: 11/10/2022] Open

Affiliation(s)

Jarrod A Chapman Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.
Martin Mascher Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Aydın Buluç Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
Kerrie Barry Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA.
Evangelos Georganas Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. .,Department of Electrical Engineering and Computer Science, Computer Science Division, University of California, Berkeley, CA, 94720, USA.
Adam Session Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.
Veronika Strnadova Department of Computer Science, University of California, Santa Barbara, CA, 93106, USA.
Jerry Jenkins Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,HudsonAlpha Institute of Biotechnology, Huntsville, AL, 35806, USA.
Sunish Sehgal Department of Plant Pathology, Kansas State University, Manhattan, KS, 65506, USA. .,Present address: Department of Plant Science, South Dakota State University, Brookings, SD, 57007, USA.
Leonid Oliker Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
Jeremy Schmutz Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,HudsonAlpha Institute of Biotechnology, Huntsville, AL, 35806, USA.
Katherine A Yelick Computational Research Division and National Energy Research Supercomputing Center (NERSC), Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA. .,Department of Electrical Engineering and Computer Science, Computer Science Division, University of California, Berkeley, CA, 94720, USA.
Uwe Scholz Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Robbie Waugh Division of Plant Sciences, University of Dundee & The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK.
Jesse A Poland Department of Plant Pathology, Kansas State University, Manhattan, KS, 65506, USA.
Gary J Muehlbauer Departments of Agronomy and Plant Genetics, and Plant Biology, University of Minnesota, St Paul, MN, 55108, USA.
Nils Stein Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, 06466, Stadt Seeland, Germany.
Daniel S Rokhsar Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA. .,Department of Molecular and Cell Biology, University of California, Berkeley, CA, 94720, USA.

Collapse

Extensive error in the number of genes inferred from draft genome assemblies. PLoS Comput Biol 2014;10:e1003998. [PMID: 25474019 PMCID: PMC4256071 DOI: 10.1371/journal.pcbi.1003998] [Citation(s) in RCA: 172] [Impact Index Per Article: 17.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 10/22/2014] [Indexed: 11/19/2022] Open

Abstract

Current sequencing methods produce large amounts of data, but genome assemblies based on these data are often woefully incomplete. These incomplete and error-filled assemblies result in many annotation errors, especially in the number of genes present in a genome. In this paper we investigate the magnitude of the problem, both in terms of total gene number and the number of copies of genes in specific families. To do this, we compare multiple draft assemblies against higher-quality versions of the same genomes, using several new assemblies of the chicken genome based on both traditional and next-generation sequencing technologies, as well as published draft assemblies of chimpanzee. We find that upwards of 40% of all gene families are inferred to have the wrong number of genes in draft assemblies, and that these incorrect assemblies both add and subtract genes. Using simulated genome assemblies of Drosophila melanogaster, we find that the major cause of increased gene numbers in draft genomes is the fragmentation of genes onto multiple individual contigs. Finally, we demonstrate the usefulness of RNA-Seq in improving the gene annotation of draft assemblies, largely by connecting genes that have been fragmented in the assembly process.

The initial publication of the genome sequence of many plants, animals, and microbes is often accompanied with great fanfare. However, these genomes are almost always first-drafts, with a lot of missing data, many gaps, and many errors in the published sequences. Compounding this problem, the genes identified in draft genome sequences are also affected by incomplete genome assemblies: the number and exact structure of predicted genes may be incorrect. Here we quantify the extent of such errors, by comparing several draft genomes against completed versions of the same sequences. Surprisingly, we find huge numbers of errors in the number of genes predicted from draft assemblies, with more than half of all genes having the wrong number of copies in the draft genomes examined. Our investigation also reveals the major causes of these errors, and further analyses using additional functional data demonstrate that many of the gene predictions can be corrected. The results presented here suggest that many inferences based on published draft genomes may be erroneous, but offer a way forward for future analyses.

Collapse

Sessa EB, Banks JA, Barker MS, Der JP, Duffy AM, Graham SW, Hasebe M, Langdale J, Li FW, Marchant DB, Pryer KM, Rothfels CJ, Roux SJ, Salmi ML, Sigel EM, Soltis DE, Soltis PS, Stevenson DW, Wolf PG. Between two fern genomes. Gigascience 2014;3:15. [PMID: 25324969 PMCID: PMC4199785 DOI: 10.1186/2047-217x-3-15] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2014] [Accepted: 09/18/2014] [Indexed: 11/10/2022] Open

Affiliation(s)

Emily B Sessa Department of Biology, Box 118525, University of Florida, Gainesville, FL 32611, USA ; Genetics Institute, University of Florida, Box 103610, Gainesville, FL 32611, USA
Jo Ann Banks Department of Botany and Plant Pathology, Purdue University, 915 West State Street, West Lafayette, IN 47907, USA
Michael S Barker Department of Ecology & Evolutionary Biology, University of Arizona, 1041 East Lowell Street, Tucson, AZ 85721, USA
Joshua P Der Department of Biology, Penn State University, 201 Life Science Building, University Park, PA 16801, USA ; Current address: Department of Biological Science, California State University, 800 N. State College Blvd., Fullerton, CA 92831, USA
Aaron M Duffy Ecology Center and Department of Biology, Utah State University, 5305 Old Main Hill, Logan, UT 84322, USA
Sean W Graham Department of Botany, University of British Columbia, 3529-6720 University Blvd., Vancouver, BC V6T 1Z4, Canada
Mitsuyasu Hasebe National Institute for Basic Biology, 38 Nishigounaka, Myo-daiji-cho, Okazaki 444-8585, Japan
Jane Langdale Department of Plant Sciences, University of Oxford, South Parks Road, Oxford OX1 3RB, UK
Fay-Wei Li Department of Biology, Duke University, Post Office Box 90338, Durham, NC 27708, USA
D Blaine Marchant Department of Biology, Box 118525, University of Florida, Gainesville, FL 32611, USA ; Florida Museum of Natural History, Dickinson Hall, University of Florida, Gainesville, FL 32611, USA
Kathleen M Pryer Department of Biology, Duke University, Post Office Box 90338, Durham, NC 27708, USA
Carl J Rothfels Department of Zoology, University of British Columbia, 2329 W. Mall, WAITING Vancouver, BC V6T 1Z4, Canada ; Current address: University Herbarium and Department of Integrative Biology, University of California, 1001 Valley Life Sciences Building, Berkeley, Berkeley, CA 94720, USA
Stanley J Roux Department of Molecular Biosciences, University of Texas, 205 W. 24th Street, Austin, TX 78712, USA
Mari L Salmi Department of Molecular Biosciences, University of Texas, 205 W. 24th Street, Austin, TX 78712, USA
Erin M Sigel Department of Biology, Duke University, Post Office Box 90338, Durham, NC 27708, USA
Douglas E Soltis Department of Biology, Box 118525, University of Florida, Gainesville, FL 32611, USA ; Genetics Institute, University of Florida, Box 103610, Gainesville, FL 32611, USA ; Florida Museum of Natural History, Dickinson Hall, University of Florida, Gainesville, FL 32611, USA
Pamela S Soltis Genetics Institute, University of Florida, Box 103610, Gainesville, FL 32611, USA ; Florida Museum of Natural History, Dickinson Hall, University of Florida, Gainesville, FL 32611, USA
Dennis W Stevenson New York Botanical Garden, 2900 Southern Boulevard, Bronx, NY 10458, USA
Paul G Wolf Ecology Center and Department of Biology, Utah State University, 5305 Old Main Hill, Logan, UT 84322, USA

Collapse

High-resolution genetic map for understanding the effect of genome-wide recombination rate on nucleotide diversity in watermelon. G3-GENES GENOMES GENETICS 2014;4:2219-30. [PMID: 25227227 PMCID: PMC4232547 DOI: 10.1534/g3.114.012815] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Pernaci M, De Mita S, Andrieux A, Pétrowski J, Halkett F, Duplessis S, Frey P. Genome-wide patterns of segregation and linkage disequilibrium: the construction of a linkage genetic map of the poplar rust fungus Melampsora larici-populina. FRONTIERS IN PLANT SCIENCE 2014;5:454. [PMID: 25309554 PMCID: PMC4159982 DOI: 10.3389/fpls.2014.00454] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Accepted: 08/21/2014] [Indexed: 05/16/2023]

Abstract

The poplar rust fungus Melampsora larici-populina causes significant yield reduction and severe economic losses in commercial poplar plantations. After several decades of breeding for qualitative resistance and subsequent breakdown of the released resistance genes, breeders now focus on quantitative resistance, perceived to be more durable. But quantitative resistance also can be challenged by an increase of aggressiveness in the pathogen. Thus, it is of primary importance to better understand the genetic architecture of aggressiveness traits. To this aim, our goal is to build a genetic linkage map for M. larici-populina in order to map quantitative trait loci related to aggressiveness. First, a large progeny of M. larici-populina was generated through selfing of the reference strain 98AG31 (which genome sequence is available) on larch plants, the alternate host of the poplar rust fungus. The progeny's meiotic origin was validated through a segregation analysis of 115 offspring with 14 polymorphic microsatellite markers, of which 12 segregated in the expected 1:2:1 Mendelian ratio. A microsatellite-based linkage disequilibrium analysis allowed us to identify one potential linkage group comprising two scaffolds. The whole genome of a subset of 47 offspring was resequenced using the Illumina HiSeq 2000 technology at a mean sequencing depth of 6X. The reads were mapped onto the reference genome of the parental strain and 144,566 SNPs were identified across the genome. Analysis of distribution and polymorphism of the SNPs along the genome led to the identification of 2580 recombination blocks. A second linkage disequilibrium analysis, using the recombination blocks as markers, allowed us to group 81 scaffolds into 23 potential linkage groups. These preliminary results showed that a high-density linkage map could be constructed by using high-quality SNPs based on low-coverage resequencing of a larger number of M. larici-populina offspring.

Collapse

Jiang Y, Xu P, Liu Z. Generation of physical map contig-specific sequences. Front Genet 2014;5:243. [PMID: 25101119 PMCID: PMC4105628 DOI: 10.3389/fgene.2014.00243] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2014] [Accepted: 07/07/2014] [Indexed: 12/13/2022] Open

Mascher M, Stein N. Genetic anchoring of whole-genome shotgun assemblies. Front Genet 2014;5:208. [PMID: 25071835 PMCID: PMC4083584 DOI: 10.3389/fgene.2014.00208] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 06/19/2014] [Indexed: 12/30/2022] Open