Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

253
(from Reference Citation Analysis)

Article PDFs (56)

Cited by > 0 (232)

Searched Name

Contig Mapping/methods

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Li M, Li LM. RegScaf: a regression approach to scaffolding. Bioinformatics 2022;38:2675-2682. [PMID: 35561180 PMCID: PMC9326850 DOI: 10.1093/bioinformatics/btac174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/19/2022] [Accepted: 03/23/2022] [Indexed: 11/13/2022] Open

Abstract

MOTIVATION

Crucial to the correctness of a genome assembly is the accuracy of the underlying scaffolds that specify the orders and orientations of contigs together with the gap distances between contigs. The current methods construct scaffolds based on the alignments of 'linking' reads against contigs. We found that some 'optimal' alignments are mistaken due to factors such as the contig boundary effect, particularly in the presence of repeats. Occasionally, the incorrect alignments can even overwhelm the correct ones. The detection of the incorrect linking information is challenging in any existing methods.

RESULTS

In this study, we present a novel scaffolding method RegScaf. It first examines the distribution of distances between contigs from read alignment by the kernel density. When multiple modes are shown in a density, orientation-supported links are grouped into clusters, each of which defines a linking distance corresponding to a mode. The linear model parameterizes contigs by their positions on the genome; then each linking distance between a pair of contigs is taken as an observation on the difference of their positions. The parameters are estimated by minimizing a global loss function, which is a version of trimmed sum of squares. The least trimmed squares estimate has such a high breakdown value that it can automatically remove the mistaken linking distances. The results on both synthetic and real datasets demonstrate that RegScaf outperforms some popular scaffolders, especially in the accuracy of gap estimates by substantially reducing extremely abnormal errors. Its strength in resolving repeat regions is exemplified by a real case. Its adaptability to large genomes and TGS long reads is validated as well.

AVAILABILITY AND IMPLEMENTATION

RegScaf is publicly available at https://github.com/lemontealala/RegScaf.git.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Nakamoto M, Uchino T, Koshimizu E, Kuchiishi Y, Sekiguchi R, Wang L, Sudo R, Endo M, Guiguen Y, Schartl M, Postlethwait JH, Sakamoto T. A Y-linked anti-Müllerian hormone type-II receptor is the sex-determining gene in ayu, Plecoglossus altivelis. PLoS Genet 2021;17:e1009705. [PMID: 34437539 PMCID: PMC8389408 DOI: 10.1371/journal.pgen.1009705] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Accepted: 07/09/2021] [Indexed: 11/19/2022] Open

Abstract

Whole-genome duplication and genome compaction are thought to have played important roles in teleost fish evolution. Ayu (or sweetfish), Plecoglossus altivelis, belongs to the superorder Stomiati, order Osmeriformes. Stomiati is phylogenetically classified as sister taxa of Neoteleostei. Thus, ayu holds an important position in the fish tree of life. Although ayu is economically important for the food industry and recreational fishing in Japan, few genomic resources are available for this species. To address this problem, we produced a draft genome sequence of ayu by whole-genome shotgun sequencing and constructed linkage maps using a genotyping-by-sequencing approach. Syntenic analyses of ayu and other teleost fish provided information about chromosomal rearrangements during the divergence of Stomiati, Protacanthopterygii and Neoteleostei. The size of the ayu genome indicates that genome compaction occurred after the divergence of the family Osmeridae. Ayu has an XX/XY sex-determination system for which we identified sex-associated loci by a genome-wide association study by genotyping-by-sequencing and whole-genome resequencing using wild populations. Genome-wide association mapping using wild ayu populations revealed three sex-linked scaffolds (total, 2.03 Mb). Comparison of whole-genome resequencing mapping coverage between males and females identified male-specific regions in sex-linked scaffolds. A duplicate copy of the anti-Müllerian hormone type-II receptor gene (amhr2bY) was found within these male-specific regions, distinct from the autosomal copy of amhr2. Expression of the Y-linked amhr2 gene was male-specific in sox9b-positive somatic cells surrounding germ cells in undifferentiated gonads, whereas autosomal amhr2 transcripts were detected in somatic cells in sexually undifferentiated gonads of both genetic males and females. Loss-of-function mutation for amhr2bY induced male to female sex reversal. Taken together with the known role of Amh and Amhr2 in sex differentiation, these results indicate that the paralog of amhr2 on the ayu Y chromosome determines genetic sex, and the male-specific amh-amhr2 pathway is critical for testicular differentiation in ayu.

Ayu (or sweetfish), Plecoglossus altivelis, is widely distributed in East Asia. Ayu belongs to the superorder Stomiati and the order Osmeriformes. Stomiati is phylogenetically classified as sister group of Neoteleostei, the largest clade of bony fish including medaka, tuna and cod. The divergence of Protacanthopterygii (salmon and pike) and the common ancestor of Stomiati and Neoteleostei is estimated to have occurred approximately 190 million years ago. Thus, ayu holds an important position in the fish tree of life. We sequenced the ayu genome and constructed linkage maps using a genotyping-by-sequencing approach. Comparative analyses of ayu, medaka and northern pike revealed chromosomal rearrangements in the ayu lineage after the divergence of ayu and northern pike. Association mapping revealed a duplicate copy of the anti-Müllerian hormone type-II receptor gene (amhr2bY) located within a male-specific region. Y-linked amhr2 expression was male-specific in supporting cells in undifferentiated gonads, whereas autosomal amhr2 transcripts were detected in somatic cells of sexually undifferentiated gonads in both. Loss-of-function mutation for amhr2bY induced male-to-female sex reversal. Taken together, these results indicate that the paralog of amhr2 on the Y chromosome determines genetic sex. Our findings support the hypothesis that the male-specific amh-amhr2 pathway is critical for gonadal differentiation in ayu.

Collapse

Fritz A, Bremges A, Deng ZL, Lesker TR, Götting J, Ganzenmueller T, Sczyrba A, Dilthey A, Klawonn F, McHardy AC. Haploflow: strain-resolved de novo assembly of viral genomes. Genome Biol 2021;22:212. [PMID: 34281604 PMCID: PMC8287296 DOI: 10.1186/s13059-021-02426-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Accepted: 06/29/2021] [Indexed: 01/03/2023] Open

Affiliation(s)

Adrian Fritz Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany
Andreas Bremges Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany
Zhi-Luo Deng Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany
Till Robin Lesker Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany
Jasper Götting German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany Institute of Virology, Hannover Medical School, Hannover, Germany
Tina Ganzenmueller German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany Institute of Virology, Hannover Medical School, Hannover, Germany Institute for Medical Virology, University Hospital Tuebingen, Tuebingen, Germany
Alexander Sczyrba Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany Faculty of Technology and Center for Biotechnology, Bielefeld University, Bielefeld, Germany
Alexander Dilthey Institute of Medical Microbiology and Hospital Hygiene, University Hospital, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, 20892, USA
Frank Klawonn Department of Computer Science, Ostfalia University of Applied Sciences, Wolfenbuettel, Germany Biostatistics Group, Helmholtz Centre for Infection Research, Braunschweig, Germany
Alice Carolyn McHardy Department of Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany. German Centre for Infection Research (DZIF), Site Hannover-Braunschweig, Braunschweig, Germany.

Collapse

Kronenberg ZN, Rhie A, Koren S, Concepcion GT, Peluso P, Munson KM, Porubsky D, Kuhn K, Mueller KA, Low WY, Hiendleder S, Fedrigo O, Liachko I, Hall RJ, Phillippy AM, Eichler EE, Williams JL, Smith TPL, Jarvis ED, Sullivan ST, Kingan SB. Extended haplotype-phasing of long-read de novo genome assemblies using Hi-C. Nat Commun 2021;12:1935. [PMID: 33911078 PMCID: PMC8081726 DOI: 10.1038/s41467-020-20536-y] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 11/12/2020] [Indexed: 01/27/2023] Open

Affiliation(s)

Zev N Kronenberg Phase Genomics, Seattle, WA, USA. Pacific Biosciences, Menlo Park, CA, USA.
Arang Rhie Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Sergey Koren Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Gregory T Concepcion Pacific Biosciences, Menlo Park, CA, USA
Paul Peluso Pacific Biosciences, Menlo Park, CA, USA
Katherine M Munson Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
David Porubsky Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
Kristen Kuhn US Meat Animal Research Center, ARS USDA, Clay Center, NE, USA
Kathryn A Mueller Phase Genomics, Seattle, WA, USA
Wai Yee Low Davies Research Centre, School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy, SA, Australia
Stefan Hiendleder Davies Research Centre, School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy, SA, Australia
Olivier Fedrigo Vertebrate Genomes Laboratory, The Rockefeller University, New York, NY, USA
Ivan Liachko Phase Genomics, Seattle, WA, USA
Richard J Hall Pacific Biosciences, Menlo Park, CA, USA
Adam M Phillippy Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, Bethesda, MD, USA
Evan E Eichler Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
John L Williams Davies Research Centre, School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy, SA, Australia Dipartimento di Scienze Animali, della Nutrizione e degli Alimenti, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
Timothy P L Smith US Meat Animal Research Center, ARS USDA, Clay Center, NE, USA
Erich D Jarvis Laboratory of Neurogenetics of Language, The Rockefeller University, New York, NY, USA Howard Hughes Medical Institute, Chevy Chase, MD, USA
Shawn T Sullivan Phase Genomics, Seattle, WA, USA
Sarah B Kingan Pacific Biosciences, Menlo Park, CA, USA.

Collapse

Deneke C, Brendebach H, Uelze L, Borowiak M, Malorny B, Tausch SH. Species-Specific Quality Control, Assembly and Contamination Detection in Microbial Isolate Sequences with AQUAMIS. Genes (Basel) 2021;12:644. [PMID: 33926025 PMCID: PMC8145556 DOI: 10.3390/genes12050644] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 04/23/2021] [Accepted: 04/24/2021] [Indexed: 01/13/2023] Open

Sarkar A, Al-Ars Z, Bertels K. QuASeR: Quantum Accelerated de novo DNA sequence reconstruction. PLoS One 2021;16:e0249850. [PMID: 33844699 PMCID: PMC8041170 DOI: 10.1371/journal.pone.0249850] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Accepted: 03/24/2021] [Indexed: 01/10/2023] Open

Di Genova A, Buena-Atienza E, Ossowski S, Sagot MF. Efficient hybrid de novo assembly of human genomes with WENGAN. Nat Biotechnol 2021;39:422-430. [PMID: 33318652 PMCID: PMC8041623 DOI: 10.1038/s41587-020-00747-w] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 10/08/2020] [Accepted: 10/21/2020] [Indexed: 12/12/2022]

Collins JH, Keating KW, Jones TR, Balaji S, Marsan CB, Çomo M, Newlon ZJ, Mitchell T, Bartley B, Adler A, Roehner N, Young EM. Engineered yeast genomes accurately assembled from pure and mixed samples. Nat Commun 2021;12:1485. [PMID: 33674578 PMCID: PMC7935868 DOI: 10.1038/s41467-021-21656-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 02/04/2021] [Indexed: 01/31/2023] Open

Schwengers O, Barth P, Falgenhauer L, Hain T, Chakraborty T, Goesmann A. Platon: identification and characterization of bacterial plasmid contigs in short-read draft assemblies exploiting protein sequence-based replicon distribution scores. Microb Genom 2020;6:mgen000398. [PMID: 32579097 PMCID: PMC7660248 DOI: 10.1099/mgen.0.000398] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 06/02/2020] [Indexed: 12/22/2022] Open

Abstract

Plasmids are extrachromosomal genetic elements that replicate independently of the chromosome and play a vital role in the environmental adaptation of bacteria. Due to potential mobilization or conjugation capabilities, plasmids are important genetic vehicles for antimicrobial resistance genes and virulence factors with huge and increasing clinical implications. They are therefore subject to large genomic studies within the scientific community worldwide. As a result of rapidly improving next-generation sequencing methods, the quantity of sequenced bacterial genomes is constantly increasing, in turn raising the need for specialized tools to (i) extract plasmid sequences from draft assemblies, (ii) derive their origin and distribution, and (iii) further investigate their genetic repertoire. Recently, several bioinformatic methods and tools have emerged to tackle this issue; however, a combination of high sensitivity and specificity in plasmid sequence identification is rarely achieved in a taxon-independent manner. In addition, many software tools are not appropriate for large high-throughput analyses or cannot be included in existing software pipelines due to their technical design or software implementation. In this study, we investigated differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs. We defined and computed statistical discrimination thresholds for a new metric: the replicon distribution score (RDS), which achieved an accuracy of 96.6 %. The final performance was further improved by the combination of the RDS metric with heuristics exploiting several plasmid-specific higher-level contig characterizations. We implemented this workflow in a new high-throughput taxon-independent bioinformatics software tool called Platon for the recruitment and characterization of plasmid-borne contigs from short-read draft assemblies. Compared to PlasFlow, Platon achieved a higher accuracy (97.5 %) and more balanced predictions (F1=82.6 %) tested on a broad range of bacterial taxa and better or equal performance against the targeted tools PlasmidFinder and PlaScope on sequenced Escherichia coli isolates. Platon is available at: http://platon.computational.bio/.

Collapse

Alonge M, Shumate A, Puiu D, Zimin AV, Salzberg SL. Chromosome-Scale Assembly of the Bread Wheat Genome Reveals Thousands of Additional Gene Copies. Genetics 2020;216:599-608. [PMID: 32796007 PMCID: PMC7536849 DOI: 10.1534/genetics.120.303501] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2020] [Accepted: 08/10/2020] [Indexed: 11/18/2022] Open

Maggi J, Roberts L, Koller S, Rebello G, Berger W, Ramesar R. De Novo Assembly-Based Analysis of RPGR Exon ORF15 in an Indigenous African Cohort Overcomes Limitations of a Standard Next-Generation Sequencing (NGS) Data Analysis Pipeline. Genes (Basel) 2020;11:genes11070800. [PMID: 32679846 PMCID: PMC7396994 DOI: 10.3390/genes11070800] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 06/24/2020] [Accepted: 07/13/2020] [Indexed: 01/10/2023] Open

Klein J, Neilen M, van Verk M, Dutilh BE, Van den Ackerveken G. Genome reconstruction of the non-culturable spinach downy mildew Peronospora effusa by metagenome filtering. PLoS One 2020;15:e0225808. [PMID: 32396560 PMCID: PMC7217449 DOI: 10.1371/journal.pone.0225808] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Accepted: 04/24/2020] [Indexed: 01/27/2023] Open

Abstract

Peronospora effusa (previously known as P. farinosa f. sp. spinaciae, and here referred to as Pfs) is an obligate biotrophic oomycete that causes downy mildew on spinach (Spinacia oleracea). To combat this destructive many disease resistant cultivars have been bred and used. However, new Pfs races rapidly break the employed resistance genes. To get insight into the gene repertoire of Pfs and identify infection-related genes, the genome of the first reference race, Pfs1, was sequenced, assembled, and annotated. Due to the obligate biotrophic nature of this pathogen, material for DNA isolation can only be collected from infected spinach leaves that, however, also contain many other microorganisms. The obtained sequences can, therefore, be considered a metagenome. To filter and obtain Pfs sequences we utilized the CAT tool to taxonomically annotate ORFs residing on long sequences of a genome pre-assembly. This study is the first to show that CAT filtering performs well on eukaryotic contigs. Based on the taxonomy, determined on multiple ORFs, contaminating long sequences and corresponding reads were removed from the metagenome. Filtered reads were re-assembled to provide a clean and improved Pfs genome sequence of 32.4 Mbp consisting of 8,635 scaffolds. Transcript sequencing of a range of infection time points aided the prediction of a total of 13,277 gene models, including 99 RxLR(-like) effector, and 14 putative Crinkler genes. Comparative analysis identified common features in the predicted secretomes of different obligate biotrophic oomycetes, regardless of their phylogenetic distance. Their secretomes are generally smaller, compared to hemi-biotrophic and necrotrophic oomycete species. We observe a reduction in proteins involved in cell wall degradation, in Nep1-like proteins (NLPs), proteins with PAN/apple domains, and host translocated effectors. The genome of Pfs1 will be instrumental in studying downy mildew virulence and for understanding the molecular adaptations by which new isolates break spinach resistance.

Collapse

Sadat-Hosseini M, Bakhtiarizadeh MR, Boroomand N, Tohidfar M, Vahdati K. Combining independent de novo assemblies to optimize leaf transcriptome of Persian walnut. PLoS One 2020;15:e0232005. [PMID: 32343733 PMCID: PMC7188282 DOI: 10.1371/journal.pone.0232005] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2019] [Accepted: 04/06/2020] [Indexed: 12/22/2022] Open

Jayakumar V, Ishii H, Seki M, Kumita W, Inoue T, Hase S, Sato K, Okano H, Sasaki E, Sakakibara Y. An improved de novo genome assembly of the common marmoset genome yields improved contiguity and increased mapping rates of sequence data. BMC Genomics 2020;21:243. [PMID: 32241258 PMCID: PMC7114785 DOI: 10.1186/s12864-020-6657-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 03/09/2020] [Indexed: 12/31/2022] Open

Linsmith G, Rombauts S, Montanari S, Deng CH, Celton JM, Guérif P, Liu C, Lohaus R, Zurn JD, Cestaro A, Bassil NV, Bakker LV, Schijlen E, Gardiner SE, Lespinasse Y, Durel CE, Velasco R, Neale DB, Chagné D, Van de Peer Y, Troggio M, Bianco L. Pseudo-chromosome-length genome assembly of a double haploid "Bartlett" pear (Pyrus communis L.). Gigascience 2019;8:giz138. [PMID: 31816089 PMCID: PMC6901071 DOI: 10.1093/gigascience/giz138] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/18/2019] [Accepted: 10/30/2019] [Indexed: 11/14/2022] Open

Affiliation(s)

Gareth Linsmith Center for Plant Systems Biology, VIB, Technologiepark 71, 9052, Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Gent, Belgium Fondazione Edmund Mach, via E. Mach 1, 38010, San Michele all'Adige (TN), Italy
Stephane Rombauts Center for Plant Systems Biology, VIB, Technologiepark 71, 9052, Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Gent, Belgium
Sara Montanari University of California Davis, Department of Plant Sciences, One Shields Ave, Davis, CA 95616, USA
Cecilia H Deng The New Zealand Institute for Plant & Food Research Limited (PFR), Mt Albert Research Centre,120 Mt Albert Road, Sandringham, Auckland, 1025, New Zealand
Jean-Marc Celton IRHS, INRA, Agrocampus-Ouest, Université d'Angers, SFR 4207 Quasav, 42 rue Georges Morel, F-49071 Beaucouzé, France
Philippe Guérif IRHS, INRA, Agrocampus-Ouest, Université d'Angers, SFR 4207 Quasav, 42 rue Georges Morel, F-49071 Beaucouzé, France
Chang Liu ZMBP, Allgemeine Genetik, Universität Tübingen, Auf der Morgenstelle 32, D-72076 Tübingen, Germany
Rolf Lohaus Center for Plant Systems Biology, VIB, Technologiepark 71, 9052, Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Gent, Belgium
Jason D Zurn USDA-ARS National Clonal Germplasm Repository, 33447 Peoria Road, Corvallis, OR 97333, USA
Alessandro Cestaro Fondazione Edmund Mach, via E. Mach 1, 38010, San Michele all'Adige (TN), Italy
Nahla V Bassil USDA-ARS National Clonal Germplasm Repository, 33447 Peoria Road, Corvallis, OR 97333, USA
Linda V Bakker Wageningen UR – Bioscience P.O. Box 16, 6700AA, Wageningen, The Netherlands
Elio Schijlen Wageningen UR – Bioscience P.O. Box 16, 6700AA, Wageningen, The Netherlands
Susan E Gardiner The New Zealand Institute for Plant & Food Research Limited (PFR), Palmerston North Research Centre, Palmerston North, New Zealand
Yves Lespinasse IRHS, INRA, Agrocampus-Ouest, Université d'Angers, SFR 4207 Quasav, 42 rue Georges Morel, F-49071 Beaucouzé, France
Charles-Eric Durel IRHS, INRA, Agrocampus-Ouest, Université d'Angers, SFR 4207 Quasav, 42 rue Georges Morel, F-49071 Beaucouzé, France
Riccardo Velasco CREA Research Centre for Viticulture and Enology, Via XXVIII Aprile 26, 31015 Conegliano (TV), Italy
David B Neale University of California Davis, Department of Plant Sciences, One Shields Ave, Davis, CA 95616, USA
David Chagné The New Zealand Institute for Plant & Food Research Limited (PFR), Palmerston North Research Centre, Palmerston North, New Zealand
Yves Van de Peer Center for Plant Systems Biology, VIB, Technologiepark 71, 9052, Gent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, 9052 Gent, Belgium Center for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Roper street, Pretoria 0028, South Africa
Michela Troggio Fondazione Edmund Mach, via E. Mach 1, 38010, San Michele all'Adige (TN), Italy
Luca Bianco Fondazione Edmund Mach, via E. Mach 1, 38010, San Michele all'Adige (TN), Italy

Collapse

Souza GM, Van Sluys MA, Lembke CG, Lee H, Margarido GRA, Hotta CT, Gaiarsa JW, Diniz AL, Oliveira MDM, Ferreira SDS, Nishiyama MY, ten-Caten F, Ragagnin GT, Andrade PDM, de Souza RF, Nicastro GG, Pandya R, Kim C, Guo H, Durham AM, Carneiro MS, Zhang J, Zhang X, Zhang Q, Ming R, Schatz MC, Davidson B, Paterson AH, Heckerman D. Assembly of the 373k gene space of the polyploid sugarcane genome reveals reservoirs of functional diversity in the world's leading biomass crop. Gigascience 2019;8:giz129. [PMID: 31782791 PMCID: PMC6884061 DOI: 10.1093/gigascience/giz129] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 05/23/2019] [Accepted: 10/08/2019] [Indexed: 11/29/2022] Open

Affiliation(s)

Glaucia Mendes Souza Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Marie-Anne Van Sluys Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
Carolina Gimiliani Lembke Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Hayan Lee Cold Spring Harbor Laboratory, One Bungtown Road, Koch Building #1119, Cold Spring Harbor, NY11724, United States of America Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CACA94598, United States of America
Gabriel Rodrigues Alves Margarido Departamento de Genética, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Avenida Pádua Dias, 11, Piracicaba, SP 13418-900, Brazil
Carlos Takeshi Hotta Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Jonas Weissmann Gaiarsa Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
Augusto Lima Diniz Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Mauro de Medeiros Oliveira Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Sávio de Siqueira Ferreira Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
Milton Yutaka Nishiyama Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil Laboratório Especial de Toxinologia Aplicada, Instituto Butantan, Av. Vital Brasil, 1500, São Paulo, SP05503-900, Brazil
Felipe ten-Caten Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Geovani Tolfo Ragagnin Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão, 277, São Paulo, SP 05508-090, Brazil
Pablo de Morais Andrade Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Av. Prof. Lineu Prestes, 748, São Paulo, SP 05508-000, Brazil
Robson Francisco de Souza Departamento de Microbiologia, Instituto de Ciências Biomédicas, Universidade de São Paulo, Av.Professor Lineu Prestes, 1734, São Paulo, SP 05508-900, Brazil
Gianlucca Gonçalves Nicastro Departamento de Microbiologia, Instituto de Ciências Biomédicas, Universidade de São Paulo, Av.Professor Lineu Prestes, 1734, São Paulo, SP 05508-900, Brazil
Ravi Pandya Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America
Changsoo Kim Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America Department of Crop Science, Chungnam National University, 99 Daehak Ro Yuseong Gu, Deajeon,34134, South Korea
Hui Guo Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America
Alan Mitchell Durham Departamento de Ciências da Computação, Instituto de Matemática e Estatística, Universidade de São Paulo, Rua do Matão, 1010, São Paulo, SP 05508-090, Brazil
Monalisa Sampaio Carneiro Departamento de Biotecnologia e Produção Vegetal e Animal, Centro de Ciências Agrárias, Universidade Federal de São Carlos, Rodovia Washington Luis km 235, Araras, SP 13.565-905, Brazil
Jisen Zhang FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
Xingtan Zhang FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
Qing Zhang FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China
Ray Ming FAFU and UIUC-SIB Joint Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Shangxiadian Road, Fuzhou 350002, Fujian, China Department of Plant Biology, University of Illinois at Urbana-Champaign, 201 W. Gregory Dr. Urbana, Urbana, Illinois 61801, United States of America
Michael C Schatz Cold Spring Harbor Laboratory, One Bungtown Road, Koch Building #1119, Cold Spring Harbor, NY11724, United States of America Departments of Computer Science and Biology, Johns Hopkins University, 3400 North Charles Street,Baltimore, MD 21218-2608, United States of America
Bob Davidson Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America
Andrew H Paterson Plant Genome Mapping Laboratory, University of Georgia, 120 Green Street, Athens, GA 30602-7223,United States of America
David Heckerman Microsoft Research, One Microsoft Way, Redmond, WA 98052, United States of America

Collapse

Kim HS, Jeon S, Kim C, Kim YK, Cho YS, Kim J, Blazyte A, Manica A, Lee S, Bhak J. Chromosome-scale assembly comparison of the Korean Reference Genome KOREF from PromethION and PacBio with Hi-C mapping information. Gigascience 2019;8:giz125. [PMID: 31794015 PMCID: PMC6889754 DOI: 10.1093/gigascience/giz125] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 09/02/2019] [Accepted: 09/28/2019] [Indexed: 01/09/2023] Open

Affiliation(s)

Hui-Su Kim KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
Sungwon Jeon KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea Department of Biomedical Engineering, School of Life Sciences, UNIST-gil 50, Eonyang-eup, Ulju-gun, UNIST, Ulsan 44919, Republic of Korea
Changjae Kim KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
Yeon Kyung Kim KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
Yun Sung Cho Clinomics Inc., UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
Jungeun Kim Personal Genomics Institute, Genome Research Foundation, Osong saengmyong1ro, Cheongju 28160, Republic of Korea
Asta Blazyte KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea
Andrea Manica Department of Zoology, Cambridge University, Downing street, Cambridge CB2 3EJ, UK
Semin Lee KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea Department of Biomedical Engineering, School of Life Sciences, UNIST-gil 50, Eonyang-eup, Ulju-gun, UNIST, Ulsan 44919, Republic of Korea
Jong Bhak KOGIC, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea Department of Biomedical Engineering, School of Life Sciences, UNIST-gil 50, Eonyang-eup, Ulju-gun, UNIST, Ulsan 44919, Republic of Korea Clinomics Inc., UNIST-gil 50, Eonyang-eup, Ulju-gun, Ulsan 44919, Republic of Korea Personal Genomics Institute, Genome Research Foundation, Osong saengmyong1ro, Cheongju 28160, Republic of Korea

Collapse

Ghosh P, Kalyanaraman A. FastEtch: A Fast Sketch-Based Assembler for Genomes. IEEE/ACM Trans Comput Biol Bioinform 2019;16:1091-1106. [PMID: 28910776 DOI: 10.1109/tcbb.2017.2737999] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Abstract

De novo genome assembly describes the process of reconstructing an unknown genome from a large collection of short (or long) reads sequenced from the genome. A single run of a Next-Generation Sequencing (NGS) technology can produce billions of short reads, making genome assembly computationally demanding (both in terms of memory and time). One of the major computational steps in modern day short read assemblers involves the construction and use of a string data structure called the de Bruijn graph. In fact, a majority of short read assemblers build the complete de Bruijn graph for the set of input reads, and subsequently traverse and prune low-quality edges, in order to generate genomic "contigs"-the output of assembly. These steps of graph construction and traversal, contribute to well over 90 percent of the runtime and memory. In this paper, we present a fast algorithm, FastEtch, that uses sketching to build an approximate version of the de Bruijn graph for the purpose of generating an assembly. The algorithm uses Count-Min sketch, which is a probabilistic data structure for streaming data sets. The result is an approximate de Bruijn graph that stores information pertaining only to a selected subset of nodes that are most likely to contribute to the contig generation step. In addition, edges are not stored; instead that fraction which contribute to our contig generation are detected on-the-fly. This approximate approach is intended to significantly improve performance (both execution time and memory footprint) whilst possibly compromising on the output assembly quality. We present two main versions of the assembler-one that generates an assembly, where each contig represents a contiguous genomic region from one strand of the DNA, and another that generates an assembly, where the contigs can straddle either of the two strands of the DNA. For further scalability, we have implemented a multi-threaded parallel code. Experimental results using our algorithm conducted on E. coli, Yeast, C. elegans, and Human (Chr2 and Chr2+3) genomes show that our method yields one of the best time-memory-quality trade-offs, when compared against many state-of-the-art genome assemblers.

Collapse

Hölzer M, Marz M. De novo transcriptome assembly: A comprehensive cross-species comparison of short-read RNA-Seq assemblers. Gigascience 2019;8:giz039. [PMID: 31077315 PMCID: PMC6511074 DOI: 10.1093/gigascience/giz039] [Citation(s) in RCA: 107] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Revised: 12/21/2018] [Accepted: 03/09/2019] [Indexed: 12/13/2022] Open

Kingan SB, Heaton H, Cudini J, Lambert CC, Baybayan P, Galvin BD, Durbin R, Korlach J, Lawniczak MKN. A High-Quality De novo Genome Assembly from a Single Mosquito Using PacBio Sequencing. Genes (Basel) 2019;10:E62. [PMID: 30669388 PMCID: PMC6357164 DOI: 10.3390/genes10010062] [Citation(s) in RCA: 79] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 01/14/2019] [Accepted: 01/15/2019] [Indexed: 12/15/2022] Open

Abstract

A high-quality reference genome is a fundamental resource for functional genetics, comparative genomics, and population genomics, and is increasingly important for conservation biology. PacBio Single Molecule, Real-Time (SMRT) sequencing generates long reads with uniform coverage and high consensus accuracy, making it a powerful technology for de novo genome assembly. Improvements in throughput and concomitant reductions in cost have made PacBio an attractive core technology for many large genome initiatives, however, relatively high DNA input requirements (~5 µg for standard library protocol) have placed PacBio out of reach for many projects on small organisms that have lower DNA content, or on projects with limited input DNA for other reasons. Here we present a high-quality de novo genome assembly from a single Anopheles coluzzii mosquito. A modified SMRTbell library construction protocol without DNA shearing and size selection was used to generate a SMRTbell library from just 100 ng of starting genomic DNA. The sample was run on the Sequel System with chemistry 3.0 and software v6.0, generating, on average, 25 Gb of sequence per SMRT Cell with 20 h movies, followed by diploid de novo genome assembly with FALCON-Unzip. The resulting curated assembly had high contiguity (contig N50 3.5 Mb) and completeness (more than 98% of conserved genes were present and full-length). In addition, this single-insect assembly now places 667 (>90%) of formerly unplaced genes into their appropriate chromosomal contexts in the AgamP4 PEST reference. We were also able to resolve maternal and paternal haplotypes for over 1/3 of the genome. By sequencing and assembling material from a single diploid individual, only two haplotypes were present, simplifying the assembly process compared to samples from multiple pooled individuals. The method presented here can be applied to samples with starting DNA amounts as low as 100 ng per 1 Gb genome size. This new low-input approach puts PacBio-based assemblies in reach for small highly heterozygous organisms that comprise much of the diversity of life.

Collapse

Xu GC, Xu TJ, Zhu R, Zhang Y, Li SQ, Wang HW, Li JT. LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly. Gigascience 2019. [PMID: 30576505 DOI: 10.5524/100540] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023] Open

Abstract

BACKGROUND

Completing a genome is an important goal of genome assembly. However, many assemblies, including reference assemblies, are unfinished and have a number of gaps. Long reads obtained from third-generation sequencing (TGS) platforms can help close these gaps and improve assembly contiguity. However, current gap-closure approaches using long reads require extensive runtime and high memory usage. Thus, a fast and memory-efficient approach using long reads is needed to obtain complete genomes.

FINDINGS

We developed LR_Gapcloser to rapidly and efficiently close the gaps in genome assembly. This tool utilizes long reads generated from TGS sequencing platforms. Tested on de novo assembled gaps, repeat-derived gaps, and real gaps, LR_Gapcloser closed a higher number of gaps faster and with a lower error rate and a much lower memory usage than two existing, state-of-the art tools. This tool utilized raw reads to fill more gaps than when using error-corrected reads. It is applicable to gaps in the assemblies by different approaches and from large and complex genomes. After performing gap-closure using this tool, the contig N50 size of the human CHM1 genome was improved from 143 kb to 19 Mb, a 132-fold increase. We also closed the gaps in the Triticum urartu genome, a large genome rich in repeats; the contig N50 size was increased by 40%. Further, we evaluated the contiguity and correctness of six hybrid assembly strategies by combining the optimal TGS-based and next-generation sequencing-based assemblers with LR_Gapcloser. A proposed and optimal hybrid strategy generated a new human CHM1 genome assembly with marked contiguity. The contig N50 value was greater than 28 Mb, which is larger than previous non-reference assemblies of the diploid human genome.

CONCLUSIONS

LR_Gapcloser is a fast and efficient tool that can be used to close gaps and improve the contiguity of genome assemblies. A proposed hybrid assembly including this tool promises reference-grade assemblies. The software is available at http://www.fishbrowser.org/software/LR_Gapcloser/.

Collapse

Affiliation(s)

Gui-Cai Xu Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China College of Marine Science, Zhejiang Ocean University, 1 Haida South Road, Zhoushan, 316022, China
Tian-Jun Xu College of Marine Science, Zhejiang Ocean University, 1 Haida South Road, Zhoushan, 316022, China
Rui Zhu Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China College of Fisheries and Life Science, Shanghai Ocean University, 999 Huchenghuan Road, Shanghai, 201306, China
Yan Zhang Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Shang-Qi Li Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Hong-Wei Wang Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Jiong-Tang Li Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China

Collapse

Xu GC, Xu TJ, Zhu R, Zhang Y, Li SQ, Wang HW, Li JT. LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly. Gigascience 2019. [PMID: 30576505 DOI: 10.1093/gigascience/giy157/5256637] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023] Open

Abstract

BACKGROUND

FINDINGS

CONCLUSIONS

Collapse

Affiliation(s)

Gui-Cai Xu Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China College of Marine Science, Zhejiang Ocean University, 1 Haida South Road, Zhoushan, 316022, China
Tian-Jun Xu College of Marine Science, Zhejiang Ocean University, 1 Haida South Road, Zhoushan, 316022, China
Rui Zhu Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China College of Fisheries and Life Science, Shanghai Ocean University, 999 Huchenghuan Road, Shanghai, 201306, China
Yan Zhang Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Shang-Qi Li Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Hong-Wei Wang Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China
Jiong-Tang Li Key Laboratory of Aquatic Genomics, Ministry of Agriculture and Rural Affairs, CAFS Key Laboratory of Aquatic Genomics and Beijing Key Laboratory of Fishery Biotechnology, Chinese Academy of Fishery Sciences, 150 Yongding Road, Beijing, 100141, China

Collapse

Kolmogorov M, Armstrong J, Raney BJ, Streeter I, Dunn M, Yang F, Odom D, Flicek P, Keane TM, Thybert D, Paten B, Pham S. Chromosome assembly of large and complex genomes using multiple references. Genome Res 2018;28:1720-1732. [PMID: 30341161 PMCID: PMC6211643 DOI: 10.1101/gr.236273.118] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 09/24/2018] [Indexed: 11/25/2022]

Affiliation(s)

Mikhail Kolmogorov Department of Computer Science and Engineering, University of California, San Diego, California 92093, USA
Joel Armstrong Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA
Brian J Raney Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA
Ian Streeter European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom
Matthew Dunn Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom
Fengtang Yang Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom
Duncan Odom Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom Cancer Research UK Cambridge Institute, University of Cambridge, CB2 0RE Cambridge, United Kingdom
Paul Flicek European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom
Thomas M Keane European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, United Kingdom School of Life Sciences, University of Nottingham, Nottingham NG7 2NR, United Kingdom
David Thybert European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom Earlham Institute, Norwich Research Park, Norwich NR4 7UG, United Kingdom
Benedict Paten Center for Biomolecular Science and Engineering, University of California, Santa Cruz, California 95064, USA
Son Pham BioTuring Incorporated, San Diego, California 92121, USA

Collapse

Lee T, Kim MY, Ha J, Lee SH. Detection of large sequence insertions by a hybrid approach that combine de novo assembly and resequencing of medium-coverage genome sequences. Genome 2018;61:745-754. [PMID: 30227080 DOI: 10.1139/gen-2018-0027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]

Pennisi E. New technologies boost genome quality. Science 2018;357:10-11. [PMID: 28684474 DOI: 10.1126/science.357.6346.10] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

White DJ, Wang J, Hall RJ. Assessing the Impact of Assemblers on Virus Detection in a De Novo Metagenomic Analysis Pipeline. J Comput Biol 2017;24:874-881. [PMID: 28414526 PMCID: PMC5610382 DOI: 10.1089/cmb.2017.0008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Besnard F, Koutsovoulos G, Dieudonné S, Blaxter M, Félix MA. Toward Universal Forward Genetics: Using a Draft Genome Sequence of the Nematode Oscheius tipulae To Identify Mutations Affecting Vulva Development. Genetics 2017;206:1747-1761. [PMID: 28630114 PMCID: PMC5560785 DOI: 10.1534/genetics.117.203521] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2017] [Accepted: 06/15/2017] [Indexed: 12/30/2022] Open

Zheng-Bradley X, Streeter I, Fairley S, Richardson D, Clarke L, Flicek P. Alignment of 1000 Genomes Project reads to reference assembly GRCh38. Gigascience 2017;6:1-8. [PMID: 28531267 PMCID: PMC5522380 DOI: 10.1093/gigascience/gix038] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Revised: 03/29/2017] [Accepted: 05/19/2017] [Indexed: 12/30/2022] Open

Baaijens JA, Aabidine AZE, Rivals E, Schönhuth A. De novo assembly of viral quasispecies using overlap graphs. Genome Res 2017;27:835-848. [PMID: 28396522 PMCID: PMC5411778 DOI: 10.1101/gr.215038.116] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Accepted: 03/10/2017] [Indexed: 11/24/2022]

Zimin AV, Puiu D, Luo MC, Zhu T, Koren S, Marçais G, Yorke JA, Dvořák J, Salzberg SL. Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm. Genome Res 2017. [PMID: 28130360 DOI: 10.1101/gr.2134c5.116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]

Clavijo BJ, Venturini L, Schudoma C, Accinelli GG, Kaithakottil G, Wright J, Borrill P, Kettleborough G, Heavens D, Chapman H, Lipscombe J, Barker T, Lu FH, McKenzie N, Raats D, Ramirez-Gonzalez RH, Coince A, Peel N, Percival-Alwyn L, Duncan O, Trösch J, Yu G, Bolser DM, Namaati G, Kerhornou A, Spannagl M, Gundlach H, Haberer G, Davey RP, Fosker C, Palma FD, Phillips AL, Millar AH, Kersey PJ, Uauy C, Krasileva KV, Swarbreck D, Bevan MW, Clark MD. An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations. Genome Res 2017;27:885-896. [PMID: 28420692 PMCID: PMC5411782 DOI: 10.1101/gr.217117.116] [Citation(s) in RCA: 243] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 03/14/2017] [Indexed: 01/16/2023]

Affiliation(s)

Bernardo J Clavijo Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Luca Venturini Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Christian Schudoma Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Gonzalo Garcia Accinelli Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Gemy Kaithakottil Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Jonathan Wright Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Philippa Borrill John Innes Centre, Norwich, NR4 7UH, United Kingdom
George Kettleborough Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Darren Heavens Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Helen Chapman Earlham Institute, Norwich, NR4 7UZ, United Kingdom
James Lipscombe Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Tom Barker Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Fu-Hao Lu John Innes Centre, Norwich, NR4 7UH, United Kingdom
Neil McKenzie John Innes Centre, Norwich, NR4 7UH, United Kingdom
Dina Raats Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Ricardo H Ramirez-Gonzalez Earlham Institute, Norwich, NR4 7UZ, United Kingdom John Innes Centre, Norwich, NR4 7UH, United Kingdom
Aurore Coince Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Ned Peel Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Lawrence Percival-Alwyn Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Owen Duncan ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
Josua Trösch ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
Guotai Yu John Innes Centre, Norwich, NR4 7UH, United Kingdom
Dan M Bolser EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
Guy Namaati EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
Arnaud Kerhornou EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
Manuel Spannagl Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
Heidrun Gundlach Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
Georg Haberer Plant Genome and Systems Biology, Helmholtz Center Munich, 85764 Neuherberg, Germany
Robert P Davey Earlham Institute, Norwich, NR4 7UZ, United Kingdom University of East Anglia, Norwich, NR4 7TJ, United Kingdom
Christine Fosker Earlham Institute, Norwich, NR4 7UZ, United Kingdom
Federica Di Palma Earlham Institute, Norwich, NR4 7UZ, United Kingdom University of East Anglia, Norwich, NR4 7TJ, United Kingdom
Andrew L Phillips Rothamsted Research, Harpenden, AL5 2JQ, United Kingdom
A Harvey Millar ARC Centre of Excellence in Plant Energy Biology, The University of Western Australia, Crawley Western Australia 6009, Australia
Paul J Kersey EMBL European Bioinformatics Institute, Hinxton, CB10 1SD, United Kingdom
Cristobal Uauy John Innes Centre, Norwich, NR4 7UH, United Kingdom
Ksenia V Krasileva Earlham Institute, Norwich, NR4 7UZ, United Kingdom University of East Anglia, Norwich, NR4 7TJ, United Kingdom The Sainsbury Laboratory, Norwich, NR4 7UH, United Kingdom
David Swarbreck Earlham Institute, Norwich, NR4 7UZ, United Kingdom University of East Anglia, Norwich, NR4 7TJ, United Kingdom
Michael W Bevan John Innes Centre, Norwich, NR4 7UH, United Kingdom
Matthew D Clark Earlham Institute, Norwich, NR4 7UZ, United Kingdom University of East Anglia, Norwich, NR4 7TJ, United Kingdom

Collapse

Jackman SD, Vandervalk BP, Mohamadi H, Chu J, Yeo S, Hammond SA, Jahesh G, Khan H, Coombe L, Warren RL, Birol I. ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter. Genome Res 2017;27:768-777. [PMID: 28232478 PMCID: PMC5411771 DOI: 10.1101/gr.214346.116] [Citation(s) in RCA: 358] [Impact Index Per Article: 51.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2016] [Accepted: 02/14/2017] [Indexed: 01/19/2023]

Weissensteiner MH, Pang AWC, Bunikis I, Höijer I, Vinnere-Petterson O, Suh A, Wolf JBW. Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications. Genome Res 2017;27:697-708. [PMID: 28360231 PMCID: PMC5411765 DOI: 10.1101/gr.215095.116] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2016] [Accepted: 03/10/2017] [Indexed: 12/27/2022]

Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, Aiden EL. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 2017;356:92-95. [PMID: 28336562 PMCID: PMC5635820 DOI: 10.1126/science.aal3327] [Citation(s) in RCA: 1131] [Impact Index Per Article: 161.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2016] [Accepted: 03/13/2017] [Indexed: 01/04/2023]

Affiliation(s)

Olga Dudchenko The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA Center for Theoretical and Biological Physics, Rice University, Houston, TX 77030, USA
Sanjit S Batra The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Arina D Omer The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Sarah K Nyquist The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Marie Hoeger The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Neva C Durand The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Muhammad S Shamim The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Ido Machol The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA
Eric S Lander Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA Department of Biology, MIT, Cambridge, MA 02139, USA Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA
Aviva Presser Aiden The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Department of Bioengineering, Rice University, Houston, TX 77030, USA Department of Pediatrics, Texas Children's Hospital, Houston, TX 77030, USA
Erez Lieberman Aiden The Center for Genome Architecture, Baylor College of Medicine, Houston, TX 77030, USA. Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA Departments of Computer Science and Computational and Applied Mathematics, Rice University, Houston, TX 77030, USA Center for Theoretical and Biological Physics, Rice University, Houston, TX 77030, USA Broad Institute of Harvard and Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA

Collapse

Guan R, Zhao Y, Zhang H, Fan G, Liu X, Zhou W, Shi C, Wang J, Liu W, Liang X, Fu Y, Ma K, Zhao L, Zhang F, Lu Z, Lee SMY, Xu X, Wang J, Yang H, Fu C, Ge S, Chen W. Draft genome of the living fossil Ginkgo biloba. Gigascience 2016. [PMID: 27871309 DOI: 10.1186/s13742-016-0154-1pmid:27871309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open

Affiliation(s)

Rui Guan BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Yunpeng Zhao The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
He Zhang BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China Stanley Ho Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
Guangyi Fan BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
Xin Liu BGI-Shenzhen, Shenzhen, 518083, China
Wenbin Zhou The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
Chengcheng Shi BGI-Shenzhen, Shenzhen, 518083, China
Jiahao Wang BGI-Shenzhen, Shenzhen, 518083, China
Weiqing Liu BGI-Wuhan, BGI-Shenzhen, Wuhan, 430074, China
Xinming Liang BGI-Shenzhen, Shenzhen, 518083, China
Yuanyuan Fu BGI-Shenzhen, Shenzhen, 518083, China State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Kailong Ma BGI-Shenzhen, Shenzhen, 518083, China
Lijun Zhao The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
Fumin Zhang State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
Zuhong Lu State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Simon Ming-Yuen Lee State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
Xun Xu BGI-Shenzhen, Shenzhen, 518083, China
Jian Wang BGI-Shenzhen, Shenzhen, 518083, China James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
Huanming Yang BGI-Shenzhen, Shenzhen, 518083, China James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
Chengxin Fu The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China. Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China.
Song Ge State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
Wenbin Chen BGI-Shenzhen, Shenzhen, 518083, China. BGI-Qingdao, Qingdao, 266555, China.

Collapse

Guan R, Zhao Y, Zhang H, Fan G, Liu X, Zhou W, Shi C, Wang J, Liu W, Liang X, Fu Y, Ma K, Zhao L, Zhang F, Lu Z, Lee SMY, Xu X, Wang J, Yang H, Fu C, Ge S, Chen W. Draft genome of the living fossil Ginkgo biloba. Gigascience 2016;5:49. [PMID: 27871309 PMCID: PMC5118899 DOI: 10.1186/s13742-016-0154-1] [Citation(s) in RCA: 153] [Impact Index Per Article: 19.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Accepted: 11/01/2016] [Indexed: 11/10/2022] Open

Affiliation(s)

Rui Guan BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Yunpeng Zhao The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
He Zhang BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China Stanley Ho Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
Guangyi Fan BGI-Shenzhen, Shenzhen, 518083, China BGI-Qingdao, Qingdao, 266555, China State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
Xin Liu BGI-Shenzhen, Shenzhen, 518083, China
Wenbin Zhou The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
Chengcheng Shi BGI-Shenzhen, Shenzhen, 518083, China
Jiahao Wang BGI-Shenzhen, Shenzhen, 518083, China
Weiqing Liu BGI-Wuhan, BGI-Shenzhen, Wuhan, 430074, China
Xinming Liang BGI-Shenzhen, Shenzhen, 518083, China
Yuanyuan Fu BGI-Shenzhen, Shenzhen, 518083, China State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Kailong Ma BGI-Shenzhen, Shenzhen, 518083, China
Lijun Zhao The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
Fumin Zhang State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
Zuhong Lu State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
Simon Ming-Yuen Lee State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
Xun Xu BGI-Shenzhen, Shenzhen, 518083, China
Jian Wang BGI-Shenzhen, Shenzhen, 518083, China James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
Huanming Yang BGI-Shenzhen, Shenzhen, 518083, China James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
Chengxin Fu The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China. Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China.
Song Ge State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
Wenbin Chen BGI-Shenzhen, Shenzhen, 518083, China. BGI-Qingdao, Qingdao, 266555, China.

Collapse

Chawla V, Kumar R, Shankar R. Identifying wrong assemblies in de novo short read primary sequence assembly contigs. J Biosci 2016;41:455-74. [PMID: 27581937 DOI: 10.1007/s12038-016-9630-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Tamazian G, Dobrynin P, Krasheninnikova K, Komissarov A, Koepfli KP, O’Brien SJ. Chromosomer: a reference-based genome arrangement tool for producing draft chromosome sequences. Gigascience 2016;5:38. [PMID: 27549770 PMCID: PMC4994284 DOI: 10.1186/s13742-016-0141-6] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2015] [Accepted: 07/31/2016] [Indexed: 11/17/2022] Open

Cápal P, Blavet N, Vrána J, Kubaláková M, Doležel J. Multiple displacement amplification of the DNA from single flow-sorted plant chromosome. Plant J 2015;84:838-844. [PMID: 26400218 DOI: 10.1111/tpj.13035] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Revised: 09/13/2015] [Accepted: 09/17/2015] [Indexed: 06/05/2023]

Galardini M, Mengoni A, Bazzicalupo M. Mapping contigs using CONTIGuator. Methods Mol Biol 2015;1231:163-76. [PMID: 25343865 DOI: 10.1007/978-1-4939-1720-4_11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Daly GM, Leggett RM, Rowe W, Stubbs S, Wilkinson M, Ramirez-Gonzalez RH, Caccamo M, Bernal W, Heeney JL. Host Subtraction, Filtering and Assembly Validations for Novel Viral Discovery Using Next Generation Sequencing Data. PLoS One 2015;10:e0129059. [PMID: 26098299 PMCID: PMC4476701 DOI: 10.1371/journal.pone.0129059] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 05/04/2015] [Indexed: 12/18/2022] Open

Akpinar BA, Magni F, Yuce M, Lucas SJ, Šimková H, Šafář J, Vautrin S, Bergès H, Cattonaro F, Doležel J, Budak H. The physical map of wheat chromosome 5DS revealed gene duplications and small rearrangements. BMC Genomics 2015;16:453. [PMID: 26070810 PMCID: PMC4465308 DOI: 10.1186/s12864-015-1641-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 05/19/2015] [Indexed: 11/24/2022] Open

Song G, Dickins BJA, Demeter J, Engel S, Dunn B, Cherry JM. AGAPE (Automated Genome Analysis PipelinE) for pan-genome analysis of Saccharomyces cerevisiae. PLoS One 2015;10:e0120671. [PMID: 25781462 PMCID: PMC4363492 DOI: 10.1371/journal.pone.0120671] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 01/25/2015] [Indexed: 11/24/2022] Open

Abstract

The characterization and public release of genome sequences from thousands of organisms is expanding the scope for genetic variation studies. However, understanding the phenotypic consequences of genetic variation remains a challenge in eukaryotes due to the complexity of the genotype-phenotype map. One approach to this is the intensive study of model systems for which diverse sources of information can be accumulated and integrated. Saccharomyces cerevisiae is an extensively studied model organism, with well-known protein functions and thoroughly curated phenotype data. To develop and expand the available resources linking genomic variation with function in yeast, we aim to model the pan-genome of S. cerevisiae. To initiate the yeast pan-genome, we newly sequenced or re-sequenced the genomes of 25 strains that are commonly used in the yeast research community using advanced sequencing technology at high quality. We also developed a pipeline for automated pan-genome analysis, which integrates the steps of assembly, annotation, and variation calling. To assign strain-specific functional annotations, we identified genes that were not present in the reference genome. We classified these according to their presence or absence across strains and characterized each group of genes with known functional and phenotypic features. The functional roles of novel genes not found in the reference genome and associated with strains or groups of strains appear to be consistent with anticipated adaptations in specific lineages. As more S. cerevisiae strain genomes are released, our analysis can be used to collate genome data and relate it to lineage-specific patterns of genome evolution. Our new tool set will enhance our understanding of genomic and functional evolution in S. cerevisiae, and will be available to the yeast genetics and molecular biology community.

Collapse

Guo X, Yu N, Ding X, Wang J, Pan Y. DIME: a novel framework for de novo metagenomic sequence assembly. J Comput Biol 2015;22:159-77. [PMID: 25684202 PMCID: PMC4326031 DOI: 10.1089/cmb.2014.0251] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Hu Z, Cheng L, Wang H. The Illumina-solexa sequencing protocol for bacterial genomes. Methods Mol Biol 2015;1231:91-97. [PMID: 25343860 DOI: 10.1007/978-1-4939-1720-4_6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Saski CA, Feltus FA, Parida L, Haiminen N. BAC sequencing using pooled methods. Methods Mol Biol 2015;1227:55-67. [PMID: 25239741 DOI: 10.1007/978-1-4939-1652-8_3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Orlandini V, Fondi M, Fani R. Methods for assembling reads and producing contigs. Methods Mol Biol 2015;1231:151-161. [PMID: 25343864 DOI: 10.1007/978-1-4939-1720-4_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Burton JN, Adey A, Patwardhan RP, Qiu R, Kitzman JO, Shendure J. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat Biotechnol 2013;31:1119-1125. [PMID: 24185095 DOI: 10.1038/nbt2727] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 10/02/2013] [Indexed: 05/19/2023]

Burton JN, Adey A, Patwardhan RP, Qiu R, Kitzman JO, Shendure J. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat Biotechnol 2013;31:1119-25. [PMID: 24185095 PMCID: PMC4117202 DOI: 10.1038/nbt.2727] [Citation(s) in RCA: 854] [Impact Index Per Article: 77.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 10/02/2013] [Indexed: 12/21/2022]

Jiang Y, Ninwichian P, Liu S, Zhang J, Kucuktas H, Sun F, Kaltenboeck L, Sun L, Bao L, Liu Z. Generation of physical map contig-specific sequences useful for whole genome sequence scaffolding. PLoS One 2013;8:e78872. [PMID: 24205335 PMCID: PMC3811975 DOI: 10.1371/journal.pone.0078872] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2013] [Accepted: 09/16/2013] [Indexed: 11/29/2022] Open

Affiliation(s)

Yanliang Jiang The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Parichart Ninwichian The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Shikai Liu The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Jiaren Zhang The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Huseyin Kucuktas The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Fanyue Sun The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Ludmilla Kaltenboeck The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Luyang Sun The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Lisui Bao The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America
Zhanjiang Liu The Fish Molecular Genetics and Biotechnology Laboratory, Aquatic Genomics Unit, School of Fisheries, Aquaculture and Aquatic Sciences and Program of Cell and Molecular Biosciences, Auburn University, Auburn, Alabama, United States of America * E-mail:

Collapse