Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ou S, Liu J, Chougule KM, Fungtammasan A, Seetharam AS, Stein JC, Llaca V, Manchanda N, Gilbert AM, Wei S, Chin CS, Hufnagel DE, Pedersen S, Snodgrass SJ, Fengler K, Woodhouse M, Walenz BP, Koren S, Phillippy AM, Hannigan BT, Dawe RK, Hirsch CN, Hufford MB, Ware D. Effect of sequence depth and length in long-read assembly of the maize inbred NC358. Nat Commun 2020;11:2288. [PMID: 32385271 PMCID: PMC7211024 DOI: 10.1038/s41467-020-16037-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Accepted: 04/09/2020] [Indexed: 01/23/2023] Open

For:	Ou S, Liu J, Chougule KM, Fungtammasan A, Seetharam AS, Stein JC, Llaca V, Manchanda N, Gilbert AM, Wei S, Chin CS, Hufnagel DE, Pedersen S, Snodgrass SJ, Fengler K, Woodhouse M, Walenz BP, Koren S, Phillippy AM, Hannigan BT, Dawe RK, Hirsch CN, Hufford MB, Ware D. Effect of sequence depth and length in long-read assembly of the maize inbred NC358. Nat Commun 2020;11:2288. [PMID: 32385271 PMCID: PMC7211024 DOI: 10.1038/s41467-020-16037-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Accepted: 04/09/2020] [Indexed: 01/23/2023] Open

Number

Cited by Other Article(s)

Joe S, Park JL, Kim J, Kim S, Park JH, Yeo MK, Lee D, Yang JO, Kim SY. Comparison of structural variant callers for massive whole-genome sequence data. BMC Genomics 2024;25:318. [PMID: 38549092 PMCID: PMC10976732 DOI: 10.1186/s12864-024-10239-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 03/18/2024] [Indexed: 04/01/2024] Open

Le MH, Morgan B, Lu MY, Moctezuma V, Burgos O, Huang JP. The genomes of Hercules beetles reveal putative adaptive loci and distinct demographic histories in pristine North American forests. Mol Ecol Resour 2024;24:e13908. [PMID: 38063363 DOI: 10.1111/1755-0998.13908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Revised: 01/14/2023] [Accepted: 11/20/2023] [Indexed: 01/12/2024]

Bringloe TT, Parent GJ. Contrasting new and available reference genomes to highlight uncertainties in assemblies and areas for future improvement: an example with monodontid species. BMC Genomics 2023;24:693. [PMID: 37985969 PMCID: PMC10659057 DOI: 10.1186/s12864-023-09779-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/31/2023] [Indexed: 11/22/2023] Open

Abstract

BACKGROUND

Reference genomes provide a foundational framework for evolutionary investigations, ecological analysis, and conservation science, yet uncertainties in the assembly of reference genomes are difficult to assess, and by extension rarely quantified. Reference genomes for monodontid cetaceans span a wide spectrum of data types and analytical approaches, providing the context to derive broader insights related to discrepancies and regions of uncertainty in reference genome assembly. We generated three beluga (Delphinapterus leucas) and one narwhal (Monodon monoceros) reference genomes and contrasted these with published chromosomal scale assemblies for each species to quantify discrepancies associated with genome assemblies.

RESULTS

The new reference genomes achieved chromosomal scale assembly using a combination of PacBio long reads, Illumina short reads, and Hi-C scaffolding data. For beluga, we identified discrepancies in the order and orientation of contigs in 2.2-3.7% of the total genome depending on the pairwise comparison of references. In addition, unsupported higher order scaffolding was identified in published reference genomes. In contrast, we estimated 8.2% of the compared narwhal genomes featured discrepancies, with inversions being notably abundant (5.3%). Discrepancies were linked to repetitive elements in both species.

CONCLUSIONS

We provide several new reference genomes for beluga (Delphinapterus leucas), while highlighting potential avenues for improvements. In particular, additional layers of data providing information on ultra-long genomic distances are needed to resolve persistent errors in reference genome construction. The comparative analyses of monodontid reference genomes suggested that the three new reference genomes for beluga are more accurate compared to the currently published reference genome, but that the new narwhal genome is less accurate than one published. We also present a conceptual summary for improving the accuracy of reference genomes with relevance to end-user needs and how they relate to levels of assembly quality and uncertainty.

Collapse

Schelkunov MI. Mabs, a suite of tools for gene-informed genome assembly. BMC Bioinformatics 2023;24:377. [PMID: 37794322 PMCID: PMC10548655 DOI: 10.1186/s12859-023-05499-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 09/26/2023] [Indexed: 10/06/2023] Open

Ramu P, Srivastava RK, Sanyal A, Fengler K, Cao J, Zhang Y, Nimkar M, Gerke J, Shreedharan S, Llaca V, May G, Peterson-Burch B, Lin H, King M, Das S, Bhupesh V, Mandaokar A, Maruthachalam K, Krishnamurthy P, Gandhi H, Rathore A, Gupta R, Chitikineni A, Bajaj P, Gupta SK, Satyavathi CT, Pandravada A, Varshney RK, Babu R. Improved pearl millet genomes representing the global heterotic pool offer a framework for molecular breeding applications. Commun Biol 2023;6:902. [PMID: 37667032 PMCID: PMC10477261 DOI: 10.1038/s42003-023-05258-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 08/18/2023] [Indexed: 09/06/2023] Open

Affiliation(s)

Punna Ramu Corteva Agriscience, Hyderabad, Telangana, India
Rakesh K Srivastava International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India.
Abhijit Sanyal Corteva Agriscience, Hyderabad, Telangana, India
Kevin Fengler Corteva Agriscience, Johnston, IA, 50131, USA
Jun Cao Corteva Agriscience, Johnston, IA, 50131, USA
Yun Zhang Corteva Agriscience, Johnston, IA, 50131, USA
Mitali Nimkar Corteva Agriscience, Hyderabad, Telangana, India
Justin Gerke Corteva Agriscience, Johnston, IA, 50131, USA
Sriram Shreedharan Corteva Agriscience, Johnston, IA, 50131, USA
Victor Llaca Corteva Agriscience, Johnston, IA, 50131, USA
Gregory May Corteva Agriscience, Johnston, IA, 50131, USA
Brooke Peterson-Burch Corteva Agriscience, Johnston, IA, 50131, USA
Haining Lin Corteva Agriscience, Johnston, IA, 50131, USA Moderna, 200 Technology Square, Cambridge, MA, 02139, USA
Matthew King Corteva Agriscience, Johnston, IA, 50131, USA Natera Inc, San Carlos, CA, 94070, USA
Sayan Das Corteva Agriscience, Hyderabad, Telangana, India
Vaid Bhupesh Corteva Agriscience, Hyderabad, Telangana, India
Ajin Mandaokar Corteva Agriscience, Hyderabad, Telangana, India
Karunakaran Maruthachalam Corteva Agriscience, Hyderabad, Telangana, India
Pobbathi Krishnamurthy Corteva Agriscience, Hyderabad, Telangana, India
Harish Gandhi International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India International Maize and Wheat Improvement Center (CIMMYT), Nairobi, Kenya
Abhishek Rathore International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India International Maize and Wheat Improvement Center (CIMMYT), Hyderabad, India
Rajeev Gupta International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India Cereal Crops Research Unit, Edward T. Schafer Agricultural Research Center, USDA-ARS, Fargo, ND, 58102, USA
Annapurna Chitikineni International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India Centre for Crop & Food Innovation, State Agricultural Biotechnology Centre, Food Futures Institute, Murdoch University, Murdoch, WA, 6150, Australia
Prasad Bajaj International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India
S K Gupta International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India
C Tara Satyavathi Indian Council of Agricultural Research - All India Coordinated Research Project on Pearl Millet, Jodhpur, India
Anand Pandravada Corteva Agriscience, Hyderabad, Telangana, India
Rajeev K Varshney International Crops Research Institute for the Semi-Arid Tropics, Hyderabad, Telangana, India. Centre for Crop & Food Innovation, State Agricultural Biotechnology Centre, Food Futures Institute, Murdoch University, Murdoch, WA, 6150, Australia.
Raman Babu Corteva Agriscience, Hyderabad, Telangana, India.

Collapse

Lehle JD, McCarrey JR. Accelerating the alignment processing speed of the comprehensive end-to-end whole-genome bisulfite sequencing pipeline, wg-blimp. Biol Methods Protoc 2023;8:bpad012. [PMID: 37431446 PMCID: PMC10329742 DOI: 10.1093/biomethods/bpad012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 06/12/2023] [Accepted: 06/12/2023] [Indexed: 07/12/2023] Open

Mokhtar MM, Abd-Elhalim HM, El Allali A. A large-scale assessment of the quality of plant genome assemblies using the LTR assembly index. AOB PLANTS 2023;15:plad015. [PMID: 37197714 PMCID: PMC10184434 DOI: 10.1093/aobpla/plad015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 04/01/2023] [Indexed: 05/19/2023]

Olson ND, Wagner J, Dwarshuis N, Miga KH, Sedlazeck FJ, Salit M, Zook JM. Variant calling and benchmarking in an era of complete human genome sequences. Nat Rev Genet 2023:10.1038/s41576-023-00590-0. [PMID: 37059810 DOI: 10.1038/s41576-023-00590-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/22/2023] [Indexed: 04/16/2023]

Hotaling S, Wilcox ER, Heckenhauer J, Stewart RJ, Frandsen PB. Highly accurate long reads are crucial for realizing the potential of biodiversity genomics. BMC Genomics 2023;24:117. [PMID: 36927511 PMCID: PMC10018877 DOI: 10.1186/s12864-023-09193-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 02/17/2023] [Indexed: 03/18/2023] Open

Abstract

BACKGROUND

Generating the most contiguous, accurate genome assemblies given available sequencing technologies is a long-standing challenge in genome science. With the rise of long-read sequencing, assembly challenges have shifted from merely increasing contiguity to correctly assembling complex, repetitive regions of interest, ideally in a phased manner. At present, researchers largely choose between two types of long read data: longer, but less accurate sequences, or highly accurate, but shorter reads (i.e., >Q20 or 99% accurate). To better understand how these types of long-read data as well as scale of data (i.e., mean length and sequencing depth) influence genome assembly outcomes, we compared genome assemblies for a caddisfly, Hesperophylax magnus, generated with longer, but less accurate, Oxford Nanopore (ONT) R9.4.1 and highly accurate PacBio HiFi (HiFi) data. Next, we expanded this comparison to consider the influence of highly accurate long-read sequence data on genome assemblies across 6750 plant and animal genomes. For this broader comparison, we used HiFi data as a surrogate for highly accurate long-reads broadly as we could identify when they were used from GenBank metadata.

RESULTS

HiFi reads outperformed ONT reads in all assembly metrics tested for the caddisfly data set and allowed for accurate assembly of the repetitive ~ 20 Kb H-fibroin gene. Across plants and animals, genome assemblies that incorporated HiFi reads were also more contiguous. For plants, the average HiFi assembly was 501% more contiguous (mean contig N50 = 20.5 Mb) than those generated with any other long-read data (mean contig N50 = 4.1 Mb). For animals, HiFi assemblies were 226% more contiguous (mean contig N50 = 20.9 Mb) versus other long-read assemblies (mean contig N50 = 9.3 Mb). In plants, we also found limited evidence that HiFi may offer a unique solution for overcoming genomic complexity that scales with assembly size.

CONCLUSIONS

Highly accurate long-reads generated with HiFi or analogous technologies represent a key tool for maximizing genome assembly quality for a wide swath of plants and animals. This finding is particularly important when resources only allow for one type of sequencing data to be generated. Ultimately, to realize the promise of biodiversity genomics, we call for greater uptake of highly accurate long-reads in future studies.

Collapse

Shi J, Tian Z, Lai J, Huang X. Plant pan-genomics and its applications. MOLECULAR PLANT 2023;16:168-186. [PMID: 36523157 DOI: 10.1016/j.molp.2022.12.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 12/07/2022] [Accepted: 12/12/2022] [Indexed: 06/17/2023]

Rabanal FA, Gräff M, Lanz C, Fritschi K, Llaca V, Lang M, Carbonell-Bejerano P, Henderson I, Weigel D. Pushing the limits of HiFi assemblies reveals centromere diversity between two Arabidopsis thaliana genomes. Nucleic Acids Res 2022;50:12309-12327. [PMID: 36453992 PMCID: PMC9757041 DOI: 10.1093/nar/gkac1115] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 09/13/2022] [Accepted: 11/10/2022] [Indexed: 12/05/2022] Open

Steenwyk JL, Buida Iii TJ, Gonçalves C, Goltz DC, Morales G, Mead ME, LaBella AL, Chavez CM, Schmitz JE, Hadjifrangiskou M, Li Y, Rokas A. BioKIT: a versatile toolkit for processing and analyzing diverse types of sequence data. Genetics 2022;221:6583183. [PMID: 35536198 PMCID: PMC9252278 DOI: 10.1093/genetics/iyac079] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 05/03/2022] [Indexed: 11/14/2022] Open

Affiliation(s)

Jacob L Steenwyk Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
Thomas J Buida Iii 9 City Place #312, Nashville, TN 37209, USA
Carla Gonçalves Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA.,Associate Laboratory i4HB-Institute for Health and Bioeconomy, NOVA School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal.,UCIBIO-Applied Molecular Biosciences Unit, Department of Life Sciences, NOVA School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal
Dayna C Goltz 2312 Elliston Place #510, Nashville, TN 37203, USA
Grace Morales Department of Pathology, Microbiology & Immunology, Center for Personalized Microbiology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
Matthew E Mead Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
Abigail L LaBella Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
Christina M Chavez Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA
Jonathan E Schmitz Department of Pathology, Microbiology & Immunology, Center for Personalized Microbiology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
Maria Hadjifrangiskou Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA.,Department of Pathology, Microbiology & Immunology, Center for Personalized Microbiology, Vanderbilt University Medical Center, Nashville, TN 37232, USA
Yuanning Li Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA
Antonis Rokas Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.,Evolutionary Studies Initiative, Vanderbilt University, Nashville, TN 37235, USA

Collapse

Zhang H, Li R, Guo Y, Zhang Y, Zhang D, Yang L. LIFE-Seq: a universal Large Integrated DNA Fragment Enrichment Sequencing strategy for deciphering the transgene integration of genetically modified organisms. PLANT BIOTECHNOLOGY JOURNAL 2022;20:964-976. [PMID: 34990051 PMCID: PMC9055813 DOI: 10.1111/pbi.13776] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 12/18/2021] [Accepted: 12/30/2021] [Indexed: 06/14/2023]

Vargas-Chavez C, Longo Pendy NM, Nsango SE, Aguilera L, Ayala D, González J. Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021;32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]

Wang W, Chen L, Fengler K, Bolar J, Llaca V, Wang X, Clark CB, Fleury TJ, Myrvold J, Oneal D, van Dyk MM, Hudson A, Munkvold J, Baumgarten A, Thompson J, Cai G, Crasta O, Aggarwal R, Ma J. A giant NLR gene confers broad-spectrum resistance to Phytophthora sojae in soybean. Nat Commun 2021;12:6263. [PMID: 34741017 PMCID: PMC8571336 DOI: 10.1038/s41467-021-26554-8] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/06/2021] [Indexed: 11/29/2022] Open

Affiliation(s)

Weidong Wang Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA
Liyang Chen Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA
Kevin Fengler Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Joy Bolar Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Victor Llaca Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Xutong Wang Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA
Chancelor B Clark Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA
Tomara J Fleury Department of Botany and Plant Pathology, Purdue University, West Lafayette, IN, 47907, USA Crop Production and Pest Control Research Unit, USDA, ARS, West Lafayette, IN, 47907, USA
Jon Myrvold Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
David Oneal Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Maria Magdalena van Dyk Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Ashley Hudson Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Jesse Munkvold Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Andy Baumgarten Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Jeff Thompson Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA
Guohong Cai Department of Botany and Plant Pathology, Purdue University, West Lafayette, IN, 47907, USA Crop Production and Pest Control Research Unit, USDA, ARS, West Lafayette, IN, 47907, USA
Oswald Crasta Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA R&D, Equinom, Inc., Indianapolis, IN, 46268, USA
Rajat Aggarwal Research and Development, Corteva Agriscience™, Johnston, IA, 50131, USA.
Jianxin Ma Department of Agronomy, Purdue University, West Lafayette, IN, 47907, USA.

Collapse

Bornowski N, Michel KJ, Hamilton JP, Ou S, Seetharam AS, Jenkins J, Grimwood J, Plott C, Shu S, Talag J, Kennedy M, Hundley H, Singan VR, Barry K, Daum C, Yoshinaga Y, Schmutz J, Hirsch CN, Hufford MB, de Leon N, Kaeppler SM, Buell CR. Genomic variation within the maize stiff-stalk heterotic germplasm pool. THE PLANT GENOME 2021;14:e20114. [PMID: 34275202 DOI: 10.1002/tpg2.20114] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Accepted: 05/06/2021] [Indexed: 05/28/2023]

Affiliation(s)

Nolan Bornowski Dep. of Plant Biology, Michigan State Univ., 612 Wilson Road, East Lansing, MI, 48824, USA
Kathryn J Michel Dep. of Agronomy, Univ. of Wisconsin - Madison, 1575 Linden Drive, Madison, WI, 53706, USA
John P Hamilton Dep. of Plant Biology, Michigan State Univ., 612 Wilson Road, East Lansing, MI, 48824, USA
Shujun Ou Dep. of Ecology, Evolution, and Organismal Biology, Iowa State Univ., 2200 Osborn Drive, Ames, IA, 50011, USA
Arun S Seetharam Dep. of Ecology, Evolution, and Organismal Biology, Iowa State Univ., 2200 Osborn Drive, Ames, IA, 50011, USA
Jerry Jenkins HudsonAlpha Institute for Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA
Jane Grimwood HudsonAlpha Institute for Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA
Chris Plott HudsonAlpha Institute for Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA
Shengqiang Shu U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Jayson Talag Arizona Genomics Institute, School of Plant Sciences, Univ. of Arizona, 1657 E Helen Street, Tucson, AZ, 85721, USA
Megan Kennedy U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Hope Hundley U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Vasanth R Singan U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Kerrie Barry U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Chris Daum U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Yuko Yoshinaga U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Jeremy Schmutz HudsonAlpha Institute for Biotechnology, 601 Genome Way Northwest, Huntsville, AL, 35806, USA U.S. Dep. of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA
Candice N Hirsch Dep. of Agronomy and Plant Genetics, Univ. of Minnesota, 1991 Upper Buford Circle, Saint Paul, MN, 55108, USA
Matthew B Hufford Dep. of Ecology, Evolution, and Organismal Biology, Iowa State Univ., 2200 Osborn Drive, Ames, IA, 50011, USA
Natalia de Leon Dep. of Agronomy, Univ. of Wisconsin - Madison, 1575 Linden Drive, Madison, WI, 53706, USA Dep. of Energy, Great Lakes Bioenergy Research Center, Univ. of Wisconsin - Madison, 1575 Linden Drive, Madison, WI, 53706, USA
Shawn M Kaeppler Dep. of Agronomy, Univ. of Wisconsin - Madison, 1575 Linden Drive, Madison, WI, 53706, USA Dep. of Energy, Great Lakes Bioenergy Research Center, Univ. of Wisconsin - Madison, 1575 Linden Drive, Madison, WI, 53706, USA Wisconsin Crop Innovation Center, Univ. of Wisconsin - Madison, 8520 University Green, Middleton, WI, 53562, USA
C Robin Buell Dep. of Plant Biology, Michigan State Univ., 612 Wilson Road, East Lansing, MI, 48824, USA Dep. of Energy, Great Lakes Bioenergy Research Center, Michigan State Univ., 612 Wilson Road, East Lansing, MI, 48824, USA

Collapse

Signal-based optical map alignment. PLoS One 2021;16:e0253102. [PMID: 34591846 PMCID: PMC8483326 DOI: 10.1371/journal.pone.0253102] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2021] [Accepted: 09/15/2021] [Indexed: 11/19/2022] Open

LeafGo: Leaf to Genome, a quick workflow to produce high-quality de novo plant genomes using long-read sequencing technology. Genome Biol 2021;22:256. [PMID: 34479618 PMCID: PMC8414726 DOI: 10.1186/s13059-021-02475-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 08/20/2021] [Indexed: 02/06/2023] Open

Hufford MB, Seetharam AS, Woodhouse MR, Chougule KM, Ou S, Liu J, Ricci WA, Guo T, Olson A, Qiu Y, Della Coletta R, Tittes S, Hudson AI, Marand AP, Wei S, Lu Z, Wang B, Tello-Ruiz MK, Piri RD, Wang N, Kim DW, Zeng Y, O'Connor CH, Li X, Gilbert AM, Baggs E, Krasileva KV, Portwood JL, Cannon EKS, Andorf CM, Manchanda N, Snodgrass SJ, Hufnagel DE, Jiang Q, Pedersen S, Syring ML, Kudrna DA, Llaca V, Fengler K, Schmitz RJ, Ross-Ibarra J, Yu J, Gent JI, Hirsch CN, Ware D, Dawe RK. De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes. Science 2021;373:655-662. [PMID: 34353948 PMCID: PMC8733867 DOI: 10.1126/science.abg5289] [Citation(s) in RCA: 233] [Impact Index Per Article: 77.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 06/24/2021] [Indexed: 12/24/2022]

Affiliation(s)

Matthew B Hufford Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
Arun S Seetharam Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA Genome Informatics Facility, Iowa State University, Ames, IA 50011, USA
Margaret R Woodhouse USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
Kapeel M Chougule Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Shujun Ou Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
Jianing Liu Department of Genetics, University of Georgia, Athens, GA 30602, USA
William A Ricci Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
Tingting Guo Department of Agronomy, Iowa State University, Ames, IA 50011, USA
Andrew Olson Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Yinjie Qiu Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
Rafael Della Coletta Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
Silas Tittes Center for Population Biology, University of California, Davis, CA 95616, USA Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
Asher I Hudson Center for Population Biology, University of California, Davis, CA 95616, USA Department of Evolution and Ecology, University of California, Davis, CA 95616, USA
Alexandre P Marand Department of Genetics, University of Georgia, Athens, GA 30602, USA
Sharon Wei Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Zhenyuan Lu Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Bo Wang Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Marcela K Tello-Ruiz Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
Rebecca D Piri Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
Na Wang Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
Dong Won Kim Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
Yibing Zeng Department of Genetics, University of Georgia, Athens, GA 30602, USA
Christine H O'Connor Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN 55108, USA
Xianran Li Department of Agronomy, Iowa State University, Ames, IA 50011, USA
Amanda M Gilbert Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
Erin Baggs Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
Ksenia V Krasileva Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, USA
John L Portwood USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
Ethalinda K S Cannon USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
Carson M Andorf USDA-ARS Corn Insects and Crop Genetics Research Unit, Iowa State University, Ames, IA 50011, USA
Nancy Manchanda Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
Samantha J Snodgrass Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
David E Hufnagel Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA Virus and Prion Research Unit, National Animal Disease Center, USDA-ARS, Ames, IA, 50010, USA
Qiuhan Jiang Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
Sarah Pedersen Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
Michael L Syring Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA
David A Kudrna Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
Victor Llaca Corteva Agriscience, Johnston, IA 50131, USA
Kevin Fengler Corteva Agriscience, Johnston, IA 50131, USA
Robert J Schmitz Department of Genetics, University of Georgia, Athens, GA 30602, USA
Jeffrey Ross-Ibarra Center for Population Biology, University of California, Davis, CA 95616, USA Department of Evolution and Ecology, University of California, Davis, CA 95616, USA Genome Center, University of California, Davis, CA 95616, USA
Jianming Yu Department of Agronomy, Iowa State University, Ames, IA 50011, USA
Jonathan I Gent Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
Candice N Hirsch Department of Agronomy and Plant Genetics, University of Minnesota, St. Paul, MN 55108, USA
Doreen Ware USDA-ARS NAA Robert W. Holley Center for Agriculture and Health, Agricultural Research Service, Ithaca, NY 14853, USA Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
R Kelly Dawe Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50011, USA.

Collapse

Wick RR, Judd LM, Wyres KL, Holt KE. Recovery of small plasmid sequences via Oxford Nanopore sequencing. Microb Genom 2021;7:000631. [PMID: 34431763 PMCID: PMC8549360 DOI: 10.1099/mgen.0.000631] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Accepted: 06/11/2021] [Indexed: 12/13/2022] Open

Baiakhmetov E, Guyomar C, Shelest E, Nobis M, Gudkova PD. The first draft genome of feather grasses using SMRT sequencing and its implications in molecular studies of Stipa. Sci Rep 2021;11:15345. [PMID: 34321531 PMCID: PMC8319324 DOI: 10.1038/s41598-021-94068-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 06/24/2021] [Indexed: 11/22/2022] Open

Sutton JM, Millwood JD, Case McCormack A, Fierst JL. Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies. GIGABYTE 2021;2021:gigabyte27. [PMID: 36824342 PMCID: PMC9650304 DOI: 10.46471/gigabyte.27] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Accepted: 07/05/2021] [Indexed: 11/09/2022] Open

Lin G, He C, Zheng J, Koo DH, Le H, Zheng H, Tamang TM, Lin J, Liu Y, Zhao M, Hao Y, McFraland F, Wang B, Qin Y, Tang H, McCarty DR, Wei H, Cho MJ, Park S, Kaeppler H, Kaeppler SM, Liu Y, Springer N, Schnable PS, Wang G, White FF, Liu S. Chromosome-level genome assembly of a regenerable maize inbred line A188. Genome Biol 2021;22:175. [PMID: 34108023 PMCID: PMC8188678 DOI: 10.1186/s13059-021-02396-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2020] [Accepted: 05/28/2021] [Indexed: 01/08/2023] Open

Affiliation(s)

Guifang Lin Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Cheng He Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Jun Zheng Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
Dal-Hoe Koo Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Ha Le Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Huakun Zheng Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Tej Man Tamang Department of Horticulture and Natural Resources, Kansas State University, Manhattan, KS, 66506-5502, USA
Jinguang Lin Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA Present Address, Corvallis, OR, 97330, USA
Yan Liu Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
Mingxia Zhao Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Yangfan Hao Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA
Frank McFraland Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
Bo Wang Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA
Yang Qin Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
Haibao Tang Center for Genomics and Biotechnology and Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, 350002, Fujian, China
Donald R McCarty Department of Horticulture, University of Florida, Gainesville, FL, 32611-0680, USA
Hairong Wei College of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, 49931, USA
Myeong-Je Cho Innovative Genomics Institute, University of California-Berkeley, Sunnyvale, CA, 94704, USA
Sunghun Park Department of Horticulture and Natural Resources, Kansas State University, Manhattan, KS, 66506-5502, USA
Heidi Kaeppler Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
Shawn M Kaeppler Department of Agronomy, University of Wisconsin-Madison, Madison, WI, 53706, USA
Yunjun Liu Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
Nathan Springer Department of Plant Biology, University of Minnesota, Saint Paul, MN, 55108, USA
Patrick S Schnable Department of Agronomy, Iowa State University, Ames, IA, 50011-3605, USA
Guoying Wang Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
Frank F White Department of Plant Pathology, University of Florida, Gainesville, FL, 32611-0680, USA
Sanzhen Liu Department of Plant Pathology, Kansas State University, 4024 Throckmorton Center, Manhattan, KS, 66506-5502, USA.

Collapse

Yang N, Yan J. New genomic approaches for enhancing maize genetic improvement. CURRENT OPINION IN PLANT BIOLOGY 2021;60:101977. [PMID: 33418269 DOI: 10.1016/j.pbi.2020.11.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Revised: 11/07/2020] [Accepted: 11/16/2020] [Indexed: 05/13/2023]

Luo J, Wei Y, Lyu M, Wu Z, Liu X, Luo H, Yan C. A comprehensive review of scaffolding methods in genome assembly. Brief Bioinform 2021;22:6149347. [PMID: 33634311 DOI: 10.1093/bib/bbab033] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 12/20/2022] Open

Genome assembly and population genomic analysis provide insights into the evolution of modern sweet corn. Nat Commun 2021;12:1227. [PMID: 33623026 PMCID: PMC7902669 DOI: 10.1038/s41467-021-21380-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 01/26/2021] [Indexed: 01/31/2023] Open

Schwartz C, Lenderts B, Feigenbutz L, Barone P, Llaca V, Fengler K, Svitashev S. CRISPR-Cas9-mediated 75.5-Mb inversion in maize. NATURE PLANTS 2020;6:1427-1431. [PMID: 33299151 DOI: 10.1038/s41477-020-00817-6] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Accepted: 11/04/2020] [Indexed: 05/11/2023]

Hufnagel DE, Hufford MB, Seetharam AS. SequelTools: a suite of tools for working with PacBio Sequel raw sequence data. BMC Bioinformatics 2020;21:429. [PMID: 33004007 PMCID: PMC7532105 DOI: 10.1186/s12859-020-03751-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 09/11/2020] [Indexed: 12/20/2022] Open

Pham GM, Hamilton JP, Wood JC, Burke JT, Zhao H, Vaillancourt B, Ou S, Jiang J, Buell CR. Construction of a chromosome-scale long-read reference genome assembly for potato. Gigascience 2020;9:giaa100. [PMID: 32964225 PMCID: PMC7509475 DOI: 10.1093/gigascience/giaa100] [Citation(s) in RCA: 132] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 08/26/2020] [Accepted: 09/05/2020] [Indexed: 01/19/2023] Open

Abstract

BACKGROUND

Worldwide, the cultivated potato, Solanum tuberosum L., is the No. 1 vegetable crop and a critical food security crop. The genome sequence of DM1-3 516 R44, a doubled monoploid clone of S. tuberosum Group Phureja, was published in 2011 using a whole-genome shotgun sequencing approach with short-read sequence data. Current advanced sequencing technologies now permit generation of near-complete, high-quality chromosome-scale genome assemblies at minimal cost.

FINDINGS

Here, we present an updated version of the DM1-3 516 R44 genome sequence (v6.1) using Oxford Nanopore Technologies long reads coupled with proximity-by-ligation scaffolding (Hi-C), yielding a chromosome-scale assembly. The new (v6.1) assembly represents 741.6 Mb of sequence (87.8%) of the estimated 844 Mb genome, of which 741.5 Mb is non-gapped with 731.2 Mb anchored to the 12 chromosomes. Use of Oxford Nanopore Technologies full-length complementary DNA sequencing enabled annotation of 32,917 high-confidence protein-coding genes encoding 44,851 gene models that had a significantly improved representation of conserved orthologs compared with the previous annotation. The new assembly has improved contiguity with a 595-fold increase in N50 contig size, 99% reduction in the number of contigs, a 44-fold increase in N50 scaffold size, and an LTR Assembly Index score of 13.56, placing it in the category of reference genome quality. The improved assembly also permitted annotation of the centromeres via alignment to sequencing reads derived from CENH3 nucleosomes.

CONCLUSIONS

Access to advanced sequencing technologies and improved software permitted generation of a high-quality, long-read, chromosome-scale assembly and improved annotation dataset for the reference genotype of potato that will facilitate research aimed at improving agronomic traits and understanding genome evolution.

Collapse

Xu M, Guo L, Gu S, Wang O, Zhang R, Peters BA, Fan G, Liu X, Xu X, Deng L, Zhang Y. TGS-GapCloser: A fast and accurate gap closer for large genomes with low coverage of error-prone long reads. Gigascience 2020;9:giaa094. [PMID: 32893860 PMCID: PMC7476103 DOI: 10.1093/gigascience/giaa094] [Citation(s) in RCA: 139] [Impact Index Per Article: 34.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 05/15/2020] [Accepted: 08/14/2020] [Indexed: 12/16/2022] Open

Abstract

BACKGROUND

Analyses that use genome assemblies are critically affected by the contiguity, completeness, and accuracy of those assemblies. In recent years single-molecule sequencing techniques generating long-read information have become available and enabled substantial improvement in contig length and genome completeness, especially for large genomes (>100 Mb), although bioinformatic tools for these applications are still limited.

FINDINGS

We developed a software tool to close sequence gaps in genome assemblies, TGS-GapCloser, that uses low-depth (∼10×) long single-molecule reads. The algorithm extracts reads that bridge gap regions between 2 contigs within a scaffold, error corrects only the candidate reads, and assigns the best sequence data to each gap. As a demonstration, we used TGS-GapCloser to improve the scaftig NG50 value of 3 human genome assemblies by 24-fold on average with only ∼10× coverage of Oxford Nanopore or Pacific Biosciences reads, covering with sequence data up to 94.8% gaps with 97.7% positive predictive value. These improved assemblies achieve 99.998% (Q46) single-base accuracy with final inserted sequences having 99.97% (Q35) accuracy, despite the high raw error rate of single-molecule reads, enabling high-quality downstream analyses, including up to a 31-fold increase in the scaftig NGA50 and up to 13.1% more complete BUSCO genes. Additionally, we show that even in ultra-large genome assemblies, such as the ginkgo (∼12 Gb), TGS-GapCloser can cover 71.6% of gaps with sequence data.

CONCLUSIONS

TGS-GapCloser can close gaps in large genome assemblies using raw long reads quickly and cost-effectively. The final assemblies generated by TGS-GapCloser have improved contiguity and completeness while maintaining high accuracy. The software is available at https://github.com/BGI-Qingdao/TGS-GapCloser.

Collapse

Affiliation(s)

Mengyang Xu BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Lidong Guo BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China BGI Education Center, University of Chinese Academy of Sciences, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Shengqiang Gu BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China BGI Education Center, University of Chinese Academy of Sciences, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Ou Wang BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Rui Zhang BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China
Brock A Peters BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, CA 95134, USA
Guangyi Fan BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Xin Liu BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China China National GeneBank, BGI-Shenzhen, Jinsha Road, Dapeng New District, Shenzhen, 518120, China
Xun Xu BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China China National GeneBank, BGI-Shenzhen, Jinsha Road, Dapeng New District, Shenzhen, 518120, China
Li Deng BGI-Qingdao, BGI-Shenzhen, 2 Hengyunshan Road, West Coast New Area, Qingdao, 266426, China State Key Laboratory of Agricultural Genomics, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
Yongwei Zhang BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China Complete Genomics Inc., 2904 Orchard Pkwy, San Jose, CA 95134, USA

Collapse