1
|
Yuan Y, Chung CYL, Chan TF. Advances in optical mapping for genomic research. Comput Struct Biotechnol J 2020; 18:2051-2062. [PMID: 32802277 PMCID: PMC7419273 DOI: 10.1016/j.csbj.2020.07.018] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 07/08/2020] [Accepted: 07/24/2020] [Indexed: 12/28/2022] Open
Abstract
Recent advances in optical mapping have allowed the construction of improved genome assemblies with greater contiguity. Optical mapping also enables genome comparison and identification of large-scale structural variations. Association of these large-scale genomic features with biological functions is an important goal in plant and animal breeding and in medical research. Optical mapping has also been used in microbiology and still plays an important role in strain typing and epidemiological studies. Here, we review the development of optical mapping in recent decades to illustrate its importance in genomic research. We detail its applications and algorithms to show its specific advantages. Finally, we discuss the challenges required to facilitate the optimization of optical mapping and improve its future development and application.
Collapse
Key Words
- 3D, three-dimensional
- DBG, de Bruijn graph
- DLS, direct label and strain
- DNA, deoxyribonucleic acid
- Genome assembly
- Hi-C, high-throughput chromosome conformation capture
- Mb, million base pair
- Next generation sequencing
- OLC, overlap-layout-consensus
- Optical mapping
- PCR, polymerase chain reaction
- PacBio, Pacific Biosciences
- SRS, short-read sequencing
- SV, structural variation
- Structural variation
- bp, base pair
- kb, kilobase pair
Collapse
Affiliation(s)
- Yuxuan Yuan
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
- State Key Laboratory for Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong SAR, China
- AoE Centre for Genomic Studies on Plant-Environment Interaction for Sustainable Agriculture and Food Security, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Claire Yik-Lok Chung
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
- State Key Laboratory for Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Ting-Fung Chan
- School of Life Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China
- State Key Laboratory for Agrobiotechnology, The Chinese University of Hong Kong, Hong Kong SAR, China
- AoE Centre for Genomic Studies on Plant-Environment Interaction for Sustainable Agriculture and Food Security, The Chinese University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
2
|
An improved assembly and annotation of the melon (Cucumis melo L.) reference genome. Sci Rep 2018; 8:8088. [PMID: 29795526 PMCID: PMC5967340 DOI: 10.1038/s41598-018-26416-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Accepted: 05/09/2018] [Indexed: 12/20/2022] Open
Abstract
We report an improved assembly (v3.6.1) of the melon (Cucumis melo L.) genome and a new genome annotation (v4.0). The optical mapping approach allowed correcting the order and the orientation of 21 previous scaffolds and permitted to correctly define the gap-size extension along the 12 pseudomolecules. A new comprehensive annotation was also built in order to update the previous annotation v3.5.1, released more than six years ago. Using an integrative annotation pipeline, based on exhaustive RNA-Seq collections and ad-hoc transposable element annotation, we identified 29,980 protein-coding loci. Compared to the previous version, the v4.0 annotation improved gene models in terms of completeness of gene structure, UTR regions definition, intron-exon junctions and reduction of fragmented genes. More than 8,000 new genes were identified, one third of them being well supported by RNA-Seq data. To make all the new resources easily exploitable and completely available for the scientific community, a redesigned Melonomics genomic platform was released at http://melonomics.net. The resources produced in this work considerably increase the reliability of the melon genome assembly and resolution of the gene models paving the way for further studies in melon and related species.
Collapse
|
3
|
Mikheikin A, Olsen A, Leslie K, Russell-Pavier F, Yacoot A, Picco L, Payton O, Toor A, Chesney A, Gimzewski JK, Mishra B, Reed J. DNA nanomapping using CRISPR-Cas9 as a programmable nanoparticle. Nat Commun 2017; 8:1665. [PMID: 29162844 PMCID: PMC5698298 DOI: 10.1038/s41467-017-01891-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 10/24/2017] [Indexed: 01/26/2023] Open
Abstract
Progress in whole-genome sequencing using short-read (e.g., <150 bp), next-generation sequencing technologies has reinvigorated interest in high-resolution physical mapping to fill technical gaps that are not well addressed by sequencing. Here, we report two technical advances in DNA nanotechnology and single-molecule genomics: (1) we describe a labeling technique (CRISPR-Cas9 nanoparticles) for high-speed AFM-based physical mapping of DNA and (2) the first successful demonstration of using DVD optics to image DNA molecules with high-speed AFM. As a proof of principle, we used this new “nanomapping” method to detect and map precisely BCL2–IGH translocations present in lymph node biopsies of follicular lymphoma patents. This HS-AFM “nanomapping” technique can be complementary to both sequencing and other physical mapping approaches. Physical mapping of DNA can be used to detect structural variants and for whole-genome haplotype assembly. Here, the authors use CRISPR-Cas9 and high-speed atomic force microscopy to ‘nanomap’ single molecules of DNA.
Collapse
Affiliation(s)
- Andrey Mikheikin
- Department of Physics, Virginia Commonwealth University, Richmond, 23284, VA, USA
| | - Anita Olsen
- Department of Physics, Virginia Commonwealth University, Richmond, 23284, VA, USA
| | - Kevin Leslie
- Department of Physics, Virginia Commonwealth University, Richmond, 23284, VA, USA
| | - Freddie Russell-Pavier
- National Physical Laboratory, Hampton Road, Teddington, TW11 0LW, Middlesex, UK.,Interface Analysis Centre, H. H. Wills Physics Laboratory, Tyndall Avenue, Bristol, BS8 1TL, UK
| | - Andrew Yacoot
- National Physical Laboratory, Hampton Road, Teddington, TW11 0LW, Middlesex, UK
| | - Loren Picco
- Interface Analysis Centre, H. H. Wills Physics Laboratory, Tyndall Avenue, Bristol, BS8 1TL, UK
| | - Oliver Payton
- Interface Analysis Centre, H. H. Wills Physics Laboratory, Tyndall Avenue, Bristol, BS8 1TL, UK
| | - Amir Toor
- Department of Internal Medicine, VCU School of Medicine, Richmond, 23284, VA, USA.,VCU Massey Cancer Center, Richmond, 23284, VA, USA
| | - Alden Chesney
- VCU Massey Cancer Center, Richmond, 23284, VA, USA.,Department of Pathology, VCU School of Medicine, Richmond, 23284, VA, USA
| | - James K Gimzewski
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, 90095, CA, USA
| | - Bud Mishra
- Departments of Computer Science and Mathematics, Courant Institute of Mathematical Sciences, New York University, New York, 10012, NY, USA
| | - Jason Reed
- Department of Physics, Virginia Commonwealth University, Richmond, 23284, VA, USA. .,VCU Massey Cancer Center, Richmond, 23284, VA, USA.
| |
Collapse
|
4
|
Abstract
Optical mapping (OM) has been used in microbiology for the past 20 years, initially as a technique to facilitate DNA sequence-based studies; however, with decreases in DNA sequencing costs and increases in sequence output from automated sequencing platforms, OM has grown into an important auxiliary tool for genome assembly and comparison. Currently, there are a number of new and exciting applications for OM in the field of microbiology, including investigation of disease outbreaks, identification of specific genes of clinical and/or epidemiological relevance, and the possibility of single-cell analysis when combined with cell-sorting approaches. In addition, designing lab-on-a-chip systems based on OM is now feasible and will allow the integrated and automated microbiological analysis of biological fluids. Here, we review the basic technology of OM, detail the current state of the art of the field, and look ahead to possible future developments in OM technology for microbiological applications.
Collapse
|
5
|
Yuan Y, Bayer PE, Batley J, Edwards D. Improvements in Genomic Technologies: Application to Crop Genomics. Trends Biotechnol 2017; 35:547-558. [DOI: 10.1016/j.tibtech.2017.02.009] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/10/2017] [Accepted: 02/14/2017] [Indexed: 12/13/2022]
|
6
|
Chaney L, Sharp AR, Evans CR, Udall JA. Genome Mapping in Plant Comparative Genomics. TRENDS IN PLANT SCIENCE 2016; 21:770-780. [PMID: 27289181 DOI: 10.1016/j.tplants.2016.05.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Revised: 04/27/2016] [Accepted: 05/12/2016] [Indexed: 05/10/2023]
Abstract
Genome mapping produces fingerprints of DNA sequences to construct a physical map of the whole genome. It provides contiguous, long-range information that complements and, in some cases, replaces sequencing data. Recent advances in genome-mapping technology will better allow researchers to detect large (>1kbp) structural variations between plant genomes. Some molecular and informatics complications need to be overcome for this novel technology to achieve its full utility. This technology will be useful for understanding phenotype responses due to DNA rearrangements and will yield insights into genome evolution, particularly in polyploids. In this review, we outline recent advances in genome-mapping technology, including the processes required for data collection and analysis, and applications in plant comparative genomics.
Collapse
Affiliation(s)
- Lindsay Chaney
- Plant and Wildlife Sciences Department, Brigham Young University, Provo, UT 84602, USA
| | - Aaron R Sharp
- Plant and Wildlife Sciences Department, Brigham Young University, Provo, UT 84602, USA
| | - Carrie R Evans
- Plant and Wildlife Sciences Department, Brigham Young University, Provo, UT 84602, USA
| | - Joshua A Udall
- Plant and Wildlife Sciences Department, Brigham Young University, Provo, UT 84602, USA.
| |
Collapse
|
7
|
Vij S, Kuhl H, Kuznetsova IS, Komissarov A, Yurchenko AA, Van Heusden P, Singh S, Thevasagayam NM, Prakki SRS, Purushothaman K, Saju JM, Jiang J, Mbandi SK, Jonas M, Hin Yan Tong A, Mwangi S, Lau D, Ngoh SY, Liew WC, Shen X, Hon LS, Drake JP, Boitano M, Hall R, Chin CS, Lachumanan R, Korlach J, Trifonov V, Kabilov M, Tupikin A, Green D, Moxon S, Garvin T, Sedlazeck FJ, Vurture GW, Gopalapillai G, Kumar Katneni V, Noble TH, Scaria V, Sivasubbu S, Jerry DR, O'Brien SJ, Schatz MC, Dalmay T, Turner SW, Lok S, Christoffels A, Orbán L. Chromosomal-Level Assembly of the Asian Seabass Genome Using Long Sequence Reads and Multi-layered Scaffolding. PLoS Genet 2016; 12:e1005954. [PMID: 27082250 PMCID: PMC4833346 DOI: 10.1371/journal.pgen.1005954] [Citation(s) in RCA: 85] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2015] [Accepted: 03/03/2016] [Indexed: 11/18/2022] Open
Abstract
We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics.
Collapse
Affiliation(s)
- Shubha Vij
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Heiner Kuhl
- Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Inna S. Kuznetsova
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
- Laboratory of Chromosome Structure and Function, Department of Cytology and Histology, Biological Faculty, Saint Petersburg State University, St. Petersburg, Russia
| | - Aleksey Komissarov
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia
| | - Andrey A. Yurchenko
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia
| | - Peter Van Heusden
- South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
| | - Siddharth Singh
- Pacific Biosciences, Menlo Park, California, United States of America
| | | | | | | | - Jolly M. Saju
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Junhui Jiang
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Stanley Kimbung Mbandi
- South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
| | - Mario Jonas
- South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
| | - Amy Hin Yan Tong
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Canada
| | - Sarah Mwangi
- South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
| | - Doreen Lau
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Si Yan Ngoh
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Woei Chang Liew
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Xueyan Shen
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
| | - Lawrence S. Hon
- Pacific Biosciences, Menlo Park, California, United States of America
| | - James P. Drake
- Pacific Biosciences, Menlo Park, California, United States of America
| | - Matthew Boitano
- Pacific Biosciences, Menlo Park, California, United States of America
| | - Richard Hall
- Pacific Biosciences, Menlo Park, California, United States of America
| | - Chen-Shan Chin
- Pacific Biosciences, Menlo Park, California, United States of America
| | | | - Jonas Korlach
- Pacific Biosciences, Menlo Park, California, United States of America
| | - Vladimir Trifonov
- Institute of Molecular and Cellular Biology, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russian Federation
| | - Marsel Kabilov
- Genomics Core Facility, Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Alexey Tupikin
- Genomics Core Facility, Institute of Chemical Biology and Fundamental Medicine, Siberian Branch of the Russian Academy of Sciences, Novosibirsk, Russia
| | - Darrell Green
- Norwich Medical School, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Simon Moxon
- The Genome Analysis Centre, Norwich, United Kingdom
| | - Tyler Garvin
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, New York, United States of America
| | - Fritz J. Sedlazeck
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, New York, United States of America
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Gregory W. Vurture
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, New York, United States of America
| | - Gopikrishna Gopalapillai
- Nutrition, Genetics & Biotechnology Division, ICAR-Central Institute of Brackishwater Aquaculture, Tamil Nadu, India
| | - Vinaya Kumar Katneni
- Nutrition, Genetics & Biotechnology Division, ICAR-Central Institute of Brackishwater Aquaculture, Tamil Nadu, India
| | - Tansyn H. Noble
- College of Marine and Environmental Sciences and Center for Sustainable Tropical Fisheries and Aquaculture, James Cook University, Townsville, Queensland, Australia
| | - Vinod Scaria
- CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), New Delhi, India
| | - Sridhar Sivasubbu
- CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), New Delhi, India
| | - Dean R. Jerry
- College of Marine and Environmental Sciences and Center for Sustainable Tropical Fisheries and Aquaculture, James Cook University, Townsville, Queensland, Australia
| | - Stephen J. O'Brien
- Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, St. Petersburg, Russia
- Oceanographic Center, Nova Southeastern University Ft. Lauderdale, Ft. Lauderdale, Florida, United States of America
| | - Michael C. Schatz
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, One Bungtown Road, Cold Spring Harbor, New York, United States of America
- Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Tamás Dalmay
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Stephen W. Turner
- Pacific Biosciences, Menlo Park, California, United States of America
| | - Si Lok
- The Centre for Applied Genomics, The Hospital for Sick Children, Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada
| | - Alan Christoffels
- South African MRC Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
| | - László Orbán
- Reproductive Genomics Group, Temasek Life Sciences Laboratory, Singapore
- Department of Animal Sciences and Animal Husbandry, Georgikon Faculty, University of Pannonia, Keszthely, Hungary
- Centre for Comparative Genomics, Murdoch University, Murdoch, Australia
| |
Collapse
|
8
|
Xiao S, Li J, Ma F, Fang L, Xu S, Chen W, Wang ZY. Rapid construction of genome map for large yellow croaker (Larimichthys crocea) by the whole-genome mapping in BioNano Genomics Irys system. BMC Genomics 2015; 16:670. [PMID: 26336087 PMCID: PMC4559010 DOI: 10.1186/s12864-015-1871-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Accepted: 08/21/2015] [Indexed: 12/21/2022] Open
Abstract
Background Large yellow croaker (Larimichthys crocea) is an important commercial fish in China and East-Asia. The annual product of the species from the aqua-farming industry is about 90 thousand tons. In spite of its economic importance, genetic studies of economic traits and genomic selections of the species are hindered by the lack of genomic resources. Specifically, a whole-genome physical map of large yellow croaker is still missing. The traditional BAC-based fingerprint method is extremely time- and labour-consuming. Here we report the first genome map construction using the high-throughput whole-genome mapping technique by nanochannel arrays in BioNano Genomics Irys system. Results For an optimal marker density of ~10 per 100 kb, the nicking endonuclease Nt.BspQ1 was chosen for the genome map generation. 645,305 DNA molecules with a total length of ~112 Gb were labelled and detected, covering more than 160X of the large yellow croaker genome. Employing IrysView package and signature patterns in raw DNA molecules, a whole-genome map of large yellow croaker was assembled into 686 maps with a total length of 727 Mb, which was consistent with the estimated genome size. The N50 length of the whole-genome map, including 126 maps, was up to 1.7 Mb. The excellent hybrid alignment with large yellow croaker draft genome validated the consensus genome map assembly and highlighted a promising application of whole-genome mapping on draft genome sequence super-scaffolding. The genome map data of large yellow croaker are accessible on lycgenomics.jmu.edu.cn/pm. Conclusion Using the state-of-the-art whole-genome mapping technique in Irys system, the first whole-genome map for large yellow croaker has been constructed and thus highly facilitates the ongoing genomic and evolutionary studies for the species. To our knowledge, this is the first public report on genome map construction by the whole-genome mapping for aquatic-organisms. Our study demonstrates a promising application of the whole-genome mapping on genome maps construction for other non-model organisms in a fast and reliable manner. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1871-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Shijun Xiao
- Key Laboratory of Healthy Mariculture in the East China Sea, Ministry of Agriculture; Fisheries College, Jimei University, Yindou Road, Xiamen, P.R. China
| | - Jiongtang Li
- Chinese Academy of Fishery Sciences, Yongding Road, Beijing, P.R. China
| | | | - Lujing Fang
- Key Laboratory of Healthy Mariculture in the East China Sea, Ministry of Agriculture; Fisheries College, Jimei University, Yindou Road, Xiamen, P.R. China
| | - Shuangbin Xu
- Key Laboratory of Healthy Mariculture in the East China Sea, Ministry of Agriculture; Fisheries College, Jimei University, Yindou Road, Xiamen, P.R. China
| | - Wei Chen
- Key Laboratory of Healthy Mariculture in the East China Sea, Ministry of Agriculture; Fisheries College, Jimei University, Yindou Road, Xiamen, P.R. China
| | - Zhi Yong Wang
- Key Laboratory of Healthy Mariculture in the East China Sea, Ministry of Agriculture; Fisheries College, Jimei University, Yindou Road, Xiamen, P.R. China.
| |
Collapse
|
9
|
Zhou S, Goldstein S, Place M, Bechner M, Patino D, Potamousis K, Ravindran P, Pape L, Rincon G, Hernandez-Ortiz J, Medrano JF, Schwartz DC. A clone-free, single molecule map of the domestic cow (Bos taurus) genome. BMC Genomics 2015; 16:644. [PMID: 26314885 PMCID: PMC4551733 DOI: 10.1186/s12864-015-1823-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Accepted: 08/07/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. RESULTS The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). CONCLUSION Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Steve Goldstein
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Michael Place
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Michael Bechner
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Diego Patino
- Departamento de Materiales, Facultad de Minas, Universidad Nacional de Colombia, Sede Medellin, Calle 75 # 79A-51, Bloque M17, Medellin, Colombia, SA.
| | - Konstantinos Potamousis
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Prabu Ravindran
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Louise Pape
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| | - Gonzalo Rincon
- Department of Animal Science, University of California-Davis, Davis, CA, 95616, USA.
| | - Juan Hernandez-Ortiz
- Departamento de Materiales, Facultad de Minas, Universidad Nacional de Colombia, Sede Medellin, Calle 75 # 79A-51, Bloque M17, Medellin, Colombia, SA.
| | - Juan F Medrano
- Department of Animal Science, University of California-Davis, Davis, CA, 95616, USA.
| | - David C Schwartz
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, and the UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI, 53706, USA.
| |
Collapse
|
10
|
Discovery of structural alterations in solid tumor oligodendroglioma by single molecule analysis. BMC Genomics 2013; 14:505. [PMID: 23885787 PMCID: PMC3727977 DOI: 10.1186/1471-2164-14-505] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Accepted: 07/23/2013] [Indexed: 12/31/2022] Open
Abstract
Background Solid tumors present a panoply of genomic alterations, from single base changes to the gain or loss of entire chromosomes. Although aberrations at the two extremes of this spectrum are readily defined, comprehensive discernment of the complex and disperse mutational spectrum of cancer genomes remains a significant challenge for current genome analysis platforms. In this context, high throughput, single molecule platforms like Optical Mapping offer a unique perspective. Results Using measurements from large ensembles of individual DNA molecules, we have discovered genomic structural alterations in the solid tumor oligodendroglioma. Over a thousand structural variants were identified in each tumor sample, without any prior hypotheses, and often in genomic regions deemed intractable by other technologies. These findings were then validated by comprehensive comparisons to variants reported in external and internal databases, and by selected experimental corroborations. Alterations range in size from under 5 kb to hundreds of kilobases, and comprise insertions, deletions, inversions and compound events. Candidate mutations were scored at sub-genic resolution and unambiguously reveal structural details at aberrant loci. Conclusions The Optical Mapping system provides a rich description of the complex genomes of solid tumors, including sequence level aberrations, structural alterations and copy number variants that power generation of functional hypotheses for oligodendroglioma genetics.
Collapse
|
11
|
Dorfman KD, King SB, Olson DW, Thomas JDP, Tree DR. Beyond gel electrophoresis: microfluidic separations, fluorescence burst analysis, and DNA stretching. Chem Rev 2013; 113:2584-667. [PMID: 23140825 PMCID: PMC3595390 DOI: 10.1021/cr3002142] [Citation(s) in RCA: 141] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Kevin D. Dorfman
- Department of Chemical Engineering and Materials Science, University of Minnesota — Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, Phone: 1-612-624-5560. Fax: 1-612-626-7246
| | - Scott B. King
- Department of Chemical Engineering and Materials Science, University of Minnesota — Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, Phone: 1-612-624-5560. Fax: 1-612-626-7246
| | - Daniel W. Olson
- Department of Chemical Engineering and Materials Science, University of Minnesota — Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, Phone: 1-612-624-5560. Fax: 1-612-626-7246
| | - Joel D. P. Thomas
- Department of Chemical Engineering and Materials Science, University of Minnesota — Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, Phone: 1-612-624-5560. Fax: 1-612-626-7246
| | - Douglas R. Tree
- Department of Chemical Engineering and Materials Science, University of Minnesota — Twin Cities, 421 Washington Ave. SE, Minneapolis, MN 55455, Phone: 1-612-624-5560. Fax: 1-612-626-7246
| |
Collapse
|
12
|
Integrated view of genome structure and sequence of a single DNA molecule in a nanofluidic device. Proc Natl Acad Sci U S A 2013; 110:4893-8. [PMID: 23479649 DOI: 10.1073/pnas.1214570110] [Citation(s) in RCA: 68] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We show how a bird's-eye view of genomic structure can be obtained at ∼1-kb resolution from long (∼2 Mb) DNA molecules extracted from whole chromosomes in a nanofluidic laboratory-on-a-chip. We use an improved single-molecule denaturation mapping approach to detect repetitive elements and known as well as unique structural variation. Following its mapping, a molecule of interest was rescued from the chip; amplified and localized to a chromosome by FISH; and interrogated down to 1-bp resolution with a commercial sequencer, thereby reconciling haplotype-phased chromosome substructure with sequence.
Collapse
|
13
|
Bannantine JP, Wu CW, Hsu C, Zhou S, Schwartz DC, Bayles DO, Paustian ML, Alt DP, Sreevatsan S, Kapur V, Talaat AM. Genome sequencing of ovine isolates of Mycobacterium avium subspecies paratuberculosis offers insights into host association. BMC Genomics 2012; 13:89. [PMID: 22409516 PMCID: PMC3337245 DOI: 10.1186/1471-2164-13-89] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2011] [Accepted: 03/12/2012] [Indexed: 01/09/2023] Open
Abstract
Background The genome of Mycobacterium avium subspecies paratuberculosis (MAP) is remarkably homogeneous among the genomes of bovine, human and wildlife isolates. However, previous work in our laboratories with the bovine K-10 strain has revealed substantial differences compared to sheep isolates. To systematically characterize all genomic differences that may be associated with the specific hosts, we sequenced the genomes of three U.S. sheep isolates and also obtained an optical map. Results Our analysis of one of the isolates, MAP S397, revealed a genome 4.8 Mb in size with 4,700 open reading frames (ORFs). Comparative analysis of the MAP S397 isolate showed it acquired approximately 10 large sequence regions that are shared with the human M. avium subsp. hominissuis strain 104 and lost 2 large regions that are present in the bovine strain. In addition, optical mapping defined the presence of 7 large inversions between the bovine and ovine genomes (~ 2.36 Mb). Whole-genome sequencing of 2 additional sheep strains of MAP (JTC1074 and JTC7565) further confirmed genomic homogeneity of the sheep isolates despite the presence of polymorphisms on the nucleotide level. Conclusions Comparative sequence analysis employed here provided a better understanding of the host association, evolution of members of the M. avium complex and could help in deciphering the phenotypic differences observed among sheep and cattle strains of MAP. A similar approach based on whole-genome sequencing combined with optical mapping could be employed to examine closely related pathogens. We propose an evolutionary scenario for M. avium complex strains based on these genome sequences.
Collapse
Affiliation(s)
- John P Bannantine
- National Animal Disease Center, USDA-Agricultural Research Service, Ames, Iowa, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Abstract
Genetic manipulation of Plasmodium falciparum remains very challenging, mainly due to the parasite genome's high A/T-richness and low transfection efficiency. This chapter includes methods for generating transient and stable transfections by electroporation, allelic replacement with tagged genes, gene deletion, and the analysis of all the above.
Collapse
|
15
|
Riley MC, Kirkup BC, Johnson JD, Lesho EP, Ockenhouse CF. Rapid whole genome optical mapping of Plasmodium falciparum. Malar J 2011; 10:252. [PMID: 21871093 PMCID: PMC3173401 DOI: 10.1186/1475-2875-10-252] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2011] [Accepted: 08/26/2011] [Indexed: 11/21/2022] Open
Abstract
Background Immune evasion and drug resistance in malaria have been linked to chromosomal recombination and gene copy number variation (CNV). These events are ideally studied using comparative genomic analyses; however in malaria these analyses are not as common or thorough as in other infectious diseases, partly due to the difficulty in sequencing and assembling complete genome drafts. Recently, whole genome optical mapping has gained wide use in support of genomic sequence assembly and comparison. Here, a rapid technique for producing whole genome optical maps of Plasmodium falciparum is described and the results of mapping four genomes are presented. Methods Four laboratory strains of P. falciparum were analysed using the Argus™ optical mapping system to produce ordered restriction fragment maps of all 14 chromosomes in each genome. Plasmodium falciparum DNA was isolated directly from blood culture, visualized using the Argus™ system and assembled in a manner analogous to next generation sequence assembly into maps (AssemblyViewer™, OpGen Inc.®). Full coverage maps were generated for P. falciparum strains 3D7, FVO, D6 and C235. A reference P. falciparum in silico map was created by the digestion of the genomic sequence of P. falciparum with the restriction enzyme AflII, for comparisons to genomic optical maps. Maps were then compared using the MapSolver™ software. Results Genomic variation was observed among the mapped strains, as well as between the map of the reference strain and the map derived from the putative sequence of that same strain. Duplications, deletions, insertions, inversions and misassemblies of sizes ranging from 3,500 base pairs up to 78,000 base pairs were observed. Many genomic events occurred in areas of known repetitive sequence or high copy number genes, including var gene clusters and rifin complexes. Conclusions This technique for optical mapping of multiple malaria genomes allows for whole genome comparison of multiple strains and can assist in identifying genetic variation and sequence contig assembly. New protocols and technology allowed us to produce high quality contigs spanning four P. falciparum genomes in six weeks for less than $1,000.00 per genome. This relatively low cost and quick turnaround makes the technique valuable compared to other genomic sequencing technologies for studying genetic variation in malaria.
Collapse
Affiliation(s)
- Matthew C Riley
- Walter Reed Army Institute of Research, Division of Malaria Vaccine Development, Silver Spring, Maryland, USA.
| | | | | | | | | |
Collapse
|
16
|
Blake DP, Oakes R, Smith AL. A genetic linkage map for the apicomplexan protozoan parasite Eimeria maxima and comparison with Eimeria tenella. Int J Parasitol 2011; 41:263-70. [DOI: 10.1016/j.ijpara.2010.09.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2010] [Revised: 09/09/2010] [Accepted: 09/15/2010] [Indexed: 11/24/2022]
|
17
|
Teague B, Waterman MS, Goldstein S, Potamousis K, Zhou S, Reslewic S, Sarkar D, Valouev A, Churas C, Kidd JM, Kohn S, Runnheim R, Lamers C, Forrest D, Newton MA, Eichler EE, Kent-First M, Surti U, Livny M, Schwartz DC. High-resolution human genome structure by single-molecule analysis. Proc Natl Acad Sci U S A 2010; 107:10848-53. [PMID: 20534489 PMCID: PMC2890719 DOI: 10.1073/pnas.0914638107] [Citation(s) in RCA: 144] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Variation in genome structure is an important source of human genetic polymorphism: It affects a large proportion of the genome and has a variety of phenotypic consequences relevant to health and disease. In spite of this, human genome structure variation is incompletely characterized due to a lack of approaches for discovering a broad range of structural variants in a global, comprehensive fashion. We addressed this gap with Optical Mapping, a high-throughput, high-resolution single-molecule system for studying genome structure. We used Optical Mapping to create genome-wide restriction maps of a complete hydatidiform mole and three lymphoblast-derived cell lines, and we validated the approach by demonstrating a strong concordance with existing methods. We also describe thousands of new variants with sizes ranging from kb to Mb.
Collapse
Affiliation(s)
- Brian Teague
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Michael S. Waterman
- Department of Biological Sciences, University of Southern California, 1050 Childs Way, Los Angeles, CA 90089-2910
| | - Steven Goldstein
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Konstantinos Potamousis
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Shiguo Zhou
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Susan Reslewic
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Deepayan Sarkar
- Department of Statistics, University of Wisconsin, 1300 University Avenue, Madison, WI 53706-1510
| | - Anton Valouev
- Department of Biological Sciences, University of Southern California, 1050 Childs Way, Los Angeles, CA 90089-2910
| | - Christopher Churas
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Jeffrey M. Kidd
- Department of Genome Sciences, University of Washington, 1705 NE Pacific Street, Seattle, WA 98195-5065
| | - Scott Kohn
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Rodney Runnheim
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Casey Lamers
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Dan Forrest
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| | - Michael A. Newton
- Department of Statistics, University of Wisconsin, 1300 University Avenue, Madison, WI 53706-1510
- Department of Biostatistics and Medical Informatics, University of Wisconsin, 1300 University Avenue, Madison, WI 53706-1510
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington, 1705 NE Pacific Street, Seattle, WA 98195-5065
| | - Marijo Kent-First
- Department of Animal Science, Department of Biological Sciences, Mississippi State University, 130 Harned Hall, Lee Boulevard, Mississippi State, MS 39762-9698
| | - Urvashi Surti
- Department of Pathology, University of Pittsburgh, 200 Lothrop Street, Pittsburgh, PA 15213-2536; and
| | - Miron Livny
- Department of Computer Sciences, University of Wisconsin, 1210 West Dayton Street, Madison, WI 53706-1685
| | - David C. Schwartz
- The Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics and Biotechnology Center, University of Wisconsin, 425 Henry Mall, Madison, WI 53706-1580
| |
Collapse
|
18
|
Abstract
Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain.
Collapse
Affiliation(s)
- Mihai Pop
- Department of Computer Science and the Center for Bioinformatics and Computational Biology at the University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
19
|
Jiang Z, Rokhsar DS, Harland RM. Old can be new again: HAPPY whole genome sequencing, mapping and assembly. Int J Biol Sci 2009; 5:298-303. [PMID: 19381348 PMCID: PMC2669597 DOI: 10.7150/ijbs.5.298] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2009] [Accepted: 04/12/2009] [Indexed: 11/05/2022] Open
Abstract
During the last three decades, both genome mapping and sequencing methods have advanced significantly to provide a foundation for scientists to understand genome structures and functions in many species. Generally speaking, genome mapping relies on genome sequencing to provide basic materials, such as DNA probes and markers for their localizations, thus constructing the maps. On the other hand, genome sequencing often requires a high-resolution map as a skeleton for whole genome assembly. However, both genome mapping and sequencing have never come together in one pipeline. After reviewing mapping and next-generation sequencing methods, we would like to share our thoughts with the genome community on how to combine the HAPPY mapping technique with the new-generation sequencing, thus integrating two systems into one pipeline, called HAPPY pipeline. The pipeline starts with preparation of a HAPPY panel, followed by multiple displacement amplification for producing a relatively large quantity of DNA. Instead of conventional marker genotyping, the amplified panel DNA samples are subject to new-generation sequencing with barcode method, which allows us to determine the presence/absence of a sequence contig as a traditional marker in the HAPPY panel. Statistical analysis will then be performed to infer how close or how far away from each other these contigs are within a genome and order the whole genome sequence assembly as well. We believe that such a universal approach will play an important role in genome sequencing, mapping, and assembly of many species; thus advancing genome science and its applications in biomedicine and agriculture.
Collapse
Affiliation(s)
- Zhihua Jiang
- Department of Animal Sciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164-6351, USA.
| | | | | |
Collapse
|
20
|
Wu CW, Schramm TM, Zhou S, Schwartz DC, Talaat AM. Optical mapping of the Mycobacterium avium subspecies paratuberculosis genome. BMC Genomics 2009; 10:25. [PMID: 19146697 PMCID: PMC2633350 DOI: 10.1186/1471-2164-10-25] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2008] [Accepted: 01/15/2009] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Infection of cattle with Mycobacterium avium subspecies paratuberculosis (M. ap) causes severe economic losses to the dairy industry in the USA and worldwide. In an effort to better examine diversity among M. ap strains, we used optical mapping to profile genomic variations between strains of M. ap K-10 (sequenced strain) and M. ap ATCC 19698 (type strain). RESULTS The assembled physical restriction map of M. ap ATCC 19698 showed a genome size of 4,839 kb compared to the sequenced K-10 genome of 4,830 kb. Interestingly, alignment of the optical map of the M. ap ATCC 19698 genome to the complete M. ap K-10 genome sequence revealed a 648-kb inversion around the origin of replication. However, Southern blotting, PCR amplification and sequencing analyses of the inverted region revealed that the genome of M. ap K-10 differs from the published sequence in the region starting from 4,197,080 bp to 11,150 bp, spanning the origin of replication. Additionally, two new copies of the coding sequences > 99.8% were identified, identical to the MAP0849c and MAP0850c genes located immediately downstream of the MAP3758c gene. CONCLUSION The optical map of M. ap ATCC 19698 clearly indicated the miss-assembly of the sequenced genome of M. ap K-10. Moreover, it identified 2 new genes in M. ap K-10 genome. This analysis strongly advocates for the utility of physical mapping protocols to complement genome sequencing projects.
Collapse
Affiliation(s)
- Chia-wei Wu
- The Laboratory of Bacterial Genomics, Department of Pathobiological Sciences, University of Wisconsin-Madison, WI, USA.
| | | | | | | | | |
Collapse
|
21
|
Jo K, Schramm TM, Schwartz DC. A single-molecule barcoding system using nanoslits for DNA analysis : nanocoding. Methods Mol Biol 2009; 544:29-42. [PMID: 19488691 DOI: 10.1007/978-1-59745-483-4_3] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Single DNA molecule approaches are playing an increasingly central role in the analytical genomic sciences because single molecule techniques intrinsically provide individualized measurements of selected molecules, free from the constraints of bulk techniques, which blindly average noise and mask the presence of minor analyte components. Accordingly, a principal challenge that must be addressed by all single molecule approaches aimed at genome analysis is how to immobilize and manipulate DNA molecules for measurements that foster construction of large, biologically relevant data sets. For meeting this challenge, this chapter discusses an integrated approach for microfabricated and nanofabricated devices for the manipulation of elongated DNA molecules within nanoscale geometries. Ideally, large DNA coils stretch via nanoconfinement when channel dimensions are within tens of nanometers. Importantly, stretched, often immobilized, DNA molecules spanning hundreds of kilobase pairs are required by all analytical platforms working with large genomic substrates because imaging techniques acquire sequence information from molecules that normally exist in free solution as unrevealing random coils resembling floppy balls of yarn. However, nanoscale devices fabricated with sufficiently small dimensions fostering molecular stretching make these devices impractical because of the requirement of exotic fabrication technologies, costly materials, and poor operational efficiencies. In this chapter, such problems are addressed by discussion of a new approach to DNA presentation and analysis that establishes scaleable nanoconfinement conditions through reduction of ionic strength; stiffening DNA molecules thus enabling their arraying for analysis using easily fabricated devices that can also be mass produced. This new approach to DNA nanoconfinement is complemented by the development of a novel labeling scheme for reliable marking of individual molecules with fluorochrome labels, creating molecular barcodes, which are efficiently read using fluorescence resonance energy transfer techniques for minimizing noise from unincorporated labels. As such, our integrative approach for the realization of genomic analysis through nanoconfinement, named nanocoding, was demonstrated through the barcoding and mapping of bacterial artificial chromosomal molecules, thereby providing the basis for a high-throughput platform competent for whole genome investigations.
Collapse
Affiliation(s)
- Kyubong Jo
- Department of Chemistry, University of Wisconsin, Madison, WI 53706, USA
| | | | | |
Collapse
|
22
|
Ananiev GE, Goldstein S, Runnheim R, Forrest DK, Zhou S, Potamousis K, Churas CP, Bergendahl V, Thomson JA, Schwartz DC. Optical mapping discerns genome wide DNA methylation profiles. BMC Mol Biol 2008; 9:68. [PMID: 18667073 PMCID: PMC2516518 DOI: 10.1186/1471-2199-9-68] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2008] [Accepted: 07/30/2008] [Indexed: 11/23/2022] Open
Abstract
Background Methylation of CpG dinucleotides is a fundamental mechanism of epigenetic regulation in eukaryotic genomes. Development of methods for rapid genome wide methylation profiling will greatly facilitate both hypothesis and discovery driven research in the field of epigenetics. In this regard, a single molecule approach to methylation profiling offers several unique advantages that include elimination of chemical DNA modification steps and PCR amplification. Results A single molecule approach is presented for the discernment of methylation profiles, based on optical mapping. We report results from a series of pilot studies demonstrating the capabilities of optical mapping as a platform for methylation profiling of whole genomes. Optical mapping was used to discern the methylation profile from both an engineered and wild type Escherichia coli. Furthermore, the methylation status of selected loci within the genome of human embryonic stem cells was profiled using optical mapping. Conclusion The optical mapping platform effectively detects DNA methylation patterns. Due to single molecule detection, optical mapping offers significant advantages over other technologies. This advantage stems from obviation of DNA modification steps, such as bisulfite treatment, and the ability of the platform to assay repeat dense regions within mammalian genomes inaccessible to techniques using array-hybridization technologies.
Collapse
Affiliation(s)
- Gene E Ananiev
- Department of Chemistry, Laboratory for Molecular and Computational Genomics, University of Wisconsin Biotechnology Center, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Latreille P, Norton S, Goldman BS, Henkhaus J, Miller N, Barbazuk B, Bode HB, Darby C, Du Z, Forst S, Gaudriault S, Goodner B, Goodrich-Blair H, Slater S. Optical mapping as a routine tool for bacterial genome sequence finishing. BMC Genomics 2007; 8:321. [PMID: 17868451 PMCID: PMC2045679 DOI: 10.1186/1471-2164-8-321] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2006] [Accepted: 09/14/2007] [Indexed: 11/25/2022] Open
Abstract
Background In sequencing the genomes of two Xenorhabdus species, we encountered a large number of sequence repeats and assembly anomalies that stalled finishing efforts. This included a stretch of about 12 Kb that is over 99.9% identical between the plasmid and chromosome of X. nematophila. Results Whole genome restriction maps of the sequenced strains were produced through optical mapping technology. These maps allowed rapid resolution of sequence assembly problems, permitted closing of the genome, and allowed correction of a large inversion in a genome assembly that we had considered finished. Conclusion Our experience suggests that routine use of optical mapping in bacterial genome sequence finishing is warranted. When combined with data produced through 454 sequencing, an optical map can rapidly and inexpensively generate an ordered and oriented set of contigs to produce a nearly complete genome sequence assembly.
Collapse
Affiliation(s)
- Phil Latreille
- Monsanto Company, 800 North Lindbergh Boulevard St. Louis, MO 63167, USA
| | - Stacie Norton
- Monsanto Company, 800 North Lindbergh Boulevard St. Louis, MO 63167, USA
| | - Barry S Goldman
- Monsanto Company, 800 North Lindbergh Boulevard St. Louis, MO 63167, USA
| | - John Henkhaus
- OpGen Technologies, Inc., 510 Charmany Drive, Suite 151, Madison, WI 53719, USA
| | - Nancy Miller
- Monsanto Company, 800 North Lindbergh Boulevard St. Louis, MO 63167, USA
| | - Brad Barbazuk
- Donald Danforth Plant Sciences Center, 975 North Warson Road St. Louis, MO 63132, USA
| | - Helge B Bode
- Institut für Pharmazeutische Biotechnologie, Universität des Saarlandes, 66123 Saarbrücken, Germany
| | - Creg Darby
- University of California, San Francisco, Department of Cell and Tissue Biology, San Francisco, CA 94143, USA
| | - Zijin Du
- Monsanto Company, 800 North Lindbergh Boulevard St. Louis, MO 63167, USA
| | - Steve Forst
- University of Wisconsin, Milwaukee, Department of Biological Sciences, Milwaukee, WI 53211, USA
| | - Sophie Gaudriault
- Institut National de la Recherche Agronomique-Université de Montpellier II, 34095 Montpellier, France
| | - Brad Goodner
- Hiram College, Department of Biology, Hiram, OH 44234, USA
| | | | - Steven Slater
- Arizona State University, The Biodesign Institute and Department of Applied Biological Sciences, 7001 E. Williams Field Road, Mesa, AZ 85212, USA
| |
Collapse
|
24
|
Zhou S, Bechner MC, Place M, Churas CP, Pape L, Leong SA, Runnheim R, Forrest DK, Goldstein S, Livny M, Schwartz DC. Validation of rice genome sequence by optical mapping. BMC Genomics 2007; 8:278. [PMID: 17697381 PMCID: PMC2048515 DOI: 10.1186/1471-2164-8-278] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2007] [Accepted: 08/15/2007] [Indexed: 11/30/2022] Open
Abstract
Background Rice feeds much of the world, and possesses the simplest genome analyzed to date within the grass family, making it an economically relevant model system for other cereal crops. Although the rice genome is sequenced, validation and gap closing efforts require purely independent means for accurate finishing of sequence build data. Results To facilitate ongoing sequencing finishing and validation efforts, we have constructed a whole-genome SwaI optical restriction map of the rice genome. The physical map consists of 14 contigs, covering 12 chromosomes, with a total genome size of 382.17 Mb; this value is about 11% smaller than original estimates. 9 of the 14 optical map contigs are without gaps, covering chromosomes 1, 2, 3, 4, 5, 7, 8 10, and 12 in their entirety – including centromeres and telomeres. Alignments between optical and in silico restriction maps constructed from IRGSP (International Rice Genome Sequencing Project) and TIGR (The Institute for Genomic Research) genome sequence sources are comprehensive and informative, evidenced by map coverage across virtually all published gaps, discovery of new ones, and characterization of sequence misassemblies; all totalling ~14 Mb. Furthermore, since optical maps are ordered restriction maps, identified discordances are pinpointed on a reliable physical scaffold providing an independent resource for closure of gaps and rectification of misassemblies. Conclusion Analysis of sequence and optical mapping data effectively validates genome sequence assemblies constructed from large, repeat-rich genomes. Given this conclusion we envision new applications of such single molecule analysis that will merge advantages offered by high-resolution optical maps with inexpensive, but short sequence reads generated by emerging sequencing platforms. Lastly, map construction techniques presented here points the way to new types of comparative genome analysis that would focus on discernment of structural differences revealed by optical maps constructed from a broad range of rice subspecies and varieties.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael C Bechner
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Michael Place
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Chris P Churas
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Louise Pape
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Sally A Leong
- USDA-ARS, CCRU, Department of Plant Pathology, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Rod Runnheim
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Dan K Forrest
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Steve Goldstein
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Miron Livny
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - David C Schwartz
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Centre, 425 Henry Mall, Madison, Wisconsin 53706, USA
- Department of Chemistry, Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
- Laboratory of Genetics; University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
25
|
Takebe S, Witola WH, Schimanski B, Günzl A, Ben Mamoun C. Purification of components of the translation elongation factor complex of Plasmodium falciparum by tandem affinity purification. EUKARYOTIC CELL 2007; 6:584-91. [PMID: 17307963 PMCID: PMC1865644 DOI: 10.1128/ec.00376-06] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2006] [Accepted: 02/07/2007] [Indexed: 11/20/2022]
Abstract
Plasmodium falciparum is the causative agent of severe human malaria, responsible for over 2 million deaths annually. Of the 5,300 polypeptides predicted to control the parasite life cycle in mosquitoes and humans, 60% are of unknown function. A major challenge of malaria postgenomic biology is to understand how the 5,300 predicted proteins coexist and interact to perform the essential tasks that define the complex life cycle of the parasite. One approach to assign function to these proteins is by identifying their physiological partners. Here we describe the use of tandem affinity purification (TAP) and mass spectrometry for identification of native protein interactions and purification of protein complexes in P. falciparum. Transgenic parasites were generated which express the translation elongation factor PfEF-1beta harboring a C-terminal PTP tag which consists of the protein C epitope, a tobacco etch virus protease cleavage site, and two protein A domains. Purification of PfEF-1beta-PTP from crude extracts followed by mass spectrometric analysis revealed, in addition to the tagged protein itself, the presence of the native PfEF-1beta, the G-protein PfEF-1alpha, and two new proteins that we named PfEF-1gamma and PfEF-1delta based on their homology to other eukaryotic gamma and delta translation elongation factor subunits. These data, which constitute the first application of TAP for purification of a protein complex under native conditions in P. falciparum, revealed that the translation elongation complex in this organism contains at least two subunits of PfEF-1beta. The success of this approach will set the stage for a systematic analysis of protein interactions in this important human pathogen.
Collapse
Affiliation(s)
- Sachiko Takebe
- Department of Genetics and Developmental Biology, University of Connecticut Health Center, Farmington, Connecticut 06030-3301, USA
| | | | | | | | | |
Collapse
|
26
|
Xiao M, Phong A, Ha C, Chan TF, Cai D, Leung L, Wan E, Kistler AL, DeRisi JL, Selvin PR, Kwok PY. Rapid DNA mapping by fluorescent single molecule detection. Nucleic Acids Res 2006; 35:e16. [PMID: 17175538 PMCID: PMC1807959 DOI: 10.1093/nar/gkl1044] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
DNA mapping is an important analytical tool in genomic sequencing, medical diagnostics and pathogen identification. Here we report an optical DNA mapping strategy based on direct imaging of individual DNA molecules and localization of multiple sequence motifs on the molecules. Individual genomic DNA molecules were labeled with fluorescent dyes at specific sequence motifs by the action of nicking endonuclease followed by the incorporation of dye terminators with DNA polymerase. The labeled DNA molecules were then stretched into linear form on a modified glass surface and imaged using total internal reflection fluorescence (TIRF) microscopy. By determining the positions of the fluorescent labels with respect to the DNA backbone, the distribution of the sequence motif recognized by the nicking endonuclease can be established with good accuracy, in a manner similar to reading a barcode. With this approach, we constructed a specific sequence motif map of lambda-DNA. We further demonstrated the capability of this approach to rapidly type a human adenovirus and several strains of human rhinovirus.
Collapse
Affiliation(s)
- Ming Xiao
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
- To whom correspondence should be addressed at: 513, Parnassus Avenue, HSW-901A, San Francisco, CA 94143, USA. Tel: +1 41 551 43876; Fax: +1 41 547 62956;
| | - Angie Phong
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Connie Ha
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Ting-Fung Chan
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Dongmei Cai
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Lucinda Leung
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Eunice Wan
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Amy L. Kistler
- Department of Biochemistry and Biophysics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Joseph L. DeRisi
- Department of Biochemistry and Biophysics, University of CaliforniaSan Francisco, CA 94115, USA
| | - Paul R. Selvin
- Department of Physics and Center of Biophysics, University of Illinois at Urbana-ChampaignUrbana, IL 61801, USA
| | - Pui-Yan Kwok
- Cardiovascular Research Institute and Center for Human Genetics, University of CaliforniaSan Francisco, CA 94115, USA
- Department of Dermatology, University of CaliforniaSan Francisco, CA 94115, USA
| |
Collapse
|
27
|
Wu T, Schwartz DC. Transchip: single-molecule detection of transcriptional elongation complexes. Anal Biochem 2006; 361:31-46. [PMID: 17187751 PMCID: PMC1945215 DOI: 10.1016/j.ab.2006.10.042] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2006] [Revised: 10/30/2006] [Accepted: 10/30/2006] [Indexed: 11/24/2022]
Abstract
A new single-molecule system, Transchip, was developed for analysis of transcription products at their genomic origins. The bacteriophage T7 RNA polymerase and its promoters were used in a model system, and resultant RNAs were imaged and detected at their positions along single template DNA molecules. The Transchip system has drawn from critical aspects of Optical Mapping, a single-molecule system that enables the construction of high-resolution ordered restriction maps of whole genomes from single DNA molecules. Through statistical analysis of hundreds of single-molecule template/transcript complexes, Transchip enables analysis of the locations and strength of promoters, the direction and processivity of transcription reactions, and the termination of transcription. These novel results suggest that the new system may serve as a high-throughput platform to investigate transcriptional events on a large genome-wide scale.
Collapse
Affiliation(s)
- Tian Wu
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | |
Collapse
|
28
|
Chen Q, Savarino SJ, Venkatesan MM. Subtractive hybridization and optical mapping of the enterotoxigenic Escherichia coli H10407 chromosome: isolation of unique sequences and demonstration of significant similarity to the chromosome of E. coli K-12. MICROBIOLOGY-SGM 2006; 152:1041-1054. [PMID: 16549668 DOI: 10.1099/mic.0.28648-0] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Enterotoxigenic Escherichia coli (ETEC) is a primary cause of diarrhoea in infants in developing countries and in travellers to endemic regions. While several virulence genes have been identified on ETEC plasmids, little is known about the ETEC chromosome, although it is expected to share significant homology in backbone sequences with E. coli K-12. In the absence of genomic sequence information, the subtractive hybridization method and the more recently described optical mapping technique were carried out to determine the degree of genomic variation between virulent ETEC strain H10407 and the non-pathogenic E. coli K-12 strain MG1655. In one round of PCR-based suppression subtractive hybridization, 153 fragments representing sequences unique to strain H10407 were identified. blast searches indicated that few unique sequences showed homology to known pathogenicity island genes identified in related E. coli pathogens. A total of 65 fragments contained sequences that were either linked to hypothetical proteins or showed no homology to any known sequence in the database. The remaining sequences were either phage or prophage related or displayed homology to classifiable genes that function in various aspects of bacterial metabolism. The 153 unique sequences showed variable distribution across different ETEC strains including ETEC strain B7A, which is attenuated in virulence and lacked several H10407-specific sequences. Restriction-enzyme-based optical maps of strain H10407 were compared to in silico restriction maps of strain MG1655 and related E. coli pathogens. The 5.1 Mb ETEC chromosome was approximately 500 kb greater in length than the chromosome of E. coli K-12, collinear with it and indicated several discrete regions where insertions and/or deletions had occurred relative to the chromosome of strain MG1655. No major inversions, transpositions or gross rearrangements were observed on the ETEC chromosome. Based on comparisons with known genomic sequences and related optical-map-based restriction site similarity, the sequence of the H10407 chromosome is expected to demonstrate approximately 96 % identity with that of E. coli K-12.
Collapse
Affiliation(s)
- Qing Chen
- Department of Enteric Infections, Division of Communicable Diseases and Immunology, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| | - Stephen J Savarino
- Enteric Diseases Department, Naval Medical Research Center, Silver Spring, MD, USA
| | - Malabi M Venkatesan
- Department of Enteric Infections, Division of Communicable Diseases and Immunology, Walter Reed Army Institute of Research, Silver Spring, MD, USA
| |
Collapse
|
29
|
Coppel RL, Black CG. Parasite genomes. Int J Parasitol 2005; 35:465-79. [PMID: 15826640 DOI: 10.1016/j.ijpara.2005.01.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2005] [Revised: 02/24/2005] [Accepted: 02/24/2005] [Indexed: 01/01/2023]
Abstract
The availability of genome sequences and the associated transcriptome and proteome mapping projects has revolutionised research in the field of parasitology. As more parasite species are sequenced, comparative and phylogenetic comparisons are improving the quality of gene prediction and annotation. Genome sequences of parasites are also providing important data sets for understanding parasite biology and identifying new vaccine candidates and drug targets. We review some of the preliminary conclusions from examination of parasite genome sequences and discuss some of the bioinformatics approaches taken in this analysis.
Collapse
Affiliation(s)
- Ross L Coppel
- Department of Microbiology and the Victorian Bioinformatics Consortium, Monash University, Melbourne, Vic. 3800, Australia.
| | | |
Collapse
|
30
|
Reslewic S, Zhou S, Place M, Zhang Y, Briska A, Goldstein S, Churas C, Runnheim R, Forrest D, Lim A, Lapidus A, Han CS, Roberts GP, Schwartz DC. Whole-genome shotgun optical mapping of Rhodospirillum rubrum. Appl Environ Microbiol 2005; 71:5511-22. [PMID: 16151144 PMCID: PMC1214604 DOI: 10.1128/aem.71.9.5511-5522.2005] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2005] [Accepted: 04/11/2005] [Indexed: 11/20/2022] Open
Abstract
Rhodospirillum rubrum is a phototrophic purple nonsulfur bacterium known for its unique and well-studied nitrogen fixation and carbon monoxide oxidation systems and as a source of hydrogen and biodegradable plastic production. To better understand this organism and to facilitate assembly of its sequence, three whole-genome restriction endonuclease maps (XbaI, NheI, and HindIII) of R. rubrum strain ATCC 11170 were created by optical mapping. Optical mapping is a system for creating whole-genome ordered restriction endonuclease maps from randomly sheared genomic DNA molecules extracted from cells. During the sequence finishing process, all three optical maps confirmed a putative error in sequence assembly, while the HindIII map acted as a scaffold for high-resolution alignment with sequence contigs spanning the whole genome. In addition to highlighting optical mapping's role in the assembly and confirmation of genome sequence, this work underscores the unique niche in resolution occupied by the optical mapping system. With a resolution ranging from 6.5 kb (previously published) to 45 kb (reported here), optical mapping advances a "molecular cytogenetics" approach to solving problems in genomic analysis.
Collapse
Affiliation(s)
- Susan Reslewic
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW-Biotechnology Center, 425 Henry Mall, Madison, WI 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Ramanathan A, Huff EJ, Lamers CC, Potamousis KD, Forrest DK, Schwartz DC. An integrative approach for the optical sequencing of single DNA molecules. Anal Biochem 2005; 330:227-41. [PMID: 15203328 DOI: 10.1016/j.ab.2004.03.029] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2003] [Indexed: 11/18/2022]
Abstract
A new approach for optically sequencing ensembles of single DNA molecules using DNA polymerase to mediate the consecutive incorporation of fluorochrome-labeled nucleotides into an array of large single DNA molecules is presented. The approach utilizes cycles of labeled fluorochrome addition, detection to count incorporations, and bleaching to reset the counter. These additions are imaged and analyzed to estimate the number of labeled additions and to correlate them on a per-locus basis along DNA backbones. Initial studies used precisely labeled polymerase chain reaction products to aid the development and validation of simple models of fluorochrome point spread functions within the imaging system. In complementary studies, nucleotides labeled with the fluorochrome R110 were incorporated into surface-elongated lambda DNA, and fluorescent signals corresponding to the addition of R110-dUTP were counted and assigned precise loci along DNA backbones. The labeled DNAs were then subjected to photobleaching and to a second cycle of addition of R110-labeled nucleotides-a second round of additions was correlated with the first to establish strings of addition histories among the ensemble of largely double-stranded templates. These results confirm the basic operational validity of this approach and point the way to the development of a practical system for optical sequencing.
Collapse
Affiliation(s)
- Arvind Ramanathan
- Laboratory for Molecular and Computational Genomics, Department of Chemistry, Laboratory of Genetics, University of Wisconsin-Madison, UW-Biotechnology Center, 425 Henry Mall, Madison, WI 53706, USA
| | | | | | | | | | | |
Collapse
|
32
|
Ferris MM, Yoshida TM, Marrone BL, Keller RA. Fingerprinting of single viral genomes. Anal Biochem 2005; 337:278-88. [PMID: 15691508 DOI: 10.1016/j.ab.2004.10.050] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2004] [Indexed: 11/30/2022]
Abstract
We demonstrate the use of technology developed for optical mapping to acquire DNA fingerprints from single genomes for the purpose of discrimination and identification of bacteria and viruses. Single genome fingerprinting (SGF) provides not only the size but also the order of the restriction fragments, which adds another dimension to the information that can be used for discrimination. Analysis of single organisms may eliminate the need to culture cells and thereby significantly reduce analysis time. In addition, samples containing mixtures of several organisms can be analyzed. For analysis, cells are embedded in an agarose matrix, lysed, and processed to yield intact DNA. The DNA is then deposited on a derivatized glass substrate. The elongated genome is digested with a restriction enzyme and stained with the intercalating dye YOYO-1. DNA is then quantitatively imaged with a fluorescence microscope and the fragments are sized to an accuracy >or=90% by their fluorescence intensity and contour length. Single genome fingerprints were obtained from pure samples of adenovirus, from bacteriophages lambda and T4 GT7, and from a mixture of the three viral genomes. SGF will enable the fingerprinting of uncultured and unamplified samples and allow rapid identification of microorganisms with applications in forensics, medicine, public health, and environmental microbiology.
Collapse
Affiliation(s)
- Matthew M Ferris
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| | | | | | | |
Collapse
|
33
|
Zhou S, Kile A, Bechner M, Place M, Kvikstad E, Deng W, Wei J, Severin J, Runnheim R, Churas C, Forrest D, Dimalanta ET, Lamers C, Burland V, Blattner FR, Schwartz DC. Single-molecule approach to bacterial genomic comparisons via optical mapping. J Bacteriol 2004; 186:7773-82. [PMID: 15516592 PMCID: PMC524920 DOI: 10.1128/jb.186.22.7773-7782.2004] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computation Genomics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
The concept behind the first Molecular Approaches to Malaria meeting, held 1-5 February 2000 in Lorne, Australia, was ahead of its time; to convene a meeting of malaria researchers, database developers and genomics scientists, and to discuss how genomic sciences and their relevant disciplines could be applied to solve important problems in malaria research. The success of the second Molecular Approaches to Malaria meeting, held 1-5 February 2004 in the same place, together with the influence of genomics on malaria research, is testament to the vision that the organizers had at the first meeting. This review attempts to capture some of the current efforts in the post-genomics era of malaria research and highlights the approaches discussed at the Molecular Approaches to Malaria 2004 meeting.
Collapse
Affiliation(s)
- Daniel J Carucci
- Grand Challenges in Global Health Initiative, Foundation for the National Institutes of Health, 45 Center Drive (3AN-44), Bethesda, MD 20892-460, USA.
| |
Collapse
|
35
|
Abstract
Malaria research is now dominated by information flowing from the genome sequencing projects and the associated transcriptome- and proteome-mapping projects. As more species are sequenced, comparative and phylogenetic comparisons are improving the quality of gene finding, and are providing various approaches to the identification of genes important to parasite biology and the pathogenesis of disease. We are still in the early days of exploiting these data in a systematic way and the sheer volume of data presents daunting challenges. This article reviews the progress in using this genomic information and discusses opportunities for other approaches.
Collapse
Affiliation(s)
- Ross L Coppel
- Department of Microbiology, Monash University, Clayton, Victoria 3800, Australia.
| | | | | |
Collapse
|
36
|
Finishing the euchromatic sequence of the human genome. Nature 2004; 431:931-45. [PMID: 15496913 DOI: 10.1038/nature03001] [Citation(s) in RCA: 2797] [Impact Index Per Article: 139.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2004] [Accepted: 09/07/2004] [Indexed: 12/13/2022]
Abstract
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers approximately 99% of the euchromatic genome and is accurate to an error rate of approximately 1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human genome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.
Collapse
|
37
|
Zhou S, Kile A, Kvikstad E, Bechner M, Severin J, Forrest D, Runnheim R, Churas C, Anantharaman TS, Myler P, Vogt C, Ivens A, Stuart K, Schwartz DC. Shotgun optical mapping of the entire Leishmania major Friedlin genome. Mol Biochem Parasitol 2004; 138:97-106. [PMID: 15500921 DOI: 10.1016/j.molbiopara.2004.08.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2004] [Accepted: 08/02/2004] [Indexed: 11/21/2022]
Abstract
Leishmania is a group of protozoan parasites which causes a broad spectrum of diseases resulting in widespread human suffering and death, as well as economic loss from the infection of some domestic animals and wildlife. To further understand the fundamental genomic architecture of this parasite, and to accelerate the on-going sequencing project, a whole-genome XbaI restriction map was constructed using the optical mapping system. This map supplemented traditional physical maps that were generated by fingerprinting and hybridization of cosmid and P1 clone libraries. Thirty-six optical map contigs were constructed for the corresponding known 36 chromosomes of the Leishmania major Friedlin genome. The chromosome sizes ranged from 326.9 to 2821.3 kb, with a total genome size of 34.7 Mb; the average XbaI restriction fragment was 25.3 kb, and ranged from 15.7 to 77.8 kb on a per chromosomes basis. Comparison between the optical maps and the in silico maps of sequence drawn from completed, nearly finished, or large sequence contigs showed that optical maps served several useful functions within the path to create finished sequence by: guiding aspects of the sequence assembly, identifying misassemblies, detection of cosmid or PAC clones misplacements to chromosomes, and validation of sequence stemming from varying degrees of finishing. Our results also showed the potential use of optical maps as a means to detect and characterize map segmental duplication within genomes.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, UW Biotechnology Center, University of Wisconsin-Madison, 425 Henry Mall, Madison, WI 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
38
|
Qiu D, Fujita K, Sakuma Y, Tanaka T, Ohashi Y, Ohshima H, Tomita M, Itaya M. Comparative analysis of physical maps of four Bacillus subtilis (natto) genomes. Appl Environ Microbiol 2004; 70:6247-56. [PMID: 15466572 PMCID: PMC522138 DOI: 10.1128/aem.70.10.6247-6256.2004] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2003] [Accepted: 06/10/2004] [Indexed: 11/20/2022] Open
Abstract
The complete SfiI and I-CeuI physical maps of four Bacillus subtilis (natto) strains, which were previously isolated as natto (fermented soybean) starters, were constructed to elucidate the genome structure. Not only the similarity in genome size and organization but also the microheterogeneity of the gene context was revealed. No large-scale genome rearrangements among the four strains were indicated by mapping of the genes, including 10 rRNA operons (rrn) and relevant genes required for natto production, to the loci corresponding to those of the B. subtilis strain Marburg 168. However, restriction fragment length polymorphism and the presence or absence of strain-specific DNA sequences, such as the prophages SP beta, skin element, and PBSX, as well as the insertion element IS4Bsu1, could be used to identify one of these strains as a Marburg type and the other three strains as natto types. The genome structure and gene heterogeneity were also consistent with the type of indigenous plasmids harbored by the strains.
Collapse
Affiliation(s)
- Dongru Qiu
- Institute for Advanced Biosciences and Bioinformatics Program, Keio University, 403-1 Nipponkoku, Daihoji, Tsuruoka, Yamagata 997-0017, Japan
| | | | | | | | | | | | | | | |
Collapse
|
39
|
Abstract
The sequencing of eukaryotic genomes has lagged behind sequencing of organisms in the other domains of life, archae and bacteria, primarily due to their greater size and complexity. With recent advances in high-throughput technologies such as robotics and improved computational resources, the number of eukaryotic genome sequencing projects has increased significantly. Among these are a number of sequencing projects of tropical pathogens of medical and veterinary importance, many of which are responsible for causing widespread morbidity and mortality in peoples of developing countries. Uncovering the complete gene complement of these organisms is proving to be of immense value in the development of novel methods of parasite control, such as antiparasitic drugs and vaccines, as well as the development of new diagnostic tools. Combining pathogen genome sequences with the host and vector genome sequences is promising to be a robust method for the identification of host-pathogen interactions. Finally, comparative sequencing of related species, especially of organisms used as model systems in the study of the disease, is beginning to realize its potential in the identification of genes, and the evolutionary forces that shape the genes, that are involved in evasion of the host immune response.
Collapse
Affiliation(s)
- Jane M Carlton
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
| |
Collapse
|
40
|
Mishra B, Daruwala RS, Zhou Y, Ugel N, Policriti A, Antoniotti M, Paxia S, Rejali M, Rudra A, Cherepinsky V, Silver N, Casey W, Piazza C, Simeoni M, Barbano P, Spivak M, Feng J, Gill O, Venkatesh M, Cheng F, Sun B, Ioniata I, Anantharaman T, Hubbard EJA, Pnueli A, Harel D, Chandru V, Hariharan R, Wigler M, Park F, Lin SC, Lazebnik Y, Winkler F, Cantor CR, Carbone A, Gromov M. A sense of life: computational and experimental investigations with models of biochemical and evolutionary processes. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2003; 7:253-68. [PMID: 14583115 DOI: 10.1089/153623103322452387] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
We collaborate in a research program aimed at creating a rigorous framework, experimental infrastructure, and computational environment for understanding, experimenting with, manipulating, and modifying a diverse set of fundamental biological processes at multiple scales and spatio-temporal modes. The novelty of our research is based on an approach that (i) requires coevolution of experimental science and theoretical techniques and (ii) exploits a certain universality in biology guided by a parsimonious model of evolutionary mechanisms operating at the genomic level and manifesting at the proteomic, transcriptomic, phylogenic, and other higher levels. Our current program in "systems biology" endeavors to marry large-scale biological experiments with the tools to ponder and reason about large, complex, and subtle natural systems. To achieve this ambitious goal, ideas and concepts are combined from many different fields: biological experimentation, applied mathematical modeling, computational reasoning schemes, and large-scale numerical and symbolic simulations. From a biological viewpoint, the basic issues are many: (i) understanding common and shared structural motifs among biological processes; (ii) modeling biological noise due to interactions among a small number of key molecules or loss of synchrony; (iii) explaining the robustness of these systems in spite of such noise; and (iv) cataloging multistatic behavior and adaptation exhibited by many biological processes.
Collapse
Affiliation(s)
- Bud Mishra
- Department of Computer Science and Mathematics, Courant Institute of Mathematical Sciences, New York University, New York, New York, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Zhou S, Kvikstad E, Kile A, Severin J, Forrest D, Runnheim R, Churas C, Hickman JW, Mackenzie C, Choudhary M, Donohue T, Kaplan S, Schwartz DC. Whole-genome shotgun optical mapping of Rhodobacter sphaeroides strain 2.4.1 and its use for whole-genome shotgun sequence assembly. Genome Res 2003; 13:2142-51. [PMID: 12952882 PMCID: PMC403714 DOI: 10.1101/gr.1128803] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2002] [Accepted: 06/30/2003] [Indexed: 11/24/2022]
Abstract
Rhodobacter sphaeroides 2.4.1 is a facultative photoheterotrophic bacterium with tremendous metabolic diversity, which has significantly contributed to our understanding of the molecular genetics of photosynthesis, photoheterotrophy, nitrogen fixation, hydrogen metabolism, carbon dioxide fixation, taxis, and tetrapyrrole biosynthesis. To further understand this remarkable bacterium, and to accelerate an ongoing sequencing project, two whole-genome restriction maps (EcoRI and HindIII) of R. sphaeroides strain 2.4.1 were constructed using shotgun optical mapping. The approach directly mapped genomic DNA by the random mapping of single molecules. The two maps were used to facilitate sequence assembly by providing an optical scaffold for high-resolution alignment and verification of sequence contigs. Our results show that such maps facilitated the closure of sequence gaps by the early detection of nascent sequence contigs during the course of the whole-genome shotgun sequencing process.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, UW Biotechnology Center, Madison, Wisconsin 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
42
|
Zhou S, Deng W, Anantharaman TS, Lim A, Dimalanta ET, Wang J, Wu T, Chunhong T, Creighton R, Kile A, Kvikstad E, Bechner M, Yen G, Garic-Stankovic A, Severin J, Forrest D, Runnheim R, Churas C, Lamers C, Perna NT, Burland V, Blattner FR, Mishra B, Schwartz DC. A whole-genome shotgun optical map of Yersinia pestis strain KIM. Appl Environ Microbiol 2002; 68:6321-31. [PMID: 12450857 PMCID: PMC134435 DOI: 10.1128/aem.68.12.6321-6331.2002] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2002] [Accepted: 09/12/2002] [Indexed: 11/20/2022] Open
Abstract
Yersinia pestis is the causative agent of the bubonic, septicemic, and pneumonic plagues (also known as black death) and has been responsible for recurrent devastating pandemics throughout history. To further understand this virulent bacterium and to accelerate an ongoing sequencing project, two whole-genome restriction maps (XhoI and PvuII) of Y. pestis strain KIM were constructed using shotgun optical mapping. This approach constructs ordered restriction maps from randomly sheared individual DNA molecules directly extracted from cells. The two maps served different purposes; the XhoI map facilitated sequence assembly by providing a scaffold for high-resolution alignment, while the PvuII map verified genome sequence assembly. Our results show that such maps facilitated the closure of sequence gaps and, most importantly, provided a purely independent means for sequence validation. Given the recent advancements to the optical mapping system, increased resolution and throughput are enabling such maps to guide sequence assembly at a very early stage of a microbial sequencing project.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Hall N, Pain A, Berriman M, Churcher C, Harris B, Harris D, Mungall K, Bowman S, Atkin R, Baker S, Barron A, Brooks K, Buckee CO, Burrows C, Cherevach I, Chillingworth C, Chillingworth T, Christodoulou Z, Clark L, Clark R, Corton C, Cronin A, Davies R, Davis P, Dear P, Dearden F, Doggett J, Feltwell T, Goble A, Goodhead I, Gwilliam R, Hamlin N, Hance Z, Harper D, Hauser H, Hornsby T, Holroyd S, Horrocks P, Humphray S, Jagels K, James KD, Johnson D, Kerhornou A, Knights A, Konfortov B, Kyes S, Larke N, Lawson D, Lennard N, Line A, Maddison M, McLean J, Mooney P, Moule S, Murphy L, Oliver K, Ormond D, Price C, Quail MA, Rabbinowitsch E, Rajandream MA, Rutter S, Rutherford KM, Sanders M, Simmonds M, Seeger K, Sharp S, Smith R, Squares R, Squares S, Stevens K, Taylor K, Tivey A, Unwin L, Whitehead S, Woodward J, Sulston JE, Craig A, Newbold C, Barrell BG. Sequence of Plasmodium falciparum chromosomes 1, 3-9 and 13. Nature 2002; 419:527-31. [PMID: 12368867 DOI: 10.1038/nature01095] [Citation(s) in RCA: 128] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2002] [Accepted: 09/02/2002] [Indexed: 02/07/2023]
Abstract
Since the sequencing of the first two chromosomes of the malaria parasite, Plasmodium falciparum, there has been a concerted effort to sequence and assemble the entire genome of this organism. Here we report the sequence of chromosomes 1, 3-9 and 13 of P. falciparum clone 3D7--these chromosomes account for approximately 55% of the total genome. We describe the methods used to map, sequence and annotate these chromosomes. By comparing our assemblies with the optical map, we indicate the completeness of the resulting sequence. During annotation, we assign Gene Ontology terms to the predicted gene products, and observe clustering of some malaria-specific terms to specific chromosomes. We identify a highly conserved sequence element found in the intergenic region of internal var genes that is not associated with their telomeric counterparts.
Collapse
Affiliation(s)
- N Hall
- The Wellcome Trust Sanger Institute, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Gardner MJ, Hall N, Fung E, White O, Berriman M, Hyman RW, Carlton JM, Pain A, Nelson KE, Bowman S, Paulsen IT, James K, Eisen JA, Rutherford K, Salzberg SL, Craig A, Kyes S, Chan MS, Nene V, Shallom SJ, Suh B, Peterson J, Angiuoli S, Pertea M, Allen J, Selengut J, Haft D, Mather MW, Vaidya AB, Martin DMA, Fairlamb AH, Fraunholz MJ, Roos DS, Ralph SA, McFadden GI, Cummings LM, Subramanian GM, Mungall C, Venter JC, Carucci DJ, Hoffman SL, Newbold C, Davis RW, Fraser CM, Barrell B. Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 2002; 419:498-511. [PMID: 12368864 PMCID: PMC3836256 DOI: 10.1038/nature01097] [Citation(s) in RCA: 3076] [Impact Index Per Article: 139.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2002] [Accepted: 09/02/2002] [Indexed: 11/08/2022]
Abstract
The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.
Collapse
Affiliation(s)
- Malcolm J Gardner
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, Maryland 20850, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Gardner MJ, Shallom SJ, Carlton JM, Salzberg SL, Nene V, Shoaibi A, Ciecko A, Lynn J, Rizzo M, Weaver B, Jarrahi B, Brenner M, Parvizi B, Tallon L, Moazzez A, Granger D, Fujii C, Hansen C, Pederson J, Feldblyum T, Peterson J, Suh B, Angiuoli S, Pertea M, Allen J, Selengut J, White O, Cummings LM, Smith HO, Adams MD, Venter JC, Carucci DJ, Hoffman SL, Fraser CM. Sequence of Plasmodium falciparum chromosomes 2, 10, 11 and 14. Nature 2002; 419:531-4. [PMID: 12368868 DOI: 10.1038/nature01094] [Citation(s) in RCA: 134] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2002] [Accepted: 09/02/2002] [Indexed: 11/09/2022]
Abstract
The mosquito-borne malaria parasite Plasmodium falciparum kills an estimated 0.7-2.7 million people every year, primarily children in sub-Saharan Africa. Without effective interventions, a variety of factors-including the spread of parasites resistant to antimalarial drugs and the increasing insecticide resistance of mosquitoes-may cause the number of malaria cases to double over the next two decades. To stimulate basic research and facilitate the development of new drugs and vaccines, the genome of Plasmodium falciparum clone 3D7 has been sequenced using a chromosome-by-chromosome shotgun strategy. We report here the nucleotide sequences of chromosomes 10, 11 and 14, and a re-analysis of the chromosome 2 sequence. These chromosomes represent about 35% of the 23-megabase P. falciparum genome.
Collapse
Affiliation(s)
- Malcolm J Gardner
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, Maryland 20850, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Abstract
High-resolution physical maps can be used as a scaffold for several subsequent studies, such as sequencing projects and positional cloning of disease genes and genetic elements that regulate gene expression. Here we describe a method for fast, high-resolution physical mapping on stretched DNA molecules, based on a combinatorial multi-FISH approach. Fluorescent labels are assigned to a binary code and probes are identified by a binary tag according to their labeling. To validate the approach, we have mapped eight probes covering a region of about 300 kb on human chromosome 11 with three hybridization assays. This approach enables one to determine the structural organization of a large region by means of the order of its clones, without ambiguities. The structure established in a control cell constitutes a reference for further studies, to detect rearrangements displayed by disease cells and to find differences shown by different cell types and organisms.
Collapse
Affiliation(s)
- Chiara Conti
- Laboratoire de Stabilité des Génomes, Département de Structure et Dynamique des Génomes, Institut Pasteur, 25, rue du Dr. Roux, Paris Cedex, France
| | | |
Collapse
|
47
|
Hoffman SL, Subramanian GM, Collins FH, Venter JC. Plasmodium, human and Anopheles genomics and malaria. Nature 2002; 415:702-9. [PMID: 11832959 DOI: 10.1038/415702a] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The Plasmodium spp. parasites that cause malaria are transmitted to humans by Anopheles spp. mosquitoes. Scientists have now amassed a great body of knowledge about the parasite, its mosquito vector and human host. Yet this year there will be 300-500 million new malaria infections and 1-3 million deaths caused by the disease. We believe that integrated analyses of genome sequence, DNA polymorphisms, and messenger RNA and protein expression profiles will lead to greater understanding of the molecular basis of vector-human and host-parasite interactions and provide strategies to build upon these insights to develop interventions to mitigate human morbidity and mortality from malaria.
Collapse
Affiliation(s)
- Stephen L Hoffman
- Celera Genomics, 45 West Gude Drive, Rockville, Maryland 20850, USA.
| | | | | | | |
Collapse
|
48
|
Abstract
Almost 5 years ago, an international consortium of sequencing centers and funding agencies was formed to sequence the genome of the human malaria parasite Plasmodium falciparum. A novel chromosome by chromosome shotgun strategy was devised to sequence this very AT-rich genome. Two of the 14 chromosomes have been completed and the remaining chromosomes are in the final stages of gap closure. The consortium recently developed plans for the annotation and analysis of the complete genome sequence and its publication in 2002.
Collapse
Affiliation(s)
- M J Gardner
- The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
| |
Collapse
|
49
|
Weinel C, Tümmler B, Hilbert H, Nelson KE, Kiewitz C. General method of rapid Smith/Birnstiel mapping adds for gap closure in shotgun microbial genome sequencing projects: application to Pseudomonas putida KT2440. Nucleic Acids Res 2001; 29:E110. [PMID: 11713330 PMCID: PMC92575 DOI: 10.1093/nar/29.22.e110] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A physical mapping strategy has been developed to verify and accelerate the assembly and gap closure phase of a microbial genome shotgun-sequencing project. The protocol was worked out during the ongoing Pseudomonas putida KT2440 genome project. A macro-restriction map was constructed by linking probe hybridisation of SwaI- or I-CeuI-restricted chromosomes to serve as a backbone for the quick quality control of sequence and contig assemblies. The library of PCR-generated SwaI linking probes was derived from the sequence assembly after 3- and 6-fold genome coverage. In order to support gap closure in regions with ambiguous assemblies such as the repetitive sequence of the seven ribosomal operons, high-resolution Smith/Birnstiel maps were generated by Southern hybridisation of pulsed-field gel electrophoresis-separated rare-cutter complete/frequent-cutter partial digestions with rare-cutter fragment end probes. Overall 1.5 Mb of the 6.1 Mb P.putida KT2440 genome has been subjected to high-resolution physical mapping in order to align assemblies generated from shotgun sequencing.
Collapse
Affiliation(s)
- C Weinel
- Klinische Forschergruppe OE 6711, Medizinische Hochschule Hannover, Carl-Neuberg-Strasse 1, D-30625 Hannover, Germany.
| | | | | | | | | |
Collapse
|
50
|
Lim A, Dimalanta ET, Potamousis KD, Yen G, Apodoca J, Tao C, Lin J, Qi R, Skiadas J, Ramanathan A, Perna NT, Plunkett G, Burland V, Mau B, Hackett J, Blattner FR, Anantharaman TS, Mishra B, Schwartz DC. Shotgun optical maps of the whole Escherichia coli O157:H7 genome. Genome Res 2001; 11:1584-93. [PMID: 11544203 PMCID: PMC311123 DOI: 10.1101/gr.172101] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 06/04/2001] [Indexed: 11/24/2022]
Abstract
We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies.
Collapse
Affiliation(s)
- A Lim
- Laboratory for Molecular and Computational Genomics, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|