1
|
Di L, Akther S, Bezrucenkovas E, Ivanova L, Sulkow B, Wu B, Mneimneh S, Gomes-Solecki M, Qiu WG. Maximum antigen diversification in a lyme bacterial population and evolutionary strategies to overcome pathogen diversity. ISME J 2022; 16:447-464. [PMID: 34413477 PMCID: PMC8376116 DOI: 10.1038/s41396-021-01089-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 08/04/2021] [Accepted: 08/09/2021] [Indexed: 12/03/2022]
Abstract
Natural populations of pathogens and their hosts are engaged in an arms race in which the pathogens diversify to escape host immunity while the hosts evolve novel immunity. This co-evolutionary process poses a fundamental challenge to the development of broadly effective vaccines and diagnostics against a diversifying pathogen. Based on surveys of natural allele frequencies and experimental immunization of mice, we show high antigenic specificities of natural variants of the outer surface protein C (OspC), a dominant antigen of a Lyme Disease-causing bacterium (Borrelia burgdorferi). To overcome the challenge of OspC antigenic diversity to clinical development of preventive measures, we implemented a number of evolution-informed strategies to broaden OspC antigenic reactivity. In particular, the centroid algorithm-a genetic algorithm to generate sequences that minimize amino-acid differences with natural variants-generated synthetic OspC analogs with the greatest promise as diagnostic and vaccine candidates against diverse Lyme pathogen strains co-existing in the Northeast United States. Mechanistically, we propose a model of maximum antigen diversification (MAD) mediated by amino-acid variations distributed across the hypervariable regions on the OspC molecule. Under the MAD hypothesis, evolutionary centroids display broad cross-reactivity by occupying the central void in the antigenic space excavated by diversifying natural variants. In contrast to vaccine designs based on concatenated epitopes, the evolutionary algorithms generate analogs of natural antigens and are automated. The novel centroid algorithm and the evolutionary antigen designs based on consensus and ancestral sequences have broad implications for combating diversifying pathogens driven by pathogen-host co-evolution.
Collapse
Affiliation(s)
- Lia Di
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY, USA
| | - Saymon Akther
- Graduate Center, City University of New York, New York, NY, USA
| | - Edgaras Bezrucenkovas
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY, USA
| | - Larisa Ivanova
- Department of Microbiology, Immunology and Biochemistry, University of Tennessee Health Science Center, Memphis, TN, USA
- Pediatrics Department, New York Medical College, Valhalla, NY, USA
| | - Brian Sulkow
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY, USA
| | - Bing Wu
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY, USA
| | - Saad Mneimneh
- Graduate Center, City University of New York, New York, NY, USA
- Department of Computer Science, Hunter College, City University of New York, New York, NY, USA
| | - Maria Gomes-Solecki
- Department of Microbiology, Immunology and Biochemistry, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY, USA.
- Graduate Center, City University of New York, New York, NY, USA.
- Department of Physiology and Biophysics & Institute for Computational Biomedicine, Weil Cornell Medical College, New York, NY, USA.
| |
Collapse
|
2
|
Schwartz I, Margos G, Casjens SR, Qiu WG, Eggers CH. Multipartite Genome of Lyme Disease Borrelia: Structure, Variation and Prophages. Curr Issues Mol Biol 2020; 42:409-454. [PMID: 33328355 DOI: 10.21775/cimb.042.409] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
All members of the Borrelia genus that have been examined harbour a linear chromosome that is about 900 kbp in length, as well as a plethora of both linear and circular plasmids in the 5-220 kbp size range. Genome sequences for 27 Lyme disease Borrelia isolates have been determined since the elucidation of the B. burgdorferi B31 genome sequence in 1997. The chromosomes, which carry the vast majority of the housekeeping genes, appear to be very constant in gene content and organization across all Lyme disease Borrelia species. The content of the plasmids, which carry most of the genes that encode the differentially expressed surface proteins that interact with the spirochete's arthropod and vertebrate hosts, is much more variable. Lyme disease Borrelia isolates carry between 7-21 different plasmids, ranging in size from 5-84 kbp. All strains analyzed to date harbor three plasmids, cp26, lp54 and lp17. The plasmids are unusual, as compared to most bacterial plasmids, in that they contain many paralogous sequences, a large number of pseudogenes, and, in some cases, essential genes. In addition, a number of the plasmids have features indicating that they are prophages. Numerous methods have been developed for Lyme disease Borrelia strain typing. These have proven valuable for clinical and epidemiological studies, as well as phylogenomic and population genetic analyses. Increasingly, these approaches have been displaced by whole genome sequencing techniques. Some correlations between genome content and pathogenicity have been deduced, and comparative whole genome analyses promise future progress in this arena.
Collapse
Affiliation(s)
- Ira Schwartz
- Department of Microbiology and Immunology, New York Medical College, Valhalla, NY USA
| | - Gabriele Margos
- National Reference Centre for Borrelia and Bavarian Health and Food Safety Authority, Oberschleissheim, Germany
| | - Sherwood R Casjens
- Division of Microbiology and Immunology, Pathology Department, University of Utah School of Medicine, Salt Lake City, UT USA
| | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College of the City University of New York, New York, NY USA
| | - Christian H Eggers
- Department of Biomedical Sciences, Quinnipiac University, Hamden, CT, USA
| |
Collapse
|
3
|
Casjens SR, Di L, Akther S, Mongodin EF, Luft BJ, Schutzer SE, Fraser CM, Qiu WG. Primordial origin and diversification of plasmids in Lyme disease agent bacteria. BMC Genomics 2018; 19:218. [PMID: 29580205 PMCID: PMC5870499 DOI: 10.1186/s12864-018-4597-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 03/12/2018] [Indexed: 12/19/2022] Open
Abstract
Background With approximately one-third of their genomes consisting of linear and circular plasmids, the Lyme disease agent cluster of species has the most complex genomes among known bacteria. We report here a comparative analysis of plasmids in eleven Borreliella (also known as Borrelia burgdorferi sensu lato) species. Results We sequenced the complete genomes of two B. afzelii, two B. garinii, and individual B. spielmanii, B. bissettiae, B. valaisiana and B. finlandensis isolates. These individual isolates carry between seven and sixteen plasmids, and together harbor 99 plasmids. We report here a comparative analysis of these plasmids, along with 70 additional Borreliella plasmids available in the public sequence databases. We identify only one new putative plasmid compatibility type (the 30th) among these 169 plasmid sequences, suggesting that all or nearly all such types have now been discovered. We find that the linear plasmids in the non-B. burgdorferi species have undergone the same kinds of apparently random, chaotic rearrangements mediated by non-homologous recombination that we previously discovered in B. burgdorferi. These rearrangements occurred independently in the different species lineages, and they, along with an expanded chromosomal phylogeny reported here, allow the identification of several whole plasmid transfer events among these species. Phylogenetic analyses of the plasmid partition genes show that a majority of the plasmid compatibility types arose early, most likely before separation of the Lyme agent Borreliella and relapsing fever Borrelia clades, and this, with occasional cross species plasmid transfers, has resulted in few if any species-specific or geographic region-specific Borreliella plasmid types. Conclusions The primordial origin and persistent maintenance of the Borreliella plasmid types support their functional indispensability as well as evolutionary roles in facilitating genome diversity. The improved resolution of Borreliella plasmid phylogeny based on conserved partition-gene clusters will lead to better determination of gene orthology which is essential for prediction of biological function, and it will provide a basis for inferring detailed evolutionary mechanisms of Borreliella genomic variability including homologous gene and plasmid exchanges as well as non-homologous rearrangements. Electronic supplementary material The online version of this article (10.1186/s12864-018-4597-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sherwood R Casjens
- Division of Microbiology and Immunology, Pathology Department and Biology Department, University of Utah School of Medicine, Salt Lake City, UT, USA. .,Biology Department, University of Utah, Salt Lake City, UT, USA. .,Pathology Department, University of Utah School of Medicine, Room 2200K Emma Eccles Jones Medical Research Building, 15 North Medical Drive East, Salt Lake City, UT, 84112, USA.
| | - Lia Di
- Department of Biological Sciences and Center for Translational and Basic Research, Hunter College of the City University of New York, New York, NY, USA
| | - Saymon Akther
- Department of Biology, The Graduate Center, City University of New York, New York, NY, USA
| | - Emmanuel F Mongodin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Benjamin J Luft
- Department of Medicine, Health Science Center, Stony Brook University, Stony Brook, NY, USA
| | - Steven E Schutzer
- Department of Medicine, New Jersey Medical School, Rutgers, The State University of New Jersey, Newark, NJ, USA
| | - Claire M Fraser
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Wei-Gang Qiu
- Department of Biology, The Graduate Center, City University of New York, New York, NY, USA. .,Department of Biological Sciences and Center for Translational and Basic Research, Hunter College of the City University of New York, New York, NY, USA. .,Department of Physiology and Biophysics & Institute for Computational Biomedicine, Weil Cornell Medical College, New York, USA.
| |
Collapse
|
4
|
Hernández Y, Bernstein R, Pagan P, Vargas L, McCaig W, Ramrattan G, Akther S, Larracuente A, Di L, Vieira FG, Qiu WG. BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines. BMC Bioinformatics 2018; 19:76. [PMID: 29499649 PMCID: PMC5833151 DOI: 10.1186/s12859-018-2074-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Accepted: 02/20/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl ( http://bioperl.org ) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. RESULTS Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper . Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper . CONCLUSIONS BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications.
Collapse
Affiliation(s)
- Yözen Hernández
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA.,Graduate Program in Bioinformatics, Boston University, Boston, MA, 02215, USA
| | - Rocky Bernstein
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Pedro Pagan
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Levy Vargas
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - William McCaig
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Girish Ramrattan
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Saymon Akther
- Graduate Center, City University of New York, New York, 10016, USA
| | - Amanda Larracuente
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Lia Di
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA
| | - Filipe G Vieira
- Centre for GeoGenetics, Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College, City University of New York, New York, 10065, USA. .,Graduate Center, City University of New York, New York, 10016, USA. .,Department of Physiology and Biophysics & Institute for Computational Biomedicine, Weil Cornell Medical College, New York, NY, 10021, USA.
| |
Collapse
|
5
|
Lapp H, Bala S, Balhoff JP, Bouck A, Goto N, Holder M, Holland R, Holloway A, Katayama T, Lewis PO, Mackey AJ, Osborne BI, Piel WH, Pond SLK, Poon AF, Qiu WG, Stajich JE, Stoltzfus A, Thierer T, Vilella AJ, Vos RA, Zmasek CM, Zwickl DJ, Vision TJ. The 2006 NESCent Phyloinformatics Hackathon: A Field Report. Evol Bioinform Online 2017. [DOI: 10.1177/117693430700300016] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
In December, 2006, a group of 26 software developers from some of the most widely used life science programming toolkits and phylogenetic software projects converged on Durham, North Carolina, for a Phyloinformatics Hackathon, an intense five-day collaborative software coding event sponsored by the National Evolutionary Synthesis Center (NESCent). The goal was to help researchers to integrate multiple phylogenetic software tools into automated workflows. Participants addressed deficiencies in interoperability between programs by implementing “glue code” and improving support for phylogenetic data exchange standards (particularly NEXUS) across the toolkits. The work was guided by use-cases compiled in advance by both developers and users, and the code was documented as it was developed. The resulting software is freely available for both users and developers through incorporation into the distributions of several widely-used open-source toolkits. We explain the motivation for the hackathon, how it was organized, and discuss some of the outcomes and lessons learned. We conclude that hackathons are an effective mode of solving problems in software interoperability and usability, and are underutilized in scientific software development.
Collapse
Affiliation(s)
- Hilmar Lapp
- National Evolutionary Synthesis Center, 2024 W. Main St., Suite A200, Durham NC 27705, U.S.A
| | - Sendu Bala
- Dunn Human Nutrition Unit, Medical Research Council, Hills Road, Cambridge CB2 0XY, United Kingdom
| | - James P. Balhoff
- National Evolutionary Synthesis Center, 2024 W. Main St., Suite A200, Durham NC 27705, U.S.A
| | - Amy Bouck
- Department of Biology, CB 3280, University of North Carolina, Chapel Hill, NC 27599
- Department of Biology, Duke University, P.O. Box 90338, Durham, NC 27708, U.S.A
| | - Naohisa Goto
- Genome Information Research Center, Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka 565–0871, Japan
| | - Mark Holder
- School of Computational Science, 150-F Dirac Science Library, Florida State University, Tallahassee, Florida 32306–4120, U.S.A
| | - Richard Holland
- EMBL—European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Alisha Holloway
- Section of Evolution and Ecology, Center for Population Biology, 3347 Storer Hall, University of California, Davis, CA 95616, U.S.A
| | - Toshiaki Katayama
- Human Genome Center, Institute of Medical Science, University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108–0071, Japan
| | - Paul O. Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 North Eagleville Road, Unit 3043, Storrs, CT 06269-3043, U.S.A
| | - Aaron J. Mackey
- GlaxoSmithKline, 1250 S. Collegeville Road, Collegeville, PA 19426, U.S.A
| | | | - William H. Piel
- Peabody Museum of Natural History, Yale University, 170 Whitney Ave., New Haven CT 06511, U.S.A
| | - Sergei L. Kosakovsky Pond
- University of California, San Diego, Division of Comparative Pathology and Antiviral Research Center, 150 West Washington Street, San Diego, CA 92103
| | - Art F.Y. Poon
- University of California, San Diego, Division of Comparative Pathology and Antiviral Research Center, 150 West Washington Street, San Diego, CA 92103
| | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College, City University of New York, 695 Park Ave, New York, NY 10021, U.S.A
| | - Jason E. Stajich
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720, U.S.A
| | - Arlin Stoltzfus
- Biochemical Science Division, National Institute of Standards and Technology, 100 Bureau Drive, Mail Stop 8310, Gaithersburg, MD, 20899-8310
| | - Tobias Thierer
- Biomatters Ltd, Level 6, 220 Queen St, Auckland, New Zealand
| | - Albert J. Vilella
- EMBL—European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Rutger A. Vos
- Department of Zoology, University of British Columbia, #2370-6270 University Blvd., Vancouver, B.C. V6T 1Z4, Canada
| | | | - Derrick J. Zwickl
- National Evolutionary Synthesis Center, 2024 W. Main St., Suite A200, Durham NC 27705, U.S.A
| | - Todd J. Vision
- National Evolutionary Synthesis Center, 2024 W. Main St., Suite A200, Durham NC 27705, U.S.A
- Department of Biology, CB 3280, University of North Carolina, Chapel Hill, NC 27599
| |
Collapse
|
6
|
Qiu WG, Polotskaia A, Xiao G, Di L, Zhao Y, Hu W, Philip J, Hendrickson R, Bargonetti J. Abstract 2493: Identification of the mutant p53-PARP-MCM chromatin axis as a triple negative breast cancer replication stress target. Cancer Res 2017. [DOI: 10.1158/1538-7445.am2017-2493] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Approximately 15% of all breast cancer is triple-negative and of these about 80% are found to have mutations in the gene for the tumor suppressor p53 (TP53). Many TP53 mutations encode gain-of-function oncogenic mutant p53 (GOF mtp53) protein. We used inducible knockdown of endogenous GOF mtp53 in MDA-MB-468 cells in conjunction with stable isotope labeling with amino acids in cell culture (SILAC) and subcellular fractionation. We sequenced over 70,000 total peptides for chromatin and cytoplasmic reciprocal data sets and were able to identify 3,010 unique cytoplasmic fraction proteins and 3,403 unique chromatin fraction proteins. We found that the heterohexomeric minichromosome maintenance (MCM) complex (MCM 2-7) along with PARP are high mtp53-chromatin associated pathways. When we depleted R273H mtp53 we found a large reduction of the amount of MCM complex and PARP proteins on the chromatin. Furthermore a direct mtp53-MCM2 interaction was detected. Overexpressed mtp53, but not wild type p53, showed a protein-protein interaction with MCM2 and MCM4. We treated cells with the PARP inhibitor talazoparib and the alkylating agent temozolomide and detected synergistic activation of apoptosis only in the presence of functional MCM2-7 and mtp53. The mtp53-PARP-MCM axis has potential use as a therapeutic and diagnostic target.
Citation Format: Wei-Gang Qiu, Alla Polotskaia, Gu Xiao, Lia Di, Yuhan Zhao, Wenwei Hu, John Philip, Ronald Hendrickson, Jill Bargonetti. Identification of the mutant p53-PARP-MCM chromatin axis as a triple negative breast cancer replication stress target [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2017; 2017 Apr 1-5; Washington, DC. Philadelphia (PA): AACR; Cancer Res 2017;77(13 Suppl):Abstract nr 2493. doi:10.1158/1538-7445.AM2017-2493
Collapse
Affiliation(s)
- Wei-Gang Qiu
- 1City Univ. of New York at Hunter College and The Graduate Center, New York, NY
| | | | - Gu Xiao
- 2City Univ. of New York at Hunter College, New York, NY
| | - Lia Di
- 2City Univ. of New York at Hunter College, New York, NY
| | - Yuhan Zhao
- 3Rutgers Cancer Institute of New Jersey, Rutgers, NJ
| | - Wenwei Hu
- 3Rutgers Cancer Institute of New Jersey, Rutgers, NJ
| | - John Philip
- 4Memorial Sloan-Kettering Cancer Center, New York, NY
| | | | - Jill Bargonetti
- 1City Univ. of New York at Hunter College and The Graduate Center, New York, NY
| |
Collapse
|
7
|
Casjens SR, Gilcrease EB, Vujadinovic M, Mongodin EF, Luft BJ, Schutzer SE, Fraser CM, Qiu WG. Plasmid diversity and phylogenetic consistency in the Lyme disease agent Borrelia burgdorferi. BMC Genomics 2017; 18:165. [PMID: 28201991 PMCID: PMC5310021 DOI: 10.1186/s12864-017-3553-5] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Accepted: 02/03/2017] [Indexed: 01/13/2023] Open
Abstract
Background Bacteria from the genus Borrelia are known to harbor numerous linear and circular plasmids. We report here a comparative analysis of the nucleotide sequences of 236 plasmids present in fourteen independent isolates of the Lyme disease agent B. burgdorferi. Results We have sequenced the genomes of 14 B. burgdorferi sensu stricto isolates that carry a total of 236 plasmids. These individual isolates carry between seven and 23 plasmids. Their chromosomes, the cp26 and cp32 circular plasmids, as well as the lp54 linear plasmid, are quite evolutionarily stable; however, the remaining plasmids have undergone numerous non-homologous and often duplicative recombination events. We identify 32 different putative plasmid compatibility types among the 236 plasmids, of which 15 are (usually) circular and 17 are linear. Because of past rearrangements, any given gene, even though it might be universally present in these isolates, is often found on different linear plasmid compatibility types in different isolates. For example, the arp gene and the vls cassette region are present on plasmids of four and five different compatibility types, respectively, in different isolates. A majority of the plasmid types have more than one organizationally different subtype, and the number of such variants ranges from one to eight among the 18 linear plasmid types. In spite of this substantial organizational diversity, the plasmids are not so variable that every isolate has a novel version of every plasmid (i.e., there appears to be a limited number of extant plasmid subtypes). Conclusions Although there have been many past recombination events, both homologous and nonhomologous, among the plasmids, particular organizational variants of these plasmids correlate with particular chromosomal genotypes, suggesting that there has not been rapid horizontal transfer of whole linear plasmids among B. burgdorferi lineages. We argue that plasmid rearrangements are essentially non-revertable and are present at a frequency of only about 0.65% that of single nucleotide changes, making rearrangement-derived novel junctions (mosaic boundaries) ideal phylogenetic markers in the study of B. burgdorferi population structure and plasmid evolution and exchange. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3553-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sherwood R Casjens
- Division of Microbiology and Immunology, Pathology Department and Biology Department, University of Utah School of Medicine, Room 2200 K Emma Eccles Jones Medical Research Building, 15 North Medical Drive East, Salt Lake City, UT, 84112, USA. .,Biology Department, University of Utah, Salt Lake City, UT, USA.
| | - Eddie B Gilcrease
- Division of Microbiology and Immunology, Pathology Department and Biology Department, University of Utah School of Medicine, Room 2200 K Emma Eccles Jones Medical Research Building, 15 North Medical Drive East, Salt Lake City, UT, 84112, USA
| | - Marija Vujadinovic
- Division of Microbiology and Immunology, Pathology Department and Biology Department, University of Utah School of Medicine, Room 2200 K Emma Eccles Jones Medical Research Building, 15 North Medical Drive East, Salt Lake City, UT, 84112, USA.,Present Address: Janssen Disease and Vaccines, Pharmaceutical Companies of Johnson and Johnson, Leiden, The Netherlands
| | - Emmanuel F Mongodin
- Institute for Genome Sciences, University of Maryland BioPark, Baltimore, MD, USA
| | - Benjamin J Luft
- Department of Medicine, Health Science Center, Stony Brook University, Stony Brook, NY, USA
| | - Steven E Schutzer
- Department of Medicine, New Jersey Medical School, Rutgers, the State University of New Jersey, Newark, NJ, 07103, USA
| | - Claire M Fraser
- Institute for Genome Sciences, University of Maryland BioPark, Baltimore, MD, USA
| | - Wei-Gang Qiu
- Department of Biology, The Graduate Center, City University of New York City, New York, NY, USA.,Department of Biological Sciences and Center for Translational and Basic Research, Hunter College of the City University of New York City, New York, NY, USA
| |
Collapse
|
8
|
Qiu WG, Polotskaia A, Xiao G, Di L, Zhao Y, Hu W, Philip J, Hendrickson RC, Bargonetti J. Identification, validation, and targeting of the mutant p53-PARP-MCM chromatin axis in triple negative breast cancer. NPJ Breast Cancer 2017; 3:1. [PMID: 28232952 PMCID: PMC5319483 DOI: 10.1038/s41523-016-0001-7] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2016] [Revised: 11/17/2016] [Accepted: 11/17/2016] [Indexed: 12/18/2022] Open
Abstract
Over 80% of triple negative breast cancers express mutant p53. Mutant p53 often gains oncogenic function suggesting that triple negative breast cancers may be driven by p53 protein type. To determine the chromatin targets of this gain-of-function mutant p53 we used inducible knockdown of endogenous gain-of-function mtp53 in MDA-MB-468 cells in conjunction with stable isotope labeling with amino acids in cell culture and subcellular fractionation. We sequenced over 70,000 total peptides for each corresponding reciprocal data set and were able to identify 3010 unique cytoplasmic fraction proteins and 3403 unique chromatin fraction proteins. The present proteomics experiment corroborated our previous experiment-based results that poly ADP-ribose polymerase has a positive association with mutant p53 on the chromatin. Here, for the first time we report that the heterohexomeric minichromosome maintenance complex that participates in DNA replication initiation ranked as a high mutant p53-chromatin associated pathway. Enrichment analysis identified the minichromosome maintenance members 2-7. To validate this mutant p53- poly ADP-ribose polymerase-minichromosome maintenance functional axis, we experimentally depleted R273H mutant p53 and found a large reduction of the amount of minichromosome maintenance complex proteins on the chromatin. Furthermore a mutant p53-minichromosome maintenance 2 direct interaction was detected. Overexpressed mutant p53, but not wild type p53, showed a protein-protein interaction with minichromosome maintenance 2 and minichromosome maintenance 4. To target the mutant p53- poly ADP-ribose polymerase-minichromosome maintenance axis we treated cells with the poly ADP-ribose polymerase inhibitor talazoparib and the alkylating agent temozolomide and detected synergistic activation of apoptosis only in the presence of mutant p53. Furthermore when minichromosome maintenance 2-7 activity was inhibited the synergistic activation of apoptosis was blocked. This mutant p53- poly ADP-ribose polymerase -minichromosome maintenance axis may be useful for theranostics.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- The Department of Biological Sciences Hunter College, City University of New York, Hunter College-Weill Cornell Belfer Research Building, 413 East 69th, New York, NY 10065 USA
- The Graduate Center PhD Program in Biology, City University of New York, New York, NY 10016 USA
- Department of Physiology and Biophysics, Weill Cornell Medical College of Cornell University, New York, NY 10065 USA
| | - Alla Polotskaia
- The Department of Biological Sciences Hunter College, City University of New York, Hunter College-Weill Cornell Belfer Research Building, 413 East 69th, New York, NY 10065 USA
| | - Gu Xiao
- The Department of Biological Sciences Hunter College, City University of New York, Hunter College-Weill Cornell Belfer Research Building, 413 East 69th, New York, NY 10065 USA
| | - Lia Di
- The Department of Biological Sciences Hunter College, City University of New York, Hunter College-Weill Cornell Belfer Research Building, 413 East 69th, New York, NY 10065 USA
| | - Yuhan Zhao
- Rutgers Cancer Institute of New Jersey, New Brunswick, NJ 08903 USA
| | - Wenwei Hu
- Rutgers Cancer Institute of New Jersey, New Brunswick, NJ 08903 USA
| | - John Philip
- Proteomics Core Facility, Memorial Sloan-Kettering Cancer Center, New York, NY 10065 USA
| | - Ronald C. Hendrickson
- Proteomics Core Facility, Memorial Sloan-Kettering Cancer Center, New York, NY 10065 USA
| | - Jill Bargonetti
- The Department of Biological Sciences Hunter College, City University of New York, Hunter College-Weill Cornell Belfer Research Building, 413 East 69th, New York, NY 10065 USA
- The Graduate Center PhD Programs in Biology and Biochemistry, City University of New York, New York, NY 10016 USA
- Department of Cell and Developmental Biology, Weill Cornell Medical College of Cornell University, New York, NY 10065 USA
| |
Collapse
|
9
|
Qiu WG, Polotskaia A, Xiao G, Di L, Philip J, Hendrikson RC, Bargonetti J. Abstract 3684: Proteome-wide triple negative breast cancer mutant p53 association index identifies chromatin unwinding for precision therapeutics. Cancer Res 2016. [DOI: 10.1158/1538-7445.am2016-3684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Over 80% of TNBCs express mutant p53 (mtp53) proteins. We coupled cell fractionation with stable isotope labeling with amino acids in cell culture (SILAC) and inducible knockdown of endogenous mtp53 to determine the mtp53 driven proteome in the cytoplasm and chromatin of triple negative breast cancer (TNBC) cells. Using SILAC coupled to tandem mass spectrometry (LC-MS/MS) we identified that R273H mtp53 expression in MDA-MB-468 breast cancer cells both up and down-regulated multiple proteins and metabolic pathways. We sequenced 73,154 peptide pairs that corresponded to 3010 proteins detected under reciprocal labeling conditions in the cytoplasm and 48,825 peptide pairs that corresponded to 5195 proteins in the chromatin fraction. Pathway enrichment analysis ranked the DNA unwinding pathway as the highest chromatin associated pathway regulated by mtp53. Moreover, to summarize and quantify the degree of under- or over-expression of a protein from two reciprocal experiments, a mutant p53 association index (mPAI) was defined as the log (of base 2) ratio of two readings from the two reciprocal experiments. Values of mPAI were normally distributed with a mean close to zero, consistent with the fact that levels of the majority of proteins were not affected by mtp53 knockdown. Standard deviation of mPAI is close to one and mtp53 itself shows mPAI values of greater than 2.0 (z-score > 2.0) in both cytosol and chromatin fractions, consistent with the expectation that its levels are significantly reduced by knockdown experiments. We thereby identified proteins and pathways significantly affected by mtp53 knockdowns as those with mPAI > 1.0 (indicating positive association) or mPAI < -1.0 (indicating negative association). A pathway that was positively associated in both the cytoplasm and the chromatin was the DNA unwinding pathway, which was represented by associated changes for all the minichromosome maintenance protein complex proteins 2 through 7 (MCM2-7). This hexomeric protein ring structure complex is the eukaryotic DNA helicase complex required for DNA replication and elongation of the replication fork. We validated that the MCM4 protein level is positively associated with mtp53 R273H in the TNBC cell line MDA-MB-468 and in the colon adenocarcinoma cell line HT-29. This suggests that targeting inhibition of the MCM hexomeric complex may be a good precision medicine approach for TNBCs. Further studies are in progress to determine the efficacy of targeting inhibition of MCMs 2-7 as a mechanism to block TNBC.
Citation Format: Wei-Gang Qiu, Alla Polotskaia, Gu Xiao, Lia Di, John Philip, Ronald C. Hendrikson, Jill Bargonetti. Proteome-wide triple negative breast cancer mutant p53 association index identifies chromatin unwinding for precision therapeutics. [abstract]. In: Proceedings of the 107th Annual Meeting of the American Association for Cancer Research; 2016 Apr 16-20; New Orleans, LA. Philadelphia (PA): AACR; Cancer Res 2016;76(14 Suppl):Abstract nr 3684.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- 1City University of New York at Hunter College and The Graduate Center, New York, NY
| | - Alla Polotskaia
- 2City University of New York at Hunter College, New York, NY
| | - Gu Xiao
- 2City University of New York at Hunter College, New York, NY
| | - Lia Di
- 2City University of New York at Hunter College, New York, NY
| | - John Philip
- 3Memorial Sloan-Kettering Cancer Center, New York, NY
| | | | - Jill Bargonetti
- 1City University of New York at Hunter College and The Graduate Center, New York, NY
| |
Collapse
|
10
|
Heavner ME, Qiu WG, Cheng HP. Phylogenetic Co-Occurrence of ExoR, ExoS, and ChvI, Components of the RSI Bacterial Invasion Switch, Suggests a Key Adaptive Mechanism Regulating the Transition between Free-Living and Host-Invading Phases in Rhizobiales. PLoS One 2015; 10:e0135655. [PMID: 26309130 PMCID: PMC4550343 DOI: 10.1371/journal.pone.0135655] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 07/23/2015] [Indexed: 11/18/2022] Open
Abstract
Both bacterial symbionts and pathogens rely on their host-sensing mechanisms to activate the biosynthetic pathways necessary for their invasion into host cells. The Gram-negative bacterium Sinorhizobium meliloti relies on its RSI (ExoR-ExoS-ChvI) Invasion Switch to turn on the production of succinoglycan, an exopolysaccharide required for its host invasion. Recent whole-genome sequencing efforts have uncovered putative components of RSI-like invasion switches in many other symbiotic and pathogenic bacteria. To explore the possibility of the existence of a common invasion switch, we have conducted a phylogenomic survey of orthologous ExoR, ExoS, and ChvI tripartite sets in more than ninety proteobacterial genomes. Our analyses suggest that functional orthologs of the RSI invasion switch co-exist in Rhizobiales, an order characterized by numerous invasive species, but not in the order’s close relatives. Phylogenomic analyses and reconstruction of orthologous sets of the three proteins in Alphaproteobacteria confirm Rhizobiales-specific gene synteny and congruent RSI evolutionary histories. Evolutionary analyses further revealed site-specific substitutions correlated specifically to either animal-bacteria or plant-bacteria associations. Lineage restricted conservation of any one specialized gene is in itself an indication of species adaptation. However, the orthologous phylogenetic co-occurrence of all interacting partners within this single signaling pathway strongly suggests that the development of the RSI switch was a key adaptive mechanism. The RSI invasion switch, originally found in S. meliloti, is a characteristic of the Rhizobiales, and potentially a conserved crucial activation step that may be targeted to control host invasion by pathogenic bacterial species.
Collapse
Affiliation(s)
- Mary Ellen Heavner
- Biochemistry Program, The Graduate Center, City University of New York, New York, New York, United States of America
| | - Wei-Gang Qiu
- Biological Sciences Department, Hunter College, City University of New York, New York, New York, United States of America
| | - Hai-Ping Cheng
- Biochemistry Program, The Graduate Center, City University of New York, New York, New York, United States of America
- Biological Sciences Department, Lehman College, City University of New York, Bronx, New York, United States of America
- * E-mail:
| |
Collapse
|
11
|
Polotskaia A, Xiao G, Reynoso K, Martin C, Qiu WG, Hendrickson R, Bargonetti J. Abstract 1225: Proteome-wide analysis of gain-of-function mutant p53 targets in breast cancer implicates PARP, PCNA and MCM4 as oncogenic drivers. Cancer Res 2015. [DOI: 10.1158/1538-7445.am2015-1225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Gain-of-function mutant p53 (mtp53) changes the cancer cell transcriptome. However no analysis of mtp53-associated proteome diversity has been carried out. We coupled cell fractionation with stable isotope labeling with amino acids in cell culture (SILAC) and inducible knockdown of endogenous mtp53 to determine the mtp53 driven proteome. Using SILAC followed by tandem mass spectrometry (LC-MS/MS) we identified that R273H mtp53 expression in MDA-MB-468 breast cancer cells both up and down-regulated multiple proteins and metabolic pathways. We sequenced 73,154 peptide pairs that corresponded to 3010 proteins detected under reciprocal labeling conditions. Importantly the high impact regulated targets included the previously identified transcriptionally regulated mevalonate pathway proteins but also identified two new levels of mtp53 protein regulation for non-transcriptional targets. Interestingly, mtp53 depletion profoundly influenced poly (ADP-ribose) polymerase 1 (PARP1) localization, with increased cytoplasmic and decreased chromatin-associated PARP1. An enzymatic PARP shift occurred with high mtp53 expression resulting in increased poly-ADP-ribosylated proteins in the nucleus. Mtp53 increased the level of PCNA and MCM4 protein without changing the amount of pcna or mcm4 transcript. Pathway enrichment analysis ranked the DNA replication pathway above the cholesterol biosynthesis pathway as an R273H mtp53 activated proteomic target. Knowledge of the proteome diversity driven by mtp53 suggests DNA replication and repair pathways are major targets of mtp53 and highlights consideration of combination chemotherapeutic strategies targeting cholesterol biosynthesis and PARP inhibition.
Acknowledgements. This work was supported by a grant to JB from the Breast Cancer Research Foundation.
Citation Format: Alla Polotskaia, Gu Xiao, Katherine Reynoso, Che Martin, Wei-Gang Qiu, Ronald Hendrickson, Jill Bargonetti. Proteome-wide analysis of gain-of-function mutant p53 targets in breast cancer implicates PARP, PCNA and MCM4 as oncogenic drivers. [abstract]. In: Proceedings of the 106th Annual Meeting of the American Association for Cancer Research; 2015 Apr 18-22; Philadelphia, PA. Philadelphia (PA): AACR; Cancer Res 2015;75(15 Suppl):Abstract nr 1225. doi:10.1158/1538-7445.AM2015-1225
Collapse
Affiliation(s)
- Alla Polotskaia
- 1City University of New York at Hunter College, New York, NY
| | - Gu Xiao
- 1City University of New York at Hunter College, New York, NY
| | | | - Che Martin
- 1City University of New York at Hunter College, New York, NY
| | - Wei-Gang Qiu
- 1City University of New York at Hunter College, New York, NY
| | | | - Jill Bargonetti
- 3City University of New York at Hunter College and The Graduate Center, New York, NY
| |
Collapse
|
12
|
Gorson J, Ramrattan G, Verdes A, Wright EM, Kantor Y, Rajaram Srinivasan R, Musunuri R, Packer D, Albano G, Qiu WG, Holford M. Molecular Diversity and Gene Evolution of the Venom Arsenal of Terebridae Predatory Marine Snails. Genome Biol Evol 2015; 7:1761-78. [PMID: 26025559 PMCID: PMC4494067 DOI: 10.1093/gbe/evv104] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Venom peptides from predatory organisms are a resource for investigating evolutionary processes such as adaptive radiation or diversification, and exemplify promising targets for biomedical drug development. Terebridae are an understudied lineage of conoidean snails, which also includes cone snails and turrids. Characterization of cone snail venom peptides, conotoxins, has revealed a cocktail of bioactive compounds used to investigate physiological cellular function, predator-prey interactions, and to develop novel therapeutics. However, venom diversity of other conoidean snails remains poorly understood. The present research applies a venomics approach to characterize novel terebrid venom peptides, teretoxins, from the venom gland transcriptomes of Triplostephanus anilis and Terebra subulata. Next-generation sequencing and de novo assembly identified 139 putative teretoxins that were analyzed for the presence of canonical peptide features as identified in conotoxins. To meet the challenges of de novo assembly, multiple approaches for cross validation of findings were performed to achieve reliable assemblies of venom duct transcriptomes and to obtain a robust portrait of Terebridae venom. Phylogenetic methodology was used to identify 14 teretoxin gene superfamilies for the first time, 13 of which are unique to the Terebridae. Additionally, basic local algorithm search tool homology-based searches to venom-related genes and posttranslational modification enzymes identified a convergence of certain venom proteins, such as actinoporin, commonly found in venoms. This research provides novel insights into venom evolution and recruitment in Conoidean predatory marine snails and identifies a plethora of terebrid venom peptides that can be used to investigate fundamental questions pertaining to gene evolution.
Collapse
Affiliation(s)
- Juliette Gorson
- Hunter College and The Graduate Center, City University of New York Invertebrate Zoology, Sackler Institute for Comparative Genomics, American Museum of Natural History, New York
| | - Girish Ramrattan
- Hunter College and The Graduate Center, City University of New York
| | - Aida Verdes
- Hunter College and The Graduate Center, City University of New York Invertebrate Zoology, Sackler Institute for Comparative Genomics, American Museum of Natural History, New York
| | - Elizabeth M Wright
- Hunter College and The Graduate Center, City University of New York Invertebrate Zoology, Sackler Institute for Comparative Genomics, American Museum of Natural History, New York
| | - Yuri Kantor
- A.N. Severtsov Institute of Ecology and Evolution, Russian Academy of Sciences, Moscow, Russia Visiting Professor, Muséum National d'Histoire Naturelle, Paris, France
| | | | - Raj Musunuri
- Department of Bioinformatics, New York University Polytechnic School of Engineering
| | - Daniel Packer
- Hunter College and The Graduate Center, City University of New York
| | - Gabriel Albano
- Estação de Biologia Marítima da Inhaca (EBMI), Faculdade de Ciencias, Universidade Eduardo Mondlane, Distrito Municipal KaNyaka, Maputo, Mozambique
| | - Wei-Gang Qiu
- Hunter College and The Graduate Center, City University of New York
| | - Mandë Holford
- Hunter College and The Graduate Center, City University of New York Invertebrate Zoology, Sackler Institute for Comparative Genomics, American Museum of Natural History, New York
| |
Collapse
|
13
|
Polotskaia A, Xiao G, Reynoso K, Martin C, Qiu WG, Hendrickson RC, Bargonetti J. Proteome-wide analysis of mutant p53 targets in breast cancer identifies new levels of gain-of-function that influence PARP, PCNA, and MCM4. Proc Natl Acad Sci U S A 2015; 112:E1220-9. [PMID: 25733866 PMCID: PMC4371979 DOI: 10.1073/pnas.1416318112] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The gain-of-function mutant p53 (mtp53) transcriptome has been studied, but, to date, no detailed analysis of the mtp53-associated proteome has been described. We coupled cell fractionation with stable isotope labeling with amino acids in cell culture (SILAC) and inducible knockdown of endogenous mtp53 to determine the mtp53-driven proteome. Our fractionation data highlight the underappreciated biology that missense mtp53 proteins R273H, R280K, and L194F are tightly associated with chromatin. Using SILAC coupled to tandem MS, we identified that R273H mtp53 expression in MDA-MB-468 breast cancer cells up- and down-regulated multiple proteins and metabolic pathways. Here we provide the data set obtained from sequencing 73,154 peptide pairs that then corresponded to 3,010 proteins detected under reciprocal labeling conditions. Importantly, the high impact regulated targets included the previously identified transcriptionally regulated mevalonate pathway proteins but also identified two new levels of mtp53 protein regulation for nontranscriptional targets. Interestingly, mtp53 depletion profoundly influenced poly(ADP ribose) polymerase 1 (PARP1) localization, with increased cytoplasmic and decreased chromatin-associated protein. An enzymatic PARP shift occurred with high mtp53 expression, resulting in increased poly-ADP-ribosylated proteins in the nucleus. Mtp53 increased the level of proliferating cell nuclear antigen (PCNA) and minichromosome maintenance 4 (MCM4) proteins without changing the amount of pcna and mcm4 transcripts. Pathway enrichment analysis ranked the DNA replication pathway above the cholesterol biosynthesis pathway as a R273H mtp53 activated proteomic target. Knowledge of the proteome diversity driven by mtp53 suggests that DNA replication and repair pathways are major targets of mtp53 and highlights consideration of combination chemotherapeutic strategies targeting cholesterol biosynthesis and PARP inhibition.
Collapse
Affiliation(s)
- Alla Polotskaia
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| | - Gu Xiao
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| | - Katherine Reynoso
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| | - Che Martin
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| | - Ronald C Hendrickson
- Proteomics Core Facility, Memorial Sloan-Kettering Cancer Center, New York, NY 10065
| | - Jill Bargonetti
- Department of Biological Sciences, Hunter College, City University of New York, New York, NY 10065; and
| |
Collapse
|
14
|
Di L, Pagan PE, Packer D, Martin CL, Akther S, Ramrattan G, Mongodin EF, Fraser CM, Schutzer SE, Luft BJ, Casjens SR, Qiu WG. BorreliaBase: a phylogeny-centered browser of Borrelia genomes. BMC Bioinformatics 2014; 15:233. [PMID: 24994456 PMCID: PMC4094996 DOI: 10.1186/1471-2105-15-233] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2014] [Accepted: 06/26/2014] [Indexed: 11/29/2022] Open
Abstract
Background The bacterial genus Borrelia (phylum Spirochaetes) consists of two groups of pathogens represented respectively by B. burgdorferi, the agent of Lyme borreliosis, and B. hermsii, the agent of tick-borne relapsing fever. The number of publicly available Borrelia genomic sequences is growing rapidly with the discovery and sequencing of Borrelia strains worldwide. There is however a lack of dedicated online databases to facilitate comparative analyses of Borrelia genomes. Description We have developed BorreliaBase, an online database for comparative browsing of Borrelia genomes. The database is currently populated with sequences from 35 genomes of eight Lyme-borreliosis (LB) group Borrelia species and 7 Relapsing-fever (RF) group Borrelia species. Distinct from genome repositories and aggregator databases, BorreliaBase serves manually curated comparative-genomic data including genome-based phylogeny, genome synteny, and sequence alignments of orthologous genes and intergenic spacers. Conclusions With a genome phylogeny at its center, BorreliaBase allows online identification of hypervariable lipoprotein genes, potential regulatory elements, and recombination footprints by providing evolution-based expectations of sequence variability at each genomic locus. The phylo-centric design of BorreliaBase (http://borreliabase.org) is a novel model for interactive browsing and comparative analysis of bacterial genomes online.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Wei-Gang Qiu
- Department of Biological Sciences, Hunter College, The City University of New York, 10065 New York, NY, USA.
| |
Collapse
|
15
|
Mongodin EF, Casjens SR, Bruno JF, Xu Y, Drabek EF, Riley DR, Cantarel BL, Pagan PE, Hernandez YA, Vargas LC, Dunn JJ, Schutzer SE, Fraser CM, Qiu WG, Luft BJ. Inter- and intra-specific pan-genomes of Borrelia burgdorferi sensu lato: genome stability and adaptive radiation. BMC Genomics 2013; 14:693. [PMID: 24112474 PMCID: PMC3833655 DOI: 10.1186/1471-2164-14-693] [Citation(s) in RCA: 59] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 09/26/2013] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Lyme disease is caused by spirochete bacteria from the Borrelia burgdorferi sensu lato (B. burgdorferi s.l.) species complex. To reconstruct the evolution of B. burgdorferi s.l. and identify the genomic basis of its human virulence, we compared the genomes of 23 B. burgdorferi s.l. isolates from Europe and the United States, including B. burgdorferi sensu stricto (B. burgdorferi s.s., 14 isolates), B. afzelii (2), B. garinii (2), B. "bavariensis" (1), B. spielmanii (1), B. valaisiana (1), B. bissettii (1), and B. "finlandensis" (1). RESULTS Robust B. burgdorferi s.s. and B. burgdorferi s.l. phylogenies were obtained using genome-wide single-nucleotide polymorphisms, despite recombination. Phylogeny-based pan-genome analysis showed that the rate of gene acquisition was higher between species than within species, suggesting adaptive speciation. Strong positive natural selection drives the sequence evolution of lipoproteins, including chromosomally-encoded genes 0102 and 0404, cp26-encoded ospC and b08, and lp54-encoded dbpA, a07, a22, a33, a53, a65. Computer simulations predicted rapid adaptive radiation of genomic groups as population size increases. CONCLUSIONS Intra- and inter-specific pan-genome sizes of B. burgdorferi s.l. expand linearly with phylogenetic diversity. Yet gene-acquisition rates in B. burgdorferi s.l. are among the lowest in bacterial pathogens, resulting in high genome stability and few lineage-specific genes. Genome adaptation of B. burgdorferi s.l. is driven predominantly by copy-number and sequence variations of lipoprotein genes. New genomic groups are likely to emerge if the current trend of B. burgdorferi s.l. population expansion continues.
Collapse
Affiliation(s)
- Emmanuel F Mongodin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Alaie A, Teller V, Qiu WG. A Bioinformatics Module for Use in an Introductory Biology Laboratory. Am Biol Teach 2012; 74:318-322. [PMID: 30327576 PMCID: PMC6186437 DOI: 10.1525/abt.2012.74.5.6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Since biomedical science has become increasingly data-intensive, acquisition of computational and quantitative skills by science students has become more important. For non-science students, an introduction to biomedical databases and their applications promotes the development of a scientifically literate population. Because typical college introductory biology laboratories do not include experiences of this type, we present a bioinformatics module that can easily be included in a 90-minute session of a biology course for both majors and non-majors. Students completing this computational, inquiry-based module observed the value of computer-assisted analysis. The module gave students an understanding of how to read files in a biological database (GenBank) and how to use a software tool (BLAST) to mine the database.
Collapse
Affiliation(s)
- Adrienne Alaie
- ADRIENE ALAIE is Assistant Professor of Biological Sciences at Hunter College of the City University of New York, 695 Park Ave., Room 818N, New York, NY 10065, where VIRGINIA TELLER is Professor and Chair of Computer Sciences and WEI-GANG QIU is Associate Professor of Biological Sciences
| | - Virginia Teller
- ADRIENE ALAIE is Assistant Professor of Biological Sciences at Hunter College of the City University of New York, 695 Park Ave., Room 818N, New York, NY 10065, where VIRGINIA TELLER is Professor and Chair of Computer Sciences and WEI-GANG QIU is Associate Professor of Biological Sciences
| | - Wei-Gang Qiu
- ADRIENE ALAIE is Assistant Professor of Biological Sciences at Hunter College of the City University of New York, 695 Park Ave., Room 818N, New York, NY 10065, where VIRGINIA TELLER is Professor and Chair of Computer Sciences and WEI-GANG QIU is Associate Professor of Biological Sciences
| |
Collapse
|
17
|
Casjens SR, Mongodin EF, Qiu WG, Luft BJ, Schutzer SE, Gilcrease EB, Huang WM, Vujadinovic M, Aron JK, Vargas LC, Freeman S, Radune D, Weidman JF, Dimitrov GI, Khouri HM, Sosa JE, Halpin RA, Dunn JJ, Fraser CM. Genome stability of Lyme disease spirochetes: comparative genomics of Borrelia burgdorferi plasmids. PLoS One 2012; 7:e33280. [PMID: 22432010 PMCID: PMC3303823 DOI: 10.1371/journal.pone.0033280] [Citation(s) in RCA: 123] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2011] [Accepted: 02/06/2012] [Indexed: 11/21/2022] Open
Abstract
Lyme disease is the most common tick-borne human illness in North America. In order to understand the molecular pathogenesis, natural diversity, population structure and epizootic spread of the North American Lyme agent, Borrelia burgdorferi sensu stricto, a much better understanding of the natural diversity of its genome will be required. Towards this end we present a comparative analysis of the nucleotide sequences of the numerous plasmids of B. burgdorferi isolates B31, N40, JD1 and 297. These strains were chosen because they include the three most commonly studied laboratory strains, and because they represent different major genetic lineages and so are informative regarding the genetic diversity and evolution of this organism. A unique feature of Borrelia genomes is that they carry a large number of linear and circular plasmids, and this work shows that strains N40, JD1, 297 and B31 carry related but non-identical sets of 16, 20, 19 and 21 plasmids, respectively, that comprise 33–40% of their genomes. We deduce that there are at least 28 plasmid compatibility types among the four strains. The B. burgdorferi ∼900 Kbp linear chromosomes are evolutionarily exceptionally stable, except for a short ≤20 Kbp plasmid-like section at the right end. A few of the plasmids, including the linear lp54 and circular cp26, are also very stable. We show here that the other plasmids, especially the linear ones, are considerably more variable. Nearly all of the linear plasmids have undergone one or more substantial inter-plasmid rearrangements since their last common ancestor. In spite of these rearrangements and differences in plasmid contents, the overall gene complement of the different isolates has remained relatively constant.
Collapse
Affiliation(s)
- Sherwood R Casjens
- Department of Pathology, University of Utah School of Medicine, Salt Lake City, Utah, United States of America.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Abstract
There is a recent emergence of interest in the genes involved in gametic recognition as drivers of reproductive isolation. The recent population genomic sequencing of two species of sexually primitive yeasts (Liti G, Carter DM, Moses AM, Warringer J, Parts L, James SA, Davey RP, Roberts IN, Burt A, Koufopanou V et al. [23 co-authors]. 2009. Population genomics of domestic and wild yeasts. Nature 458:337-341.) has provided data for systematic study of the roles these genes play in the early evolution of sex and speciation. Here, we discovered that among genes encoding cell surface proteins, the sexual adhesin genes have evolved significantly more rapidly than others, both within and between Saccharomyces cerevisiae and its closest relative S. paradoxus. This result was supported by analyses using the PAML pairwise model, a modified McDonald-Kreitman test, and the PAML branch model. Moreover, using a combination of a new statistic of neutrality, an information theory-based measure of evolutionary variability, and functional characterization of amino acid changes, we found that a higher proportion of amino acid changes are fixed in the sexual adhesins than in other proteins and a greater proportion of the fixed amino acid changes either between the two species or the two subgroups of S. paradoxus are functionally dissimilar or radically different. These results suggest that the accelerated evolution of sexual adhesin genes may facilitate speciation, or incipient speciation, and promote sexual selection in general.
Collapse
Affiliation(s)
- Xianfa Xie
- Department of Biology, Brooklyn College, City University of New York, NY, USA.
| | | | | |
Collapse
|
19
|
Wywial E, Haven J, Casjens SR, Hernandez YA, Singh S, Mongodin EF, Fraser-Liggett CM, Luft BJ, Schutzer SE, Qiu WG. Fast, adaptive evolution at a bacterial host-resistance locus: the PFam54 gene array in Borrelia burgdorferi. Gene 2009; 445:26-37. [PMID: 19505540 DOI: 10.1016/j.gene.2009.05.017] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2009] [Revised: 05/15/2009] [Accepted: 05/31/2009] [Indexed: 10/20/2022]
Abstract
Microbial pathogens have evolved sophisticated mechanisms for evasion of host innate and adaptive immunities. PFam54 is the largest paralogous gene family in the genomes of Borrelia burgdorferi, the Lyme disease bacterium. One member of PFam54, the complement-regulator acquiring surface proteins 1 (BbCrasp-1), is able to abort the alternative pathway of complement activation via binding human complement-regulator factor H (FH). The gene coding for BbCRASP-1 exists in a tandem array of PFam54 genes in the B. burgdorferi genome, a result apparently of repeated gene duplications. To help elucidate the functions of the large number of PFam54 genes, we performed phylogenomic and structural analyses of the PFam54 gene array from ten B. burgdorferi genomes. Analyses based on gene tree, genome synteny, and structural models revealed rapid adaptive evolution of this array through gene duplication, gene loss, and functional diversification. Individual PFam54 genes, however, do not show high intra-population sequence polymorphisms as genes providing evasion from adaptive immunity generally do. PFam54 members able to bind human FH are not monophyletic, suggesting that human FH affinity, however strong, is an incidental rather than main function of these PFam54 proteins. The large number of PFam54 genes existing in any single B. burgdorferi genome may target different innate-immunity proteins of a single host species or the same immune protein of a variety of host species. Genetic variability of the PFam54 gene array suggests that universally present PFam54 lineages such as BBA64, BBA65, BBA66, and BBA73 may be better candidates for the development of broad-spectrum vaccines or drugs than strain-restricted lineages such as BbCRASP-1.
Collapse
Affiliation(s)
- Ewa Wywial
- Department of Biology, The Graduate Center of the City University of New York, 365 Fifth Avenue, New York, NY 10016, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Tran D, Haven J, Qiu WG, Polle JEW. An update on carotenoid biosynthesis in algae: phylogenetic evidence for the existence of two classes of phytoene synthase. Planta 2009; 229:723-9. [PMID: 19066941 PMCID: PMC6008256 DOI: 10.1007/s00425-008-0866-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Accepted: 11/17/2008] [Indexed: 05/10/2023]
Abstract
Carotenoids play crucial roles in structure and function of the photosynthetic apparatus of bacteria, algae, and higher plants. The entry-step reaction to carotenoid biosynthesis is catalyzed by the phytoene synthase (PSY), which is structurally and functionally related in all organisms. A comparative genomic analysis regarding the PSY revealed that the green algae Ostreococcus and Micromonas possess two orthologous copies of the PSY genes, indicating an ancient gene duplication event that produced two classes of PSY in algae. However, some other green algae (Chlamydomonas reinhardtii, Chlorella vulgaris, and Volvox carteri), red algae (Cyanidioschyzon merolae), diatoms (Thalassiosira pseudonana and Phaeodactylum tricornutum), and higher plants retained only one class of the PSY gene whereas the other gene copy was lost in these species. Further, similar to the situation in higher plants recent gene duplications of PSY have occurred for example in the green alga Dunaliella salina/bardawil. As members of the PSY gene families in some higher plants are differentially regulated during development or stress, the discovery of two classes of PSY gene families in some algae suggests that carotenoid biosynthesis in these algae is differentially regulated in response to development and environmental stress as well.
Collapse
Affiliation(s)
- Duc Tran
- Department of Biology, Brooklyn College of the City University of New York, Brooklyn, NY 11210, USA
| | | | | | | |
Collapse
|
21
|
Qiu WG, Bruno JF, McCaig WD, Xu Y, Livey I, Schriefer ME, Luft BJ. Wide Distribution of a High-VirulenceBorrelia burgdorferiClone in Europe and North America. Emerg Infect Dis 2008. [DOI: 10.3201/eid/1407.070880] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Wei-Gang Qiu
- Hunter College of the City University of New York, New York, New York, USA
| | | | - William D. McCaig
- Hunter College of the City University of New York, New York, New York, USA
| | - Yun Xu
- Stony Brook University, Stony Brook, New York, USA
| | - Ian Livey
- Baxter Innovations GmBH, Orth/Donau, Austria
| | | | | |
Collapse
|
22
|
Qiu WG, Bruno JF, McCaig WD, Xu Y, Livey I, Schriefer ME, Luft BJ. Wide distribution of a high-virulence Borrelia burgdorferi clone in Europe and North America. Emerg Infect Dis 2008; 14:1097-104. [PMID: 18598631 PMCID: PMC2600328 DOI: 10.3201/eid1407.070880] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The A and B clones of Borrelia burgdorferi sensu stricto, distinguished by outer surface protein C (ospC) gene sequences, are commonly associated with disseminated Lyme disease. To resolve phylogenetic relationships among isolates, we sequenced 68 isolates from Europe and North America at 1 chromosomal locus (16S-23S ribosomal RNA spacer) and 3 plasmid loci (ospC,dbpA, and BBD14). The ospC-A clone appeared to be highly prevalent on both continents, and isolates of this clone were uniform in DNA sequences, which suggests a recent trans-oceanic migration. The genetic homogeneity of ospC-A isolates was confirmed by sequences at 6 additional chromosomal housekeeping loci (gap, alr, glpA, xylB, ackA, and tgt). In contrast, the ospC-B group consists of genotypes distinct to each continent, indicating geographic isolation. We conclude that the ospC-A clone has dispersed rapidly and widely in the recent past. The spread of the ospC-A clone may have contributed, and likely continues to contribute, to the rise of Lyme disease incidence.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- Hunter College of the City University of New York, New York, New York, USA
| | | | - William D. McCaig
- Hunter College of the City University of New York, New York, New York, USA
| | - Yun Xu
- Stony Brook University, Stony Brook, New York, USA
| | - Ian Livey
- Baxter Innovations GmBH, Orth/Donau, Austria
| | | | | |
Collapse
|
23
|
Coronado JE, Mneimneh S, Epstein SL, Qiu WG, Lipke PN. Conserved processes and lineage-specific proteins in fungal cell wall evolution. Eukaryot Cell 2007; 6:2269-77. [PMID: 17951517 PMCID: PMC2168262 DOI: 10.1128/ec.00044-07] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 10/03/2007] [Indexed: 11/20/2022]
Abstract
The cell wall is a defining organelle that differentiates fungi from its sister clades in the opisthokont superkingdom. With a sensitive technique to align low-complexity protein sequences, we have identified 187 cell wall-related proteins in Saccharomyces cerevisiae and determined the presence or absence of homologs in 17 other fungal genomes. There were both conserved and lineage-specific cell wall proteins, and the degree of conservation was strongly correlated with protein function. Some functional classes were poorly conserved and lineage specific: adhesins, structural wall glycoprotein components, and unannotated open reading frames. These proteins are primarily those that are constituents of the walls themselves. On the other hand, glycosyl hydrolases and transferases, proteases, lipases, proteins in the glycosyl phosphatidyl-inositol-protein synthesis pathway, and chaperones were strongly conserved. Many of these proteins are also conserved in other eukaryotes and are associated with wall synthesis in plants. This gene conservation, along with known similarities in wall architecture, implies that the basic architecture of fungal walls is ancestral to the divergence of the ascomycetes and basidiomycetes. The contrasting lineage specificity of wall resident proteins implies diversification. Therefore, fungal cell walls consist of rapidly diversifying proteins that are assembled by the products of an ancestral and conserved set of genes.
Collapse
Affiliation(s)
- Juan E Coronado
- Department of Biological Sciences, Hunter College, City University of New York, New York, New York 10021, USA
| | | | | | | | | |
Collapse
|
24
|
Coronado JE, Epstein SL, Qiu WG, Lipke PN. Discovery of Recurrent Sequence Motifs in Saccharomyces cerevisiae Cell Wall Proteins. Match (Mulh) 2007; 58:281-299. [PMID: 19430580 PMCID: PMC2678842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
This paper describes a procedure for the discovery of recurrent substrings in amino acid sequences of proteins, and its application to fungal cell walls. The evolutionary origins of fungal cell walls are an open biological question. This question can be approached by studies of similarity among the sequences and sub-sequences of fungal wall proteins and by comparison to proteins in animals. We describe here how we have discovered building blocks, represented as recurrent sequence motifs (sub-sequences), within fungal cell wall proteins. These motifs have not been systematically identified before, because the low Shannon entropy of the cell wall sequences has hindered searches for local sequence similarities by sequence alignments. Nonetheless, our new, composition-based scoring matrices for local alignment searches now support statistically valid alignments for such low entropy sequences (Coronado et al. 2006. Euk. Cell 5: 628-637). We have now searched for similarities in a set of 171 known and putative cell wall proteins from baker's yeast, Saccharomyces cerevisiae. The aligned segments were repeatedly subdivided and catalogued to identify 217 recurrent sequence motifs of length 8 amino acids or greater. 95% of these motifs occur in more than one cell wall protein. The median length of the motifs is 22 amino acid residues, considerably shorter than protein domains. For many cell wall proteins, these motifs collectively account for more than half of their amino acids. The prevalence of these motifs supports the idea of fungal cell wall proteins as assemblies of recurrent building blocks.
Collapse
Affiliation(s)
- Juan E. Coronado
- Department of Biological Sciences and Center for Gene Structure and Function, Hunter College of City University of New York, New York, NY 10021, USA
| | - Susan L. Epstein
- Department of Computer Science, Hunter College of City University of New York, New York NY 10021, USA
| | - Wei-Gang Qiu
- Department of Biological Sciences and Center for Gene Structure and Function, Hunter College of City University of New York, New York, NY 10021, USA
| | - Peter N. Lipke
- Department of Biological Sciences and Center for Gene Structure and Function, Hunter College of City University of New York, New York, NY 10021, USA
| |
Collapse
|
25
|
Coronado JE, Attie O, Epstein SL, Qiu WG, Lipke PN. Composition-modified matrices improve identification of homologs of saccharomyces cerevisiae low-complexity glycoproteins. Eukaryot Cell 2006; 5:628-37. [PMID: 16607010 PMCID: PMC1459670 DOI: 10.1128/ec.5.4.628-637.2006] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Yeast glycoproteins are representative of low-complexity sequences, those sequences rich in a few types of amino acids. Low-complexity protein sequences comprise more than 10% of the proteome but are poorly aligned by existing methods. Under default conditions, BLAST and FASTA use the scoring matrix BLOSUM62, which is optimized for sequences with diverse amino acid compositions. Because low-complexity sequences are rich in a few amino acids, these tools tend to align the most common residues in nonhomologous positions, thereby generating anomalously high scores, deviations from the expected extreme value distribution, and small e values. This anomalous scoring prevents BLOSUM62-based BLAST and FASTA from identifying correct homologs for proteins with low-complexity sequences, including Saccharomyces cerevisiae wall proteins. We have devised and empirically tested scoring matrices that compensate for the overrepresentation of some amino acids in any query sequence in different ways. These matrices were tested for sensitivity in finding true homologs, discrimination against nonhomologous and random sequences, conformance to the extreme value distribution, and accuracy of e values. Of the tested matrices, the two best matrices (called E and gtQ) gave reliable alignments in BLAST and FASTA searches, identified a consistent set of paralogs of the yeast cell wall test set proteins, and improved the consistency of secondary structure predictions for cell wall proteins.
Collapse
Affiliation(s)
- Juan E Coronado
- Department of Biological Sciences, Hunter College, 695 Park Ave., New York, NY 10021, USA
| | | | | | | | | |
Collapse
|
26
|
Romov PA, Li F, Lipke PN, Epstein SL, Qiu WG. Comparative genomics reveals long, evolutionarily conserved, low-complexity islands in yeast proteins. J Mol Evol 2006; 63:415-25. [PMID: 16927006 DOI: 10.1007/s00239-005-0291-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2005] [Accepted: 04/27/2006] [Indexed: 01/12/2023]
Abstract
Eukaryotic proteomes abound in low-complexity sequences, including tandem repeats and regions with significantly biased amino acid compositions. We assessed the functional importance of compositionally biased sequences in the yeast proteome using an evolutionary analysis of 2838 orthologous open reading frame (ORF) families from three Saccharomyces species (S. cerevisiae, S. bayanus, and S. paradoxus). Sequence conservation was measured by the amino acid sequence variability and by the ratio of nonsynonymous-to-synonymous nucleotide substitutions (K(a)/K(s)) between pairs of orthologous ORFs. A total of 1033 ORF families contained one or more long (at least 45 residues), low-complexity islands as defined by a measure based on the Shannon information index. Low-complexity islands were generally less conserved than ORFs as a whole; on average they were 50% more variable in amino acid sequences and 50% higher in K(a)/K(s) ratios. Fast-evolving low-complexity sequences outnumbered conserved low-complexity sequences by a ratio of 10 to 1. Sequence differences between orthologous ORFs fit well to a selectively neutral Poisson model of sequence divergence. We therefore used the Poisson model to identify conserved low-complexity sequences. ORFs containing the 33 most conserved low-complexity sequences were overrepresented by those encoding nucleic acid binding proteins, cytoskeleton components, and intracellular transporters. While a few conserved low-complexity islands were known functional domains (e.g., DNA/RNA-binding domains), most were uncharacterized. We discuss how comparative genomics of closely related species can be employed further to distinguish functionally important, shorter, low-complexity sequences from the vast majority of such sequences likely maintained by neutral processes.
Collapse
Affiliation(s)
- Philip A Romov
- Department of Computer Science, Hunter College, City University of New York, New York, New York 10021, USA
| | | | | | | | | |
Collapse
|
27
|
Attie O, Bruno JF, Xu Y, Qiu D, Luft BJ, Qiu WG. Co-evolution of the outer surface protein C gene (ospC) and intraspecific lineages of Borrelia burgdorferi sensu stricto in the northeastern United States. Infect Genet Evol 2006; 7:1-12. [PMID: 16684623 DOI: 10.1016/j.meegid.2006.02.008] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2005] [Revised: 02/01/2006] [Accepted: 02/22/2006] [Indexed: 11/15/2022]
Abstract
Clinical and tick isolates of Borrelia burgdorferi sensu stricto, the bacterial agent of Lyme disease, from the northeastern United States were sequenced at 12 loci located on the main chromosome and 7 plasmids (lp54, cp26, cp9, lp17, lp25, lp28-2, and lp38). The outer surface protein C gene (ospC) showed the highest number (12) of major alleles (defined as alleles differing by 5% or more in nucleotide sequence) while other ORFs had only two to four major alleles. A non-recombining chromosomal marker, the rrs-rrlA ribosomal RNA spacer, was used to infer the intraspecific phylogeny among these B. burgdorferi isolates. We were thus able to analyze the multilocus genotypes in the context of a B. burgdorferi intraspecific phylogeny. Except for ospC, sequence variations at plasmid-borne loci showed broad inconsistency with the intraspecific phylogeny, supporting DNA exchanges mediated by plasmid transfers. The multilocus linkage frequently observed in B. burgdorferi populations is due more likely to a "founder effect" than to a lack of recombination. The exceptional phylogenetic consistency of ospC, in conjunction with its selectively maintained high intraspecific diversity, suggested a dominant role ospC plays in the initiation and maintenance of adaptive differentiation in B. burgdorferi.
Collapse
Affiliation(s)
- Oliver Attie
- Department of Biological Sciences, Hunter College of the City University of New York, 695 Park Avenue, New York, NY 10021, United States
| | | | | | | | | | | |
Collapse
|
28
|
Abstract
SUMMARY Nexplorer is a web-based program for interactive browsing and manipulation of character data in NEXUS format, well suited for use with alignments and trees representing families of homologous genes or proteins. Users may upload a sequence family dataset, or choose from one of several thousand already available. Nexplorer provides a flexible means to develop customized views that combine a tree and a data matrix or alignment, to create subsets of data, and to output data files or publication-quality graphics. AVAILABILITY Web access is from http://www.molevol.org/nexplorer
Collapse
Affiliation(s)
- Vivek Gopalan
- Center for Advanced Research in Biotechnology, 9600 Gudelsky Drive, Rockville, MD 20850, USA
| | | | | | | |
Collapse
|
29
|
Qiu WG, Schutzer SE, Bruno JF, Attie O, Xu Y, Dunn JJ, Fraser CM, Casjens SR, Luft BJ. Genetic exchange and plasmid transfers in Borrelia burgdorferi sensu stricto revealed by three-way genome comparisons and multilocus sequence typing. Proc Natl Acad Sci U S A 2004; 101:14150-5. [PMID: 15375210 PMCID: PMC521097 DOI: 10.1073/pnas.0402745101] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Comparative genomics of closely related bacterial isolates is a powerful method for uncovering virulence and other important genome elements. We determined draft sequences (8-fold coverage) of the genomes of strains JD1 and N40 of Borrelia burgdorferi sensu stricto, the causative agent of Lyme disease, and we compared the predicted genes from the two genomes with those from the previously sequenced B31 genome. The three genomes are closely related and are evolutionarily approximately equidistant ( approximately 0.5% pairwise nucleotide differences on the main chromosome). We used a Poisson model of nucleotide substitution to screen for genes with elevated levels of nucleotide polymorphisms. The three-way genome comparison allowed distinction between polymorphisms introduced by mutations and those introduced by recombination using the method of phylogenetic partitioning. Tests for recombination suggested that patches of high-density nucleotide polymorphisms on the chromosome and plasmids arise by DNA exchange. The role of recombination as the main mechanism driving B. burgdorferi diversification was confirmed by multilocus sequence typing of 18 clinical isolates at 18 polymorphic loci. A strong linkage between the multilocus sequence genotypes and the major alleles of outer-surface protein C (ospC) suggested that balancing selection at ospC is a dominant force maintaining B. burgdorferi diversity in local populations. We conclude that B. burgdorferi undergoes genome-wide genetic exchange, including plasmid transfers, and previous reports of its clonality are artifacts from the use of geographically and ecological isolated samples. Frequent recombination implies a potential for rapid adaptive evolution and a possible polygenic basis of B. burgdorferi pathogenicity.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- Department of Biological Sciences, Hunter College of the City University of New York, 695 Park Avenue, New York, NY 10021, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Abstract
Theories regarding the evolution of spliceosomal introns differ in the extent to which the distribution of introns reflects either a formative role in the evolution of protein-coding genes or the adventitious gain of genetic elements. Here, systematic methods are used to assess the causes of the present-day distribution of introns in 10 families of eukaryotic protein-coding genes comprising 1,868 introns in 488 distinct alignment positions. The history of intron evolution inferred using a probabilistic model that allows ancestral inheritance of introns, gain of introns, and loss of introns reveals that the vast majority of introns in these eukaryotic gene families were not inherited from the most recent common ancestral genes, but were gained subsequently. Furthermore, among inferred events of intron gain that meet strict criteria of reliability, the distribution of sites of gain with respect to reading-frame phase shows a 5:3:2 ratio of phases 0, 1 and 2, respectively, and exhibits a nucleotide preference for MAG GT (positions -3 to +2 relative to the site of gain). The nucleotide preferences of intron gain may prove to be the ultimate cause for the phase bias. The phase bias of intron gain is sufficient to account quantitatively for the well-known 5:3:2 bias in phase frequencies among extant introns, a conclusion that holds even when taxonomic heterogeneity in phase patterns is considered. Thus, intron gain accounts for the vast majority of extant introns and for the bias toward phase 0 introns that previously was interpreted as evidence for ancient formative introns.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland, USA
| | | | | |
Collapse
|
31
|
Qiu WG, Dykhuizen DE, Acosta MS, Luft BJ. Geographic uniformity of the Lyme disease spirochete (Borrelia burgdorferi) and its shared history with tick vector (Ixodes scapularis) in the Northeastern United States. Genetics 2002; 160:833-49. [PMID: 11901105 PMCID: PMC1462027 DOI: 10.1093/genetics/160.3.833] [Citation(s) in RCA: 184] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Over 80% of reported cases of Lyme disease in the United States occur in coastal regions of northeastern and mid-Atlantic states. The genetic structure of the Lyme disease spirochete (Borrelia burgdorferi) and its main tick vector (Ixodes scapularis) was studied concurrently and comparatively by sampling natural populations of I. scapularis ticks along the East Coast from 1996 to 1998. Borrelia is genetically highly diverse at the outer surface protein ospC. Since Borrelia is highly clonal, the ospC alleles can be used to define clones. A newly designed reverse line blotting (RLB) assay shows that up to 10 Borrelia clones can infect a single tick. The clone frequencies in Borrelia populations are the same across the Northeast. On the other hand, I. scapularis populations show strong regional divergence (among northeastern, mid-Atlantic, and southern states) as well as local differentiation. The high genetic diversity within Borrelia populations and the disparity in the genetic structure between Borrelia and its tick vector are likely consequences of strong balancing selection on local Borrelia clones. Demographically, both Borrelia and I. scapularis populations in the Northeast show the characteristics of a species that has recently expanded from a population bottleneck. Major geological and ecological events, such as the last glacial maximum (18,000 years ago) and the modern-day expansion of tick habitats, are likely causes of the observed "founder effects" for the two organisms in the Northeast. We therefore conclude that the genetic structure of B. burgdorferi has been intimately shaped by the natural history of its main vector, the northern lineage of I. scapularis ticks.
Collapse
Affiliation(s)
- Wei-Gang Qiu
- Department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794-5245, USA
| | | | | | | |
Collapse
|
32
|
Abstract
Recent breakthroughs in molecular technology, most significantly the polymerase chain reaction (PCR) and in situ hybridization, have allowed the detection of genetic variation in bacterial communities without prior cultivation. These methods often produce data in the form of the presence or absence of alleles or genotypes, however, rather than counts of alleles. Using relative allele frequencies from presence-absence data as estimates of population allele frequencies tends to underestimate the frequencies of common alleles and overestimate those of rare ones, potentially biasing the results of a test of neutrality in favor of balancing selection. In this study, a maximum-likelihood estimator (MLE) of bacterial allele frequencies designed for use with presence-absence data is derived using an explicit stochastic model of the host infection (or bacterial sampling) process. The performance of the MLE is evaluated using computer simulation and a method is presented for evaluating the fit of estimated allele frequencies to the neutral infinite alleles model (IAM). The methods are applied to estimate allele frequencies at two outer surface protein loci (ospA and ospC) of the Lyme disease spirochete, Borrelia burgdorferi, infecting local populations of deer ticks (Ixodes scapularis) and to test the fit to a neutral IAM.
Collapse
Affiliation(s)
- B Rannala
- Department of Ecology and Evolution, State University of New York, Stony Brook 11754-5245, USA.
| | | | | |
Collapse
|
33
|
Qiu WG, Bosler EM, Campbell JR, Ugine GD, Wang IN, Luft BJ, Dykhuizen DE. A population genetic study of Borrelia burgdorferi sensu stricto from eastern Long Island, New York, suggested frequency-dependent selection, gene flow and host adaptation. Hereditas 1998; 127:203-16. [PMID: 9474903 DOI: 10.1111/j.1601-5223.1997.00203.x] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Eastern Long Island, New York, is one of the major foci of Lyme disease in the United States. As in almost all other parts of North America, Lyme disease in this region is caused by a single genomic species of spirochete, Borrelia burgdorferi sensu stricto. For three consecutive years, natural populations of Lyme Borrelia in this region were sampled and studied for gene flow among different locations, changes in population structure over time, and selective forces. The genetic diversity of Borrelia populations was measured at the outer surface protein A (ospA) locus using Cold Single-Stranded Conformation Polymorphism (Cold SSCP) analysis. The Borrelia populations were found to be highly polymorphic within any of thirteen local populations. Ewens-Watterson tests of neutrality revealed that the high level of genetic diversity within local Borrelia populations is maintained by balancing selection. Frequency-dependent selection for the different strains distinguished by the ospA alleles is likely the mechanism of the balancing selection. Allele frequency distributions of Borrelia populations were homogeneous across the region in any particular year, although different infection rates of local tick (Ixodes scapularis) populations suggested that the Borrelia populations were at least partially isolated. Since the allele frequency distribution changed over time, while remaining homogeneous over space, the nearly uniform allele frequency distribution across the region cannot be explained by recent geographic expansion from a single population. This uniform distribution across the region thus may be maintained by selection, or by a significant amount of migration or both. The genetic structure of B. burgdorferi sensu stricto also differed between spirochetes infecting nymphal ticks and those infecting adult ticks. Since larval and nymphal ticks have distinctly different host feeding preferences, host adaptation of spirochete populations is implied. This distinction and an animal study using chipmunks suggest that ticks infected by Borrelia as larvae may have high mortality in the wild. This study represents a genetic analysis of local populations of a bacterial species.
Collapse
Affiliation(s)
- W G Qiu
- Department of Ecology and Evolution, State University of New York at Stony Brook, USA
| | | | | | | | | | | | | |
Collapse
|