51
|
Cohen O, Pupko T. Inference and characterization of horizontally transferred gene families using stochastic mapping. Mol Biol Evol 2009; 27:703-13. [PMID: 19808865 PMCID: PMC2822287 DOI: 10.1093/molbev/msp240] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Macrogenomic events, in which genes are gained and lost, play a pivotal evolutionary role in microbial evolution. Nevertheless, probabilistic-evolutionary models describing such events and methods for their robust inference are considerably less developed than existing methodologies for analyzing site-specific sequence evolution. Here, we present a novel method for the inference of gains and losses of gene families. First, we develop probabilistic-evolutionary models describing the dynamics of gene-family content, which are more biologically realistic than previously suggested models. In our likelihood-based models, gains and losses are represented by transitions between presence and absence, given an underlying phylogeny. We employ a mixture-model approach in which we allow both the gain rate and the loss rate to vary among gene families. Second, we use these models together with the analytic implementation of stochastic mapping to infer branch-specific events. Our novel methodology allows us to infer and quantify horizontal gene transfer (HGT) events. This enables us to rank various gene families and lineages according to their propensity to undergo gains and losses. Applying our methodology to 4,873 gene families shows that: 1) the novel mixture models describe the observed variability in gene-family content among microbes significantly better than previous models; 2) The stochastic mapping approach enables accurate inference of gain and loss events based on simulations; 3) At least 34% of the gene families analyzed are inferred to have experienced HGT at least once during their evolution; and 4) Gene families that were inferred to experience HGT are both enriched and depleted with respect to specific functional categories.
Collapse
Affiliation(s)
- Ofir Cohen
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | | |
Collapse
|
52
|
Csurös M, Miklós I. Streamlining and large ancestral genomes in Archaea inferred with a phylogenetic birth-and-death model. Mol Biol Evol 2009; 26:2087-95. [PMID: 19570746 PMCID: PMC2726834 DOI: 10.1093/molbev/msp123] [Citation(s) in RCA: 106] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Homologous genes originate from a common ancestor through vertical inheritance, duplication, or horizontal gene transfer. Entire homolog families spawned by a single ancestral gene can be identified across multiple genomes based on protein sequence similarity. The sequences, however, do not always reveal conclusively the history of large families. To study the evolution of complete gene repertoires, we propose here a mathematical framework that does not rely on resolved gene family histories. We show that so-called phylogenetic profiles, formed by family sizes across multiple genomes, are sufficient to infer principal evolutionary trends. The main novelty in our approach is an efficient algorithm to compute the likelihood of a phylogenetic profile in a model of birth-and-death processes acting on a phylogeny. We examine known gene families in 28 archaeal genomes using a probabilistic model that involves lineage- and family-specific components of gene acquisition, duplication, and loss. The model enables us to consider all possible histories when inferring statistics about archaeal evolution. According to our reconstruction, most lineages are characterized by a net loss of gene families. Major increases in gene repertoire have occurred only a few times. Our reconstruction underlines the importance of persistent streamlining processes in shaping genome composition in Archaea. It also suggests that early archaeal genomes were as complex as typical modern ones, and even show signs, in the case of the methanogenic ancestor, of an extremely large gene repertoire.
Collapse
Affiliation(s)
- Miklós Csurös
- Department of Computer Science and Operations Research, University of Montréal, Montréal, Canada.
| | | |
Collapse
|
53
|
Wang X, Gowik U, Tang H, Bowers JE, Westhoff P, Paterson AH. Comparative genomic analysis of C4 photosynthetic pathway evolution in grasses. Genome Biol 2009; 10:R68. [PMID: 19549309 PMCID: PMC2718502 DOI: 10.1186/gb-2009-10-6-r68] [Citation(s) in RCA: 100] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2009] [Revised: 05/27/2009] [Accepted: 06/23/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sorghum is the first C4 plant and the second grass with a full genome sequence available. This makes it possible to perform a whole-genome-level exploration of C4 pathway evolution by comparing key photosynthetic enzyme genes in sorghum, maize (C4) and rice (C3), and to investigate a long-standing hypothesis that a reservoir of duplicated genes is a prerequisite for the evolution of C4 photosynthesis from a C3 progenitor. RESULTS We show that both whole-genome and individual gene duplication have contributed to the evolution of C4 photosynthesis. The C4 gene isoforms show differential duplicability, with some C4 genes being recruited from whole genome duplication duplicates by multiple modes of functional innovation. The sorghum and maize carbonic anhydrase genes display a novel mode of new gene formation, with recursive tandem duplication and gene fusion accompanied by adaptive evolution to produce C4 genes with one to three functional units. Other C4 enzymes in sorghum and maize also show evidence of adaptive evolution, though differing in level and mode. Intriguingly, a phosphoenolpyruvate carboxylase gene in the C3 plant rice has also been evolving rapidly and shows evidence of adaptive evolution, although lacking key mutations that are characteristic of C4 metabolism. We also found evidence that both gene redundancy and alternative splicing may have sheltered the evolution of new function. CONCLUSIONS Gene duplication followed by functional innovation is common to evolution of most but not all C4 genes. The apparently long time-lag between the availability of duplicates for recruitment into C4 and the appearance of C4 grasses, together with the heterogeneity of origins of C4 genes, suggests that there may have been a long transition process before the establishment of C4 photosynthesis.
Collapse
Affiliation(s)
- Xiyin Wang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
- College of Sciences, Hebei Polytechnic University, Tangshan, Hebei 063000, China
| | - Udo Gowik
- Institut fur Entwicklungs- und Molekularbiologie der Pflanzen, Heinrich-Heine-Universitat 1, Universitatsstrasse, D-40225 Dusseldorf, Germany
| | - Haibao Tang
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| | - John E Bowers
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
| | - Peter Westhoff
- Institut fur Entwicklungs- und Molekularbiologie der Pflanzen, Heinrich-Heine-Universitat 1, Universitatsstrasse, D-40225 Dusseldorf, Germany
| | - Andrew H Paterson
- Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
54
|
Cohen O, Rubinstein ND, Stern A, Gophna U, Pupko T. A likelihood framework to analyse phyletic patterns. Philos Trans R Soc Lond B Biol Sci 2009; 363:3903-11. [PMID: 18852099 DOI: 10.1098/rstb.2008.0177] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Probabilistic evolutionary models revolutionized our capability to extract biological insights from sequence data. While these models accurately describe the stochastic processes of site-specific substitutions, single-base substitutions represent only a fraction of all the events that shape genomes. Specifically, in microbes, events in which entire genes are gained (e.g. via horizontal gene transfer) and lost play a pivotal evolutionary role. In this research, we present a novel likelihood-based evolutionary model for gene gains and losses, and use it to analyse genome-wide patterns of the presence and absence of gene families. The model assumes a Markovian stochastic process, where gains and losses are represented by the transition between presence and absence, respectively, given an underlying phylogenetic tree. To account for differences in the rates of gain and loss of different gene families, we assume among-gene family rate variability, thus allowing for more accurate description of the data. Using the Bayesian approach, we estimated an evolutionary rate for each gene family. Simulation studies demonstrated that our methodology accurately infers these rates. Our methodology was applied to analyse a large corpus of data, consisting of 4873 gene families spanning 63 species and revealed novel insights regarding the evolutionary nature of genome-wide gain and loss dynamics.
Collapse
Affiliation(s)
- Ofir Cohen
- Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | | | | | |
Collapse
|
55
|
Luo R, Larget B. Modeling substitution and indel processes for AFLP marker evolution and phylogenetic inference. Ann Appl Stat 2009. [DOI: 10.1214/08-aoas212] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
56
|
|
57
|
Hao W, Golding GB. Uncovering rate variation of lateral gene transfer during bacterial genome evolution. BMC Genomics 2008; 9:235. [PMID: 18492275 PMCID: PMC2426709 DOI: 10.1186/1471-2164-9-235] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2007] [Accepted: 05/20/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Large scale genome arrangement, such as whole gene insertion/deletion, plays an important role in bacterial genome evolution. Various methods have been employed to study the dynamic process of gene insertions and deletions, such as parsimony methods and maximum likelihood methods. Previous maximum likelihood studies have assumed that the rate of gene insertions/deletions is constant over different genes. This assumption is unrealistic. For instance, it has been shown that informational genes are less likely to be laterally transferred than non-informational genes. However, how much of the variation in gene transfer rates is due to the difference between informational genes and non-informational genes is unclear. In this study, a Gamma-distribution was incorporated in the likelihood estimation by considering rate variation for gene insertions/deletions between genes. This makes it possible to address whether a difference between informational genes and non-informational genes is the main contributor to rate variation of lateral gene transfers. RESULTS The results show that models incorporating rate variation fit the data better than do constant rate models in many phylogenetic groups. Even though informational genes are less likely to be laterally transferred than non-informational genes, the degree of rate variation for insertions/deletions did not change dramatically and remained high even when informational genes were excluded from the study. This suggests that the variation in rate of insertions/deletions is not due mainly to the simple difference between informational genes and non-informational genes. Among genes that are not classified as informational and among the informational genes themselves, there are still large differences in the rates that these genes are inserted and deleted. CONCLUSION While the difference in informational gene rates contributes to rate variation, it is only a small fraction of the variation present; instead, a substantial amount of rate variation for insertions/deletions remains among both informational genes and among non-informational genes.
Collapse
Affiliation(s)
- Weilong Hao
- Department of Biology, McMaster University, Hamilton, Ontario L8S 4K1, Canada .
| | | |
Collapse
|
58
|
Ridenhour BJ, Storfer AT. Geographically variable selection in Ambystoma tigrinum virus (Iridoviridae) throughout the western USA. J Evol Biol 2008; 21:1151-9. [PMID: 18444995 DOI: 10.1111/j.1420-9101.2008.01537.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
We investigated spatially variable selection in Ambystoma tigrinum virus (ATV) which causes frequent and geographically widespread epizootics of the tiger salamander, Ambystoma tigrinum. To test for evidence of selection, we sequenced several coding and noncoding regions from virus strains isolated from epizootics throughout western North America. Three of the sequenced regions contained homologues for genes putatively involved in host immune evasion and virulence: eIF-2alpha, caspase activation and recruitment domain (CARD) and beta-OH-steroid oxidoreductase. Selection analysis showed evidence of very strong purifying selection on eIF-2alpha, purifying selection within certain viral clades on CARD and positive selection on beta-OH-steroid oxidoreductase within certain clades. Analysis using MULTIDIVTIME and Tajima's relative rate tests indicate accelerated rates of evolution within clades associated with anthropogenic movement. These clades also demonstrate greater spatial variability in selection, suggesting a lack of local adaptation (i.e. locally adapted populations should exhibit little to no selection because of absent or reduced variation in fitness once a fitness optimum is reached). Increased transfer of non-native viral strains to naïve salamander populations, in conjunction with local maladaptation as a result of local selection pressures, may explain the spread and emergence of ATV epizootics in A. tigrinum in western North America.
Collapse
Affiliation(s)
- B J Ridenhour
- School of Biological Sciences, Washington State University, Pullman, WA, USA.
| | | |
Collapse
|
59
|
Five Drosophila genomes reveal nonneutral evolution and the signature of host specialization in the chemoreceptor superfamily. Genetics 2008; 177:1395-416. [PMID: 18039874 DOI: 10.1534/genetics.107.078683] [Citation(s) in RCA: 157] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The insect chemoreceptor superfamily comprises the olfactory receptor (Or) and gustatory receptor (Gr) multigene families. These families give insects the ability to smell and taste chemicals in the environment and are thus rich resources for linking molecular evolutionary and ecological processes. Although dramatic differences in family size among distant species and high divergence among paralogs have led to the belief that the two families evolve rapidly, a lack of evolutionary data over short time scales has frustrated efforts to identify the major forces shaping this evolution. Here, we investigate patterns of gene loss/gain, divergence, and polymorphism in the entire repertoire of approximately 130 chemoreceptor genes from five closely related species of Drosophila that share a common ancestor within the past 12 million years. We demonstrate that the overall evolution of the Or and Gr families is nonneutral. We also show that selection regimes differ both between the two families as wholes and within each family among groups of genes with varying functions, patterns of expression, and phylogenetic histories. Finally, we find that the independent evolution of host specialization in Drosophila sechellia and D. erecta is associated with a fivefold acceleration of gene loss and increased rates of amino acid evolution at receptors that remain intact. Gene loss appears to primarily affect Grs that respond to bitter compounds while elevated Ka/Ks is most pronounced in the subset of Ors that are expressed in larvae. Our results provide strong evidence that the observed phenomena result from the invasion of a novel ecological niche and present a unique synthesis of molecular evolutionary analyses with ecological data.
Collapse
|
60
|
Abstract
UNLABELLED Many fundamental questions concerning the emergence and subsequent evolution of eukaryotic exon-intron organization are still unsettled. Genome-scale comparative studies, which can shed light on crucial aspects of eukaryotic evolution, require adequate computational tools. We describe novel computational methods for studying spliceosomal intron evolution. Our goal is to give a reliable characterization of the dynamics of intron evolution. Our algorithmic innovations address the identification of orthologous introns, and the likelihood-based analysis of intron data. We discuss a compression method for the evaluation of the likelihood function, which is noteworthy for phylogenetic likelihood problems in general. We prove that after O(n l) preprocessing time, subsequent evaluations take O(n l/log l) time almost surely in the Yule-Harding random model of n-taxon phylogenies, where l is the input sequence length. We illustrate the practicality of our methods by compiling and analyzing a data set involving 18 eukaryotes, which is more than in any other study to date. The study yields the surprising result that ancestral eukaryotes were fairly intron-rich. For example, the bilaterian ancestor is estimated to have had more than 90% as many introns as vertebrates do now. AVAILABILITY The Java implementations of the algorithms are publicly available from the corresponding author's site http://www.iro.umontreal.ca/~csuros/introns/. Data are available on request.
Collapse
Affiliation(s)
- Miklós Csurös
- Department of Computer Science and Operations Research, Université de Montréal, Québec, Canada.
| | | | | |
Collapse
|
61
|
Abstract
The process of inferring phylogenetic trees from molecular sequences almost always starts with a multiple alignment of these sequences but can also be based on methods that do not involve multiple sequence alignment. Very little is known about the accuracy with which such alignment-free methods recover the correct phylogeny or about the potential for increasing their accuracy. We conducted a large-scale comparison of ten alignment-free methods, among them one new approach that does not calculate distances and a faster variant of our pattern-based approach; all distance-based alignment-free methods are freely available from http://www.bioinformatics.org.au (as Python package decaf+py). We show that most methods exhibit a higher overall reconstruction accuracy in the presence of high among-site rate variation. Under all conditions that we considered, variants of the pattern-based approach were significantly better than the other alignment-free methods. The new pattern-based variant achieved a speed-up of an order of magnitude in the distance calculation step, accompanied by a small loss of tree reconstruction accuracy. A method of Bayesian inference from k-mers did not improve on classical alignment-free (and distance-based) methods but may still offer other advantages due to its Bayesian nature. We found the optimal word length k of word-based methods to be stable across various data sets, and we provide parameter ranges for two different alphabets. The influence of these alphabets was analyzed to reveal a trade-off in reconstruction accuracy between long and short branches. We have mapped the phylogenetic accuracy for many alignment-free methods, among them several recently introduced ones, and increased our understanding of their behavior in response to biologically important parameters. In all experiments, the pattern-based approach emerged as superior, at the expense of higher resource consumption. Nonetheless, no alignment-free method that we examined recovers the correct phylogeny as accurately as does an approach based on maximum-likelihood distance estimates of multiply aligned sequences.
Collapse
Affiliation(s)
- Michael Höhl
- Australian Research Council Centre in Bioinformatics, and Institute for Molecular Bioscience, The University of QueenslandBrisbane, QLD 4072, Australia E-mail:
| | - Mark A. Ragan
- Australian Research Council Centre in Bioinformatics, and Institute for Molecular Bioscience, The University of QueenslandBrisbane, QLD 4072, Australia E-mail:
| |
Collapse
|
62
|
Hao W, Golding GB. The fate of laterally transferred genes: life in the fast lane to adaptation or death. Genome Res 2006; 16:636-43. [PMID: 16651664 PMCID: PMC1457040 DOI: 10.1101/gr.4746406] [Citation(s) in RCA: 126] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Large-scale genome arrangement plays an important role in bacterial genome evolution. A substantial number of genes can be inserted into, deleted from, or rearranged within genomes during evolution. Detecting or inferring gene insertions/deletions is of interest because such information provides insights into bacterial genome evolution and speciation. However, efficient inference of genome events is difficult because genome comparisons alone do not generally supply enough information to distinguish insertions, deletions, and other rearrangements. In this study, homologous genes from the complete genomes of 13 closely related bacteria were examined. The presence or absence of genes from each genome was cataloged, and a maximum likelihood method was used to infer insertion/deletion rates according to the phylogenetic history of the taxa. It was found that whole gene insertions/deletions in genomes occur at rates comparable to or greater than the rate of nucleotide substitution and that higher insertion/deletion rates are often inferred to be present at the tips of the phylogeny with lower rates on more ancient interior branches. Recently transferred genes are under faster and relaxed evolution compared with more ancient genes. Together, this implies that many of the lineage-specific insertions are lost quickly during evolution and that perhaps a few of the genes inserted by lateral transfer are niche specific.
Collapse
Affiliation(s)
- Weilong Hao
- Department of Biology, McMaster University, Hamilton, Ontario, Canada L8S 4K1
| | - G. Brian Golding
- Department of Biology, McMaster University, Hamilton, Ontario, Canada L8S 4K1
- Corresponding author.E-mail ; fax (905) 522-6066
| |
Collapse
|
63
|
Davidov Y, Jurkevitch E. Diversity and evolution of Bdellovibrio-and-like organisms (BALOs), reclassification of Bacteriovorax starrii as Peredibacter starrii gen. nov., comb. nov., and description of the Bacteriovorax–Peredibacter clade as Bacteriovoracaceae fam. nov. Int J Syst Evol Microbiol 2004; 54:1439-1452. [PMID: 15388693 DOI: 10.1099/ijs.0.02978-0] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A phylogenetic analysis of Bdellovibrio-and-like organisms (BALOs) was performed. It was based on the characterization of 71 strains and on all consequent 16S rRNA gene sequences available in databases, including clones identified by data-mining, totalling 120 strains from very varied biotopes. Amplified rDNA restriction analysis (ARDRA) accurately reflected the diversity and phylogenetic affiliation of BALOs, thereby providing an efficient screening tool. Extensive phylogenetic analysis of the 16S rRNA gene sequences revealed great diversity within the Bdellovibrio (>14 % divergence) and Bacteriovorax (>16 %) clades, which comprised nine and eight clusters, respectively, exhibiting more than 3 % intra-cluster divergence. The clades diverged by more than 20 %. The analysis of conserved 16S rRNA secondary structures showed that Bdellovibrio contained motifs atypical of the δ-Proteobacteria, suggesting that it is ancestral to Bacteriovorax. While none of the Bdellovibrio strains were of marine origin, Bacteriovorax included separate soil/freshwater and marine-specific groups. On the basis of their extensive diversity and the large distance separating the groups, it is proposed that Bacteriovorax starrii be placed into a new genus, Peredibacter gen. nov., with Peredibacter starrii A3.12T (=ATCC 15145T=NCCB 72004T) as its type strain. Also proposed is a redefinition of the Bdellovibrio and the Bacteriovorax–Peredibacter lineages as two different families, i.e. ‘Bdellovibrionaceae’ and a new family, Bacteriovoracaceae. Also, a re-evaluation of oligonucleotides targeting BALOs is presented, and the implications of the large diversity of these organisms and of their distribution in very different environments are discussed.
Collapse
MESH Headings
- Bdellovibrio/classification
- Bdellovibrio/genetics
- Biological Evolution
- Cluster Analysis
- DNA Fingerprinting
- DNA, Bacterial/chemistry
- DNA, Bacterial/isolation & purification
- DNA, Ribosomal/analysis
- DNA, Ribosomal/chemistry
- DNA, Ribosomal/isolation & purification
- Deltaproteobacteria/classification
- Deltaproteobacteria/genetics
- Genes, Bacterial
- Genes, rRNA
- Molecular Sequence Data
- Nucleic Acid Conformation
- Phylogeny
- Polymorphism, Restriction Fragment Length
- RNA, Bacterial/genetics
- RNA, Ribosomal, 16S/genetics
- Sequence Analysis, DNA
- Soil Microbiology
- Water Microbiology
Collapse
Affiliation(s)
- Yaacov Davidov
- Department of Plant Pathology and Microbiology, Faculty of Agricultural, Food and Environmental Quality Sciences, The Hebrew University of Jerusalem, PO Box 12, Rehovot 76100, Israel
| | - Edouard Jurkevitch
- Department of Plant Pathology and Microbiology, Faculty of Agricultural, Food and Environmental Quality Sciences, The Hebrew University of Jerusalem, PO Box 12, Rehovot 76100, Israel
| |
Collapse
|
64
|
Lott TJ, Effat MM. Evidence for a more recently evolved clade within a Candida albicans North American population. MICROBIOLOGY (READING, ENGLAND) 2001; 147:1687-1692. [PMID: 11390700 DOI: 10.1099/00221287-147-6-1687] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Candida albicans is diploid and displays a primarily clonal mode of reproduction. There is, however, evidence for meiosis and the degree to which this occurs in nature is unknown. Although random mating would act to obscure clonal lineages, previous studies have demonstrated that collections of North American isolates display three major partitions with no evidence of geographic clustering. To better understand the extent of sexuality and its role in the phylogeny of the species, a reference subset of 50 isolates representing this tripartite division was analysed using 1 minisatellite, 5 microsatellites (MSs) and 15 nuclear polymorphisms (NP). A total of 87 alleles were observed for 21 loci and 12/16 informative loci exhibited a departure from Hardy-Weinberg expectations (G(2)</=0.05). We did not observe an absolute correlation between MSs and NP, although isolates with identical NP genotypes were correlated with a previously defined, predominant class (putative group I). The use of additional markers did not give increased support for the tripartite structure of the population. However, (9/19) group I isolates were found to be highly related, differing by only one or a few alleles. Designated subgroup A, the interpretation is that these isolates are related by descent and that they are of a more recent evolutionary origin, diverging from an ancestral group I clone. The reason for their relative abundance in the population is unknown; one possibility is that they may be under positive selection.
Collapse
Affiliation(s)
- Timothy J Lott
- Mycotic Diseases Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, 1600 Clifton Road, Atlanta, GA 30333, USA1
| | - Mohammad M Effat
- Mycotic Diseases Branch, Division of Bacterial and Mycotic Diseases, National Center for Infectious Diseases, Centers for Disease Control and Prevention, 1600 Clifton Road, Atlanta, GA 30333, USA1
| |
Collapse
|
65
|
Minkwitz A, Berg G. Comparison of antifungal activities and 16S ribosomal DNA sequences of clinical and environmental isolates of Stenotrophomonas maltophilia. J Clin Microbiol 2001; 39:139-45. [PMID: 11136762 PMCID: PMC87693 DOI: 10.1128/jcm.39.1.139-145.2001] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2000] [Accepted: 10/10/2000] [Indexed: 11/20/2022] Open
Abstract
In recent years, the gram-negative bacterium Stenotrophomonas maltophilia has become increasingly important in biotechnology and as a nosocomial pathogen, giving rise to a need for new information about its taxonomy and epidemiology. To determine intraspecies diversity and whether strains can be distinguished based on the sources of their isolation, 50 S. maltophilia isolates from clinical and environmental sources, including strains of biotechnological interest, were investigated. The isolates were characterized by in vitro antagonism against pathogenic fungi and the production of antifungal metabolites and enzymes. Phenotypically the strains showed variability that did not correlate significantly with their sources of isolation. Clinical strains displayed remarkable activity against the human pathogenic fungus Candida albicans. Antifungal activity against plant pathogens was more common and generally more severe from the environmental isolates, although not exclusive to them. All isolates, clinical and environmental, produced a range of antifungal metabolites including antibiotics, siderophores, and the enzymes proteases and chitinases. From 16S ribosomal DNA sequencing analysis, the isolates could be separated into three clusters, two of which consisted of isolates originating from the environment, especially rhizosphere isolates, and one of which consisted of clinical and aquatic strains. In contrast to the results of other recent investigations, these strains could be grouped based on their sources of isolation, with the exception of three rhizosphere isolates. Because there was evidence of nucleotide signature positions within the sequences that are suitable for distinguishing among the clusters, the clusters could be defined as different genomovars of S. maltophilia. Key sequences on the 16S ribosomal DNA could be used to develop a diagnostic method that differentiates these genomovars.
Collapse
Affiliation(s)
- A Minkwitz
- Department of Biosciences, Microbiology, University of Rostock, D-18055 Rostock, Germany
| | | |
Collapse
|
66
|
Wang Y, Zhang Z. Comparative sequence analyses reveal frequent occurrence of short segments containing an abnormally high number of non-random base variations in bacterial rRNA genes. MICROBIOLOGY (READING, ENGLAND) 2000; 146 ( Pt 11):2845-2854. [PMID: 11065363 DOI: 10.1099/00221287-146-11-2845] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
rRNA genes are thought unlikely to be laterally transferred, because rRNA must coevolve with a large number of cellular components to form the highly sophisticated translation apparatus and perform protein synthesis. In this paper, the authors first hypothesized that lateral gene transfer (LGT) might occur to rRNA genes via replacement of gene segments encoding individual domains of rRNA: the 'simplified complexity hypothesis'. Comparative sequence analyses of the 16S and 23S rRNA genes from a large number of actinomycete species frequently identified rRNA genes containing short segments with an abnormally high number of non-random base variations. These variations were nearly always characterized by complementing covariations of several paired bases within the stem of a hairpin. The nature of these base variations is not consistent with random mutations but satisfies well the predictions of the 'simplified complexity hypothesis'. The most parsimonious explanation for this phenomenon is the lateral transfer of rRNA gene segments between different bacterial species. This mode of LGT may create mosaic rRNA genes and occur repeatedly in different regions of a gene, gradually destroying the evolutionary history recorded in the nucleotide sequence.
Collapse
MESH Headings
- Actinomycetales/classification
- Actinomycetales/genetics
- Base Pairing
- Base Sequence
- DNA Primers/genetics
- DNA, Bacterial/genetics
- DNA, Ribosomal/genetics
- Evolution, Molecular
- Genes, Bacterial
- Models, Genetic
- Molecular Sequence Data
- Nucleic Acid Conformation
- Phylogeny
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 23S/chemistry
- RNA, Ribosomal, 23S/genetics
- Sequence Analysis, RNA
- Sequence Homology, Nucleic Acid
- Species Specificity
Collapse
Affiliation(s)
- Yue Wang
- Microbial Collection and Screening Laboratory, Institute of Molecular and Cell Biology, 30 Medical Drive, Singapore 1176091
| | - Zhenshui Zhang
- Microbial Collection and Screening Laboratory, Institute of Molecular and Cell Biology, 30 Medical Drive, Singapore 1176091
| |
Collapse
|
67
|
Kuhner MK, Beerli P, Yamato J, Felsenstein J. Usefulness of single nucleotide polymorphism data for estimating population parameters. Genetics 2000; 156:439-47. [PMID: 10978306 PMCID: PMC1461258 DOI: 10.1093/genetics/156.1.439] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Single nucleotide polymorphism (SNP) data can be used for parameter estimation via maximum likelihood methods as long as the way in which the SNPs were determined is known, so that an appropriate likelihood formula can be constructed. We present such likelihoods for several sampling methods. As a test of these approaches, we consider use of SNPs to estimate the parameter Theta = 4N(e)micro (the scaled product of effective population size and per-site mutation rate), which is related to the branch lengths of the reconstructed genealogy. With infinite amounts of data, ML models using SNP data are expected to produce consistent estimates of Theta. With finite amounts of data the estimates are accurate when Theta is high, but tend to be biased upward when Theta is low. If recombination is present and not allowed for in the analysis, the results are additionally biased upward, but this effect can be removed by incorporating recombination into the analysis. SNPs defined as sites that are polymorphic in the actual sample under consideration (sample SNPs) are somewhat more accurate for estimation of Theta than SNPs defined by their polymorphism in a panel chosen from the same population (panel SNPs). Misrepresenting panel SNPs as sample SNPs leads to large errors in the maximum likelihood estimate of Theta. Researchers collecting SNPs should collect and preserve information about the method of ascertainment so that the data can be accurately analyzed.
Collapse
Affiliation(s)
- M K Kuhner
- Department of Genetics, University of Washington, Seattle, Washington 98195-7360, USA.
| | | | | | | |
Collapse
|
68
|
Abstract
We derive a Markov chain to sample from the posterior distribution for a phylogenetic tree given sequence information from the corresponding set of organisms, a stochastic model for these data, and a prior distribution on the space of trees. A transformation of the tree into a canonical cophenetic matrix form suggests a simple and effective proposal distribution for selecting candidate trees close to the current tree in the chain. We illustrate the algorithm with restriction site data on 9 plant species, then extend to DNA sequences from 32 species of fish. The algorithm mixes well in both examples from random starting trees, generating reproducible estimates and credible sets for the path of evolution.
Collapse
Affiliation(s)
- B Mau
- Department of Statistics, University of Wisconsin-Madison, 53706-1685, USA.
| | | | | |
Collapse
|
69
|
Reconstruction of phylogenetic relationships. J Biosci 1999. [DOI: 10.1007/bf02941116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
70
|
|
71
|
Persing DH. Nucleic Acid-Based Discovery Techniques for Potential Xenozoonotic Pathogens. Xenotransplantation 1997. [DOI: 10.1007/978-3-642-60572-7_54] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
72
|
Gray EM. DNA Fingerprinting Reveals a Lack of Genetic Variation in Northern Populations of the Western Pond Turtle (Clemmys marmorata). CONSERVATION BIOLOGY : THE JOURNAL OF THE SOCIETY FOR CONSERVATION BIOLOGY 1995; 9:1244-1255. [PMID: 34261276 DOI: 10.1046/j.1523-1739.1995.9051234.x-i1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
I used DNA fingerprinting to provide the first analysis of the genetic composition of western pond turtle (Clemmys marmorata( populations in Washington, Oregon, and California. Populations of the western pond turtle in Washington and northern Oregon are rapidly approaching extinction. Genetic similarity within the largest northern populations, which are located inland, is high. An analysis of population substructure (Fst ) revealed significant genetic divergence between inland populations, indicating a lack of dispersal and gene flow between sites. In contrast, northern coastal sites are not genetically distinct, but there are few if any viable populations remaining in this region. Genetic variability within southern California populations is a great deal higher than in northern inland sites. Similarly, a low Fst value indicated a lack of genetic differentiation between southern sites. An inter-regional analysis of population substructure (Fst = 0.24) revealed a significant degree of genetic divergence between geographical regions throughout the range. In addition, an estimate of western pond turtle phylogeny showed a genetic break in the species between northern and southern populations. Both population subdivision and phylogenetic analyses suggest a lack of appreciable gene flow between geographical regions for a considerable period of time. Genetic analyses support traditional subdivision based solely on the morphological variation of Clemmys marmorata into two subspecies: northern Clemmys marmorata marmorata and southern Clemmys marmorata pallida. Recovery of dwindling northern populations must combine demographic and genetic considerations. A first step should be to preserve local gene pools while augmenting population numbers, with the goal of preventing the extinction of this genetically and morphologically distinct subspecies. La falta de variación genetica en poblaciones norteñas de la tortuga de agua dulce del oeste (Clemmys marmorata).
Collapse
Affiliation(s)
- Elizabeth M Gray
- Department of Zoology NJ-15, University of Washington, Seattle, WA 98195, U.S.A
| |
Collapse
|
73
|
Abstract
A formal mathematical analysis of the substitution process in nucleotide sequence evolution was done in terms of the Markov process. By using matrix algebra theory, the theoretical foundation of Barry and Hartigan's (Stat. Sci. 2:191-210, 1987) and Lanave et al.'s (J. Mol. Evol. 20:86-93, 1984) methods was provided. Extensive computer simulation was used to compare the accuracy and effectiveness of various methods for estimating the evolutionary distance between two nucleotide sequences. It was shown that the multiparameter methods of Lanave et al.'s (J. Mol. Evol. 20:86-93, 1984), Gojobori et al.'s (J. Mol. Evol. 18:414-422, 1982), and Barry and Hartigan's (Stat. Sci. 2:191-210, 1987) are preferable to others for the purpose of phylogenetic analysis when the sequences are long. However, when sequences are short and the evolutionary distance is large, Tajima and Nei's (Mol. Biol. Evol. 1:269-285, 1984) method is superior to others.
Collapse
Affiliation(s)
- A Zharkikh
- Center for Demographic and Population Genetics, University of Texas, Houston 77225
| |
Collapse
|