1
|
Zhu X, Chen C, Wang B. Phylogenetics and evolution of Trx SET genes in fully sequenced land plants. Genome 2012; 55:269-80. [PMID: 22417073 DOI: 10.1139/g2012-012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Plant Trx SET proteins are involved in H3K4 methylation and play a key role in plant floral development. Genes encoding Trx SET proteins constitute a multigene family in which the copy number varies among plant species and functional divergence appears to have occurred repeatedly. To investigate the evolutionary history of the Trx SET gene family, we made a comprehensive evolutionary analysis on this gene family from 13 major representatives of green plants. A novel clustering (here named as cpTrx clade), which included the III-1, III-2, and III-4 orthologous groups, previously resolved was identified. Our analysis showed that plant Trx proteins possessed a variety of domain organizations and gene structures among paralogs. Additional domains such as PHD, PWWP, and FYR were early integrated into primordial SET-PostSET domain organization of cpTrx clade. We suggested that the PostSET domain was lost in some members of III-4 orthologous group during the evolution of land plants. At least four classes of gene structures had been formed at the early evolutionary stage of land plants. Three intronless orphan Trx SET genes from the Physcomitrella patens (moss) were identified, and supposedly, their parental genes have been eliminated from the genome. The structural differences among evolutionary groups of plant Trx SET genes with different functions were described, contributing to the design of further experimental studies.
Collapse
Affiliation(s)
- Xinyu Zhu
- School of Life Sciences, Nantong University, Nantong, China.
| | | | | |
Collapse
|
2
|
Conant GC, Stadler PF. Solvent exposure imparts similar selective pressures across a range of yeast proteins. Mol Biol Evol 2009; 26:1155-61. [PMID: 19233963 DOI: 10.1093/molbev/msp031] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We study how an amino acid residue's solvent exposure influences its propensity for substitution by analyzing multiple alignments of 61 yeast genes for which the crystal structure is known. We find that the selective constraint on the interior residues is on average 10 times that of residues on the surface. Surprisingly, there is no correlation between the overall selective constraint observed for a protein alignment and the ratio of constraints on interior and surface residues. By modeling the selective constraint on several amino acid properties, we show that although residue volume and hydropathy are strongly conserved across most alignments, there is little variation in interior versus surface conservation for these two properties. By contrast, residue charge (isoelectric point) is less generally conserved when considering the protein as a whole but shows a strong constraint against the introduction of charged residues into the protein interior.
Collapse
Affiliation(s)
- Gavin C Conant
- Division of Animal Sciences and Informatics Institute, University of Missouri-Columbia, USA.
| | | |
Collapse
|
3
|
Fitzpatrick DA. Lines of evidence for horizontal gene transfer of a phenazine producing operon into multiple bacterial species. J Mol Evol 2009; 68:171-85. [PMID: 19189039 DOI: 10.1007/s00239-009-9198-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2008] [Revised: 12/06/2008] [Accepted: 01/12/2009] [Indexed: 10/21/2022]
Abstract
Phenazines are secondary metabolites with broad-spectrum antibiotic activity against bacteria, fungi, and eukaryotes. In pseudomonad species, a conserved seven-gene phenazine operon (phzABCDEFG) is required for the conversion of chorismic acid to the broad-spectrum antibiotic phenazine-1-carboxylate. Previous analyses of genes involved in phenazine production from nonpseudomonad species uncovered a high degree of sequence similarity to pseudomonad homologues. The analyses undertaken in this study wished to eluciadate the evolutionary history of genes involved in the production of phenazines. Furthermore, I wanted to determine if the phenazine operon has been transferred through horizontal gene transfer. Analyses of GC content, codon usage patterns, frequency of 3:1 dinucleotides, sequence similarities, and phylogenetic reconstructions were undertaken to map the evolutionary history of phenazine genes from multiple bacterial species. Patchy phyletic distribution, high sequence similarities, and phylogenetic evidence infer that pseudomonad, Streptomyces cinnamonensis, Pantoea agglomerans, Burkholderia cepacia, Pectobacterium atrosepticum, Brevibacterium linens, and Mycobacterium abscessus species all contain a phenazine operon which has most likely been transferred among these species through horizontal gene transfer. The acquisition of an antibiotic-associated operon is significant, as it may increase the relative fitness of the recipient species.
Collapse
Affiliation(s)
- David A Fitzpatrick
- Department of Biology, National University of Ireland, Maynooth, Co. Kildare, Ireland.
| |
Collapse
|
4
|
Jayaswal V, Robinson J, Jermiin L. Estimation of phylogeny and invariant sites under the general Markov model of nucleotide sequence evolution. Syst Biol 2007; 56:155-62. [PMID: 17454972 DOI: 10.1080/10635150701247921] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The models of nucleotide substitution used by most maximum likelihood-based methods assume that the evolutionary process is stationary, reversible, and homogeneous. We present an extension of the Barry and Hartigan model, which can be used to estimate parameters by maximum likelihood (ML) when the data contain invariant sites and there are violations of the assumptions of stationarity, reversibility, and homogeneity. Unlike most ML methods for estimating invariant sites, we estimate the nucleotide composition of invariant sites separately from that of variable sites. We analyze a bacterial data set where problems due to lack of stationarity and homogeneity have been previously well noted and use the parametric bootstrap to show that the data are consistent with our general Markov model. We also show that estimates of invariant sites obtained using our method are fairly accurate when applied to data simulated under the general Markov model.
Collapse
Affiliation(s)
- Vivek Jayaswal
- Sydney Bioinformatics, University of Sydney, NSW 2006, Australia
| | | | | |
Collapse
|
5
|
Fitzpatrick DA, Creevey CJ, McInerney JO. Genome phylogenies indicate a meaningful alpha-proteobacterial phylogeny and support a grouping of the mitochondria with the Rickettsiales. Mol Biol Evol 2005; 23:74-85. [PMID: 16151187 DOI: 10.1093/molbev/msj009] [Citation(s) in RCA: 115] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Placement of the mitochondrial branch on the tree of life has been problematic. Sparse sampling, the uncertainty of how lateral gene transfer might overwrite phylogenetic signals, and the uncertainty of phylogenetic inference have all contributed to the issue. Here we address this issue using a supertree approach and completed genomic sequences. We first determine that a sensible alpha-proteobacterial phylogenetic tree exists and that it can confidently be inferred using orthologous genes. We show that congruence across these orthologous gene trees is significantly better than might be expected by random chance. There is some evidence of horizontal gene transfer within the alpha-proteobacteria, but it appears to be restricted to a minority of genes ( approximately 23%) most of whom ( approximately 74%) can be categorized as operational. This means that placement of the mitochondrion should not be excessively hampered by interspecies gene transfer. We then show that there is a consistently strong signal for placement of the mitochondrion on this tree and that this placement is relatively insensitive to methodological approach or data set. A concatenated alignment was created consisting of 15 mitochondrion-encoded proteins that are unlikely to have undergone any lateral gene transfer in the timeline under consideration. This alignment infers that the sister group of the mitochondria, for the taxa that have been sampled, is the order Rickettsiales.
Collapse
Affiliation(s)
- David A Fitzpatrick
- Department of Biology, National University of Ireland, Maynooth, County Kildare, Ireland
| | | | | |
Collapse
|
6
|
Horner DS, Embley TM. Chaperonin 60 phylogeny provides further evidence for secondary loss of mitochondria among putative early-branching eukaryotes. Mol Biol Evol 2001; 18:1970-5. [PMID: 11557802 DOI: 10.1093/oxfordjournals.molbev.a003737] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
|
7
|
Conant GC, Lewis PO. Effects of Nucleotide Composition Bias on the Success of the Parsimony Criterion in Phylogenetic Inference. Mol Biol Evol 2001; 18:1024-33. [PMID: 11371591 DOI: 10.1093/oxfordjournals.molbev.a003874] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Convergence in nucleotide composition (CNC) in unrelated lineages is a factor potentially affecting the performance of most phylogeny reconstruction methods. Such convergence has deleterious effects because unrelated lineages show similarities due to similar nucleotide compositions and not shared histories. While some methods (such as the LogDet/paralinear distance measure) avoid this pitfall, the amount of convergence in nucleotide composition necessary to deceive other phylogenetic methods has never been quantified. We examined analytically the relationship between convergence in nucleotide composition and the consistency of parsimony as a phylogenetic estimator for four taxa. Our results show that rather extreme amounts of convergence are necessary before parsimony begins to prefer the incorrect tree. Ancillary observations are that (for unweighted Fitch parsimony) transition/transversion bias contributes to the impact of CNC and, for a given amount of CNC and fixed branch lengths, data sets exhibiting substantial site-to-site rate heterogeneity present fewer difficulties than data sets in which rates are homogeneous. We conclude by reexamining a data set originally used to illustrate the problems caused by CNC. Using simulations, we show that in this case the convergence in nucleotide composition alone is insufficient to cause any commonly used methods to fail, and accounting for other evolutionary factors (such as site-to-site rate heterogeneity) can give a correct inference without accounting for CNC.
Collapse
Affiliation(s)
- G C Conant
- Department of Biology, 167 Castetter Hall, University of New Mexico, Albuquerque, NM 87131-1091.
| | | |
Collapse
|
8
|
Waddell PJ, Steel MA. General time-reversible distances with unequal rates across sites: mixing gamma and inverse Gaussian distributions with invariant sites. Mol Phylogenet Evol 1997; 8:398-414. [PMID: 9417897 DOI: 10.1006/mpev.1997.0452] [Citation(s) in RCA: 138] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
A series of new results useful to the study of DNA sequences using Markov models of substitution are presented with proofs. General time-reversible distances can be extended to accommodate any fixed distribution of rates across sites by replacing the logarithmic function of a matrix with the inverse of a moment generating function. Estimators are presented assuming a gamma distribution, the inverse Gaussian distribution, or a mixture of either of these with invariant sites. Also considered are the different ways invariant sites may be removed and how these differences may affect estimated distances. Through collaboration, we implemented these distances into PAUP in 1994. The variance of these new distances is approximated via the delta method. It is also shown how to predict the divergence expected for a pair of sequences given a rate matrix and a distribution of rates across sites, allowing iterated ML estimates of distances under any reversible model. A simple test of whether a rate matrix is time reversible is also presented. These new methods are used to estimate the divergence time of humans and chimps from mtDNA sequence data. These analyses support suggestions that the human lineage has an enhanced transition rate relative to other hominoids. These studies also show that transversion distances differ substantially from the overall distances which are dominated by transitions. Transversions alone apparently suggest a very recent divergence time for humans versus chimps and/or a very old (> 16 myr) divergence time for humans versus orangutans. This work illustrates graphically ways to interpret the reliability of distance-based transformations, using the corrected transition to transversion ratio returned for pairs of sequences which are successively more diverged.
Collapse
Affiliation(s)
- P J Waddell
- School of Biological Sciences, Massey University, Palmerston North, New Zealand.
| | | |
Collapse
|
9
|
Lockhart PJ, Steel MA, W.D. Larkum A. Gene duplication and the evaluation of photosynthetic reaction center proteins. FEBS Lett 1996. [DOI: 10.1016/0014-5793(96)81381-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
10
|
Chang JT. Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across characters. Math Biosci 1996; 134:189-215. [PMID: 8664540 DOI: 10.1016/0025-5564(95)00172-7] [Citation(s) in RCA: 114] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
A fundamental problem in reconstructing the evolutionary history of a set of species is to infer the topology of the evolutionary tree that relates those species. A statistical method for estimating such a topology from character data is called consistent if, given data from more and more characters, the method is sure to converge to the true topology. A number of popular methods are based on modeling the evolution of each character as a Markov process along the evolutionary tree. The standard models further assume that each character has in fact evolved according to the same Markov process. This homogeneity assumption is unrealistic; for example, different types of characters are known to experience substitutions at different rates. Certain distance and maximum likelihood methods for topology estimation have been shown to be consistent under the homogeneity assumption. Here we give examples showing that these methods can fail to be consistent when the homogeneity assumption is relaxed. The examples are very simple, requiring only four taxa, binary characters, and characters that evolve at two different rates.
Collapse
Affiliation(s)
- J T Chang
- Yale University Statistics Department, New Haven, Connecticut 06520-8290, USA
| |
Collapse
|
11
|
Lockhart PJ, Steel MA, Larkum AW. Gene duplication and the evolution of photosynthetic reaction center proteins. FEBS Lett 1996; 385:193-6. [PMID: 8647249 DOI: 10.1016/0014-5793(96)00383-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
We investigate the evolutionary relationships between photosynthetic reaction center proteins (D1, D2, L and M) and demonstrate that the pattern of nucleotide substitution in these is more complicated than has been assumed in previous phylogenetic analyses. We show that there are serious violations of methodological assumptions in previous published studies. We conclude that there is equal support for hypotheses indicating (i) a single gene duplication of an ancestral reaction center protein followed by diversification and (ii) two independent gene duplications giving rise to proteins in oxygenic and anoxygenic systems.
Collapse
Affiliation(s)
- P J Lockhart
- Lehrstuhl für Pflanzenökologie und Systematik, Universität Bayreuth, Germany.
| | | | | |
Collapse
|
12
|
Abstract
A new model of molecular evolution is introduced that allows for heterogeneous rates across the sequence positions. The development of this model was motivated by two issues: first, a number of studies have shown that the positions in a DNA sequence evolve at different rates, and second, it has been shown that not accounting for this heterogeneity can lead to biased estimates of evolutionary parameters. The authors generalize the Markovian model of molecular evolution to allow for heterogeneous rates and explore some of the consequences of such a model. In particular, they quantify the biases incurred by incorrectly assuming an equal-rate model and consider what can be learned about evolutionary parameters under a heterogeneous model.
Collapse
Affiliation(s)
- C Kelly
- Department of Computer Science and Statistics, University of Rhode Island, Kingston 02881, USA
| | | |
Collapse
|
13
|
Lockhart PJ, Larkum AW, Steel M, Waddell PJ, Penny D. Evolution of chlorophyll and bacteriochlorophyll: the problem of invariant sites in sequence analysis. Proc Natl Acad Sci U S A 1996; 93:1930-4. [PMID: 8700861 PMCID: PMC39885 DOI: 10.1073/pnas.93.5.1930] [Citation(s) in RCA: 150] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Competing hypotheses seek to explain the evolution of oxygenic and anoxygenic processes of photosynthesis. Since chlorophyll is less reduced and precedes bacteriochlorophyll on the modern biosynthetic pathway, it has been proposed that chlorophyll preceded bacteriochlorophyll in its evolution. However, recent analyses of nucleotide sequences that encode chlorophyll and bacteriochlorophyll biosynthetic enzymes appear to provide support for an alternative hypothesis. This is that the evolution of bacteriochlorophyll occurred earlier than the evolution of chlorophyll. Here we demonstrate that the presence of invariant sites in sequence datasets leads to inconsistency in tree building (including maximum-likelihood methods). Homologous sequences with different biological functions often share invariant sites at the same nucleotide positions. However, different constraints can also result in additional invariant sites unique to the genes, which have specific and different biological functions. Consequently, the distribution of these sites can be uneven between the different types of homologous genes. The presence of invariant sites, shared by related biosynthetic genes as well as those unique to only some of these genes, has misled the recent evolutionary analysis of oxygenic and anoxygenic photosynthetic pigments. We evaluate an alternative scheme for the evolution of chlorophyll and bacteriochlorophyll.
Collapse
Affiliation(s)
- P J Lockhart
- Molecular Genetics Unit, School of Biological Sciences, Massey University, Palmerston North New Zealand
| | | | | | | | | |
Collapse
|
14
|
Adachi J, Hasegawa M. Improved dating of the human/chimpanzee separation in the mitochondrial DNA tree: heterogeneity among amino acid sites. J Mol Evol 1995; 40:622-8. [PMID: 7643413 DOI: 10.1007/bf00160510] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The internal branch lengths estimated by distance methods such as neighbor-joining are shown to be biased to be short when the evolutionary rate differs among sites. The variable-invariable model for site heterogeneity fits the amino acid sequence data encoded by the mitochondrial DNA from Hominoidea remarkably well. By assuming the orangutan separation to be 13 or 16 Myr old, a maximum-likelihood analysis estimates a young date of 3.6 +/- 0.6 or 4.4 +/- 0.7 Myr (+/- 1 SE) for the human/chimpanzee separation, and these estimates turn out to be robust against differences in the assumed model for amino acid substitutions. Although some uncertainties still exist in our estimates, this analysis suggests that humans separated from chimpanzees some 4-5 Myr ago.
Collapse
Affiliation(s)
- J Adachi
- Department of Statistical Science, Graduate University for Advanced Studies, Tokyo, Japan
| | | |
Collapse
|