126
|
Jordan IK, Mariño-Ramírez L, Wolf YI, Koonin EV. Conservation and coevolution in the scale-free human gene coexpression network. Mol Biol Evol 2004; 21:2058-70. [PMID: 15282333 DOI: 10.1093/molbev/msh222] [Citation(s) in RCA: 153] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The role of natural selection in biology is well appreciated. Recently, however, a critical role for physical principles of network self-organization in biological systems has been revealed. Here, we employ a systems level view of genome-scale sequence and expression data to examine the interplay between these two sources of order, natural selection and physical self-organization, in the evolution of human gene regulation. The topology of a human gene coexpression network, derived from tissue-specific expression profiles, shows scale-free properties that imply evolutionary self-organization via preferential node attachment. Genes with numerous coexpressed partners (the hubs of the coexpression network) evolve more slowly on average than genes with fewer coexpressed partners, and genes that are coexpressed show similar rates of evolution. Thus, the strength of selective constraints on gene sequences is affected by the topology of the gene coexpression network. This connection is strong for the coding regions and 3' untranslated regions (UTRs), but the 5' UTRs appear to evolve under a different regime. Surprisingly, we found no connection between the rate of gene sequence divergence and the extent of gene expression profile divergence between human and mouse. This suggests that distinct modes of natural selection might govern sequence versus expression divergence, and we propose a model, based on rapid, adaptation-driven divergence and convergent evolution of gene expression patterns, for how natural selection could influence gene expression divergence.
Collapse
|
127
|
Jordan IK, Wolf YI, Koonin EV. Duplicated genes evolve slower than singletons despite the initial rate increase. BMC Evol Biol 2004; 4:22. [PMID: 15238160 PMCID: PMC481058 DOI: 10.1186/1471-2148-4-22] [Citation(s) in RCA: 156] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Accepted: 07/06/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene duplication is an important mechanism that can lead to the emergence of new functions during evolution. The impact of duplication on the mode of gene evolution has been the subject of several theoretical and empirical comparative-genomic studies. It has been shown that, shortly after the duplication, genes seem to experience a considerable relaxation of purifying selection. RESULTS Here we demonstrate two opposite effects of gene duplication on evolutionary rates. Sequence comparisons between paralogs show that, in accord with previous observations, a substantial acceleration in the evolution of paralogs occurs after duplication, presumably due to relaxation of purifying selection. The effect of gene duplication on evolutionary rate was also assessed by sequence comparison between orthologs that have paralogs (duplicates) and those that do not (singletons). It is shown that, in eukaryotes, duplicates, on average, evolve significantly slower than singletons. Eukaryotic ortholog evolutionary rates for duplicates are also negatively correlated with the number of paralogs per gene and the strength of selection between paralogs. A tally of annotated gene functions shows that duplicates tend to be enriched for proteins with known functions, particularly those involved in signaling and related cellular processes; by contrast, singletons include an over-abundance of poorly characterized proteins. CONCLUSIONS These results suggest that whether or not a gene duplicate is retained by selection depends critically on the pre-existing functional utility of the protein encoded by the ancestral singleton. Duplicates of genes of a higher biological import, which are subject to strong functional constraints on the sequence, are retained relatively more often. Thus, the evolutionary trajectory of duplicated genes appears to be determined by two opposing trends, namely, the post-duplication rate acceleration and the generally slow evolutionary rate owing to the high level of functional constraints.
Collapse
MESH Headings
- Animals
- Base Composition/genetics
- DNA/genetics
- DNA, Archaeal/genetics
- DNA, Bacterial/genetics
- Evolution, Molecular
- Genes/genetics
- Genes/physiology
- Genes, Archaeal/genetics
- Genes, Archaeal/physiology
- Genes, Bacterial/genetics
- Genes, Bacterial/physiology
- Genes, Duplicate/genetics
- Genes, Duplicate/physiology
- Genes, Fungal/genetics
- Genes, Fungal/physiology
- Genes, Insect/genetics
- Genes, Insect/physiology
- Gram-Negative Bacteria/genetics
- Gram-Positive Bacteria/genetics
- Humans
- Mice
- Mutation/genetics
- Sequence Homology, Nucleic Acid
Collapse
|
128
|
Jordan IK, Bowen NJ. Computational Analysis of Transposable Element Sequences. Mob Genet Elements 2004; 260:59-71. [PMID: 15020802 DOI: 10.1385/1-59259-755-6:059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
This chapter provides a simple guide for the computational analysis of transposable element (TE) sequences. Web links are provided for a number of sequence analysis applications, and their potential use in the analysis of TE sequences is briefly described. The level of detail provided is intended to be sufficient for a naive user to begin to analyze TE sequences in silico. The emphasis is placed on the identification, retrieval and manipulation of TE sequences. Information is also provided on the evolutionary study of TE sequences including the use phylogenetics programs.
Collapse
|
129
|
Bowen NJ, Jordan IK, Epstein JA, Wood V, Levin HL. Retrotransposons and their recognition of pol II promoters: a comprehensive survey of the transposable elements from the complete genome sequence of Schizosaccharomyces pombe. Genome Res 2003; 13:1984-97. [PMID: 12952871 PMCID: PMC403668 DOI: 10.1101/gr.1191603] [Citation(s) in RCA: 117] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The complete DNA sequence of the genome of Schizosaccharomyces pombe provides the opportunity to investigate the entire complement of transposable elements (TEs), their association with specific sequences, their chromosomal distribution, and their evolution. Using homology-based sequence identification, we found that the sequenced strain of S. pombe contained only one family of full-length transposons. This family, Tf2, consisted of 13 full-length copies of a long terminal repeat (LTR) retrotransposon. We found that LTR-LTR recombination of previously existing transposons had resulted in extensive populations of solo LTRs. These included 35 solo LTRs of Tf2, as well as 139 solo LTRs from other Tf families. Phylogenetic analysis of solo Tf LTRs reveals that Tf1 and Tf2 were the most recently active elements within the genome. The solo LTRs also served as footprints for previous insertion events by the Tf retrotransposons. Analysis of 186 genomic insertion events revealed a close association with RNA polymerase II promoters. These insertions clustered in the promoter-proximal regions of genes, upstream of protein coding regions by 100 to 400 nucleotides. The association of Tf insertions with pol II promoters was very similar to the preference previously observed for Tf1 integration. We found that the recently active Tf elements were absent from centromeres and pericentromeric regions of the genome containing tandem tRNA gene clusters. In addition, our analysis revealed that chromosome III has twice the density of insertion events compared to the other two chromosomes. Finally we describe a novel repetitive sequence, wtf, which was also preferentially located on chromosome III, and was often located near solo LTRs of Tf elements.
Collapse
|
130
|
Jordan IK, Henze K, Fedorova ND, Koonin EV, Galperin MY. Phylogenomic analysis of the Giardia intestinalis transcarboxylase reveals multiple instances of domain fusion and fission in the evolution of biotin-dependent enzymes. J Mol Microbiol Biotechnol 2003; 5:172-89. [PMID: 12766347 DOI: 10.1159/000070268] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Sequencing of the gene encoding a pyruvate carboxylase-like protein from the amitochondrial eukaryote Giardia intestinalis revealed a 1,338 aa protein composed of acetyl-CoA carboxyltransferase (ACCT), pyruvate carboxyltransferase (PycB), and biotin carboxyl carrier protein (BCCP) domains, linked in a single polypeptide chain. This particular domain combination has been previously seen only in the methylmalonyl-CoA:pyruvate transcarboxylase from Propionibacterium freudenreichii, where each of these domains is encoded by an individual gene and forms a separate subunit. To get an insight into the evolutionary origin and biochemical function of the G. intestinalis enzyme, we compared its domain composition to those of other biotin-dependent enzymes and performed a phylogenetic analysis of each of its domains. The results obtained indicate that: (1) evolution of the BCCP domain included several domain fusion events, leading to the ACCT-BCCP and PycB-BCCP domain combinations; (2) fusions of the PycB and BCCP domains in pyruvate carboxylases and oxaloacetate decarboxylases occurred on several independent occasions in different prokaryotic lineages, probably due to selective pressure towards co-expression of these genes, and (3) because newly sequenced biotin-dependent enzymes are often misannotated in sequence databases, their annotation as either carboxylases, decarboxylases, or transcarboxylases has to rely on detailed analysis of their domain composition, operon organization of the corresponding genes, gene content in the particular genome, and phylogenetic analysis.
Collapse
|
131
|
Jordan IK, Rogozin IB, Glazko GV, Koonin EV. Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet 2003; 19:68-72. [PMID: 12547512 DOI: 10.1016/s0168-9525(02)00006-9] [Citation(s) in RCA: 409] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Transposable elements (TEs) are abundant in mammalian genomes and have potentially contributed to their hosts' evolution by providing novel regulatory or coding sequences. We surveyed different classes of regulatory region in the human genome to assess systematically the potential contribution of TEs to gene regulation. Almost 25% of the analyzed promoter regions contain TE-derived sequences, including many experimentally characterized cis-regulatory elements. Scaffold/matrix attachment regions (S/MARs) and locus control regions (LCRs) that are involved in the simultaneous regulation of multiple genes also contain numerous TE-derived sequences. Thus, TEs have probably contributed substantially to the evolution of both gene-specific and global patterns of human gene regulation.
Collapse
|
132
|
Jordan IK, Wolf YI, Koonin EV. No simple dependence between protein evolution rate and the number of protein-protein interactions: only the most prolific interactors tend to evolve slowly. BMC Evol Biol 2003; 3:1. [PMID: 12515583 PMCID: PMC140311 DOI: 10.1186/1471-2148-3-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2002] [Accepted: 01/06/2003] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND It has been suggested that rates of protein evolution are influenced, to a great extent, by the proportion of amino acid residues that are directly involved in protein function. In agreement with this hypothesis, recent work has shown a negative correlation between evolutionary rates and the number of protein-protein interactions. However, the extent to which the number of protein-protein interactions influences evolutionary rates remains unclear. Here, we address this question at several different levels of evolutionary relatedness. RESULTS Manually curated data on the number of protein-protein interactions among Saccharomyces cerevisiae proteins was examined for possible correlation with evolutionary rates between S. cerevisiae and Schizosaccharomyces pombe orthologs. Only a very weak negative correlation between the number of interactions and evolutionary rate of a protein was observed. Furthermore, no relationship was found between a more general measure of the evolutionary conservation of S. cerevisiae proteins, based on the taxonomic distribution of their homologs, and the number of protein-protein interactions. However, when the proteins from yeast were assorted into discrete bins according to the number of interactions, it turned out that 6.5% of the proteins with the greatest number of interactions evolved, on average, significantly slower than the rest of the proteins. Comparisons were also performed using protein-protein interaction data obtained with high-throughput analysis of Helicobacter pylori proteins. No convincing relationship between the number of protein-protein interactions and evolutionary rates was detected, either for comparisons of orthologs from two completely sequenced H. pylori strains or for comparisons of H. pylori and Campylobacter jejuni orthologs, even when the proteins were classified into bins by the number of interactions. CONCLUSION The currently available comparative-genomic data do not support the hypothesis that the evolutionary rates of the majority of proteins substantially depend on the number of protein-protein interactions they are involved in. However, a small fraction of yeast proteins with the largest number of interactions (the hubs of the interaction network) tend to evolve slower than the bulk of the proteins.
Collapse
|
133
|
|
134
|
Jiang N, Jordan IK, Wessler SR. Dasheng and RIRE2. A nonautonomous long terminal repeat element and its putative autonomous partner in the rice genome. PLANT PHYSIOLOGY 2002; 130:1697-705. [PMID: 12481052 PMCID: PMC166684 DOI: 10.1104/pp.015412] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2002] [Revised: 10/04/2002] [Accepted: 10/08/2002] [Indexed: 05/18/2023]
Abstract
Dasheng is one of the highest copy number long terminal repeat elements and one of the most recent elements to amplify in the rice (Oryza sativa) genome. However, the absence of any significant coding capacity for retroviral proteins, including gag and pol, suggests that Dasheng is a nonautonomous element. Here, we have exploited the availability of 360 Mb of rice genomic sequence to identify a candidate autonomous element. RIRE2 is a previously described gypsy-like long terminal repeat retrotransposon with significant sequence similarity to Dasheng in the regions where putative cis factors for retrotransposition are thought to be located. Dasheng and RIRE2 elements have similar chromosomal distribution patterns and similar target site sequences, suggesting that they use the same transposition machinery. In addition, the presence of several RIRE2-Dasheng element chimeras in the genome is consistent with the copackaging of element mRNAs in the same virus-like particle. Finally, both families have recently amplified members, suggesting that they could have been co-expressed, a necessary prerequisite for RIRE2 to serve as the source of transposition machinery for Dasheng. Consistent with this hypothesis, transcripts from both elements were found in the same expressed sequence tag library.
Collapse
|
135
|
Bowen NJ, Jordan IK. Transposable elements and the evolution of eukaryotic complexity. Curr Issues Mol Biol 2002; 4:65-76. [PMID: 12074196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023] Open
Abstract
Eukaryotic transposable elements are ubiquitous and widespread mobile genetic entities. These elements often make up a substantial fraction of the host genomes in which they reside. For example, approximately 1/2 of the human genome was recently shown to consist of transposable element sequences. There is a growing body of evidence that demonstrates that transposable elements have been major players in genome evolution. A sample of this evidence is reviewed here with an emphasis on the role that transposable elements may have played in driving the evolution of eukaryotic complexity. A number of specific scenarios are presented that implicate transposable elements in the evolution of the complex molecular and cellular machinery that are characteristic of the eukaryotic domain of life.
Collapse
|
136
|
Jordan IK, Rogozin IB, Wolf YI, Koonin EV. Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. Genome Res 2002; 12:962-8. [PMID: 12045149 PMCID: PMC1383730 DOI: 10.1101/gr.87702] [Citation(s) in RCA: 340] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The "knockout-rate" prediction holds that essential genes should be more evolutionarily conserved than are nonessential genes. This is because negative (purifying) selection acting on essential genes is expected to be more stringent than that for nonessential genes, which are more functionally dispensable and/or redundant. However, a recent survey of evolutionary distances between Saccharomyces cerevisiae and Caenorhabditis elegans proteins did not reveal any difference between the rates of evolution for essential and nonessential genes. An analysis of mouse and rat orthologous genes also found that essential and nonessential genes evolved at similar rates when genes thought to evolve under directional selection were excluded from the analysis. In the present study, we combine genomic sequence data with experimental knockout data to compare the rates of evolution and the levels of selection for essential versus nonessential bacterial genes. In contrast to the results obtained for eukaryotic genes, essential bacterial genes appear to be more conserved than are nonessential genes over both relatively short (microevolutionary) and longer (macroevolutionary) time scales.
Collapse
|
137
|
Jordan IK, McDonald JF. A Biologically Active Family of Human Endogenous Retroviruses Evolved from an Ancient Inactive Lineage. ACTA ACUST UNITED AC 2002. [DOI: 10.1166/gl.2002.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
138
|
Abstract
The availability of multiple complete genome sequences from the same species can facilitate attempts to systematically address basic questions in genome evolution. We refer to such efforts as "microevolutionary genomics". We report the results of comparative analyses of complete intraspecific genome (and proteome) sequences from four bacterial species--Chlamydophila pneumoniae, Escherichia coli, Helicobacter pylori and Neisseria meningitidis. Comparisons of average synonymous (K(s)) and nonsynonymous (K(a)) substitution rates were used to assess the influence of various biological factors on the rate of protein evolution. For example, E. coli experiences the most intense purifying selection of the species analyzed, and this may be due to the relatively larger population size of this species. In addition, essential genes were shown to be more evolutionarily conserved than nonessential genes in E. coli and duplicated genes have higher rates of evolution than unique genes for all species studied except C. pneumoniae. Different functional categories of genes were shown to evolve at significantly different rates emphasizing the role of category-specific functional constraints in determining evolutionary rates. Finally, functionally characterized genes tend to be conserved between strains, while uncharacterized genes are over-represented among the unique, strain-specific genes. This suggests the possibility that nonessential genes are responsible for driving the evolutionary diversification between strains.
Collapse
|
139
|
Rogozin IB, Spiridonov AN, Sorokin AV, Wolf YI, Jordan IK, Tatusov RL, Koonin EV. Purifying and directional selection in overlapping prokaryotic genes. Trends Genet 2002; 18:228-32. [PMID: 12047938 DOI: 10.1016/s0168-9525(02)02649-5] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In overlapping genes, the same DNA sequence codes for two proteins using different reading frames. Analysis of overlapping genes can help in understanding the mode of evolution of a coding region from noncoding DNA. We identified 71 pairs of convergent genes, with overlapping 3' ends longer than 15 nucleotides, that are conserved in at least two prokaryotic genomes. Among the overlap regions, we observed a statistically significant bias towards the 123:132 phase (i.e. the second codon base in one gene facing the degenerate third position in the second gene). This phase ensures the least mutual constraint on nonconservative amino acid replacements in both overlapping coding sequences. The excess of this phase is compatible with directional (positive) selection acting on the overlapping coding regions. This could be a general evolutionary mode for genes emerging from noncoding sequences, in which the protein sequence has not been subject to selection.
Collapse
|
140
|
Jordan IK, Natale DA, Koonin EV, Galperin MY. Independent evolution of heavy metal-associated domains in copper chaperones and copper-transporting atpases. J Mol Evol 2001; 53:622-33. [PMID: 11677622 DOI: 10.1007/s002390010249] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2000] [Accepted: 05/09/2001] [Indexed: 11/29/2022]
Abstract
Copper chaperones are small cytoplasmic proteins that bind intracellular copper (Cu) and deliver it to Cu-dependent enzymes such as cytochrome oxidase, superoxide dismutase, and amine oxidase. Copper chaperones are similar in sequence and structure to the Cu-binding heavy metal-associated (HMA) domains of Cu-transporting ATPases (Cu-ATPases), and the genes for copper chaperones and Cu-ATPases are often located in the same operon. Phylogenetic analysis shows that Cu chaperones and HMA domains of Cu-ATPases represent ancient and distinct lineages that have evolved largely independently since their initial separation. Copper chaperone-Cu-ATPase operons appear to have evolved independently in different prokaryotic lineages, probably due to a strong selective pressure for coexpression of these genes.
Collapse
|
141
|
Jordan IK, Bishop GR, Gonzalez DS. Sequence and structural aspects of functional diversification in class I alpha-mannosidase evolution. Bioinformatics 2001; 17:965-76. [PMID: 11673242 DOI: 10.1093/bioinformatics/17.10.965] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Class I alpha-mannosidases comprise a homologous and functionally diverse family of glycoside hydrolases. Phylogenetic analysis based on an amino acid sequence alignment of the catalytic domain of class I alpha-mannosidases reveals four well-supported phylogenetic groups within this family. These groups include a number of paralogous members generated by gene duplications that occurred as far back as the initial divergence of the crown-group of eukaryotes. Three of the four phylogenetic groups consist of enzymes that have group-specific biochemical specificity and/or sites of activity. An attempt has been made to uncover the role that natural selection played in the sequence and structural divergence between the phylogenetically and functionally distinct Endoplasmic Reticulum (ER) and Golgi apparatus groups. RESULTS Comparison of site-specific amino acid variability profiles for the ER and Golgi groups revealed statistically significant evidence for functional diversification at the sequence level and indicated a number of residues that are most likely to have played a role in the functional divergence between the two groups. The majority of these sites appear to contain residues that have been fixed within one organelle-specific group by positive selection. Somewhat surprisingly these selected residues map to the periphery of the alpha-mannosidase catalytic domain tertiary structure. Changes in these peripherally located residues would not seem to have a gross effect on protein function. Thus diversifying selection between the two groups may have acted in a gradual manner consistent with the Darwinian model of natural selection. CONTACT bishogr@millsaps.edu.
Collapse
|
142
|
Jordan IK, Makarova KS, Spouge JL, Wolf YI, Koonin EV. Lineage-specific gene expansions in bacterial and archaeal genomes. Genome Res 2001; 11:555-65. [PMID: 11282971 PMCID: PMC311027 DOI: 10.1101/gr.gr-1660r] [Citation(s) in RCA: 122] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Gene duplication is an important mechanistic antecedent to the evolution of new genes and novel biochemical functions. In an attempt to assess the contribution of gene duplication to genome evolution in archaea and bacteria, clusters of related genes that appear to have expanded subsequent to the diversification of the major prokaryotic lineages (lineage-specific expansions) were analyzed. Analysis of 21 completely sequenced prokaryotic genomes shows that lineage-specific expansions comprise a substantial fraction (approximately 5%-33%) of their coding capacities. A positive correlation exists between the fraction of the genes taken up by lineage-specific expansions and the total number of genes in a genome. Consistent with the notion that lineage-specific expansions are made up of relatively recently duplicated genes, >90% of the detected clusters consists of only two to four genes. The more common smaller clusters tend to include genes with higher pairwise similarity (as reflected by average score density) than larger clusters. Regardless of size, cluster members tend to be located more closely on bacterial chromosomes than expected by chance, which could reflect a history of tandem gene duplication. In addition to the small clusters, almost all genomes also contain rare large clusters of size > or =20. Several examples of the potential adaptive significance of these large clusters are explored. The presence or absence of clusters and their related genes was used as the basis for the construction of a similarity graph for completely sequenced prokaryotic genomes. The topology of the resulting graph seems to reflect a combined effect of common ancestry, horizontal transfer, and lineage-specific gene loss.
Collapse
|
143
|
|
144
|
Jordan IK, Makarova KS, Wolf YI, Koonin EV. Gene conversions in genes encoding outer-membrane proteins in H. pylori and C. pneumoniae. Trends Genet 2001; 17:7-10. [PMID: 11163905 DOI: 10.1016/s0168-9525(00)02151-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Helicobacter pylori and Chlamydia pneumoniae are both pathogenic to humans. Their genomes have recently been completed, allowing detailed study of their evolution and organization. Here we describe an evolutionary analysis of the H. pylori and C. pneumoniae genes that encode their outer-membrane proteins. By comparing complete genome sequences of two H. pylori strains and two C. pneumoniae strains, we identify multiple independent conversions among these genes. Such recombination events might provide a selective advantage for these bacterial pathogens.
Collapse
|
145
|
Jordan IK, Kondrashov FA, Rogozin IB, Tatusov RL, Wolf YI, Koonin EV. Constant relative rate of protein evolution and detection of functional diversification among bacterial, archaeal and eukaryotic proteins. Genome Biol 2001; 2:RESEARCH0053. [PMID: 11790256 PMCID: PMC64838 DOI: 10.1186/gb-2001-2-12-research0053] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2001] [Revised: 09/11/2001] [Accepted: 10/05/2001] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Detection of changes in a protein's evolutionary rate may reveal cases of change in that protein's function. We developed and implemented a simple relative rates test in an attempt to assess the rate constancy of protein evolution and to detect cases of functional diversification between orthologous proteins. The test was performed on clusters of orthologous protein sequences from complete bacterial genomes (Chlamydia trachomatis, C. muridarum and Chlamydophila pneumoniae), complete archaeal genomes (Pyrococcus horikoshii, P. abyssi and P. furiosus) and partially sequenced mammalian genomes (human, mouse and rat). RESULTS Amino-acid sequence evolution rates are significantly correlated on different branches of phylogenetic trees representing the great majority of analyzed orthologous protein sets from all three domains of life. However, approximately 1% of the proteins from each group of species deviates from this pattern and instead shows variation that is consistent with an acceleration of the rate of amino-acid substitution, which may be due to functional diversification. Most of the putative functionally diversified proteins from all three species groups are predicted to function at the periphery of the cells and mediate their interaction with the environment. CONCLUSIONS Relative rates of protein evolution are remarkably constant for the three species groups analyzed here. Deviations from this rate constancy are probably due to changes in selective constraints associated with diversification between orthologs. Functional diversification between orthologs is thought to be a relatively rare event. However, the resolution afforded by the test designed specifically for genomic-scale datasets allowed us to identify numerous cases of possible functional diversification between orthologous proteins.
Collapse
|
146
|
Jordan IK, Natale DA, Galperin MY. Copper chaperones in bacteria: association with copper-transporting ATPases. Trends Biochem Sci 2000; 25:480-1. [PMID: 11203382 DOI: 10.1016/s0968-0004(00)01662-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
147
|
Jordan IK, McDonald JF. Comparative genomics and evolutionary dynamics of Saccharomyces cerevisiae Ty elements. Genetica 2000; 107:3-13. [PMID: 10952193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
The availability of the complete genome sequence of Saccharomyces cerevisiae provides the unique opportunity to study an entire genomic complement of retrotransposons from an evolutionary perspective. There are five families of yeast retrotransposons, Ty1-Ty5. We have conducted a series of comparative sequence analyses within and among S. cerevisiae Ty families in an effort to document the evolutionary forces that have shaped element variation. Our results indicate that within families Ty elements vary little in terms of both size and sequence. Furthermore, intra-element 5'-3' long terminal repeat (LTR) sequence comparisons indicate that almost all Ty elements in the genome have recently transposed. For each family, solo LTR sequences generated by intra-element recombination far outnumber full length insertions. Taken together, these results suggest a rapid genomic turnover of S. cerevisiae Ty elements. The closely related Ty1 and Ty2 are the most numerous elements in the genome. Phylogenetic analysis of full length insertions reveals that reverse transcriptase mediated recombination between Ty1 and Ty2 elements has generated a number of hybrid Ty1/2 elements. These hybrid Ty1/2 elements have similar genomic structures with chimeric LTRs and chimeric TYB (pol) genes. Analysis of the levels of nonsynonymous (Ka) and synonymous (Ks) nucleotide variation indicates that Ty1 and Ty2 coding regions have been subject to strong negative (purifying) selection. Distribution of Ka and Ks on Ty1, Ty2 and Ty1/2 phylogenies reveals evidence of negative selection on both internal and external branches. This pattern of variation suggests that the majority of full length Ty1, Ty2 and Ty1/2 insertions represent active or recently active element lineages and is consistent with a high level of genomic turnover. The evolutionary dynamics of S. cerevisae Ty elements uncovered by our analyses are discussed with respect to selection among elements and the interaction between the elements and their host genome.
Collapse
|
148
|
Abstract
alpha-Mannosidase enzymes comprise a class of gylcoside hydrolases involved in the maturation and degradaton of glycoprotein-linked oligosaccharides. Various alpha-mannosidase enzymatic activities are encoded by an ancient and ubiquitous gene superfamily. A comparative sequence analysis was employed here to characterize the evolutionary relationships and dynamics of the alpha-mannosidase superfamily. A series of lineage-specific BLAST searches recovered the first ever recognized archaean and eubacterial alpha-mannosidase sequences, in addition to numerous eukaryotic sequences. Motif-based alignment and subsequent phylogenetic analysis of the entire superfamily revealed the presence of three well-supported monophyletic clades that represent discrete alpha-mannosidase families. The comparative method was used to evaluate the phylogenetic distribution of alpha-mannosidase functional variants within families. Results of this analysis demonstrate a pattern of functional diversification of alpha-mannosidase paralogs followed by conservation of function among orthologs. Nucleotide polymorphism among the most closely related pair of duplicated genes was analyzed to evaluate the role of natural selection in the functional diversification of alpha-mannosidase paralogs. Ratios of nonsynonymous and synonymous variation show an increase in the rate of nonsynonymous change after duplication and a relative excess of fixed nonsynonymous changes between the two groups of paralogs. These data point to a possible role for positive Darwinian selection in the evolution of alpha-mannosidase functional diversification following gene duplication.
Collapse
|
149
|
Jordan IK, Sutter BA, McClure MA. Molecular evolution of the Paramyxoviridae and Rhabdoviridae multiple-protein-encoding P gene. Mol Biol Evol 2000; 17:75-86. [PMID: 10666708 DOI: 10.1093/oxfordjournals.molbev.a026240] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Presented here is an analysis of the molecular evolutionary dynamics of the P gene among 76 representative sequences of the Paramyxoviridae and Rhabdoviridae RNA virus families. In a number of Paramyxoviridae taxa, as well as in vesicular stomatitis viruses of the Rhabdoviridae, the P gene encodes multiple proteins from a single genomic RNA sequence. These products include the phosphoprotein (P), as well as the C and V proteins. The complexity of the P gene makes it an intriguing locus to study from an evolutionary perspective. Amino acid sequence alignments of the proteins encoded at the P and N loci were used in independent phylogenetic reconstructions of the Paramyxoviridae and Rhabdoviridae families. P-gene-coding capacities were mapped onto the Paramyxoviridae phylogeny, and the most parsimonious path of multiple-coding-capacity evolution was determined. Levels of amino acid variation for Paramyxoviridae and Rhabdoviridae P-gene-encoded products were also analyzed. Proteins encoded in overlapping reading frames from the same nucleotides have different levels of amino acid variation. The nucleotide architecture that underlies the amino acid variation was determined in order to evaluate the role of selection in the evolution of the P gene overlapping reading frames. In every case, the evolution of one of the proteins encoded in the overlapping reading frames has been constrained by negative selection while the other has evolved more rapidly. The integrity of the overlapping reading frame that represents a derived state is generally maintained at the expense of the ancestral reading frame encoded by the same nucleotides. The evolution of such multicoding sequences is likely a response by RNA viruses to selective pressure to maximize genomic information content while maintaining small genome size. The ability to evolve such a complex genomic strategy is intimately related to the dynamics of the viral quasispecies, which allow enhanced exploration of the adaptive landscape.
Collapse
|
150
|
Jordan IK, Matyunina LV, McDonald JF. Evidence for the recent horizontal transfer of long terminal repeat retrotransposon. Proc Natl Acad Sci U S A 1999; 96:12621-5. [PMID: 10535972 PMCID: PMC23018 DOI: 10.1073/pnas.96.22.12621] [Citation(s) in RCA: 95] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The evolutionary dynamics existing between transposable elements (TEs) and their host genomes have been likened to an "arms race." The selfish drive of TEs to replicate, in turn, elicits the evolution of host-mediated regulatory mechanisms aimed at repressing transpositional activity. It has been postulated that horizontal (cross-species) transfer may be one effective strategy by which TEs and other selfish genes can escape host-mediated silencing mechanisms over evolutionary time; however, to date, the most definitive evidence that TEs horizontally transfer between species has been limited to class II or DNA-type elements. Evidence that the more numerous and widely distributed retroelements may also be horizontally transferred between species has been more ambiguous. In this paper, we report definitive evidence for a recent horizontal transfer of the copia long terminal repeat retrotransposon between Drosophila melanogaster and Drosophila willistoni.
Collapse
|