101
|
Kim DH, Ausubel FM. Evolutionary perspectives on innate immunity from the study of Caenorhabditis elegans. Curr Opin Immunol 2005; 17:4-10. [PMID: 15653303 DOI: 10.1016/j.coi.2004.11.007] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Genetic and functional genomic approaches have begun to define the molecular determinants of pathogen resistance in Caenorhabditis elegans. Conserved signal transduction components are required for pathogen resistance, including a Toll/IL-1 receptor domain adaptor protein that functions upstream of a conserved p38 MAP kinase pathway. We suggest that this pathway is an ancestral innate immune signaling pathway present in the common ancestor of nematodes, arthropods and vertebrates, which is likely to predate the involvement of canonical Toll signaling pathways in innate immunity. We anticipate that the study of pathogen resistance in C. elegans will continue to provide evolutionary and mechanistic insights into the signal transduction and physiology of innate immunity.
Collapse
Affiliation(s)
- Dennis H Kim
- Department of Genetics, Harvard Medical School, and Department of Molecular Biology, Massachusetts General Hospital, Boston, MA 02114, USA
| | | |
Collapse
|
102
|
|
103
|
Abstract
With the completion of the human genome and the growing number of diverse genomes being sequenced, a new age of evolutionary research is currently taking shape. The myriad of technological breakthroughs in biology that are leading to the unification of broad scientific fields such as molecular biology, biochemistry, physics, mathematics, and computer science are now known as systems biology. Here, I present an overview, with an emphasis on eukaryotes, of how the postgenomics era is adopting comparative approaches that go beyond comparisons among model organisms to shape the nascent field of evolutionary systems biology.
Collapse
Affiliation(s)
- Mónica Medina
- Department of Evolutionary Genomics, Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA.
| |
Collapse
|
104
|
Delsuc F, Brinkmann H, Philippe H. Phylogenomics and the reconstruction of the tree of life. Nat Rev Genet 2005; 6:361-75. [PMID: 15861208 DOI: 10.1038/nrg1603] [Citation(s) in RCA: 748] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
As more complete genomes are sequenced, phylogenetic analysis is entering a new era - that of phylogenomics. One branch of this expanding field aims to reconstruct the evolutionary history of organisms on the basis of the analysis of their genomes. Recent studies have demonstrated the power of this approach, which has the potential to provide answers to several fundamental evolutionary questions. However, challenges for the future have also been revealed. The very nature of the evolutionary history of organisms and the limitations of current phylogenetic reconstruction methods mean that part of the tree of life might prove difficult, if not impossible, to resolve with confidence.
Collapse
Affiliation(s)
- Frédéric Delsuc
- Canadian Institute for Advanced Research, Département de Biochimie, Centre Robert-Cedergren, Université de Montréal, Succursale Centre-Ville, Montréal, Québec H3C3J7, Canada
| | | | | |
Collapse
|
105
|
Dopazo H, Dopazo J. Genome-scale evidence of the nematode-arthropod clade. Genome Biol 2005; 6:R41. [PMID: 15892869 PMCID: PMC1175953 DOI: 10.1186/gb-2005-6-5-r41] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2005] [Accepted: 04/06/2005] [Indexed: 11/13/2022] Open
Abstract
The most extensive phylogenetic analysis carried out to date, including 11 complete genomes, is shown to support the Ecdysozoa hypothesis in the open-ended debate of the Coelomata-Ecdysozoa evolutionary problem. Background The issue of whether coelomates form a single clade, the Coelomata, or whether all animals that moult an exoskeleton (such as the coelomate arthropods and the pseudocoelomate nematodes) form a distinct clade, the Ecdysozoa, is the most puzzling issue in animal systematics and a major open-ended subject in evolutionary biology. Previous single-gene and genome-scale analyses designed to resolve the issue have produced contradictory results. Here we present the first genome-scale phylogenetic evidence that strongly supports the Ecdysozoa hypothesis. Results Through the most extensive phylogenetic analysis carried out to date, the complete genomes of 11 eukaryotic species have been analyzed in order to find homologous sequences derived from 18 human chromosomes. Phylogenetic analysis of datasets showing an increased adjustment to equal evolutionary rates between nematode and arthropod sequences produced a gradual change from support for Coelomata to support for Ecdysozoa. Transition between topologies occurred when fast-evolving sequences of Caenorhabditis elegans were removed. When chordate, nematode and arthropod sequences were constrained to fit equal evolutionary rates, the Ecdysozoa topology was statistically accepted whereas Coelomata was rejected. Conclusions The reliability of a monophyletic group clustering arthropods and nematodes was unequivocally accepted in datasets where traces of the long-branch attraction effect were removed. This is the first phylogenomic evidence to strongly support the 'moulting clade' hypothesis.
Collapse
Affiliation(s)
- Hernán Dopazo
- Pharmacogenomics and Comparative Genomics Unit, Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler 16, 46013 Valencia, Spain
| | - Joaquín Dopazo
- Functional Genomics Unit, Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler 16, 46013 Valencia, Spain
- Functional Genomics Node, INB, Centro de Investigación Príncipe Felipe, Autopista del Saler 16, 46013 Valencia, Spain
| |
Collapse
|
106
|
Zdobnov EM, von Mering C, Letunic I, Bork P. Consistency of genome-based methods in measuring Metazoan evolution. FEBS Lett 2005; 579:3355-61. [PMID: 15943981 DOI: 10.1016/j.febslet.2005.04.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/05/2005] [Indexed: 11/22/2022]
Abstract
Seven distinct genome-wide divergence measures were applied pairwise to the nine sequenced animal genomes of human, mouse, rat, chicken, pufferfish, fruit fly, mosquito, and two nematode worms (Caenorhabditis briggsae and Caenorhabditis elegans). Qualitatively, all of these divergence measures are found to correlate with the estimated time since speciation; however, marked deviations are observed in a few lineages. The distinct genome divergence measures also correlate well among themselves, indicating that most of the processes shaping genomes are dominated by neutral events. The deviations from the clock-like scenario in some lineages are observed consistently by several measures, implicitly confirming their reliability.
Collapse
|
107
|
Hughes AL, Ekollu V, Friedman R, Rose JR. Gene Family Content-Based Phylogeny of Prokaryotes: The Effect of Criteria for Inferring Homology. Syst Biol 2005; 54:268-76. [PMID: 16012097 DOI: 10.1080/10635150590923335] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
Abstract
A number of recent papers have suggested that gene family content can be used to resolve phylogenies, particularly in the case of prokaryotes, in which extensive horizontal gene transfer means that individual gene phylogenies may not mirror the organismal phylogeny. However, no study has yet examined how sensitive such analyses are to the criterion of homology assessment used to assemble multigene families. Using data from 99 completely sequenced prokaryotic genomes, we examined the effect of homology criteria in phylogenetic analyses wherein presence or absence of each family in the genome was used as a cladistic character. Different criteria resulted in evidence for contradictory tree topologies, sometimes with high bootstrap support. A moderately strict criterion seemed best for assembling multigene families in a biologically meaningful way, but it was not necessarily preferable for phylogenetic analysis. Instead, a very strict criterion, which broke up gene families into smaller subfamilies, seemed to have advantages for phylogenetic purposes. The poor performance of gene family content-based phylogenetic analysis in the case of prokaryotes appears to reflect high levels of homoplasy resulting not only from horizontal gene transfer but also, more importantly, from extensive parallel loss of gene families in certain bacteria genomes.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29205, USA.
| | | | | | | |
Collapse
|
108
|
Hughes AL, Friedman R. Shedding genomic ballast: extensive parallel loss of ancestral gene families in animals. J Mol Evol 2005; 59:827-33. [PMID: 15599514 DOI: 10.1007/s00239-004-0115-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2004] [Accepted: 07/21/2004] [Indexed: 10/26/2022]
Abstract
Loss of ancestral gene families has played an important role in genomic specialization in animals. An examination of the pattern of gene family loss in completely sequenced animal genomes revealed that the same gene families have been lost independently in different lineages to a far greater extent than expected if gene loss occurred at random. This result implies that certain ancestral gene families-and thus the biological functions they encode-have been more expendable than others over the radiation of the animal phyla.
Collapse
Affiliation(s)
- Austin L Hughes
- Department of Biological Sciences, University of South Carolina, Coker Life Sciences Building, 700 Sumter Street, Columbia, SC 29205, USA.
| | | |
Collapse
|
109
|
Roy SW, Gilbert W. Resolution of a deep animal divergence by the pattern of intron conservation. Proc Natl Acad Sci U S A 2005; 102:4403-8. [PMID: 15769859 PMCID: PMC555513 DOI: 10.1073/pnas.0409891102] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The relationship between three biologically important groups, arthropods, nematodes, and deuterostomes, remains unresolved. It is unknown whether arthropods are more closely related to nematodes (consistent with the "ecdysozoa" hypothesis) or to deuterostomes (consistent with "coelomata"). We present a method in which we use the pattern of spliceosomal intron conservation to develop a series of inequalities that characterize each possible relationship. We find that only the ecdysozoa grouping satisfies these predictions, with P < 10(-6). Simulations show that our method, unlike some previous methods, is largely insensitive to rate variation between branches.
Collapse
Affiliation(s)
- Scott William Roy
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA.
| | | |
Collapse
|
110
|
Blair JE, Shah P, Hedges SB. Evolutionary sequence analysis of complete eukaryote genomes. BMC Bioinformatics 2005; 6:53. [PMID: 15762985 PMCID: PMC1274250 DOI: 10.1186/1471-2105-6-53] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2004] [Accepted: 03/11/2005] [Indexed: 11/29/2022] Open
Abstract
Background Gene duplication and gene loss during the evolution of eukaryotes have hindered attempts to estimate phylogenies and divergence times of species. Although current methods that identify clusters of orthologous genes in complete genomes have helped to investigate gene function and gene content, they have not been optimized for evolutionary sequence analyses requiring strict orthology and complete gene matrices. Here we adopt a relatively simple and fast genome comparison approach designed to assemble orthologs for evolutionary analysis. Our approach identifies single-copy genes representing only species divergences (panorthologs) in order to minimize potential errors caused by gene duplication. We apply this approach to complete sets of proteins from published eukaryote genomes specifically for phylogeny and time estimation. Results Despite the conservative criterion used, 753 panorthologs (proteins) were identified for evolutionary analysis with four genomes, resulting in a single alignment of 287,000 amino acids. With this data set, we estimate that the divergence between deuterostomes and arthropods took place in the Precambrian, approximately 400 million years before the first appearance of animals in the fossil record. Additional analyses were performed with seven, 12, and 15 eukaryote genomes resulting in similar divergence time estimates and phylogenies. Conclusion Our results with available eukaryote genomes agree with previous results using conventional methods of sequence data assembly from genomes. They show that large sequence data sets can be generated relatively quickly and efficiently for evolutionary analyses of complete genomes.
Collapse
Affiliation(s)
- Jaime E Blair
- NASA Astrobiology Institute and Department of Biology, The Pennsylvania State University, 208 Mueller Laboratory, University Park, Pennsylvania 16802-5301, USA
| | - Prachi Shah
- NASA Astrobiology Institute and Department of Biology, The Pennsylvania State University, 208 Mueller Laboratory, University Park, Pennsylvania 16802-5301, USA
| | - S Blair Hedges
- NASA Astrobiology Institute and Department of Biology, The Pennsylvania State University, 208 Mueller Laboratory, University Park, Pennsylvania 16802-5301, USA
| |
Collapse
|
111
|
Rokas A, Carroll SB. More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy. Mol Biol Evol 2005; 22:1337-44. [PMID: 15746014 DOI: 10.1093/molbev/msi121] [Citation(s) in RCA: 245] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The relative contribution of taxon number and gene number to accuracy in phylogenetic inference is a major issue in phylogenetics and of central importance to the choice of experimental strategies for the successful reconstruction of a broad sketch of the tree of life. Maximization of the number of taxa sampled is the strategy favored by most phylogeneticists, although its necessity remains the subject of debate. Vast increases in gene number are now possible due to advances in genomics, but large numbers of genes will be available for only modest numbers of taxa, raising the question of whether such genome-scale phylogenies will be robust to the addition of taxa. To examine the relative benefit of increasing taxon number or gene number to phylogenetic accuracy, we have developed an assay that utilizes the symmetric difference tree distance as a measure of phylogenetic accuracy. We have applied this assay to a genome-scale data matrix containing 106 genes from 14 yeast species. Our results show that increasing taxon number correlates with a slight decrease in phylogenetic accuracy. In contrast, increasing gene number has a significant positive effect on phylogenetic accuracy. Analyses of an additional taxon-rich data matrix from the same yeast clade show that taxon number does not have a significant effect on phylogenetic accuracy. The positive effect of gene number and the lack of effect of taxon number on phylogenetic accuracy are also corroborated by analyses of two data matrices from mammals and angiosperm plants, respectively. We conclude that, for typical data sets, the number of genes utilized may be a more important determinant of phylogenetic accuracy than taxon number.
Collapse
Affiliation(s)
- Antonis Rokas
- Howard Hughes Medical Institute and Laboratory of Molecular Biology, University of Wisconsin-Madison, USA
| | | |
Collapse
|
112
|
Philippe H, Lartillot N, Brinkmann H. Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. Mol Biol Evol 2005; 22:1246-53. [PMID: 15703236 DOI: 10.1093/molbev/msi111] [Citation(s) in RCA: 354] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Almost a decade ago, a new phylogeny of bilaterian animals was inferred from small-subunit ribosomal RNA (rRNA) that claimed the monophyly of two major groups of protostome animals: Ecdysozoa (e.g., arthropods, nematodes, onychophorans, and tardigrades) and Lophotrochozoa (e.g., annelids, molluscs, platyhelminths, brachiopods, and rotifers). However, it received little additional support. In fact, several multigene analyses strongly argued against this new phylogeny. These latter studies were based on a large amount of sequence data and therefore showed an apparently strong statistical support. Yet, they covered only a few taxa (those for which complete genomes were available), making systematic artifacts of tree reconstruction more probable. Here we expand this sparse taxonomic sampling and analyze a large data set (146 genes, 35,371 positions) from a diverse sample of animals (35 species). Our study demonstrates that the incongruences observed between rRNA and multigene analyses were indeed due to long-branch attraction artifacts, illustrating the enormous impact of systematic biases on phylogenomic studies. A refined analysis of our data set excluding the most biased genes provides strong support in favor of the new animal phylogeny and in addition suggests that urochordates are more closely related to vertebrates than are cephalochordates. These findings have important implications for the interpretation of morphological and genomic data.
Collapse
Affiliation(s)
- Hervé Philippe
- Canadian Institute for Advanced Research and Département de Biochimie, Université de Montréal, Montréal, Québec, Canada.
| | | | | |
Collapse
|
113
|
Philip GK, Creevey CJ, McInerney JO. The Opisthokonta and the Ecdysozoa May Not Be Clades: Stronger Support for the Grouping of Plant and Animal than for Animal and Fungi and Stronger Support for the Coelomata than Ecdysozoa. Mol Biol Evol 2005; 22:1175-84. [PMID: 15703245 DOI: 10.1093/molbev/msi102] [Citation(s) in RCA: 133] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In considering the best possible solutions for answering phylogenetic questions from genomic sequences, we have chosen a strategy that we suggest is superior to others that have gone previously. We have ignored multigene families and instead have used single-gene families. This minimizes the inadvertent analysis of paralogs. We have employed strict data controls and have reasoned that if a protein is not capable of recovering the uncontroversial parts of a phylogenetic tree, then why should we use it for the more controversial parts? We have sliced and diced the data in as many ways as possible in order to uncover the signals in that data. Using this strategy, we have tested two controversial hypotheses concerning eukaryotic phylogenetic relationships: the placement of arthropoda and nematodes and the relationships of animals, plants, and fungi. We have constructed phylogenetic trees from 780 single-gene families from 10 completed genomes and amalgamated these into a single supertree. We have also carried out a total evidence analysis on the only universally distributed protein families that can accurately reconstruct the uncontroversial parts of the phylogenetic tree: a total of five families. In doing so, we ignore the majority of single-gene families that are universally distributed as they do not have the appropriate signals to recover the uncontroversial parts of the tree. We have also ignored every protein that has ever been used previously to address this issue, simply because none of them meet our strict criteria. Using these data controls, site stripping, and multiple analyses, 24 out of 26 analyses strongly support the grouping of vertebrates with arthropods (Coelomata hypothesis) and plants with animals. In the other two analyses, the data were ambivalent. The latter finding overturns an 11-year theory of Eukaryotic evolution; the first confirms what has already been said by others. In the light of this new tree, we re-analyze the evolution of intron gain and loss in the rpL14 gene and find that it is much more compatible with the hypothesis presented here than with the Opisthokonta hypothesis.
Collapse
Affiliation(s)
- Gayle K Philip
- Department of Biology, National University of Ireland Maynooth, Co. Kildare, Ireland
| | | | | |
Collapse
|
114
|
Abstract
We use the pattern of intron conservation in 684 groups of orthologs from seven fully sequenced eukaryotic genomes to provide maximum likelihood estimates of the number of introns present in the same orthologs in various eukaryotic ancestors. We find: (i) intron density in the plant-animal ancestor was high, perhaps two-thirds that of humans and three times that of Drosophila; and (ii) intron density in the ancestral bilateran was also high, equaling that of humans and four times that of Drosophila. We further find that modern introns are generally very old, with two-thirds of modern bilateran introns dating to the ancestral bilateran and two-fifths of modern plant, animal, and fungus introns dating to the plant-animal ancestor. Intron losses outnumber gains over a large range of eukaryotic lineages. These results show that early eukaryotic gene structures were very complex, and that simplification, not embellishment, has dominated subsequent evolution.
Collapse
Affiliation(s)
- Scott W Roy
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
115
|
Abstract
We studied intron loss in 684 groups of orthologous genes from seven fully sequenced eukaryotic genomes. We found that introns closer to the 3' ends of genes are preferentially lost, as predicted if introns are lost through gene conversion with a reverse transcriptase product of a spliced mRNA. Adjacent introns tend to be lost in concert, as expected if such events span multiple intron positions. Directly contrary to the expectations of some, introns that do not interrupt codons (phase zero) are more, not less, likely to be lost, an intriguing and previously unappreciated result. Adjacent introns with matching phases are not more likely to be retained, as would be expected if they enjoyed a relative selective advantage. The findings of 3' and phase zero intron loss biases are in direct contradiction to an extremely recent study of fungi intron evolution. All patterns are less pronounced in the lineage leading to Caenorhabditis elegans, suggesting that the process of intron loss may be qualitatively different in nematodes. Our results support a reverse transcriptase-mediated model of intron loss.
Collapse
Affiliation(s)
- Scott W Roy
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA 02138, USA.
| | | |
Collapse
|
116
|
Ogura A, Ikeo K, Gojobori T. Estimation of ancestral gene set of bilaterian animals and its implication to dynamic change of gene content in bilaterian evolution. Gene 2005; 345:65-71. [PMID: 15716111 DOI: 10.1016/j.gene.2004.11.036] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2004] [Revised: 11/17/2004] [Accepted: 11/23/2004] [Indexed: 11/17/2022]
Abstract
To understand the process of bilaterian evolution, we estimated ancestral gene sets at the split of plant-animal-fungi and the divergence of bilaterian animals and from 1,236,790 non-redundant genes. We, then, examined how the numbers of the gene clusters have changed since the split. As a result, we estimated the numbers of gene clusters in the ancestral gene sets of plant-animal-fungi and bilaterian animals to be at least 2469 and 6577, respectively. Thus, we found a 2.7-fold increase in the number of gene clusters during the period from the evolutionary split of plant-animal-fungi to the divergence of bilaterian animals. Moreover, when we compared these numbers of ancestral gene clusters with those of extant animals such as the nematode, fly, mouse and human, we found that the extant bilaterian animals have retained more than 3500 gene clusters of the ancestral gene set, and have lost more than 1600 gene clusters. It suggests that these processes of genomic diversification provided bilaterian animals with molecular basis for species diversity.
Collapse
Affiliation(s)
- Atsushi Ogura
- Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan
| | | | | |
Collapse
|
117
|
Gadagkar SR, Rosenberg MS, Kumar S. Inferring species phylogenies from multiple genes: Concatenated sequence tree versus consensus gene tree. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2005; 304:64-74. [PMID: 15593277 DOI: 10.1002/jez.b.21026] [Citation(s) in RCA: 281] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Phylogenetic trees from multiple genes can be obtained in two fundamentally different ways. In one, gene sequences are concatenated into a super-gene alignment, which is then analyzed to generate the species tree. In the other, phylogenies are inferred separately from each gene, and a consensus of these gene phylogenies is used to represent the species tree. Here, we have compared these two approaches by means of computer simulation, using 448 parameter sets, including evolutionary rate, sequence length, base composition, and transition/transversion rate bias. In these simulations, we emphasized a worst-case scenario analysis in which 100 replicate datasets for each evolutionary parameter set (gene) were generated, and the replicate dataset that produced a tree topology showing the largest number of phylogenetic errors was selected to represent that parameter set. Both randomly selected and worst-case replicates were utilized to compare the consensus and concatenation approaches primarily using the neighbor-joining (NJ) method. We find that the concatenation approach yields more accurate trees, even when the sequences concatenated have evolved with very different substitution patterns and no attempts are made to accommodate these differences while inferring phylogenies. These results appear to hold true for parsimony and likelihood methods as well. The concatenation approach shows >95% accuracy with only 10 genes. However, this gain in accuracy is sometimes accompanied by reinforcement of certain systematic biases, resulting in spuriously high bootstrap support for incorrect partitions, whether we employ site, gene, or a combined bootstrap resampling approach. Therefore, it will be prudent to report the number of individual genes supporting an inferred clade in the concatenated sequence tree, in addition to the bootstrap support.
Collapse
|
118
|
|
119
|
Mousley A, Marks NJ, Maule AG. Neuropeptide signalling: a repository of targets for novel endectocides? Trends Parasitol 2004; 20:482-7. [PMID: 15363442 DOI: 10.1016/j.pt.2004.07.011] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The only available parasiticides with a spectrum of action that includes a broad range of helminth and arthropod parasites are the macrocyclic lactones. Designated endectocides, these drugs have action against both endoparasitic nematodes and ectoparasitic arthropods. Unfortunately, the discovery of such drugs is exceedingly rare and there is no evidence that novel endectocidal agents will be identified and developed in the short to medium term. However, the discovery of neuropeptides with motor-modulatory activities in both arthropods and helminths, coupled with recent progress in the characterization of invertebrate neuropeptide receptors, has the potential to propel neuropeptide signalling to the forefront of efforts to develop a novel endectocide.
Collapse
Affiliation(s)
- Angela Mousley
- Parasitology Research Group, School of Biology and Biochemistry, Queen's University Belfast, 97 Lisburn Rd, Belfast, Northern Ireland, BT9 7BL, UK
| | | | | |
Collapse
|
120
|
Stuart GW, Berry MW. An SVD-based comparison of nine whole eukaryotic genomes supports a coelomate rather than ecdysozoan lineage. BMC Bioinformatics 2004; 5:204. [PMID: 15606920 PMCID: PMC544558 DOI: 10.1186/1471-2105-5-204] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2004] [Accepted: 12/17/2004] [Indexed: 11/24/2022] Open
Abstract
Background Eukaryotic whole genome sequences are accumulating at an impressive rate. Effective methods for comparing multiple whole eukaryotic genomes on a large scale are needed. Most attempted solutions involve the production of large scale alignments, and many of these require a high stringency pre-screen for putative orthologs in order to reduce the effective size of the dataset and provide a reasonably high but unknown fraction of correctly aligned homologous sites for comparison. As an alternative, highly efficient methods that do not require the pre-alignment of operationally defined orthologs are also being explored. Results A non-alignment method based on the Singular Value Decomposition (SVD) was used to compare the predicted protein complement of nine whole eukaryotic genomes ranging from yeast to man. This analysis resulted in the simultaneous identification and definition of a large number of well conserved motifs and gene families, and produced a species tree supporting one of two conflicting hypotheses of metazoan relationships. Conclusions Our SVD-based analysis of the entire protein complement of nine whole eukaryotic genomes suggests that highly conserved motifs and gene families can be identified and effectively compared in a single coherent definition space for the easy extraction of gene and species trees. While this occurs without the explicit definition of orthologs or homologous sites, the analysis can provide a basis for these definitions.
Collapse
Affiliation(s)
- Gary W Stuart
- Department of Life Sciences, Indiana State University, Terre Haute, IN 47809, USA
- Visiting Scientist, Center for Genomics and Bioinformatics, Indiana University, Bloomington, IN 47405, USA
| | - Michael W Berry
- Department of Computer Science, University of Tennessee, Knoxville TN 37996-3450, USA
| |
Collapse
|
121
|
Affiliation(s)
- Kenneth M. Halanych
- Department of Biological Sciences, Auburn University, Auburn, Alabama 36849;
| |
Collapse
|
122
|
Abstract
MOTIVATION In this paper, we shall examine the evolution of domain architectures across 62 genomes of known phylogeny including all kingdoms of life. We look in particular at the possibility of convergent evolution, with a view to determining the extent to which the architectures observed in the genomes are due to functional necessity or evolutionary descent. We used domains of known structure, because from this and other information we know their evolutionary relationships. We use a range of methods including phylogenetic grouping, sequence similarity/alignment, mutation rates and comparative genomics to approach this difficult problem from several angles. RESULTS Although we do not claim an exhaustive analysis, we conclude that between 0.4 and 4% of sequences are involved in convergent evolution of domain architectures, and expect the actual number to be close to the lower bound. We also made two incidental observations, albeit on a small sample: the events leading to convergent evolution appear to be random with no functional or structural preferences, and changes in the number of tandem repeat domains occur more readily than changes which alter the domain composition. CONCLUSION The principal conclusion is that the observed domain architectures of the sequences in the genomes are driven by evolutionary descent rather than functional necessity. CONTACT gough@supfam.org.
Collapse
Affiliation(s)
- Julian Gough
- RIKEN Genomic Sciences Centre, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama 230-0045, Japan.
| |
Collapse
|
123
|
Wallberg A, Thollesson M, Farris JS, Jondelius U. The phylogenetic position of the comb jellies (Ctenophora) and the importance of taxonomic sampling. Cladistics 2004; 20:558-578. [DOI: 10.1111/j.1096-0031.2004.00041.x] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
124
|
Novichkov PS, Omelchenko MV, Gelfand MS, Mironov AA, Wolf YI, Koonin EV. Genome-wide molecular clock and horizontal gene transfer in bacterial evolution. J Bacteriol 2004; 186:6575-85. [PMID: 15375139 PMCID: PMC516599 DOI: 10.1128/jb.186.19.6575-6585.2004] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We describe a simple theoretical framework for identifying orthologous sets of genes that deviate from a clock-like model of evolution. The approach used is based on comparing the evolutionary distances within a set of orthologs to a standard intergenomic distance, which was defined as the median of the distribution of the distances between all one-to-one orthologs. Under the clock-like model, the points on a plot of intergenic distances versus intergenomic distances are expected to fit a straight line. A statistical technique to identify significant deviations from the clock-like behavior is described. For several hundred analyzed orthologous sets representing three well-defined bacterial lineages, the alpha-Proteobacteria, the gamma-Proteobacteria, and the Bacillus-Clostridium group, the clock-like null hypothesis could not be rejected for approximately 70% of the sets, whereas the rest showed substantial anomalies. Subsequent detailed phylogenetic analysis of the genes with the strongest deviations indicated that over one-half of these genes probably underwent a distinct form of horizontal gene transfer, xenologous gene displacement, in which a gene is displaced by an ortholog from a different lineage. The remaining deviations from the clock-like model could be explained by lineage-specific acceleration of evolution. The results indicate that although xenologous gene displacement is a major force in bacterial evolution, a significant majority of orthologous gene sets in three major bacterial lineages evolved in accordance with the clock-like model. The approach described here allows rapid detection of deviations from this mode of evolution on the genome scale.
Collapse
Affiliation(s)
- Pavel S Novichkov
- Department of Bioengineering and Bioinformatics, Moscow State University, Russia
| | | | | | | | | | | |
Collapse
|
125
|
Krauss V, Pecyna M, Kurz K, Sass H. Phylogenetic Mapping of Intron Positions: A Case Study of Translation Initiation Factor eIF2γ. Mol Biol Evol 2004; 22:74-84. [PMID: 15356279 DOI: 10.1093/molbev/msh255] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Eukaryotic translation initiation factor 2 (eIF2) is a G protein that delivers the methionyl initiator tRNA to the small ribosomal subunit and releases it upon GTP hydrolysis after the recognition of the initiation codon. eIF2 is composed of three subunits, alpha, beta, and gamma. Subunit gamma shows the strongest conservation, and it confers both tRNA and GTP/GDP binding. Using intron positioning and protein sequence alignment, here we show that eIF2gamma is a suitable phylogenetic marker for eukaryotes. We determined or completed the sequences of 13 arthropod eIF2gamma genes. Analyzing the phylogenetic distribution of 52 different intron positions in 55 distantly related eIF2gamma genes, we identified ancient ones and shared derived introns in our data set. Obviously, intron positioning in eIF2gamma is evolutionarily conserved. However, there were episodes of complete and partial intron losses followed by intron gains. We identified 17 clusters of intron positions based on their distribution. The evolution of these clusters appears to be connected with preferred exon length and can be used to estimate the relative timing of intron gain because nearby precursor introns had to be erased from the gene before the new introns could be inserted. Moreover, we identified a putative case of intron sliding that constitutes a synapomorphic character state supporting monophyly of Coleoptera, Lepidoptera, and Diptera excluding Hymenoptera. We also performed tree reconstructions using the eIF2gamma protein sequences and intron positioning as phylogenetic information. Our results support the monophyly of Viridoplantae, Ascomycota, Homobasidiomyceta, and Apicomplexa.
Collapse
Affiliation(s)
- Veiko Krauss
- Department of Genetics, University of Leipzig, Leipzig, Germany.
| | | | | | | |
Collapse
|
126
|
Philippe H, Snell EA, Bapteste E, Lopez P, Holland PWH, Casane D. Phylogenomics of Eukaryotes: Impact of Missing Data on Large Alignments. Mol Biol Evol 2004; 21:1740-52. [PMID: 15175415 DOI: 10.1093/molbev/msh182] [Citation(s) in RCA: 313] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Resolving the relationships between Metazoa and other eukaryotic groups as well as between metazoan phyla is central to the understanding of the origin and evolution of animals. The current view is based on limited data sets, either a single gene with many species (e.g., ribosomal RNA) or many genes but with only a few species. Because a reliable phylogenetic inference simultaneously requires numerous genes and numerous species, we assembled a very large data set containing 129 orthologous proteins ( approximately 30,000 aligned amino acid positions) for 36 eukaryotic species. Included in the alignments are data from the choanoflagellate Monosiga ovata, obtained through the sequencing of about 1,000 cDNAs. We provide conclusive support for choanoflagellates as the closest relative of animals and for fungi as the second closest. The monophyly of Plantae and chromalveolates was recovered but without strong statistical support. Within animals, in contrast to the monophyly of Coelomata observed in several recent large-scale analyses, we recovered a paraphyletic Coelamata, with nematodes and platyhelminths nested within. To include a diverse sample of organisms, data from EST projects were used for several species, resulting in a large amount of missing data in our alignment (about 25%). By using different approaches, we verify that the inferred phylogeny is not sensitive to these missing data. Therefore, this large data set provides a reliable phylogenetic framework for studying eukaryotic and animal evolution and will be easily extendable when large amounts of sequence information become available from a broader taxonomic range.
Collapse
Affiliation(s)
- Hervé Philippe
- School of Animal and Microbial Sciences, The University of Reading, Reading, UK.
| | | | | | | | | | | |
Collapse
|
127
|
Iyer LM, Aravind L, Coon SL, Klein DC, Koonin EV. Evolution of cell-cell signaling in animals: did late horizontal gene transfer from bacteria have a role? Trends Genet 2004; 20:292-9. [PMID: 15219393 DOI: 10.1016/j.tig.2004.05.007] [Citation(s) in RCA: 145] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | |
Collapse
|
128
|
Abstract
The relationship between vertebrates and the principal model invertebrates - fruitflies and nematodes - is unclear. A fly-nematode grouping was becoming widely accepted, but recent comparisons of their genomes argue against this and link flies with the vertebrates instead.
Collapse
Affiliation(s)
- Maximilian J Telford
- Department of Biology, University College London, Darwin Building, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
129
|
Bertrand S, Brunet FG, Escriva H, Parmentier G, Laudet V, Robinson-Rechavi M. Evolutionary Genomics of Nuclear Receptors: From Twenty-Five Ancestral Genes to Derived Endocrine Systems. Mol Biol Evol 2004; 21:1923-37. [PMID: 15229292 DOI: 10.1093/molbev/msh200] [Citation(s) in RCA: 258] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Bilaterian animals are notably characterized by complex endocrine systems. The receptors for many steroids, retinoids, and other hormones belong to the superfamily of nuclear receptors, which are transcription factors regulating many aspects of development and homeostasis. Despite a diversity of regulatory mechanisms and physiological roles, nuclear receptors share a common protein organization. To obtain the broad picture of bilaterian nuclear hormone receptor evolution, we have characterized the complete set of nuclear receptor genes from nine animal genome sequences and analyzed it in a phylogenetic framework. In addition, expressed sequence tags from key lineages with no available genome sequence were also searched. This allows us to date the evolutionary events that led from an ancestral nuclear receptor gene, in an early metazoan, to present day diversity. We show that there were approximately 25 nuclear receptor genes in Urbilateria, the ancestor of bilaterians, at which point the fundamental diversity of the subfamily was already established. Surprisingly, differential gene loss played an important role in the evolution of different nuclear receptor sets in bilaterian lineages. The nuclear receptor distribution was also shaped by periods of gene duplication, essentially in vertebrates, as well as a lineage-specific duplication burst in nematodes. Our results imply that the genes for major receptors such as steroid receptors or thyroid hormone receptors were present in Urbilateria.
Collapse
Affiliation(s)
- Stéphanie Bertrand
- Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, Lyon, France
| | | | | | | | | | | |
Collapse
|
130
|
Takezaki N, Figueroa F, Zaleska-Rutczynska Z, Takahata N, Klein J. The phylogenetic relationship of tetrapod, coelacanth, and lungfish revealed by the sequences of forty-four nuclear genes. Mol Biol Evol 2004; 21:1512-24. [PMID: 15128875 DOI: 10.1093/molbev/msh150] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The origin of tetrapods is a major outstanding issue in vertebrate phylogeny. Each of the three possible principal hypotheses (coelacanth, lungfish, or neither being the sister group of tetrapods) has found support in different sets of data. In an attempt to resolve the controversy, sequences of 44 nuclear genes encoding amino acid residues at 10,404 positions were obtained and analyzed. However, this large set of sequences did not support conclusively one of the three hypotheses. Apparently, the coelacanth, lungfish, and tetrapod lineages diverged within such a short time interval that at this level of analysis, their relationships appear to be an irresolvable trichotomy.
Collapse
Affiliation(s)
- Naoko Takezaki
- Max-Planck-Institut für Biologie, Abteilung Immungenetik, Corrensstrasse Tübingen, Germany.
| | | | | | | | | |
Collapse
|
131
|
Koonin EV, Fedorova ND, Jackson JD, Jacobs AR, Krylov DM, Makarova KS, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS, Rogozin IB, Smirnov S, Sorokin AV, Sverdlov AV, Vasudevan S, Wolf YI, Yin JJ, Natale DA. A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes. Genome Biol 2004; 5:R7. [PMID: 14759257 PMCID: PMC395751 DOI: 10.1186/gb-2004-5-2-r7] [Citation(s) in RCA: 676] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2003] [Revised: 12/01/2003] [Accepted: 12/04/2003] [Indexed: 11/10/2022] Open
Abstract
We examined functional and evolutionary patterns in the recently constructed set of 5,873 clusters of predicted orthologs from seven eukaryotic genomes. The analysis reveals a conserved core of largely essential eukaryotic genes as well as major diversification and innovation associated with evolution of eukaryotic genomes. Background Sequencing the genomes of multiple, taxonomically diverse eukaryotes enables in-depth comparative-genomic analysis which is expected to help in reconstructing ancestral eukaryotic genomes and major events in eukaryotic evolution and in making functional predictions for currently uncharacterized conserved genes. Results We examined functional and evolutionary patterns in the recently constructed set of 5,873 clusters of predicted orthologs (eukaryotic orthologous groups or KOGs) from seven eukaryotic genomes: Caenorhabditis elegans, Drosophila melanogaster, Homo sapiens, Arabidopsis thaliana, Saccharomyces cerevisiae, Schizosaccharomyces pombe and Encephalitozoon cuniculi. Conservation of KOGs through the phyletic range of eukaryotes strongly correlates with their functions and with the effect of gene knockout on the organism's viability. The approximately 40% of KOGs that are represented in six or seven species are enriched in proteins responsible for housekeeping functions, particularly translation and RNA processing. These conserved KOGs are often essential for survival and might approximate the minimal set of essential eukaryotic genes. The 131 single-member, pan-eukaryotic KOGs we identified were examined in detail. For around 20 that remained uncharacterized, functions were predicted by in-depth sequence analysis and examination of genomic context. Nearly all these proteins are subunits of known or predicted multiprotein complexes, in agreement with the balance hypothesis of evolution of gene copy number. Other KOGs show a variety of phyletic patterns, which points to major contributions of lineage-specific gene loss and the 'invention' of genes new to eukaryotic evolution. Examination of the sets of KOGs lost in individual lineages reveals co-elimination of functionally connected genes. Parsimonious scenarios of eukaryotic genome evolution and gene sets for ancestral eukaryotic forms were reconstructed. The gene set of the last common ancestor of the crown group consists of 3,413 KOGs and largely includes proteins involved in genome replication and expression, and central metabolism. Only 44% of the KOGs, mostly from the reconstructed gene set of the last common ancestor of the crown group, have detectable homologs in prokaryotes; the remainder apparently evolved via duplication with divergence and invention of new genes. Conclusions The KOG analysis reveals a conserved core of largely essential eukaryotic genes as well as major diversification and innovation associated with evolution of eukaryotic genomes. The results provide quantitative support for major trends of eukaryotic evolution noticed previously at the qualitative level and a basis for detailed reconstruction of evolution of eukaryotic genomes and biology of ancestral forms.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|