Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Liberles DA, Schreiber DR, Govindarajan S, Chamberlin SG, Benner SA. The adaptive evolution database (TAED). Genome Biol 2001;2:RESEARCH0028. [PMID: 11532212 PMCID: PMC55325 DOI: 10.1186/gb-2001-2-8-research0028] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2001] [Revised: 05/21/2001] [Accepted: 06/06/2001] [Indexed: 11/24/2022] Open

For:	Liberles DA, Schreiber DR, Govindarajan S, Chamberlin SG, Benner SA. The adaptive evolution database (TAED). Genome Biol 2001;2:RESEARCH0028. [PMID: 11532212 PMCID: PMC55325 DOI: 10.1186/gb-2001-2-8-research0028] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2001] [Revised: 05/21/2001] [Accepted: 06/06/2001] [Indexed: 11/24/2022] Open

Number

Cited by Other Article(s)

Northover DE, Shank SD, Liberles DA. Characterizing lineage-specific evolution and the processes driving genomic diversification in chordates. BMC Evol Biol 2020;20:24. [PMID: 32046633 PMCID: PMC7011509 DOI: 10.1186/s12862-020-1585-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 01/16/2020] [Indexed: 11/21/2022] Open

Abstract

Background

Understanding the origins of genome content has long been a goal of molecular evolution and comparative genomics. By examining genome evolution through the guise of lineage-specific evolution, it is possible to make inferences about the evolutionary events that have given rise to species-specific diversification. Here we characterize the evolutionary trends found in chordate species using The Adaptive Evolution Database (TAED). TAED is a database of phylogenetically indexed gene families designed to detect episodes of directional or diversifying selection across chordates. Gene families within the database have been assessed for lineage-specific estimates of dN/dS and have been reconciled to the chordate species to identify retained duplicates. Gene families have also been mapped to the functional pathways and amino acid changes which occurred on high dN/dS lineages have been mapped to protein structures.

Results

An analysis of this exhaustive database has enabled a characterization of the processes of lineage-specific diversification in chordates. A pathway level enrichment analysis of TAED determined that pathways most commonly found to have elevated rates of evolution included those involved in metabolism, immunity, and cell signaling. An analysis of protein fold presence on proteins, after normalizing for frequency in the database, found common folds such as Rossmann folds, Jelly Roll folds, and TIM barrels were overrepresented on proteins most likely to undergo directional selection. A set of gene families which experience increased numbers of duplications within short evolutionary times are associated with pathways involved in metabolism, olfactory reception, and signaling. An analysis of protein secondary structure indicated more relaxed constraint in β-sheets and stronger constraint on alpha Helices, amidst a general preference for substitutions at exposed sites. Lastly a detailed analysis of the ornithine decarboxylase gene family, a key enzyme in the pathway for polyamine synthesis, revealed lineage-specific evolution along the lineage leading to Cetacea through rapid sequence evolution in a duplicate gene with amino acid substitutions causing active site rearrangement.

Conclusion

Episodes of lineage-specific evolution are frequent throughout chordate species. Both duplication and directional selection have played large roles in the evolution of the phylum. TAED is a powerful tool for facilitating this understanding of lineage-specific evolution.

Collapse

The Adaptive Evolution Database (TAED): A New Release of a Database of Phylogenetically Indexed Gene Families from Chordates. J Mol Evol 2017;85:46-56. [PMID: 28795237 DOI: 10.1007/s00239-017-9806-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 08/03/2017] [Indexed: 12/11/2022]

Simakov O, Kawashima T. Independent evolution of genomic characters during major metazoan transitions. Dev Biol 2016;427:179-192. [PMID: 27890449 DOI: 10.1016/j.ydbio.2016.11.012] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2016] [Revised: 11/08/2016] [Accepted: 11/14/2016] [Indexed: 02/03/2023]

Extracting functional trends from whole genome duplication events using comparative genomics. Biol Proced Online 2016;18:11. [PMID: 27168732 PMCID: PMC4862183 DOI: 10.1186/s12575-016-0041-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 04/24/2016] [Indexed: 01/06/2023] Open

Wang K, Ouyang H, Xie Z, Yao C, Guo N, Li M, Jiao H, Pang D. Efficient Generation of Myostatin Mutations in Pigs Using the CRISPR/Cas9 System. Sci Rep 2015;5:16623. [PMID: 26564781 PMCID: PMC4643223 DOI: 10.1038/srep16623] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2015] [Accepted: 10/16/2015] [Indexed: 12/15/2022] Open

Evidence for positive selection on the leptin gene in Cetacea and Pinnipedia. PLoS One 2011;6:e26579. [PMID: 22046310 PMCID: PMC3203152 DOI: 10.1371/journal.pone.0026579] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2011] [Accepted: 09/29/2011] [Indexed: 01/21/2023] Open

Abstract

The leptin gene has received intensive attention and scientific investigation for its importance in energy homeostasis and reproductive regulation in mammals. Furthermore, study of the leptin gene is of crucial importance for public health, particularly for its role in obesity, as well as for other numerous physiological roles that it plays in mammals. In the present work, we report the identification of novel leptin genes in 4 species of Cetacea, and a comparison with 55 publicly available leptin sequences from mammalian genome assemblies and previous studies. Our study provides evidence for positive selection in the suborder Odontoceti (toothed whales) of the Cetacea and the family Phocidae (earless seals) of the Pinnipedia. We also detected positive selection in several leptin gene residues in these two lineages. To test whether leptin and its receptor evolved in a coordinated manner, we analyzed 24 leptin receptor gene (LPR) sequences from available mammalian genome assemblies and other published data. Unlike the case of leptin, our analyses did not find evidence of positive selection for LPR across the Cetacea and Pinnipedia lineages. In line with this, positively selected sites identified in the leptin genes of these two lineages were located outside of leptin receptor binding sites, which at least partially explains why co-evolution of leptin and its receptor was not observed in the present study. Our study provides interesting insights into current understanding of the evolution of mammalian leptin genes in response to selective pressures from life in an aquatic environment, and leads to a hypothesis that new tissue specificity or novel physiologic functions of leptin genes may have arisen in both odontocetes and phocids. Additional data from other species encompassing varying life histories and functional tests of the adaptive role of the amino acid changes identified in this study will help determine the factors that promote the adaptive evolution of the leptin genes in marine mammals.

Collapse

Liberles DA, Tisdell MDM, Grahnen JA. Binding constraints on the evolution of enzymes and signalling proteins: the important role of negative pleiotropy. Proc Biol Sci 2011;278:1930-5. [PMID: 21490020 DOI: 10.1098/rspb.2010.2637] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open

Kamneva OK, Liberles DA, Ward NL. Genome-wide influence of indel Substitutions on evolution of bacteria of the PVC superphylum, revealed using a novel computational method. Genome Biol Evol 2010;2:870-86. [PMID: 21048002 PMCID: PMC3000692 DOI: 10.1093/gbe/evq071] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open

Rodgers BD, Garikipati DK. Clinical, agricultural, and evolutionary biology of myostatin: a comparative review. Endocr Rev 2008;29:513-34. [PMID: 18591260 PMCID: PMC2528853 DOI: 10.1210/er.2008-0003] [Citation(s) in RCA: 160] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Liberles DA, Dittmar K. Characterizing gene family evolution. Biol Proced Online 2008;10:66-73. [PMID: 19461954 PMCID: PMC2683547 DOI: 10.1251/bpo144] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2007] [Revised: 03/17/2008] [Accepted: 04/07/2008] [Indexed: 11/23/2022] Open

Dorman KS. Identifying dramatic selection shifts in phylogenetic trees. BMC Evol Biol 2007;7 Suppl 1:S10. [PMID: 17288568 PMCID: PMC1796604 DOI: 10.1186/1471-2148-7-s1-s10] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Ardawatia H, Liberles DA. A systematic analysis of lineage-specific evolution in metabolic pathways. Gene 2007;387:67-74. [PMID: 17034962 DOI: 10.1016/j.gene.2006.08.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2006] [Revised: 07/30/2006] [Accepted: 08/10/2006] [Indexed: 12/29/2022]

Romanish MT, Lock WM, van de Lagemaat LN, Dunn CA, Mager DL. Repeated recruitment of LTR retrotransposons as promoters by the anti-apoptotic locus NAIP during mammalian evolution. PLoS Genet 2006;3:e10. [PMID: 17222062 PMCID: PMC1781489 DOI: 10.1371/journal.pgen.0030010] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2006] [Accepted: 12/05/2006] [Indexed: 12/19/2022] Open

Abstract

Neuronal apoptosis inhibitory protein (NAIP, also known as BIRC1) is a member of the conserved inhibitor of apoptosis protein (IAP) family. Lineage-specific rearrangements and expansions of this locus have yielded different copy numbers among primates and rodents, with human retaining a single functional copy and mouse possessing several copies, depending on the strain. Roles for this gene in disease have been documented, but little is known about transcriptional regulation of NAIP. We show here that NAIP has multiple promoters sharing no similarity between human and rodents. Moreover, we demonstrate that multiple, domesticated long terminal repeats (LTRs) of endogenous retroviral elements provide NAIP promoter function in human, mouse, and rat. In human, an LTR serves as a tissue-specific promoter, active primarily in testis. However, in rodents, our evidence indicates that an ancestral LTR common to all rodent genes is the major, constitutive promoter for these genes, and that a second LTR found in two of the mouse genes is a minor promoter. Thus, independently acquired LTRs have assumed regulatory roles for orthologous genes, a remarkable evolutionary scenario. We also demonstrate that 5′ flanking regions of IAP family genes as a group, in both human and mouse are enriched for LTR insertions compared to average genes. We propose several potential explanations for these findings, including a hypothesis that recruitment of LTRs near NAIP or other IAP genes may represent a host-cell adaptation to modulate apoptotic responses.

When retroviruses infect cells, the viral DNA inserts into the cellular genome. If this happens in gametes (egg or sperm), the viral DNA will be transmitted from parent to offspring, like all chromosomal DNA. Through evolutionary time, such infections of gametes have been so prevalent that 8%–10% of the normal human and mouse genomes are now composed of ancient viral DNA, termed endogenous retroviruses (ERVs). In human, these ERVs are mutated or “dead” but it has been shown that ERV regulatory regions can be employed by the host to help control expression of cellular genes. Here, we report on a remarkable example of this phenomenon. We demonstrate that both the human and rodent neuronal apoptosis inhibitory protein (NAIP) genes, involved in preventing cell death, use different ERV sequences to drive gene expression. Moreover, in each of the primate and rodent lineages, two separate ERVs contribute to NAIP gene expression. This repeated ERV recruitment by NAIP genes throughout evolution is very unlikely to have occurred by chance. We offer a number of potential explanations, including the intriguing possibility that it may be advantageous for anti-cell death genes like NAIP to use ERVs to control their expression. These results support the view that not all retroviral remnants in our genome are simply junk DNA.

Collapse

Pie MR, Alvares LE. Evolution of myostatin in vertebrates: Is there evidence for positive selection? Mol Phylogenet Evol 2006;41:730-4. [PMID: 16876447 DOI: 10.1016/j.ympev.2006.05.038] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2006] [Revised: 05/17/2006] [Accepted: 05/30/2006] [Indexed: 12/01/2022]

Berglund-Sonnhammer AC, Steffansson P, Betts MJ, Liberles DA. Optimal gene trees from sequences and species trees using a soft interpretation of parsimony. J Mol Evol 2006;63:240-50. [PMID: 16830091 DOI: 10.1007/s00239-005-0096-1] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2005] [Accepted: 04/15/2006] [Indexed: 10/24/2022]

Roth C, Liberles DA. A systematic search for positive selection in higher plants (Embryophytes). BMC PLANT BIOLOGY 2006;6:12. [PMID: 16784532 PMCID: PMC1540423 DOI: 10.1186/1471-2229-6-12] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2006] [Accepted: 06/19/2006] [Indexed: 05/04/2023]

Chen L, Lee C. Distinguishing HIV-1 drug resistance, accessory, and viral fitness mutations using conditional selection pressure analysis of treated versus untreated patient samples. Biol Direct 2006;1:14. [PMID: 16737543 PMCID: PMC1523337 DOI: 10.1186/1745-6150-1-14] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2006] [Accepted: 05/31/2006] [Indexed: 11/18/2022] Open

Abstract

BACKGROUND

HIV can evolve drug resistance rapidly in response to new drug treatments, often through a combination of multiple mutations 123. It would be useful to develop automated analyses of HIV sequence polymorphism that are able to predict drug resistance mutations, and to distinguish different types of functional roles among such mutations, for example, those that directly cause drug resistance, versus those that play an accessory role. Detecting functional interactions between mutations is essential for this classification. We have adapted a well-known measure of evolutionary selection pressure (Ka/Ks) and developed a conditional Ka/Ks approach to detect important interactions.

RESULTS

We have applied this analysis to four independent HIV protease sequencing datasets: 50,000 clinical samples sequenced by Specialty Laboratories, Inc.; 1800 samples from patients treated with protease inhibitors; 2600 samples from untreated patients; 400 samples from untreated African patients. We have identified 428 mutation interactions in Specialty dataset with statistical significance and we were able to distinguish primary vs. accessory mutations for many well-studied examples. Amino acid interactions identified by conditional Ka/Ks matched 80 of 92 pair wise interactions found by a completely independent study of HIV protease (p-value for this match is significant: 10-70). Furthermore, Ka/Ks selection pressure results were highly reproducible among these independent datasets, both qualitatively and quantitatively, suggesting that they are detecting real drug-resistance and viral fitness mutations in the wild HIV-1 population.

CONCLUSION

Conditional Ka/Ks analysis can detect mutation interactions and distinguish primary vs. accessory mutations in HIV-1. Ka/Ks analysis of treated vs. untreated patient data can distinguish drug-resistance vs. viral fitness mutations. Verification of these results would require longitudinal studies. The result provides a valuable resource for AIDS research and will be available for open access upon publication at http://www.bioinformatics.ucla.edu/HIV.

Collapse

Berglund AC, Wallner B, Elofsson A, Liberles DA. Tertiary windowing to detect positive diversifying selection. J Mol Evol 2005;60:499-504. [PMID: 15883884 DOI: 10.1007/s00239-004-0223-4] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2004] [Accepted: 10/20/2004] [Indexed: 12/01/2022]

Bhushan S, Ståhl A, Nilsson S, Lefebvre B, Seki M, Roth C, McWilliam D, Wright SJ, Liberles DA, Shinozaki K, Bruce BD, Boutry M, Glaser E. Catalysis, subcellular localization, expression and evolution of the targeting peptides degrading protease, AtPreP2. PLANT & CELL PHYSIOLOGY 2005;46:985-96. [PMID: 15827031 DOI: 10.1093/pcp/pci107] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

We have previously identified a zinc metalloprotease involved in the degradation of mitochondrial and chloroplast targeting peptides, the presequence protease (PreP). In the Arabidopsis thaliana genomic database, there are two genes that correspond to the protease, the zinc metalloprotease (AAL90904) and the putative zinc metalloprotease (AAG13049). We have named the corresponding proteins AtPreP1 and AtPreP2, respectively. AtPreP1 and AtPreP2 show significant differences in their targeting peptides and the proteins are predicted to be localized in different compartments. AtPreP1 was shown to degrade both mitochondrial and chloroplast targeting peptides and to be dual targeted to both organelles using an ambiguous targeting peptide. Here, we have overexpressed, purified and characterized proteolytic and targeting properties of AtPreP2. AtPreP2 exhibits different proteolytic subsite specificity from AtPreP1 when used for degradation of organellar targeting peptides and their mutants. Interestingly, AtPreP2 precursor protein was also found to be dual targeted to both mitochondria and chloroplasts in a single and dual in vitro import system. Furthermore, targeting peptide of the AtPreP2 dually targeted green fluorescent protein (GFP) to both mitochondria and chloroplasts in tobacco protoplasts and leaves using an in vivo transient expression system. The targeting of both AtPreP1 and AtPreP2 proteases to chloroplasts in A. thaliana in vivo was confirmed via a shotgun mass spectrometric analysis of highly purified chloroplasts. Reverse transcription-polymerase chain reaction (RT-PCR) analysis revealed that AtPreP1 and AtPreP2 are differentially expressed in mature A. thaliana plants. Phylogenetic evidence indicated that AtPreP1 and AtPreP2 are recent gene duplicates that may have diverged through subfunctionalization.

Collapse

Roth C, Betts MJ, Steffansson P, Saelensminde G, Liberles DA. The Adaptive Evolution Database (TAED): a phylogeny based tool for comparative genomics. Nucleic Acids Res 2005;33:D495-7. [PMID: 15608245 PMCID: PMC540044 DOI: 10.1093/nar/gki090] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Rastogi S, Liberles DA. Subfunctionalization of duplicated genes as a transition state to neofunctionalization. BMC Evol Biol 2005;5:28. [PMID: 15831095 PMCID: PMC1112588 DOI: 10.1186/1471-2148-5-28] [Citation(s) in RCA: 250] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2005] [Accepted: 04/14/2005] [Indexed: 11/10/2022] Open

Tellgren A, Berglund AC, Savolainen P, Janis CM, Liberles DA. Myostatin rapid sequence evolution in ruminants predates domestication. Mol Phylogenet Evol 2005;33:782-90. [PMID: 15522803 DOI: 10.1016/j.ympev.2004.07.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2004] [Revised: 05/19/2004] [Indexed: 11/29/2022]

Wong WSW, Yang Z, Goldman N, Nielsen R. Accuracy and power of statistical methods for detecting adaptive evolution in protein coding sequences and for identifying positively selected sites. Genetics 2005;168:1041-51. [PMID: 15514074 PMCID: PMC1448811 DOI: 10.1534/genetics.104.031153] [Citation(s) in RCA: 447] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Braun FN, Liberles DA. Retention of enzyme gene duplicates by subfunctionalization. Int J Biol Macromol 2004;33:19-22. [PMID: 14599579 DOI: 10.1016/s0141-8130(03)00059-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

The planetary biology of cytochrome P450 aromatases. BMC Biol 2004;2:19. [PMID: 15315709 PMCID: PMC515309 DOI: 10.1186/1741-7007-2-19] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2004] [Accepted: 08/17/2004] [Indexed: 11/24/2022] Open

Abstract

Background

Joining a model for the molecular evolution of a protein family to the paleontological and geological records (geobiology), and then to the chemical structures of substrates, products, and protein folds, is emerging as a broad strategy for generating hypotheses concerning function in a post-genomic world. This strategy expands systems biology to a planetary context, necessary for a notion of fitness to underlie (as it must) any discussion of function within a biomolecular system.

Results

Here, we report an example of such an expansion, where tools from planetary biology were used to analyze three genes from the pig Sus scrofa that encode cytochrome P450 aromatases–enzymes that convert androgens into estrogens. The evolutionary history of the vertebrate aromatase gene family was reconstructed. Transition redundant exchange silent substitution metrics were used to interpolate dates for the divergence of family members, the paleontological record was consulted to identify changes in physiology that correlated in time with the change in molecular behavior, and new aromatase sequences from peccary were obtained. Metrics that detect changing function in proteins were then applied, including K_A/K_Svalues and those that exploit structural biology. These identified specific amino acid replacements that were associated with changing substrate and product specificity during the time of presumed adaptive change. The combined analysis suggests that aromatase paralogs arose in pigs as a result of selection for Suoidea with larger litters than their ancestors, and permitted the Suoidea to survive the global climatic trauma that began in the Eocene.

Conclusions

This combination of bioinformatics analysis, molecular evolution, paleontology, cladistics, global climatology, structural biology, and organic chemistry serves as a paradigm in planetary biology. As the geological, paleontological, and genomic records improve, this approach should become widely useful to make systems biology statements about high-level function for biomolecular systems.

Collapse

Hughes T, Hyun Y, Liberles DA. Visualising very large phylogenetic trees in three dimensional hyperbolic space. BMC Bioinformatics 2004;5:48. [PMID: 15117420 PMCID: PMC419335 DOI: 10.1186/1471-2105-5-48] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2004] [Accepted: 04/29/2004] [Indexed: 11/10/2022] Open

Endo T, Ogishima S, Tanaka H. Standardized phylogenetic tree: a reference to discover functional evolution. J Mol Evol 2004;57 Suppl 1:S174-81. [PMID: 15008414 DOI: 10.1007/s00239-003-0025-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Swart EC, Hide WA, Seoighe C. FRAGS: estimation of coding sequence substitution rates from fragmentary data. BMC Bioinformatics 2004;5:8. [PMID: 15005802 PMCID: PMC344743 DOI: 10.1186/1471-2105-5-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2003] [Accepted: 01/29/2004] [Indexed: 01/06/2023] Open

Abstract

Background

Rates of substitution in protein-coding sequences can provide important insights into evolutionary processes that are of biomedical and theoretical interest. Increased availability of coding sequence data has enabled researchers to estimate more accurately the coding sequence divergence of pairs of organisms. However the use of different data sources, alignment protocols and methods to estimate substitution rates leads to widely varying estimates of key parameters that define the coding sequence divergence of orthologous genes. Although complete genome sequence data are not available for all organisms, fragmentary sequence data can provide accurate estimates of substitution rates provided that an appropriate and consistent methodology is used and that differences in the estimates obtainable from different data sources are taken into account.

Results

We have developed FRAGS, an application framework that uses existing, freely available software components to construct in-frame alignments and estimate coding substitution rates from fragmentary sequence data. Coding sequence substitution estimates for human and chimpanzee sequences, generated by FRAGS, reveal that methodological differences can give rise to significantly different estimates of important substitution parameters. The estimated substitution rates were also used to infer upper-bounds on the amount of sequencing error in the datasets that we have analysed.

Conclusion

We have developed a system that performs robust estimation of substitution rates for orthologous sequences from a pair of organisms. Our system can be used when fragmentary genomic or transcript data is available from one of the organisms and the other is a completely sequenced genome within the Ensembl database. As well as estimating substitution statistics our system enables the user to manage and query alignment and substitution data.

Collapse

Choi SS, Lahn BT. Adaptive evolution of MRG, a neuron-specific gene family implicated in nociception. Genome Res 2003;13:2252-9. [PMID: 14525927 PMCID: PMC403691 DOI: 10.1101/gr.1431603] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2003] [Accepted: 08/11/2003] [Indexed: 12/19/2022]

Benner SA, Caraco MD, Thomson JM, Gaucher EA. Planetary biology--paleontological, geological, and molecular histories of life. Science 2002;296:864-8. [PMID: 11988562 DOI: 10.1126/science.1069863] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Wagner A. Selection and gene duplication: a view from the genome. Genome Biol 2002;3:reviews1012. [PMID: 12049669 PMCID: PMC139360 DOI: 10.1186/gb-2002-3-5-reviews1012] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Liberles DA, Wayne ML. Tracking adaptive evolutionary events in genomic sequences. Genome Biol 2002;3:REVIEWS1018. [PMID: 12093382 PMCID: PMC139374 DOI: 10.1186/gb-2002-3-6-reviews1018] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Liberles DA. Evaluation of methods for determination of a reconstructed history of gene sequence evolution. Mol Biol Evol 2001;18:2040-7. [PMID: 11606700 DOI: 10.1093/oxfordjournals.molbev.a003745] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open