1
|
Keeling PJ. The impact of history on our perception of evolutionary events: endosymbiosis and the origin of eukaryotic complexity. Cold Spring Harb Perspect Biol 2014; 6:a016196. [PMID: 24492708 PMCID: PMC3941238 DOI: 10.1101/cshperspect.a016196] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Evolutionary hypotheses are correctly interpreted as products of the data they set out to explain, but they are less often recognized as being heavily influenced by other factors. One of these is the history of preceding thought, and here I look back on historically important changes in our thinking about the role of endosymbiosis in the origin of eukaryotic cells. Specifically, the modern emphasis on endosymbiotic explanations for numerous eukaryotic features, including the cell itself (the so-called chimeric hypotheses), can be seen not only as resulting from the advent of molecular and genomic data, but also from the intellectual acceptance of the endosymbiotic origin of mitochondria and plastids. This transformative idea may have unduly affected how other aspects of the eukaryotic cell are explained, in effect priming us to accept endosymbiotic explanations for endogenous processes. Molecular and genomic data, which were originally harnessed to answer questions about cell evolution, now so dominate our thinking that they largely define the question, and the original questions about how eukaryotic cellular architecture evolved have been neglected. This is unfortunate because, as Roger Stanier pointed out, these cellular changes represent life's "greatest single evolutionary discontinuity," and on this basis I advocate a return to emphasizing evolutionary cell biology when thinking about the origin of eukaryotes, and suggest that endogenous explanations will prevail when we refocus on the evolution of the cell.
Collapse
Affiliation(s)
- Patrick J Keeling
- Canadian Institute for Advanced Research, Botany Department, University of British Columbia, Vancouver BC V6T 1Z4, Canada
| |
Collapse
|
2
|
Extraribosomal functions associated with the C terminus of the 37/67 kDa laminin receptor are required for maintaining cell viability. Cell Death Dis 2011; 1:e42. [PMID: 21243100 PMCID: PMC3019570 DOI: 10.1038/cddis.2010.19] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The 37/67 kDa laminin receptor (LAMR) is a multifunctional protein, acting as an extracellular receptor, localizing to the nucleus, and playing roles in rRNA processing and ribosome assembly. LAMR is important for cell viability; however, it is unclear which of its functions are essential. We developed a silent mutant LAMR construct, resistant to siRNA, to rescue the phenotypic effects of knocking down endogenous LAMR, which include inhibition of protein synthesis, cell cycle arrest, and apoptosis. In addition, we generated a C-terminal-truncated silent mutant LAMR construct structurally homologous to the Archaeoglobus fulgidus S2 ribosomal protein and missing the C-terminal 75 residues of LAMR, which displays more sequence divergence. We found that HT1080 cells stably expressing either silent mutant LAMR construct still undergo arrest in the G1 phase of the cell cycle when treated with siRNA. However, the expression of full-length silent mutant LAMR rescues cell viability, whereas the expression of the C-terminal-truncated LAMR does not. Interestingly, we also found that both silent mutant constructs restore protein translation and localize to the nucleus. Our findings indicate that the ability of LAMR to regulate viability is associated with its C-terminal 75 residues. Furthermore, this function is distinct from its role in cell proliferation, independent of its ribosomal functions, and may be regulated by a nonnuclear localization.
Collapse
|
3
|
Omar A, Jovanovic K, Da Costa Dias B, Gonsalves D, Moodley K, Caveney R, Mbazima V, Weiss SFT. Patented biological approaches for the therapeutic modulation of the 37 kDa/67 kDa laminin receptor. Expert Opin Ther Pat 2010; 21:35-53. [DOI: 10.1517/13543776.2011.539203] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
4
|
Abstract
The 67LR (67 kDa laminin receptor) is a cell-surface receptor with high affinity for its primary ligand. Its role as a laminin receptor makes it an important molecule both in cell adhesion to the basement membrane and in signalling transduction following this binding event. The protein also plays critical roles in the metastasis of tumour cells. Isolation of the protein from either normal or cancerous cells results in a product with an approx. molecular mass of 67 kDa. This protein is believed to be derived from a smaller precursor, the 37LRP (37 kDa laminin receptor precursor). However, the precise mechanism by which cytoplasmic 37LRP becomes cell-membrane-embedded 67LR is unclear. The process may involve post-translational fatty acylation of the protein combined with either homo- or hetero-dimerization, possibly with a galectin-3-epitope-containing partner. Furthermore, it has become clear that acting as a receptor for laminin is not the only function of this protein. 67LR also acts as a receptor for viruses, such as Sindbis virus and dengue virus, and is involved with internalization of the prion protein. Interestingly, unmodified 37LRP is a ribosomal component and homologues of this protein are found in all five kingdoms. In addition, it appears to be strongly associated with histones in the eukaryotic cell nucleus, although the precise role of these interactions is not clear. Here we review the current understanding of the structure and function of this molecule, as well as highlighting areas requiring further research.
Collapse
|
5
|
Tabb-Massey A, Caffrey JM, Logsden P, Taylor S, Trent JO, Ellis SR. Ribosomal proteins Rps0 and Rps21 of Saccharomyces cerevisiae have overlapping functions in the maturation of the 3' end of 18S rRNA. Nucleic Acids Res 2003; 31:6798-805. [PMID: 14627813 PMCID: PMC290267 DOI: 10.1093/nar/gkg899] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2003] [Revised: 10/13/2003] [Accepted: 10/13/2003] [Indexed: 12/22/2022] Open
Abstract
The Rps0 proteins of Saccharomyces cerevisiae are components of the 40S ribosomal subunit required for maturation of the 3' end of 18S rRNA. Drosophila and human homologs of the Rps0 proteins physically interact with Rps21 proteins, and decreased expression of both proteins in Drosophila impairs control of cellular proliferation in hematopoietic organs during larval development. Here, we characterize the yeast RPS21A/B genes and show that strains where both genes are disrupted are not viable. Relative to the wild type, cells with disrupted RPS21A or RPS21B genes exhibit a reduction in growth rate, a decrease in free 40S subunits, an increase in the amount of free 60S subunits, and a decrease in polysome size. Ribosomal RNA processing studies reveal RPS21 and RPS0 mutants have virtually identical processing defects. The pattern of processing defects observed in RPS0 and RPS21 mutants is not a general characteristic of strains with suboptimal levels of small subunit ribosomal proteins, since disruption of the RPS18A or RPS18B genes results in related but distinct processing defects. Together, these data link the Rps0 and Rps21 proteins together functionally in promoting maturation of the 3' end of 18S rRNA and formation of active 40S ribosomal subunits.
Collapse
Affiliation(s)
- Amy Tabb-Massey
- Department of Biochemistry and Molecular Biology, University of Louisville, Louisville, KY 40292, USA
| | | | | | | | | | | |
Collapse
|
6
|
|
7
|
Das R, Junker J, Greenbaum D, Gerstein MB. Global perspectives on proteins: comparing genomes in terms of folds, pathways and beyond. THE PHARMACOGENOMICS JOURNAL 2002; 1:115-25. [PMID: 11911438 DOI: 10.1038/sj.tpj.6500021] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The sequencing of complete genomes provides us with a global view of all the proteins in an organism. Proteomic analysis can be done on a purely sequence-based level, with a focus on finding homologues and grouping them into families and clusters of orthologs. However, incorporating protein structure into this analysis provides valuable simplification; it allows one to collect together very distantly related sequences, thus condensing the proteome into a minimal number of 'parts.' We describe issues related to surveying proteomes in terms of structural parts, including methods for fold assignment and formats for comparisons (eg top-10 lists and whole-genome trees), and show how biases in the databases and in sampling can affect these surveys. We illustrate our main points through a case study on the unique protein properties evident in many thermophile genomes (eg more salt bridges). Finally, we discuss metabolic pathways as an even greater simplification of genomes. In comparison to folds these allow the organization of many more genes into coherent systems, yet can nevertheless be understood in many of the same terms.
Collapse
Affiliation(s)
- R Das
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT 06520, USA
| | | | | | | |
Collapse
|
8
|
Gauczynski S, Hundt C, Leucht C, Weiss S. Interaction of prion proteins with cell surface receptors, molecular chaperones, and other molecules. ADVANCES IN PROTEIN CHEMISTRY 2002; 57:229-72. [PMID: 11447692 DOI: 10.1016/s0065-3233(01)57024-2] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- S Gauczynski
- Laboratorium für Molekulare, Biologie-Genzentrum-Institut für Biochemie der LMU München, D-81377 Munich, Germany
| | | | | | | |
Collapse
|
9
|
Abstract
Annotation, the process by which structural or functional information is inferred for genes or proteins, is crucial for obtaining value from genome sequences. We define the process of annotating a previously annotated genome sequence as 're-annotation', and examine the strengths and weaknesses of current manual and automatic genome-wide re-annotation approaches.
Collapse
Affiliation(s)
- Christos A Ouzounis
- Computational Genomics Group, The European Bioinformatics Institute, EMBL Cambridge Outstation, Cambridge CB10 1SD, UK. E-mail:
| | - Peter D Karp
- Bioinformatics Research Group, AI Center, SRI International, Menlo Park, CA 94025, USA. E-mail:
| |
Collapse
|
10
|
Galperin MY, Grishin NV. The synthetase domains of cobalamin biosynthesis amidotransferases cobB and cobQ belong to a new family of ATP-dependent amidoligases, related to dethiobiotin synthetase. Proteins 2000; 41:238-47. [PMID: 10966576 DOI: 10.1002/1097-0134(20001101)41:2<238::aid-prot80>3.0.co;2-l] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Phosphotransacetylases of Escherichia coli and several other bacteria contain an additional 350-aa N-terminal fragment that is not required for phosphotransacetylase activity. Sequence analysis of this fragment revealed that it is closely related to a family of ATP-dependent enzymes that also includes dethiobiotin synthetase and the synthetase domains of two amidotransferases involved in cobalamin biosynthesis, cobyrinic acid a,c-diamide synthase (CobB) and cobyric acid synthase (CobQ). Further database searches showed that this enzyme family is also related to the MinD family of ATPases involved in regulation of cell division in bacteria and archaea. Analysis of sequence conservation in the members of this enzyme family using the structure of dethiobiotin synthetase active site as a guide allowed us to suggest a model for the interaction of CobB and CobQ with their respective substrates. CobB and CobQ were also found to contain unusual Triad family (class I) glutamine amidotransferase domains with conserved Cys and His residues, but lacking the Glu residue of the catalytic triad. These results should help in understanding the enzymology of cobalamin biosynthesis and in resolving the role of phosphotransacetylase in regulation of the carbon flow to and from acetate.
Collapse
Affiliation(s)
- M Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | |
Collapse
|
11
|
Abstract
Using the sequences of all the known transcription-associated proteins from Bacteria and Eucarya (a total of 4,147), we have identified their homologous counterparts in the four complete archaeal genomes. Through extensive sequence comparisons, we establish the presence of 280 predicted transcription factors or transcription-associated proteins in the four archaeal genomes, of which 168 have homologs only in Bacteria, 51 have homologs only in Eucarya, and the remaining 61 have homologs in both phylogenetic domains. Although bacterial and eukaryotic transcription have very few factors in common, each exclusively shares a significantly greater number with the Archaea, especially the Bacteria. This last fact contrasts with the obvious close relationship between the archaeal and eukaryotic transcription mechanisms per se, and in particular, basic transcription initiation. We interpret these results to mean that the archaeal transcription system has retained more ancestral characteristics than have the transcription mechanisms in either of the other two domains.
Collapse
Affiliation(s)
- N C Kyrpides
- Department of Microbiology, University of Illinois at Urbana-Champaign, B103 Chemistry and Life Sciences, MC 110, 407 South Goodwin Avenue, Urbana, IL 61801, USA
| | | |
Collapse
|
12
|
Ménard S, Tagliabue E, Colnaghi MI. The 67 kDa laminin receptor as a prognostic factor in human cancer. Breast Cancer Res Treat 1999; 52:137-45. [PMID: 10066078 DOI: 10.1023/a:1006171403765] [Citation(s) in RCA: 121] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Different receptors for adhesion molecules, including the monomeric 67 kDa laminin receptor (67LR), are responsible for the interactions between tumor cells and components of the extracellular matrix that play an important role in tumor invasion and metastasis. Clinical data clearly demonstrate the importance of the 67LR in the progression of a wide variety of tumors, including breast, lung, ovary, and prostate carcinomas and lymphomas. Indeed, data on more than 4000 cases of different tumors from different organs studied by immunohistochemistry are all concordant with a role for the 67LR in invasiveness, metastasis, and even tumor growth. This receptor molecule appears to be unusual since the corresponding full-length gene encodes a 37 kDa precursor protein which, after acylation, dimerizes to generate the mature 67 kDa form. The primary function of the membrane receptor is to stabilize the binding of laminin to cell surface integrins, acting as an integrin-accessory molecule, although homology of the gene encoding the receptor precursor with other genes suggests additional functions. Studies conducted to define the structure, expression, and function of this laminin receptor represent a step toward developing therapeutic strategies that target this molecule. In particular, therapeutic approaches that downregulate expression of the receptor on tumor cells might lead to decreased tumor aggressiveness.
Collapse
Affiliation(s)
- S Ménard
- Division of Experimental Oncology E, Istituto Nazionale per lo Studio e la Cura dei Tumori, Milan, Italy.
| | | | | |
Collapse
|
13
|
Abstract
Eight microbial genomes are compared in terms of protein structure. Specifically, yeast, H. influenzae, M. genitalium, M. jannaschii, Synechocystis, M. pneumoniae, H. pylori, and E. coli are compared in terms of patterns of fold usage-whether a given fold occurs in a particular organism. Of the approximately 340 soluble protein folds currently in the structure databank (PDB), 240 occur in at least one of the eight genomes, and 30 are shared amongst all eight. The shared folds are depleted in allhelical structure and enriched in mixed helix-sheet structure compared to the folds in the PDB. The top-10 most common of the shared 30 are enriched in superfolds, uniting many non-homologous sequence families, and are especially similar in overall architecture-eight having helices packed onto a central sheet. They are also very different from the common folds in the PBD, highlighting databank biases. Folds can be ranked in terms of expression as well as genome duplication. In yeast the top-10 most highly expressed folds are considerably different from the most highly duplicated folds. A tree can be constructed grouping genomes in terms of their shared folds. This has a remarkably similar topology to more conventional classifications, based on very different measures of relatedness. Finally, folds of membrane proteins can be analyzed through transmembrane-helix (TM) prediction. All the genomes appear to have similar usage patterns for these folds, with the occurrence of a particular fold falling off rapidly with increasing numbers of TM-elements, according to a "Zipf-like" law. This implies there are no marked preferences for proteins with particular numbers of TM-helices (e.g. 7-TM) in microbial genomes.
Collapse
Affiliation(s)
- M Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA.
| |
Collapse
|
14
|
Gerstein M, Hegyi H. Comparing genomes in terms of protein structure: surveys of a finite parts list. FEMS Microbiol Rev 1998; 22:277-304. [PMID: 10357579 DOI: 10.1111/j.1574-6976.1998.tb00371.x] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
We give an overview of the emerging field of structural genomics, describing how genomes can be compared in terms of protein structure. As the number of genes in a genome and the total number of protein folds are both quite limited, these comparisons take the form of surveys of a finite parts list, similar in respects to demographic censuses. Fold surveys have many similarities with other whole-genome characterizations, e.g., analyses of motifs or pathways. However, structure has a number of aspects that make it particularly suitable for comparing genomes, namely the way it allows for the precise definition of a basic protein module and the fact that it has a better defined relationship to sequence similarity than does protein function. An essential requirement for a structure survey is a library of folds, which groups the known structures into 'fold families.' This library can be built up automatically using a structure comparison program, and we described how important objective statistical measures are for assessing similarities within the library and between the library and genome sequences. After building the library, one can use it to count the number of folds in genomes, expressing the results in the form of Venn diagrams and 'top-10' statistics for shared and common folds. Depending on the counting methodology employed, these statistics can reflect different aspects of the genome, such as the amount of internal duplication or gene expression. Previous analyses have shown that the common folds shared between very different microorganisms, i.e., in different kingdoms, have a remarkably similar structure, being comprised of repeated strand-helix-strand super-secondary structure units. A major difficulty with this sort of 'fold-counting' is that only a small subset of the structures in a complete genome are currently known and this subset is prone to sampling bias. One way of overcoming biases is through structure prediction, which can be applied uniformly and comprehensively to a whole genome. Various investigators have, in fact, already applied many of the existing techniques for predicting secondary structure and transmembrane (TM) helices to the recently sequenced genomes. The results have been consistent: microbial genomes have similar fractions of strands and helices even though they have significantly different amino acid composition. The fraction of membrane proteins with a given number of TM helices falls off rapidly with more TM elements, approximately according to a Zipf law. This latter finding indicates that there is no preference for the highly studied 7-TM proteins in microbial genomes. Continuously updated tables and further information pertinent to this review are available over the web at http://bioinfo.mbb.yale.edu/genome.
Collapse
Affiliation(s)
- M Gerstein
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA.
| | | |
Collapse
|
15
|
Constantinesco F, Benachenhou N, Motorin Y, Grosjean H. The tRNA(guanine-26,N2-N2) methyltransferase (Trm1) from the hyperthermophilic archaeon Pyrococcus furiosus: cloning, sequencing of the gene and its expression in Escherichia coli. Nucleic Acids Res 1998; 26:3753-61. [PMID: 9685492 PMCID: PMC147764 DOI: 10.1093/nar/26.16.3753] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The structural gene pfTRM1 (GenBank accession no. AF051912), encoding tRNA(guanine-26, N 2- N 2) methyltransferase (EC 2.1.1.32) of the strictly anaerobic hyperthermophilic archaeon Pyrococcus furiosus, has been identified by sequence similarity to the TRM1 gene of Saccharomyces cerevisiae (YDR120c). The pfTRM1 gene in a 3.0 kb restriction DNA fragment of P.furiosus genomic DNA has been cloned by library screening using a PCR probe to the 5'-part of the corresponding ORF. Sequence analysis revealed an entire ORF of 1143 bp encoding a polypeptide of 381 residues (calculated molecular mass 43.3 kDa). The deduced amino acid sequence of this newly identified gene shares significant similarity with the TRM1- like genes of three other archaea (Methanococcus jannaschii, Methanobacterium thermoautotrophicum and Archaeoglobus fulgidus), one eukaryon (Caenorhabditis elegans) and one hyperthermophilic eubacterium (Aquifex aeolicus). Two short consensus motifs for S-adenosyl-l-methionine binding are detected in the sequence of pfTrm1p. Cloning of the P.furiosus TRM1 gene in an Escherichia coli expression vector allowed expression of the recombinant protein (pfTrm1p) with an apparent molecular mass of 42 kDa. A protein extract from the transformed E.coli cells shows enzymatic activity for the quantitative formation of N 2, N 2-dimethylguanosine at position 26 in a transcript of yeast tRNAPhe used as substrate. The recombinant enzyme was also shown to modify bulk E.coli tRNAs in vivo.
Collapse
MESH Headings
- Amino Acid Sequence
- Base Sequence
- Binding Sites/genetics
- Cloning, Molecular
- DNA Primers/genetics
- DNA, Archaeal/genetics
- Escherichia coli/genetics
- Gene Expression
- Genes
- Genes, Archaeal
- Guanine/chemistry
- Molecular Sequence Data
- Nucleic Acid Conformation
- Point Mutation
- Pyrococcus/enzymology
- Pyrococcus/genetics
- RNA, Transfer, Phe/chemistry
- RNA, Transfer, Phe/genetics
- RNA, Transfer, Phe/metabolism
- Recombinant Proteins/genetics
- Recombinant Proteins/metabolism
- Sequence Homology, Amino Acid
- Substrate Specificity
- tRNA Methyltransferases/genetics
- tRNA Methyltransferases/metabolism
Collapse
Affiliation(s)
- F Constantinesco
- Laboratoire d'Enzymologie et Biochimie Structurales, Centre National de la Recherche Scientifique,1 Avenue de la Terrasse, Batiment 34, F-91198 Gif-sur-Yvette, France
| | | | | | | |
Collapse
|
16
|
Prangishvili D, Klenk HP, Jakobs G, Schmiechen A, Hanselmann C, Holz I, Zillig W. Biochemical and phylogenetic characterization of the dUTPase from the archaeal virus SIRV. J Biol Chem 1998; 273:6024-9. [PMID: 9497317 DOI: 10.1074/jbc.273.11.6024] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The derived amino acid sequence from a 474-base pair open reading frame in the genome of the Sulfolobus islandicus rod-shaped virus SIRV shows striking similarity to bacterial dCTP deaminases and to dUTPases from eukaryotes, bacteria, Poxviridae, and Retroviridae. The putative gene was expressed in Escherichia coli, and dUTPase activity of the recombinant enzyme was demonstrated by hydrolysis of dUTP to dUMP. Deamination of dCTP by the enzyme was not detected. Phylogenetic analysis based on amino acid sequences of the characterized enzyme and its homologues showed that the dUTPase-encoding dut genes and the dCTP deaminase-encoding dcd genes constitute a paralogous gene family. This report is the first identification and functional characterization of an archaeal dUTPase and the first phylogeny derived for the dcd-dut gene family.
Collapse
Affiliation(s)
- D Prangishvili
- Max Planck Institute for Biochemistry, Am Klopferspitz 18a, 82152 Martinsried, Federal Republic of Germany.
| | | | | | | | | | | | | |
Collapse
|
17
|
Gerstein M. A structural census of genomes: comparing bacterial, eukaryotic, and archaeal genomes in terms of protein structure. J Mol Biol 1997; 274:562-76. [PMID: 9417935 DOI: 10.1006/jmbi.1997.1412] [Citation(s) in RCA: 124] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Representative genomes from each of the three kingdoms of life are compared in terms of protein structure, in particular, those of Haemophilus influenzae (a bacteria), Methanococcus jannaschii (an archaeon), and yeast (a eukaryote). The comparison is in the form of a census (or comprehensive accounting) of the relative occurrence of secondary and tertiary structures in the genomes, which particular emphasis on patterns of supersecondary structure. Comparison of secondary structure shows that the three genomes have nearly the same overall secondary-structure content, although they differ markedly in amino acid composition. Comparison of super-secondary structure, using a novel "frequent-words" approach, shows that yeast has a preponderance of consecutive strands (e.g. beta-beta-beta patterns), Haemophilus, consecutive helices (alpha-alpha-alpha), and Methanococcus, alternating helix-strand structures (beta-alpha-beta). Yeast also has significantly more helical membrane proteins than the other two genomes, with most of the differences concentrated in proteins containing two transmembrane segments. Comparison of tertiary structure (by sequence matching and domain-level clustering) highlights the substantial duplication in each genome (approximately 30% to 50%), with the degree of duplication following similar patterns in all three. Many sequence families are shared among the genomes, with the degree of overlap between any two genomes being roughly similar. In total, the three genomes contain 148 of the approximately 300 known protein folds. Forty-five of these 148 that are present in all three genomes are especially enriched in mixed super-secondary structures (alpha/beta). Moreover, the five most common of these 45 (the "top-5") have a remarkably similar super-secondary structure architecture, containing a central sheet of parallel strands with helices packed onto at least one face and beta-alpha-beta connections between adjacent strands. These most basic molecular parts, which, presumably, were present in the last common ancestor to the three Kingdoms, include the TIM-barrel, Rossmann, flavodoxin, thiamin-binding, and P-loop-hydrolase folds.
Collapse
Affiliation(s)
- M Gerstein
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
18
|
Rieger R, Edenhofer F, Lasmézas CI, Weiss S. The human 37-kDa laminin receptor precursor interacts with the prion protein in eukaryotic cells. Nat Med 1997; 3:1383-8. [PMID: 9396609 DOI: 10.1038/nm1297-1383] [Citation(s) in RCA: 327] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Prions are thought to consist of infectious proteins that cause transmissible spongiform encephalopathies. According to overwhelming evidence, the pathogenic prion protein PrPSc converts its host encoded isoform PrPC into insoluble aggregates of PrPSc, concomitant with pathological modifications (for review, see refs. 1-3). Although the physiological role of PrPC is poorly understood, studies with PrP knockout mice demonstrated that PrPC is required for the development of prion diseases. Using the yeast two-hybrid technology in Saccharomyces cerevisiae, we identified the 37-kDa laminin receptor precursor (LRP) as interacting with the cellular prion protein PrPC. Mapping analysis of the LRP-PrP interaction site in S. cerevisiae revealed that PrP and laminin share the same binding domain (amino acids 161 to 180) on LRP. The LRP-PrP interaction was confirmed in vivo in insect (Sf9) and mammalian cells (COS-7). The LRP level was increased in scrapie-infected murine N2a cells and in brain and spleen of scrapie-infected mice. In contrast, the LRP concentration was not significantly altered in these organs from mice infected with the bovine spongiform encephalopathic agent (BSE), which have a lower PrPSc accumulation. LRP levels, however, were dramatically increased in brain and pancreas, slightly increased in the spleen and not altered in the liver of crapie-infected hamsters. These data show that enhanced LRP concentrations are correlated with PrPSc accumulation in organs from mice and hamsters. The laminin receptor precursor, which is highly conserved among mammals and is located on the cell surface, may act as a receptor or co-receptor for the prion protein on mammalian cells.
Collapse
Affiliation(s)
- R Rieger
- Laboratorium für Molekulare Biologie-Genzentrum-Institut für Biochemie der LMU München, Munich, Germany
| | | | | | | |
Collapse
|
19
|
|
20
|
Rondon MR, Trzebiatowski JR, Escalante-Semerena JC. Biochemistry and molecular genetics of cobalamin biosynthesis. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 1997; 56:347-84. [PMID: 9187059 DOI: 10.1016/s0079-6603(08)61010-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- M R Rondon
- Department of Bacteriology, University of Wisconsin-Madison 53706-1567, USA
| | | | | |
Collapse
|
21
|
Rahman RNZA, Fujiwara S, Imanaka T. Gene cloning and sequence analysis of cobyric acid synthase and cobalamin (5′-phosphate) synthase from hyperthermophilic archaeon Pyrococcus sp. KOD1. ACTA ACUST UNITED AC 1997. [DOI: 10.1016/s0922-338x(97)87336-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
22
|
Koonin EV, Mushegian AR. Complete genome sequences of cellular life forms: glimpses of theoretical evolutionary genomics. Curr Opin Genet Dev 1996; 6:757-62. [PMID: 8994848 DOI: 10.1016/s0959-437x(96)80032-3] [Citation(s) in RCA: 49] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The availability of complete genome sequences of cellular life forms creates the opportunity to explore the functional content of the genomes and evolutionary relationships between them at a new qualitative level. With the advent of these sequences, the construction of a minimal gene set sufficient for sustaining cellular life and reconstruction of the genome of the last common ancestor of bacteria, eukaryotes, and archaea become realistic, albeit challenging, research projects. A version of the minimal gene set for modern-type cellular life derived by comparative analysis of two bacterial genomes, those of Haemophilus influenzae and Mycoplasma genitalium, consists of approximately 250 genes. A comparison of the protein sequences encoded in these genes with those of the proteins encoded in the complete yeast genome suggests that the last common ancestor of all extant life might have had an RNA genome.
Collapse
Affiliation(s)
- E V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | |
Collapse
|
23
|
Clausse N, Jackers P, Jarès P, Joris B, Sobel ME, Castronovo V. Identification of the active gene coding for the metastasis-associated 37LRP/p40 multifunctional protein. DNA Cell Biol 1996; 15:1009-23. [PMID: 8985115 DOI: 10.1089/dna.1996.15.1009] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
A 37LRP/p40 polypeptide is of major interest because it is consistently up-regulated in cancer cells in correlation with their invasive and metastatic phenotype. Furthermore, this polypeptide presents intriguing multifunctional properties because it has been characterized as the precursor of the metastasis-associated 67-kD laminin receptor (67LR) and as a cytoplasmic ribosomal-associated protein. The isolation of the 37LRP/p40 gene is a prerequisite for identifying the molecular mechanisms responsible for the constant up-regulation of the 67LR expression in cancer cells. To date, the active 37LRP/p40 gene has never been identified in any species due to the existence of multiple pseudogenes in most vertebrates genomes. In this study, we report for the first time the gene structure and potential regulatory sequences of the 37 LRP/p40 gene. The chicken genome was selected to undergo this characterization because it is the only known vertebrate that bears a single 37 LRP/p40 gene copy. The 37 LRP/p40 active gene is composed of 7 exons and 6 introns and bears features characteristic of a ribosomal protein gene. It does not bear a classical TATA box and it exhibits several transcription initiation sites as demonstrated by RNase protection assay and primer extension. Analysis of potential regulatory regions suggests that gene expression is driven not only by the 5' genomic region but also by the 5' untranslated and intron 1 sequences. On the basis of gene structure and extensive protein evolutionary study, we found that the carboxyterminal domain of the protein is a conserved lock-and-key structure/function domain that could be involved in the biosynthesis of the higher-molecular-weight 67-kD laminin receptor in vertebrates, whereas the central core of the protein would be responsible for the ribosome associated function. The first identification of the active 37LRP/p40 gene presented in this study is a critical step toward the isolation of the corresponding human gene and the understanding of the molecular mechanisms involved in the up-regulation of its expression during tumor invasion and metastasis.
Collapse
Affiliation(s)
- N Clausse
- Metastasis Research Laboratory, University of Liège, Belgium
| | | | | | | | | | | |
Collapse
|
24
|
Fujiwara S, Okuyama S, Imanaka T. The world of archaea: genome analysis, evolution and thermostable enzymes. Gene 1996; 179:165-70. [PMID: 8955643 DOI: 10.1016/s0378-1119(96)00428-3] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Pyrococcus sp. KOD1 is a newly isolated hyperthermophilic archaeon from a solfatara at a wharf on Kodakara Island, Kagoshima, Japan. A physical map of the KOD1 chromosome was constructed using pulsed-field gel electrophoresis of restriction fragments generated by AscI, PacI and PmeI. The order of the AscI fragments was deduced from Southern hybridization using the AscI, PmeI and PacI fragments as a probe. The derived physical map indicates that KOD1 possesses a circular-form genome and its size was estimated to be 2036 kb. Several cloned genes were hybridized to restriction fragments to locate their positions on the physical map. Some genes involved in the central dogma were located on the restricted segment of the genome. Novel characteristics of KOD1 enzymes are also introduced in this article.
Collapse
Affiliation(s)
- S Fujiwara
- Department of Biotechnology, Graduate School of Engineering, Osaka University, Japan
| | | | | |
Collapse
|
25
|
Kyrpides NC, Woese CR, Ouzounis CA. KOW: a novel motif linking a bacterial transcription factor with ribosomal proteins. Trends Biochem Sci 1996; 21:425-6. [PMID: 8987397 DOI: 10.1016/s0968-0004(96)30036-4] [Citation(s) in RCA: 109] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Affiliation(s)
- N C Kyrpides
- Department of Microbiology, University of Illnois, Urbana Champaign 61801, USA
| | | | | |
Collapse
|
26
|
Abstract
The phylogenetic distribution of divergently related protein families into the three domains of life (archaea, bacteria and eukaryotes) can signify the presence or absence of entire cellular processes in these domains and their ancestors. We can thus study the emergence of the major transitions during cellular evolution, and resolve some of the controversies surrounding the evolutionary status of archaea and the origins of the eukaryotic cell. In view of the ongoing projects that sequence the complete genomes of several Archaea, this work forms a testable prediction when the genome sequences become available. Using the presence of the protein families as taxonomic traits, and linking them to biochemical pathways, we are able to reason about the presence of the corresponding cellular processes in the last universal ancestor of contemporary cells. The analysis shows that metabolism was already a complex network of reactions which included amino acid, nucleotide, fatty acid, sugar and coenzyme metabolism. In addition, genetic processes such as translation are conserved and close to the original form. However, other processes such as DNA replication and repair or transcription are exceptional and seem to be associated with the structural changes that drove eukaryotes and bacteria away from their common ancestor. There are two major hypotheses in the present work: first, that archaea are probably closer to the last universal ancestor than any other extant life form, and second, that the major cellular processes were in place before the major splitting. The last universal ancestor had metabolism and translation very similar to the contemporary ones, while having an operonic genome organization and archaean-like transcription. Evidently, all cells today contain remnants of the primordial genome of the last universal ancestor.
Collapse
Affiliation(s)
- C Ouzounis
- AI Center, SRI International, Menlo Park, CA 94025-3493, USA.
| | | |
Collapse
|
27
|
Charlebois RL, Gaasterland T, Ragan MA, Doolittle WF, Sensen CW. The Sulfolobus solfataricus P2 genome project. FEBS Lett 1996; 389:88-91. [PMID: 8682213 DOI: 10.1016/0014-5793(96)00525-x] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Over 800 kbp of the 3-Mbp genome of Sulfolobus solfataricus have been sequenced to date. Our approach is to sequence subclones of mapped cosmids, followed by sequencing directly on cosmid templates with custom primers. Using a prototype automated system for genome-scale analysis, known as MAGPIE, along with other tools, we have discovered one open reading frame of at least 100 amino acids per kbp of sequence, and have been able to associate 50% of these with known genes through database searches. An examination of completely sequenced cosmids suggests a clustering of genes by function in the S. solfataricus genome.
Collapse
Affiliation(s)
- R L Charlebois
- Canadian Institute for Advanced Research, University of Ottawa, ON, Canada.
| | | | | | | | | |
Collapse
|
28
|
Koonin EV. Pseudouridine synthases: four families of enzymes containing a putative uridine-binding motif also conserved in dUTPases and dCTP deaminases. Nucleic Acids Res 1996; 24:2411-5. [PMID: 8710514 PMCID: PMC145960 DOI: 10.1093/nar/24.12.2411] [Citation(s) in RCA: 171] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Using a combination of several methods for protein sequence comparison and motif analysis, it is shown that the four recently described pseudouridine syntheses with different specificities belong to four distinct families. Three of these families share two conserved motifs that are likely to be directly involved in catalysis. One of these motifs is detected also in two other families of enzymes that specifically bind uridine, namely deoxycitidine triphosphate deaminases and deoxyuridine triphosphatases. It is proposed that this motif is an essential part of the uridine-binding site. Two of the pseudouridine syntheses, one of which modifies the anticodon arm of tRNAs and the other is predicted to modify a portion of the large ribosomal subunit RNA belonging to the peptidyltransferase center, are encoded in all extensively sequenced genomes, including the 'minimal' genome of Mycoplasma genitalium. These particular RNA modifications and the respective enzymes are likely to be essential for the functioning of any cell.
Collapse
Affiliation(s)
- E V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
29
|
Demianova M, Formosa TG, Ellis SR. Yeast proteins related to the p40/laminin receptor precursor are essential components of the 40 S ribosomal subunit. J Biol Chem 1996; 271:11383-91. [PMID: 8626693 DOI: 10.1074/jbc.271.19.11383] [Citation(s) in RCA: 50] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
We report here the isolation of two genes from the yeast, Saccharomyces cerevisiae, that encode proteins closely related to mammalian p40/laminin receptor precursors (LRPs). The yeast genes, designated YST1 and YST2, encode proteins with over 95% amino acid sequence identity with one another and over 60% identity with the human p40/laminin receptor precursor. The Yst/p40/37-LRP proteins are also more distantly related to the S2 family of ribosomal proteins. Analysis of the distribution of Yst1 tagged with the c-myc epitope revealed that the Yst proteins are components of the 40 S ribosomal subunit. Disruption of either YST1 or YST2 causes a significant reduction in growth rate, while disruption of both genes is lethal. Compared to wild type, polysome profiles in strains lacking either YST1 or YST2 show a pronounced shift from larger to smaller polysomes. This shift is accompanied by a substantial increase in free 60 S subunits and reduced levels of 40 S subunits. We conclude that the Yst proteins are required for translation and contribute to the assembly and/or stability of the 40 S ribosomal subunit.
Collapse
Affiliation(s)
- M Demianova
- Department of Biochemistry, University of Louisville, Kentucky 40292, USA
| | | | | |
Collapse
|
30
|
Ouzounis CA, Kyrpides NC. Parallel origins of the nucleosome core and eukaryotic transcription from Archaea. J Mol Evol 1996; 42:234-9. [PMID: 8919875 DOI: 10.1007/bf02198849] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Computational sequence analysis of 10 available archaean histone-like proteins has shown that this family is not only divergently related to the eukaryotic core histones H2A/B, H3, and H4, but also to the central domain of subunits A and C of the CCAAT-binding factor (CBF), a transcription factor associated with eukaryotic promoters. Despite the low sequence identity, it is unambiguously shown that the core histone fold shares a common evolutionary history. Archaean histones and the two CBF families show a remarkable variability in contrast to eukaryotic core histones. Conserved residues shared between families are identified, possibly being responsible for the functional versatility of the core histone fold. The H4 subfamily is most similar to archaean proteins and may be the progenitor of the other core histones in eukaryotes. While it is not clear whether archaean histones are more actively involved in transcription regulation, the present observations link two processes, nucleosomal packing and transcription in a unique way. Both these processes, evidently hybrid in Archaea, have originated before the ermergence of the eukaryotic cell.
Collapse
|
31
|
Baross JA, Holden JF. Overview of hyperthermophiles and their heat-shock proteins. ADVANCES IN PROTEIN CHEMISTRY 1996; 48:1-34. [PMID: 8791623 DOI: 10.1016/s0065-3233(08)60360-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- J A Baross
- School of Oceanography, University of Washington, Seattle 98195, USA
| | | |
Collapse
|