1
|
Nigam D, Muthukrishnan E, Flores-López LF, Nigam M, Wamaitha MJ. Comparative Genome Analysis of Old World and New World TYLCV Reveals a Biasness toward Highly Variable Amino Acids in Coat Protein. PLANTS (BASEL, SWITZERLAND) 2023; 12:1995. [PMID: 37653912 PMCID: PMC10223811 DOI: 10.3390/plants12101995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 05/01/2023] [Accepted: 05/08/2023] [Indexed: 09/02/2023]
Abstract
Begomoviruses, belonging to the family Geminiviridae and the genus Begomovirus, are DNA viruses that are transmitted by whitefly Bemisia tabaci (Gennadius) in a circulative persistent manner. They can easily adapt to new hosts and environments due to their wide host range and global distribution. However, the factors responsible for their adaptability and coevolutionary forces are yet to be explored. Among BGVs, TYLCV exhibits the broadest range of hosts. In this study, we have identified variable and coevolving amino acid sites in the proteins of Tomato yellow leaf curl virus (TYLCV) isolates from Old World (African, Indian, Japanese, and Oceania) and New World (Central and Southern America). We focused on mutations in the coat protein (CP), as it is highly variable and interacts with both vectors and host plants. Our observations indicate that some mutations were accumulating in Old World TYLCV isolates due to positive selection, with the S149N mutation being of particular interest. This mutation is associated with TYLCV isolates that have spread in Europe and Asia and is dominant in 78% of TYLCV isolates. On the other hand, the S149T mutation is restricted to isolates from Saudi Arabia. We further explored the implications of these amino acid changes through structural modeling. The results presented in this study suggest that certain hypervariable regions in the genome of TYLCV are conserved and may be important for adapting to different host environments. These regions could contribute to the mutational robustness of the virus, allowing it to persist in different host populations.
Collapse
Affiliation(s)
- Deepti Nigam
- Institute for Genomics of Crop Abiotic Stress Tolerance, Department of Plant and Soil Science, Texas Tech University (TTU), Lubbock, TX 79409, USA
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY 14850, USA
| | | | - Luis Fernando Flores-López
- Departamento de Biotecnología y Bioquímica, Centro de Investigacióny de Estudios Avanzados de IPN (CINVESTAV) Unidad Irapuato, Irapuato 368224, Mexico
| | - Manisha Nigam
- Department of Biochemistry, Hemvati Nandan Bahuguna Garhwal University, Srinagar 246174, Uttarakhand, India
| | - Mwathi Jane Wamaitha
- Kenya Agricultural and Livestock Research Organization (KALRO), Nairobi P.O. Box 14733-00800, Kenya
| |
Collapse
|
2
|
Hartman EC, Jakobson CM, Favor AH, Lobba MJ, Álvarez-Benedicto E, Francis MB, Tullman-Ercek D. Quantitative characterization of all single amino acid variants of a viral capsid-based drug delivery vehicle. Nat Commun 2018; 9:1385. [PMID: 29643335 PMCID: PMC5895741 DOI: 10.1038/s41467-018-03783-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 03/13/2018] [Indexed: 12/22/2022] Open
Abstract
Self-assembling proteins are critical to biological systems and industrial technologies, but predicting how mutations affect self-assembly remains a significant challenge. Here, we report a technique, termed SyMAPS (Systematic Mutation and Assembled Particle Selection), that can be used to characterize the assembly competency of all single amino acid variants of a self-assembling viral structural protein. SyMAPS studies on the MS2 bacteriophage coat protein revealed a high-resolution fitness landscape that challenges some conventional assumptions of protein engineering. An additional round of selection identified a previously unknown variant (CP[T71H]) that is stable at neutral pH but less tolerant to acidic conditions than the wild-type coat protein. The capsids formed by this variant could be more amenable to disassembly in late endosomes or early lysosomes-a feature that is advantageous for delivery applications. In addition to providing a mutability blueprint for virus-like particles, SyMAPS can be readily applied to other self-assembling proteins.
Collapse
Affiliation(s)
- Emily C Hartman
- Department of Chemistry, University of California, Berkeley, CA, 94720-1460, USA
| | - Christopher M Jakobson
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, CA, 94720-1460, USA
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Andrew H Favor
- Department of Chemistry, University of California, Berkeley, CA, 94720-1460, USA
| | - Marco J Lobba
- Department of Chemistry, University of California, Berkeley, CA, 94720-1460, USA
| | | | - Matthew B Francis
- Department of Chemistry, University of California, Berkeley, CA, 94720-1460, USA.
- Materials Sciences Division, Lawrence Berkeley National Laboratories, Berkeley, CA, 94720-1460, USA.
| | - Danielle Tullman-Ercek
- Department of Chemical and Biological Engineering, Northwestern University, 2145 Sheridan Road, Technological Institute E136, Evanston, IL, 60208-3120, USA.
| |
Collapse
|
3
|
Selection maintaining protein stability at equilibrium. J Theor Biol 2016; 391:21-34. [DOI: 10.1016/j.jtbi.2015.12.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Revised: 11/29/2015] [Accepted: 12/01/2015] [Indexed: 11/24/2022]
|
4
|
Aoto S, Yura K. Case study on the evolution of hetero-oligomer interfaces based on the differences in paralogous proteins. Biophys Physicobiol 2015; 12:103-16. [PMID: 27493859 PMCID: PMC4736837 DOI: 10.2142/biophysico.12.0_103] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 11/09/2015] [Indexed: 12/31/2022] Open
Abstract
We addressed the evolutionary trace of hetero-oligomer interfaces by comparing the structures of paralogous proteins; one of them is a monomer or homo-oligomer and the other is a hetero-oligomer. We found different trends in amino acid conservation pattern and hydrophobicity between homo-oligomer and hetero-oligomer. The degree of amino acid conservation in the interface of homo-oligomer has no obvious difference from that in the surface, whereas the degree of conservation is much higher in the interface of hetero-oligomer. The interface of homo-oligomer has a few very conserved residue positions, whereas the residue conservation in the interface of hetero-oligomer tends to be higher. In addition, the interface of hetero-oligomer has a tendency of being more hydrophobic compared with the one in homo-oligomer. We conjecture that these differences are related to the inherent symmetry in homo-oligomers that cannot exist in hetero-oligomers. Paucity of the structural data precludes statistical tests of these tendencies, yet the trend can be applied to the prediction of the interface of hetero-oligomer. We obtained putative interfaces of the subunits in CPSF (cleavage and polyadenylation specificity factor), one of the human pre-mRNA 3′-processing complexes. The locations of predicted interface residues were consistent with the known experimental data.
Collapse
Affiliation(s)
- Saki Aoto
- Graduate School of Humanities and Sciences, Ochanomizu University, Bunkyo, Tokyo 112-8610, Japan
| | - Kei Yura
- Graduate School of Humanities and Sciences, Ochanomizu University, Bunkyo, Tokyo 112-8610, Japan; Centre for Informational Biology, Ochanomizu University, Bunkyo, Tokyo 112-8610, Japan; National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
| |
Collapse
|
5
|
Hijikata A, Yura K, Ohara O, Go M. Structural and functional analyses of Barth syndrome-causing mutations and alternative splicing in the tafazzin acyltransferase domain. Meta Gene 2015; 4:92-106. [PMID: 25941633 PMCID: PMC4412953 DOI: 10.1016/j.mgene.2015.04.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Accepted: 04/03/2015] [Indexed: 10/26/2022] Open
Abstract
Tafazzin is a mitochondrial phospholipid transacylase, and its mutations cause Barth syndrome (BTHS). Human tafazzin gene produces four distinct alternatively spliced transcripts. To understand the molecular mechanisms of tafazzin deficiency, we performed an atomic resolution analysis of the influence of the BTHS mutations and of alternative splicing on the structure and function of tafazzin. From the three-dimensional (3D) homology modeling of tafazzin, we identified candidate amino acid residues that contribute to cardiolipin binding and to mitochondrial membrane associations that facilitate acyl-transfer reactions. Primate specific exon 5, which is alternatively spliced, is predicted to correspond to an intrinsically unstructured region in the protein. We proposed that this region should change the substrate-binding affinity and/or contribute to primate-specific molecular interactions. Exon 7, another alternatively spliced exon, encodes a region forming a part of the putative substrate-binding cleft, suggesting that the gene products lacking exon 7 will lose their substrate-binding ability. We demonstrate a clear localization of the BTHS mutations at residues responsible for membrane association, substrate binding, and the conformational stability of tafazzin. These findings provide new insights into the function of defective tafazzin and the pathogenesis of BTHS at the level of protein 3D structure and the evolution of alternatively spliced exons in primates.
Collapse
Affiliation(s)
- Atsushi Hijikata
- Nagahama Institute of Bio-Science and Technology, 1266 Tamura-cho, Nagahama, Shiga 526-0829, Japan
| | - Kei Yura
- Graduate School of Humanities and Sciences, Ochanomizu University, 2-1-1 Otsuka, Bunkyo-ku, Tokyo 112-8610, Japan
| | - Osamu Ohara
- Laboratory for Integrative Genomics, RIKEN Center for Integrative Medical Sciences, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan ; Department of Technology Development, Kazusa DNA Research Institute, 2-6-7 Kazusa-Kamatari, Kisarazu, Chiba 292-0818, Japan
| | - Mitiko Go
- Nagahama Institute of Bio-Science and Technology, 1266 Tamura-cho, Nagahama, Shiga 526-0829, Japan ; Research Organization of Information and Systems, 4-3-13, Toranomon, Minatoku, Tokyo 105-0001, Japan
| |
Collapse
|
6
|
Oshiro S, Honda S. Imparting albumin-binding affinity to a human protein by mimicking the contact surface of a bacterial binding protein. ACS Chem Biol 2014; 9:1052-60. [PMID: 24533528 DOI: 10.1021/cb400946m] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Attachment of a bacterial albumin-binding protein module is an attractive strategy for extending the plasma residence time of protein therapeutics. However, a protein fused with such a bacterial module could induce unfavorable immune reactions. To address this, we designed an alternative binding protein by imparting albumin-binding affinity to a human protein using molecular surface grafting. The result was a series of human-derived 6 helix-bundle proteins, one of which specifically binds to human serum albumin (HSA) with adequate affinity (KD = 100 nM). The proteins were designed by transferring key binding residues of a bacterial albumin-binding module, Finegoldia magna protein G-related albumin-binding domain (GA) module, onto the human protein scaffold. Despite 13-15 mutations, the designed proteins maintain the original secondary structure by virtue of careful grafting based on structural informatics. Competitive binding assays and thermodynamic analyses of the best binders show that the binding mode resembles that of the GA module, suggesting that the contacting surface of the GA module is mimicked well on the designed protein. These results indicate that the designed protein may act as an alternative low-risk binding module to HSA. Furthermore, molecular surface grafting in combination with structural informatics is an effective approach for avoiding deleterious mutations on a target protein and for imparting the binding function of one protein onto another.
Collapse
Affiliation(s)
- Satoshi Oshiro
- Department
of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan
| | - Shinya Honda
- Department
of Medical Genome Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan
- Biomedical
Research Institute, National Institute of Advanced Industrial Science and Technology, Central 6, 1-1-1 Higashi, Tsukuba, Ibaraki 305-8566, Japan
| |
Collapse
|
7
|
Miyazawa S. Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis. BMC Evol Biol 2013; 13:257. [PMID: 24256155 PMCID: PMC4225520 DOI: 10.1186/1471-2148-13-257] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2013] [Accepted: 11/14/2013] [Indexed: 11/25/2022] Open
Abstract
Background Nucleotide and amino acid substitution tendencies are characteristic of each species, organelle, and protein family. Hence, various empirical amino acid substitution rate matrices have needed to be estimated for phylogenetic analysis: JTT, WAG, and LG for nuclear proteins, mtREV for mitochondrial proteins, cpREV10 and cpREV64 for chloroplast-encoded proteins, and FLU for influenza proteins. On the other hand, in a mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the ratio of fixation depending on the type of amino acid replacement, mutation rates and the strength of selective constraint on amino acids can be tailored to each protein family with additional 11 parameters. As a result, in the evolutionary analysis of codon sequences it outperforms codon substitution models equivalent to empirical amino acid substitution matrices. Is it superior even for amino acid sequences, among which synonymous substitutions cannot be identified? Results Nucleotide mutations are assumed to occur independently of codon positions but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene with a linear function of a given estimate of selective constraints, which were estimated by maximizing the likelihood of an empirical amino acid or codon substitution frequency matrix, each of JTT, WAG, LG, and KHG. It is shown that the mechanistic codon substitution model with the assumption of equal codon usage yields better values of Akaike and Bayesian information criteria for all three phylogenetic trees of mitochondrial, chloroplast, and influenza-A hemagglutinin proteins than the empirical amino acid substitution models with mtREV, cpREV64, and FLU, which were designed specifically for those protein families, respectively. The variation of selective constraint across sites fits the datasets significantly better than variable codon mutation rates, confirming that substitution rate variations across sites detected by amino acid substitution models are caused primarily by the variation of selective constraint against amino acid substitutions rather than the variation of codon mutation rate. Conclusions The mechanistic codon substitution model is superior to amino acid substitution models even in the evolutionary analysis of protein sequences.
Collapse
|
8
|
Terui H, Akagi K, Kawame H, Yura K. CoDP: predicting the impact of unclassified genetic variants in MSH6 by the combination of different properties of the protein. J Biomed Sci 2013; 20:25. [PMID: 23621914 PMCID: PMC3651391 DOI: 10.1186/1423-0127-20-25] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2012] [Accepted: 04/15/2013] [Indexed: 02/06/2023] Open
Abstract
Background Lynch syndrome is a hereditary cancer predisposition syndrome caused by a mutation in one of the DNA mismatch repair (MMR) genes. About 24% of the mutations identified in Lynch syndrome are missense substitutions and the frequency of missense variants in MSH6 is the highest amongst these MMR genes. Because of this high frequency, the genetic testing was not effectively used in MSH6 so far. We, therefore, developed CoDP (Combination of the Different Properties), a bioinformatics tool to predict the impact of missense variants in MSH6. Methods We integrated the prediction results of three methods, namely MAPP, PolyPhen-2 and SIFT. Two other structural properties, namely solvent accessibility and the change in the number of heavy atoms of amino acids in the MSH6 protein, were further combined explicitly. MSH6 germline missense variants classified by their associated clinical and molecular data were used to fit the parameters for the logistic regression model and to assess the prediction. The performance of CoDP was compared with those of other conventional tools, namely MAPP, SIFT, PolyPhen-2 and PON-MMR. Results A total of 294 germline missense variants were collected from the variant databases and literature. Of them, 34 variants were available for the parameter training and the prediction performance test. We integrated the prediction results of MAPP, PolyPhen-2 and SIFT, and two other structural properties, namely solvent accessibility and the change in the number of heavy atoms of amino acids in the MSH6 protein, were further combined explicitly. Variants data classified by their associated clinical and molecular data were used to fit the parameters for the logistic regression model and to assess the prediction. The values of the positive predictive value (PPV), the negative predictive value (NPV), sensitivity, specificity and accuracy of the tools were compared on the whole data set. PPV of CoDP was 93.3% (14/15), NPV was 94.7% (18/19), specificity was 94.7% (18/19), sensitivity was 93.3% (14/15) and accuracy was 94.1% (32/34). Area under the curve of CoDP was 0.954, that of MAPP for MSH6 was 0.919, of SIFT was 0.864 and of PolyPhen-2 HumVar was 0.819. The power to distinguish between pathogenic and non-pathogenic variants of these methods was tested by Wilcoxon rank sum test (p < 8.9 × 10-6 for CoDP, p < 3.3 × 10-5 for MAPP, p < 3.1 × 10-4 for SIFT and p < 1.2 × 10-3 for PolyPhen-2 HumVar), and CoDP was shown to outperform other conventional methods. Conclusion In this paper, we provide a human curated data set for MSH6 missense variants, and CoDP, the prediction tool, which achieved better accuracy for predicting the impact of missense variants in MSH6 than any other known tools. CoDP is available at http://cib.cf.ocha.ac.jp/CoDP/.
Collapse
Affiliation(s)
- Hiroko Terui
- The Graduate School of Humanities and Sciences, Ochanomizu University, 2-1-1 Otsuka, Bunkyo, Tokyo 112-8610, Japan
| | | | | | | |
Collapse
|
9
|
Ma Y, Lu M, Li JY, Qin Y, Gong XG. Evolutive and structural characterization of Nostoc commune iron-superoxide dismutase that is fit for modification. GENETICS AND MOLECULAR RESEARCH 2012; 11:3607-17. [PMID: 23096686 DOI: 10.4238/2012.october.4.8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Superoxide dismutase (SOD) has extensive clinical applications for protecting organisms from toxic oxidation. In this study, the integrated iron-superoxide dismutase gene (fe-sod) coding sequence of Nostoc commune stain CHEN was cloned from genomic DNA and compared to sods from other reported algae. These analyses of immunology and phylogenetics indicated that this Fe-SOD is considerably homologous with SODs from lower prokaryotes (Fe-SOD or Mn-SOD) but not those from higher animals (Cu/Zn-SOD). In addition, the N. commune Fe-SOD shows 67 to 93% protein sequence identity to 10 other algal Fe-SODs (or Mn-SODs) and 69 to 93% gene sequence identity. Rare nonsynonymous substitutions imply that algal SODs are being subjected to strong natural selection. Interestingly, the N. commune Fe-SOD enzyme molecule has a compact active center that is highly conserved (38.1% of residues are absolutely conserved), and 2 loose ends localized outside the molecule and inclined to mutate (only 11.5% of residues are absolutely conserved). Based on associative analyses of evolution, structure, and function, this special phenomenon is attributed to function-dependent evolution through negative natural selection. Under strong natural selection, although the mutation is random on the gene level, the exterior region is inclined to mutate on the protein level owing to more nonsynonymous substitutions in the exterior region, which demonstrates the theoretical feasibility of modifying Fe-SOD on its ends to overcome its disadvantages in clinical applications.
Collapse
Affiliation(s)
- Y Ma
- Institution of Biochemistry, College of Life Sciences, Zhejiang University, Hangzhou, P.R. China
| | | | | | | | | |
Collapse
|
10
|
Miyazawa S. Advantages of a mechanistic codon substitution model for evolutionary analysis of protein-coding sequences. PLoS One 2011; 6:e28892. [PMID: 22220197 PMCID: PMC3248416 DOI: 10.1371/journal.pone.0028892] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2011] [Accepted: 11/16/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND A mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the average fixation probability depending on the type of amino acid replacement, has advantages over nucleotide, amino acid, and empirical codon substitution models in evolutionary analysis of protein-coding sequences. It can approximate a wide range of codon substitution processes. If no selection pressure on amino acids is taken into account, it will become equivalent to a nucleotide substitution model. If mutation rates are assumed not to depend on the codon type, then it will become essentially equivalent to an amino acid substitution model. Mutation at the nucleotide level and selection at the amino acid level can be separately evaluated. RESULTS The present scheme for single nucleotide mutations is equivalent to the general time-reversible model, but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene in a linear function of a given estimate of selective constraints. Their good estimates are those calculated by maximizing the respective likelihoods of empirical amino acid or codon substitution frequency matrices. Akaike and Bayesian information criteria indicate that the present model performs far better than the other substitution models for all five phylogenetic trees of highly-divergent to highly-homologous sequences of chloroplast, mitochondrial, and nuclear genes. It is also shown that multiple nucleotide changes in infinitesimal time are significant in long branches, although they may be caused by compensatory substitutions or other mechanisms. The variation of selective constraint over sites fits the datasets significantly better than variable mutation rates, except for 10 slow-evolving nuclear genes of 10 mammals. An critical finding for phylogenetic analysis is that assuming variable mutation rates over sites lead to the overestimation of branch lengths.
Collapse
Affiliation(s)
- Sanzo Miyazawa
- Graduate School of Engineering, Gunma University, Kiryu, Gunma, Japan
- * E-mail:
| |
Collapse
|
11
|
Hijikata A, Yura K, Noguti T, Go M. Revisiting gap locations in amino acid sequence alignments and a proposal for a method to improve them by introducing solvent accessibility. Proteins 2011; 79:1868-77. [PMID: 21465562 PMCID: PMC3110861 DOI: 10.1002/prot.23011] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2010] [Revised: 01/23/2011] [Accepted: 01/28/2011] [Indexed: 12/27/2022]
Abstract
In comparative modeling, the quality of amino acid sequence alignment still constitutes a major bottleneck in the generation of high quality models of protein three-dimensional (3D) structures. Substantial efforts have been made to improve alignment quality by revising the substitution matrix, introducing multiple sequences, replacing dynamic programming with hidden Markov models, and incorporating 3D structure information. Improvements in the gap penalty have not been a major focus, however, following the development of the affine gap penalty and of the secondary structure dependent gap penalty. We revisited the correlation between protein 3D structure and gap location in a large protein 3D structure data set, and found that the frequency of gap locations approximated to an exponential function of the solvent accessibility of the inserted residues. The nonlinearity of the gap frequency as a function of accessibility corresponded well to the relationship between residue mutation pattern and residue accessibility. By introducing this relationship into the gap penalty calculation for pairwise alignment between template and target amino acid sequences, we were able to obtain a sequence alignment much closer to the structural alignment. The quality of the alignments was substantially improved on a pair of sequences with identity in the “twilight zone” between 20 and 40%. The relocation of gaps by our new method made a significant improvement in comparative modeling, exemplified here by the Bacillus subtilis yitF protein. The method was implemented in a computer program, ALAdeGAP (ALignment with Accessibility dependent GAp Penalty), which is available at http://cib.cf.ocha.ac.jp/target_protein/. Proteins 2011; © 2011 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- Atsushi Hijikata
- Division of Biological Science, Graduate School of Science, Nagoya University, Furo, Chikusa, Nagoya 464-8602, Japan
| | | | | | | |
Collapse
|
12
|
Yura K, Sulaiman S, Hatta Y, Shionyu M, Go M. RESOPS: a database for analyzing the correspondence of RNA editing sites to protein three-dimensional structures. PLANT & CELL PHYSIOLOGY 2009; 50:1865-73. [PMID: 19808808 PMCID: PMC2775959 DOI: 10.1093/pcp/pcp132] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2009] [Accepted: 09/24/2009] [Indexed: 05/21/2023]
Abstract
Transcripts from mitochondrial and chloroplast DNA of land plants often undergo cytidine to uridine conversion-type RNA editing events. RESOPS is a newly built database that specializes in displaying RNA editing sites of land plant organelles on protein three-dimensional (3D) structures to help elucidate the mechanisms of RNA editing for gene expression regulation. RESOPS contains the following information: unedited and edited cDNA sequences with notes for the target nucleotides of RNA editing, conceptual translation from the edited cDNA sequence in pseudo-UniProt format, a list of proteins under the influence of RNA editing, multiple amino acid sequence alignments of edited proteins, the location of amino acid residues coded by codons under the influence of RNA editing in protein 3D structures and the statistics of biased distributions of the edited residues with respect to protein structures. Most of the data processing procedures are automated; hence, it is easy to keep abreast of updated genome and protein 3D structural data. In the RESOPS database, we clarified that the locations of residues switched by RNA editing are significantly biased to protein structural cores. The integration of different types of data in the database also help advance the understanding of RNA editing mechanisms. RESOPS is accessible at http://cib.cf.ocha.ac.jp/RNAEDITING/.
Collapse
Affiliation(s)
- Kei Yura
- Computational Biology, Graduate School of Humanities and Sciences, Ochanomizu University, 2-1-1 Otsuka, Bunkyo, Tokyo, 112-8610 Japan.
| | | | | | | | | |
Collapse
|
13
|
|
14
|
Romo-González T, Vargas-Madrazo E. Substitution patterns in alleles of immunoglobulin V genes in humans and mice. Mol Immunol 2005; 43:731-44. [PMID: 15935479 DOI: 10.1016/j.molimm.2005.03.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2005] [Indexed: 11/26/2022]
Abstract
Immunoglobulins (Igs) constitute a subfamily of rapidly evolving proteins. It is postulated that this characteristic is due mainly to the participation of these proteins in highly diverse functions of recognition and defense. Although this vision of rapid evolution in Igs is widely accepted, various studies have demonstrated that diverse and contradictory forces not yet completely understood converge in the evolution of these receptors. In a recent study of the substitution patterns in the alleles that form the human IGHV locus, we found that the variation in genetic and structural information does not occur homogeneously among the different genes, nor among the regions and positions conforming said locus. In view of these results and of the importance of a better understanding of the basic evolutionary process in specific receptors (such as Igs) for both immunology and molecular evolution, it is important to explore the nature of the diversification process in these proteins in detail. In this work, therefore, we analyzed the substitution patterns in all the alleles reported for loci IGKV and IGLV in humans and mice, and we compared the results with those previously observed in the human IGHV locus. We found that the process of evolutionary variation of the Igs reflect the diversity of selective pressures operating on the different loci, genes, sub-regions and positions; for example, diversification through substitution is generally centered on CDRs, but only few positions inside the CDRs were frequently substituted. In spite of this general tendency, it is possible to observe differences in the degree of diversification among loci, families and genes. These tendencies to modify only certain attributes of IGV genes seem to be in agreement with differential strategies associated with the restrictions of the molecular immune recognition mechanism. The complexity of the evolutionary patterns observed in this study leads us to think that the predispositions observed herein may also be due in part to processes of DNA dynamics.
Collapse
Affiliation(s)
- Tania Romo-González
- Instituto de Investigaciones Biológicas, Universidad Veracruzana, 2a Schubert No. 4 Indeco Animas, Xalapa, Ver., C.P. 91190, Mexico
| | | |
Collapse
|
15
|
Romo-González T, Morales-Montor J, Rodríguez-Dorantes M, Vargas-Madrazo E. Novel Substitution Polymorphisms of Human Immunoglobulin VH Genes in Mexicans. Hum Immunol 2005; 66:732-40. [PMID: 15993719 DOI: 10.1016/j.humimm.2005.03.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2005] [Revised: 02/26/2005] [Accepted: 03/01/2005] [Indexed: 11/18/2022]
Abstract
It has been proposed that the defense and recognition functions of the immune system, especially those mediated by antibodies, require a great diversity of receptors. Nonetheless, functional and structural evidence has demonstrated the presence of restrictions, both in the use of the repertoire and in the recognition of antigens. Fifty-one functional genes have been described in the IghV locus; however, there is a variety of evidences indicating that only a small fraction of the immunoglobulin genes plays a central role in determining the fundamental properties of the antibody repertoire of the immune system. On the basis of this functional and structural information, we selected four IghV genes and characterized their polymorphism in a sample of Mexican individuals. We also analyzed the implications for the recognition mechanism of the substitutions found in the sequenced alleles. We found that diversification through allelism varies from segment to segment, both in the amount of alleles encountered and in the nature and distribution of mutations in the codifying zone, which might depend on its importance for the repertoire. Such functional characteristics may be useful in the interpretation of differential gene usage in certain physiological, ontological, and/or pathological conditions.
Collapse
Affiliation(s)
- Tania Romo-González
- Departamento de Biología Sistémica, Instituto de Investigaciones Biológicas, Universidad Veracruzana, Xalapa, Veracruz, México.
| | | | | | | |
Collapse
|
16
|
Grant Mauk A. Electron transfer in genetically engineered proteins. The cytochrome c paradigm. STRUCTURE AND BONDING 2005. [DOI: 10.1007/3-540-53260-9_5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]
|
17
|
Romo-González T, Vargas-Madrazo E. Structural analysis of substitution patterns in alleles of human immunoglobulin VH genes. Mol Immunol 2004; 42:1085-97. [PMID: 15829298 DOI: 10.1016/j.molimm.2004.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Indexed: 10/26/2022]
Abstract
The diversity in repertoires of antibodies (Abs) needed in response to the antigen challenge is produced by evolutionary and somatic processes. The mechanisms operating at a somatic level have been studied in great detail. In contrast, neither the mechanisms nor the strategies of diversification at an evolutionary level have yet been understood in similar detail. Particularly, the substitution patterns in alleles of immunoglobulin genes (Igs) have not been systematically studied. Furthermore, there is a scarcity of studies which link the analysis at a genetic level of the diversification of repertoires with the structural consequences at the protein level of the changes in DNA information. For the purpose of systematically characterizing the strategies of evolutionary diversification through sequence variation at alleles, in this work, we built a database for all the alleles of the IGHV locus in humans reported until now. Based on these data, we performed diverse analyses of substitution patterns and linked these results with studies at the protein level. We found that the sequence diversification in different alleles does not operate with equal intensity for all V genes. Our studies, both of the number of substitutions and of the type of amino acid change per sub-segment of the V-REGION evidenced differences in the selective pressure to which these regions are exposed. The implications of these results for understanding the evolutionary diversification strategies, as well as for the somatic generation of antibody repertoires are discussed.
Collapse
Affiliation(s)
- Tania Romo-González
- Departamento Biología Sistémica, Instituto de Investigaciones Biológicas, Universidad Veracruzana, Morelos 9 La Pitaya, Zoncuantla, 91500 Coatepec, Xalapa, Veracruz, Mexico
| | | |
Collapse
|
18
|
Neves-Petersen MT, Petersen EI, Fojan P, Noronha M, Madsen RG, Petersen SB. Engineering the pH-optimum of a triglyceride lipase: from predictions based on electrostatic computations to experimental results. J Biotechnol 2001; 87:225-54. [PMID: 11334666 DOI: 10.1016/s0168-1656(01)00240-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The optimisation of enzymes for particular purposes or conditions remains an important target in virtually all protein engineering endeavours. Here, we present a successful strategy for altering the pH-optimum of the triglyceride lipase cutinase from Fusarium solani pisi. The computed electrostatic pH-dependent potentials in the active site environment are correlated with the experimentally observed enzymatic activities. At pH-optimum a distinct negative potential is present in all the lipases and esterases that we studied so far. This has prompted us to propose the "The Electrostatic Catapult Model" as a model for product release after cleavage of the ester bond. The origin of the negative potential is associated with the titration status of specific residues in the vicinity of the active site cleft. In the case of cutinase, the role of Glu44 was systematically investigated by mutations into Ala and Lys. Also, the neighbouring Thr45 was mutated into Proline, with the aim of shifting the spatial location of Glu44. All the charge mutants displayed altered titration behaviour of active site electrostatic potentials. Typically, the substitution of the residue Glu44 pushes the onset of the active site negative potential towards more alkaline conditions. We, therefore, predicted more alkaline pH optima, and this was indeed the experimentally observed. Finally, it was found that the pH-dependent computed Coulombic energy displayed a strong correlation with the observed melting temperatures of native cutinase.
Collapse
Affiliation(s)
- M T Neves-Petersen
- The Biostructure and Protein Engineering Group, Department of Life Sciences, Aalborg University, Sohngaardsholmsvej 49, DK-9000 Aalborg, Denmark
| | | | | | | | | | | |
Collapse
|
19
|
Neves Petersen MT, Fojan P, Petersen SB. How do lipases and esterases work: the electrostatic contribution. J Biotechnol 2001; 85:115-47. [PMID: 11165360 DOI: 10.1016/s0168-1656(00)00360-6] [Citation(s) in RCA: 92] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
This work explores the role of one of the factors explaining lipase/esterase activity: the contribution of electrostatic interactions to lipase/esterase activity. The electrostatic potential distribution on the molecular surface of an enzyme as a function of pH determines, to a large extent, the enzyme's pH activity profile. Other important factors include the presence and distribution of polar and hydrophobic residues in the active cleft. We have mapped the electrostatic potential distribution as a function of pH on the molecular surface of nine lipases/esterases for which the 3D structure is experimentally known. A comparison of these potential maps at different pH values with the corresponding pH-activity profile, pH optimum or pH range where the activity displayed by the enzyme is maximum, has revealed a considerable correlation. A negative potential in the active site appears correlated with maximum activity towards triglycerides, which has prompted us to propose a model for product release ('The electrostatic catapult model') after cleavage of an ester bond. At the same time as the bottom of the active site cleft becomes negatively charged, other nearby regions also titrate and become negatively charged when pH becomes more alkaline, for some of the studied lipases. If such lipases also show phospholipase activity (such as guinea pig lipase-related proteins 2 chimera) we raise the hypothesis that such other titratable regions after becoming negatively charged might stabilise the positive charge present in the polar head of phospholipids, such as phosphatidylcholine, phosphatidylethanolamine and phosphatidylserine. The distribution of polar, weak polar and non-polar residues on the molecular surface of each studied lipase, in particular the active site region, was compared for all the lipases studied. The combination of graphical visualisation of the electrostatic potential maps and the polarity maps combined with knowledge about the location of key residues on the protein surface allows us to envision atomic models for lipolytic activity.
Collapse
Affiliation(s)
- M T Neves Petersen
- Biostructure and Protein Engineering Group, Institute of Life Science, Aalborg University, Sohngaardsholmsvej 57, DK-9000, Aalborg, Denmark
| | | | | |
Collapse
|
20
|
Miyazawa S, Jernigan RL. Identifying sequence-structure pairs undetected by sequence alignments. PROTEIN ENGINEERING 2000; 13:459-75. [PMID: 10906342 DOI: 10.1093/protein/13.7.459] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.
Collapse
Affiliation(s)
- S Miyazawa
- Faculty of Technology, Gunma University, Kiryu, Gunma 376, Japan and Room B-116, Bldg 12B, MSC 5677, Laboratory of Experimental and Computational Biology, DBS, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5677,USA
| | | |
Collapse
|
21
|
Yamaguchi-Kabata Y, Gojobori T. Reevaluation of amino acid variability of the human immunodeficiency virus type 1 gp120 envelope glycoprotein and prediction of new discontinuous epitopes. J Virol 2000; 74:4335-50. [PMID: 10756049 PMCID: PMC111951 DOI: 10.1128/jvi.74.9.4335-4350.2000] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
To elucidate the evolutionary mechanisms of the human immunodeficiency virus type 1 gp120 envelope glycoprotein at the single-site level, the degree of amino acid variation and the numbers of synonymous and nonsynonymous substitutions were examined in 186 nucleotide sequences for gp120 (subtype B). Analyses of amino acid variabilities showed that the level of variability was very different from site to site in both conserved (C1 to C5) and variable (V1 to V5) regions previously assigned. To examine the relative importance of positive and negative selection for each amino acid position, the numbers of synonymous and nonsynonymous substitutions that occurred at each codon position were estimated by taking phylogenetic relationships into account. Among the 414 codon positions examined, we identified 33 positions where nonsynonymous substitutions were significantly predominant. These positions where positive selection may be operating, which we call putative positive selection (PS) sites, were found not only in the variable loops but also in the conserved regions (C1 to C4). In particular, we found seven PS sites at the surface positions of the alpha-helix (positions 335 to 347 in the C3 region) in the opposite face for CD4 binding. Furthermore, two PS sites in the C2 region and four PS sites in the C4 region were detected in the same face of the protein. The PS sites found in the C2, C3, and C4 regions were separated in the amino acid sequence but close together in the three-dimensional structure. This observation suggests the existence of discontinuous epitopes in the protein's surface including this alpha-helix, although the antigenicity of this area has not been reported yet.
Collapse
Affiliation(s)
- Y Yamaguchi-Kabata
- Center for Information Biology, National Institute of Genetics, Mishima 411-8540, Japan
| | | |
Collapse
|
22
|
Abstract
Xylanases are classified into two families, numbered F/10 and G/11 according to the similarity of amino acid sequences of their catalytic domain (Henrissat, B., Bairoch, A., 1993. New families in the classification of glycosyl hydrolases based on amino acid sequence similarities. Biochem. J. 293, 781-788). Three-dimensional structure of the catalytic domain of the family F/10 xylanase was reported (White, A., Withers, S.G., Gilkes, N.R., Rose, D.R., 1994. Crystal structure of the catalytic domain of the beta-1,4-glycanase Cex from Cellulomonas fimi. Biochemistry 33, 12546-12552). The domain was decomposed into 22 modules by centripetal profiles (Go, M., Nosaka, M., 1987. Protein architecture and the origin of introns. Cold Spring Harbor Symp. Quant. Biol. 52, 915-924; Noguti, T., Sakakibara, H., Go, M., 1993. Localization of hydrogen-bonds within modules in barnase. Proteins 16, 357-363). A module is a contiguous polypeptide segment of amino acid residues having a compact conformation within a globular domain. Collected 31 intron sites of the family F/10 xylanase genes from fungus were found to be correlated to module boundaries with considerable statistical force (p values <0.001). The relationship between the intron locations and protein structures provides supporting evidence for the ancient origin of introns, because such a relationship cannot be expected by random insertion of introns into eukaryotic genes, but it rather suggests pre-existence of introns in the ancestral genes of prokaryotes and eukaryotes. A phylogenetic tree of the fungal and bacterial xylanase sequences made two clusters; one includes both the bacterial and fungal genes, but the other consists of only fungal genes. The mixed cluster of bacterial genes without introns and the fungal genes with introns further supports the ancient origin of introns. Comparison of the conserved base sequences of introns indicates that sliding of a splice site occurred in Aspergillus kawachii gene by one base from the ancestral position. Substrate-binding sites of xylanase are localized on eight modules, and introns are found at both termini of six out of these functional modules. This result suggests that introns might play a functional role in shuffling the exons encoding the substrate-binding modules.
Collapse
Affiliation(s)
- Y Sato
- Division of Biological Science, Graduate School of Science, Nagoya University, Japan
| | | | | | | |
Collapse
|
23
|
Takahashi KI, Noguti T, Hojo H, Yamauchi K, Kinoshita M, Aimoto S, Ohkubo T, Gō M. A mini-protein designed by removing a module from barnase: molecular modeling and NMR measurements of the conformation. PROTEIN ENGINEERING 1999; 12:673-80. [PMID: 10469828 DOI: 10.1093/protein/12.8.673] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
A globular domain can be decomposed into compact modules consisting of contiguous 10-30 amino acid residues. The correlation between modules and exons observed in different proteins suggests that each module was encoded by an ancestral exon and that modules were combined into globular domains by exon fusion. Barnase is a single domain RNase consisting of 110 amino acid residues and was decomposed into six modules. We designed a mini-protein by removing the second module, M2, from barnase in order to gain an insight into the structural and functional roles of the module. In the molecular modeling of the mini-protein, we evaluated thermodynamic stability and aqueous solubility together with mechanical stability of the model. We chemically synthesized a mini-barnase with (15)N-labeling at 10 residues, whose corresponding residues in barnase are all found in the region around the hydrophobic core. Circular dichroism and NMR measurements revealed that mini-barnase takes a non-random specific conformation that has a similar hydrophobic core structure to that of barnase. This result, that a module could be deleted without altering the structure of core region of barnase, supports the view that modules act as the building blocks of protein design.
Collapse
Affiliation(s)
- K i Takahashi
- Division of Biological Science, Graduate School of Science, Nagoya University, Furo-cho, Chikusa, Nagoya 464-8602, Japan
| | | | | | | | | | | | | | | |
Collapse
|
24
|
|
25
|
Petersen SB, Jonson PH, Fojan P, Petersen EI, Petersen MT, Hansen S, Ishak RJ, Hough E. Protein engineering the surface of enzymes. J Biotechnol 1998; 66:11-26. [PMID: 9866858 DOI: 10.1016/s0168-1656(98)00153-9] [Citation(s) in RCA: 47] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The protein surface is the interface through which a protein senses the external world. Its composition of charged, polar and hydrophobic residues is crucial for the stability and activity of the protein. The charge state of seven of the twenty naturally occurring amino acids is pH dependent. A total of 95% of all titratable residues are located on the surface of soluble proteins. In evolutionary related families of proteins such residues are particularly prone to substitutions, insertions and deletions. We present here an analysis of the residue composition of 4038 proteins, selected from 125 protein families with < 25% identity between core members of each family. Whereas only 16.8% of the residues were truly buried, 40.7% were > 30% exposed on the surface and the remainder were < 30% exposed. The individual residue types show distinct differences. The data presented provides an important new approach to protein engineering of protein surfaces. Guidelines for the optimization of solvent exposure for a given residue are given. The cutinase family of enzymes has been investigated. The stability of native cutinase has been studied as a function of pH, and has been compared with the cutinase activity towards tributyrin. Whereas the onset of enzymatic activity is linked with the deprotonation of the active site HIS188, destabilization of the 3D structure as determined by differential scanning calorimetry is coupled with the loss of activity at very basic pH values. A modeling investigation of the pH dependence of the electrostatic potentials reveals that the activity range is accompanied by the development of a highly significant negative potential in the active site cleft. The 3D structures of three mutants of the Fusarium solani pisi cutinase have been solved to high resolution using X-ray diffraction analysis. Preliminary X-ray data are presented.
Collapse
Affiliation(s)
- S B Petersen
- Department of Biotechnology, University of Aalborg, Denmark.
| | | | | | | | | | | | | | | |
Collapse
|
26
|
Marcone MF, Kakuda Y, Yada RY. Salt-soluble seed globulins of dicotyledonous and monocotyledonous plants II. Structural characterization. Food Chem 1998. [DOI: 10.1016/s0308-8146(97)00159-3] [Citation(s) in RCA: 73] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
27
|
Shirai T, Go M. Adaptive amino acid replacements accompanied by domain fusion in reverse transcriptase. J Mol Evol 1997; 44 Suppl 1:S155-62. [PMID: 9071024 DOI: 10.1007/pl00000068] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Two basic processes are involved in protein evolution: One is amino acid replacement and another is reorganization of structural or functional units of proteins. Multidomain or multifunctional proteins are thought to have evolved by fusion of smaller structural units such as modules or domains. Reverse transcriptase (RT) is one of such fused proteins. The N-terminal part forms of globular domain with polymerase activity and the C-terminal part forms another globular domain with ribonuclease H activity (RNase H domain). There are single-domain enzymes which are homologous with the RNase H domain. The group of enzymes is called type I ribonuclease H (RNase HI). It is most likely that the ancestors of RNase HI and the polymerase domain were fused and became contemporary RT. At fusion, amino acid replacements presumably occurred at the interface of the domains to reinforce the interdomain interactions. Such replaced amino acid residues are conserved during evolution of the fused enzyme. We analyzed the pattern of amino acid replacement at each residue site in the free form, RNase HI group, and the integrated form, RNase H domain group. Then we compared the patterns between the two forms. Drastic fitting replacements of amino acid residues occurred at four of 29 residue sites involved in interdomain contact. Hydrophilic amino acid residues of the free form were substituted with hydrophobic or ambivalent ones in the integrated form. These substitutions aid in stabilizing the fused conformation by hydrophobic interactions at the interface of the domains. These observations imply that domain fusion could have occurred with only a relatively small number of adaptive amino acid substitutions.
Collapse
Affiliation(s)
- T Shirai
- Division of Biological Science, Graduate School of Science, Nagoya University, Japan
| | | |
Collapse
|
28
|
Affiliation(s)
- H R Guy
- Laboratory of Mathematical Biology, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892-5677, USA
| | | |
Collapse
|
29
|
Nakagawa H, Koyama Y, Kawai T, Okada T. A new clustering system for protein sequences and its application to constraints discovery in protein evolution. INTERNATIONAL JOURNAL OF PEPTIDE AND PROTEIN RESEARCH 1995; 46:440-51. [PMID: 8567189 DOI: 10.1111/j.1399-3011.1995.tb01079.x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
A conceptual clustering system, CLUSMOL/S, has been developed to classify protein sequences from a user-defined point of view. Given a grouping of amino acids as a viewpoint, the system constructs taxonomic trees of sequences based on minimum information criterion. Every tree node expresses itself as a generic consensus sequence that consists of specific consensus amino acids insertion/deletion points, and generic amino acids with a specified character. The resulting tree and generic sequences show the similarity-based relationships among sequences and their characteristics. Application to vertebrate cytochromes c yields an acceptable cladrogram only when amino acids are grouped by volume and length of sidechains. The result indicates that the steric factor is the most important constraint in the process of protein evolution.
Collapse
Affiliation(s)
- H Nakagawa
- Faculty of Science, Kwansei Gakuin University, Nishinomiya, Japan
| | | | | | | |
Collapse
|
30
|
Miyata T, Inagi R, Wada Y, Ueda Y, Iida Y, Takahashi M, Taniguchi N, Maeda K. Glycation of human beta 2-microglobulin in patients with hemodialysis-associated amyloidosis: identification of the glycated sites. Biochemistry 1994; 33:12215-21. [PMID: 7918443 DOI: 10.1021/bi00206a026] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
beta 2-Microglobulin (beta 2M) is a major component forming amyloid deposits in patients with hemodialysis-associated amyloidosis (HAA), a serious complication of long-term hemodialysis. Recently, we demonstrated that beta 2M modified with the Maillard reaction is a definite constituent of amyloid deposits in patients with HAA. Our further study demonstrated that this modified beta 2M induces not only chemotaxis of monocytes but also secretion of tumor necrosis factor-alpha, interleukin-1 beta, and interleukin-6 from macrophages, suggesting the potential link of glycation of beta 2M by the Maillard reaction to the pathogenesis of HAA. The present study was undertaken to identify the glycated site(s) of beta 2M purified from long-term hemodialysis patients as well as beta 2M incubated with glucose in vitro. Borotritide-treated beta 2M was cleaved by endoproteinase Lys-C, and peptides were isolated by reverse-phase high-performance liquid chromatography, followed by amino acid sequence analysis and fast atom bombardment mass spectrometry to identify the glycated site. The glycated sites of beta 2M formed in vivo were found to be almost the same as those of glycated beta 2M in vitro. The primary glycated site was the alpha-amino group of the amino terminal isoleucine. Other minor sites were the epsilon-amino groups of Lys-19, -41, -48, -58, -91, and -94. Computer graphics of the three-dimensional structure of beta 2M suggested that the high specificity for the glycated site at Ile-1 may be explained by its high solvent accessibility and the nearby imidazole group of His-31 as an acid-base catalyst of the Amadori rearrangement.
Collapse
Affiliation(s)
- T Miyata
- Department of Internal Medicine, Branch Hospital, Nagoya University School of Medicine, Japan
| | | | | | | | | | | | | | | |
Collapse
|
31
|
Parker JM, Hodges RS. HomologyPlot: searching for homology to a family of proteins using a database of unique conserved patterns. J Comput Aided Mol Des 1994; 8:193-210. [PMID: 8064334 DOI: 10.1007/bf00119867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
A new database of conserved amino acid residues is derived from the multiple sequence alignment of over 84 families of protein sequences that have been reported in the literature. This database contains sequences of conserved hydrophobic core patterns which are probably important for structure and function, since they are conserved for most sequences in that family. This database differs from other single-motif or signature databases reported previously, since it contains multiple patterns for each family. The new database is used to align a new sequence with the conserved regions of a family. This is analogous to reports in the literature where multiple sequence alignments are used to improve a sequence alignment. A program called HomologyPlot (suitable for IBM or compatible computers) uses this database to find homology of a new sequence to a family of protein sequences. There are several advantages to using multiple patterns. First, the program correctly identifies a new sequence as a member of a known family. Second, the search of the entire database is rapid and requires less than one minute. This is similar to performing a multiple sequence alignment of a new sequence to all of the known protein family sequences. Third, the alignment of a new sequence to family members is reliable and can reproduce the alignment of conserved regions already described in the literature. The speed and efficiency of this method is enhanced, since there is no need to score for insertions or deletions as is done in the more commonly used sequence alignment methods. In this method only the patterns are aligned. HomologyPlot also provides general information on each family, as well as a listing of patterns in a family.
Collapse
Affiliation(s)
- J M Parker
- Department of Biochemistry, University of Alberta, Edmonton, Canada
| | | |
Collapse
|
32
|
McCormack K, Lin L, Sigworth FJ. Substitution of a hydrophobic residue alters the conformational stability of Shaker K+ channels during gating and assembly. Biophys J 1993; 65:1740-8. [PMID: 8274662 PMCID: PMC1225901 DOI: 10.1016/s0006-3495(93)81202-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
A leucine residue at position 370 (L370) in 29-4 Shaker K+ channels resides within two overlapping sequence motifs conserved among most voltage-gated channels: the S4 segment and a leucine heptad repeat. Here we investigate the effects observed upon substitution of L370 with many other uncharged amino acid residues. We find that smaller or more hydrophilic residues produce greater alterations in both activation and inactivation gating than does substitution with other large hydrophobic residues. In addition, subunits containing less conservative substitutions at position 370 are restricted in their assembly with wild-type subunits and are unlikely to form homomultimeric channel complexes. Consistent with the idea that L370 influences the tertiary structure of these channels, the results indicate that L370 undergoes specific hydrophobic interactions during the conformational transitions of gating; similar interactions may take place during the folding, insertion, or assembly of Shaker K+ channel subunits.
Collapse
Affiliation(s)
- K McCormack
- Department of Cellular and Molecular Physiology, Yale University School of Medicine, New Haven, Connecticut 06510
| | | | | |
Collapse
|
33
|
Abstract
Atomic-scale computer models were developed for how cecropin peptides may assemble in membranes to form two types of ion channels. The models are based on experimental data and physiochemical principles. Initially, cecropin peptides, in a helix-bend-helix motif, were arranged as antiparallel dimers to position conserved residues of adjacent monomers in contact. The dimers were postulated to bind to the membrane with the NH2-terminal helices sunken into the head-group layer and the COOH-terminal helices spanning the hydrophobic core. This causes a thinning of the top lipid layer of the membrane. A collection of the membrane bound dimers were then used to form the type I channel structure, with the pore formed by the transmembrane COOH-terminal helices. Type I channels were then assembled into a hexagonal lattice to explain the large number of peptides that bind to the bacterium. A concerted conformational change of a type I channel leads to the larger type II channel, in which the pore is formed by the NH2-terminal helices. By having the dimers move together, the NH2-terminal helices are inserted into the hydrophobic core without having to desolvate the charged residues. It is also shown how this could bring lipid head-groups into the pore lining.
Collapse
Affiliation(s)
- S R Durell
- Laboratory of Mathematical Biology, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892
| | | | | |
Collapse
|
34
|
Morris KN, Wool IG. Determination by systematic deletion of the amino acids essential for catalysis by ricin A chain. Proc Natl Acad Sci U S A 1992; 89:4869-73. [PMID: 1594586 PMCID: PMC49189 DOI: 10.1073/pnas.89.11.4869] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The A chain of ricin is a RNA N-glycosidase that inactivates ribosomes by depurination of a single adenosine in 28S rRNA. Of the 267 amino acids in the protein, 222 (83%) could be deleted from one or another of 74 mutants without loss of the capacity of the protein to recognize a single nucleotide from among the 7000 in ribosomes or to catalyze hydrolysis.
Collapse
Affiliation(s)
- K N Morris
- Department of Biochemistry and Molecular Biology, University of Chicago, IL 60637
| | | |
Collapse
|
35
|
Durell SR, Guy HR. Atomic scale structure and functional models of voltage-gated potassium channels. Biophys J 1992; 62:238-47; discussion 247-50. [PMID: 1600096 PMCID: PMC1260525 DOI: 10.1016/s0006-3495(92)81809-x] [Citation(s) in RCA: 277] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Recent mutagenesis experiments have confirmed our hypothesis that a segment between S5 and S6 forms the ion selective portion of voltage-gated ion channels. Based on these and other new data, we have revised previous models of the general folding pattern of voltage-gated channel proteins and have developed atomic scale models of the entire transmembrane region of the Shaker A K+ channel. In these models, the ion selective region is a beta-barrel that spans the outer half of the membrane. The inner half of the pore is larger. The voltage-dependent conformational changes of activation gating are modeled to occur by the "helical screw" mechanism, in which the four S4 segments move along and rotate about their axes. These changes are followed by a voltage-independent conformational change, in which the segments linking S4 to S5 move from blocking the intracellular entrance of the pore to forming part of the lining of the large inner portion of the pore. The NH2-terminal of the protein was modeled as an alpha-helix that plugs the intracellular half of the pore to inactivate the channel.
Collapse
Affiliation(s)
- S R Durell
- Laboratory of Mathematical Biology, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892
| | | |
Collapse
|
36
|
Ladant D, Glaser P, Ullmann A. Insertional mutagenesis of Bordetella pertussis adenylate cyclase. J Biol Chem 1992. [DOI: 10.1016/s0021-9258(18)45869-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
|
37
|
|
38
|
Herning T, Yutani K, Taniyama Y, Kikuchi M. Effects of proline mutations on the unfolding and refolding of human lysozyme: the slow refolding kinetic phase does not result from proline cis-trans isomerization. Biochemistry 1991; 30:9882-91. [PMID: 1911779 DOI: 10.1021/bi00105a011] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The unfolding and refolding kinetics of six proline mutants of the human lysozyme (h-lysozyme) were carried out and compared to that of the wild-type protein. Our results show that the slow refolding phase observed in the h-lysozyme refolding kinetics cannot be ascribed to proline isomerization reactions. The h-lysozyme contains two proline residues at positions 71 and 103, both in the trans conformation in the native state. The refolding kinetics of the P71G/P103G mutant, in which both prolines have been replaced by a glycine, were found to be similar to those of the wild-type protein. The same slow phase amplitude of about 10% was found for both proteins, and the slow phase rate constants were also identical within experimental error. Other mutants such as P103G or P71G, in which only one of the two prolines has been replaced by a glycine, and A47P with its three prolines, gave identical slow refolding phases. The X-ray structure analysis and scanning microcalorimetric study of each protein (Herning et al., unpublished experiments) have confirmed that none of the considered mutations affects significantly protein structure and that no major changes in protein stability were brought about by these mutations. Therefore, comparison of the properties of the mutant and wild-type proteins is legitimate. Interestingly, the refolding kinetics of the V110P mutant, in which a proline residue has been introduced at position 110 (N-terminus of an alpha-helix), were clearly triphasic. For this mutant an additional very slow phase with properties similar to those expected from the proline hypothesis was detected. Equilibrium denaturation studies were conducted for each protein, and the refolding pathway of h-lysozyme is partly presented. We also discuss the effect of proline mutations on the energetics of the folding pathway of the h-lysozyme in water.
Collapse
Affiliation(s)
- T Herning
- Protein Engineering Research Institute, Osaka, Japan
| | | | | | | |
Collapse
|
39
|
Tsukihara T, Fukuyama K, Mizushima M, Harioka T, Kusunoki M, Katsube Y, Hase T, Matsubara H. Structure of the [2Fe-2S] ferredoxin I from the blue-green alga Aphanothece sacrum at 2.2 A resolution. J Mol Biol 1990; 216:399-410. [PMID: 2123937 DOI: 10.1016/s0022-2836(05)80330-4] [Citation(s) in RCA: 95] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Crystals of a [2Fe-2S] ferredoxin (Fd) I with a relative molecular mass of 10,480 were obtained from the blue-green alga Aphanothece sacrum. Each asymmetric unit of the crystal contains four molecules. An electron density map calculated by the single isomorphous replacement method with the anomalous dispersion at 2.5 A resolution was refined by averaging the four molecules in the asymmetric unit. Positional and isotropic thermal parameters for the non-hydrogen atoms of the four molecules and 158 water molecules were refined to an R-factor (R = sigma[Fo-Fc[/sigma Fo) of 0.23 by the restrained least-squares method. The estimated root-mean-square (r.m.s.) error for the atomic positions is 0.3 A. The r.m.s. deviations of equivalent C alpha atoms of the asymmetric-unit molecules superposed by the least-squares method average 0.35 A. The Fd molecule has a structure like the beta-barrel in the molecule of the [2Fe-2S] Fd from Spirulina platensis. A [2Fe-2S] cluster is bonded covalently to the protein molecule by four Fe-S, in which three of the Fe-S bonds are in a loop segment from position 38 to 47. The hydrophobic core inside the beta-barrel is formed by seven conservative residues: Val15, Val18, Ile24, Leu51, Ile74, Ala79 and Ile87. The molecular surface around Tyr23, Tyr80 and the active center may interact with ferredoxin-NADP+ reductase. One of the two iron atoms of the [2Fe-2S] cluster should be more easily reduced than the other because of differences in the hydrogen-bonding scheme and the hydrophobicity around the atoms.
Collapse
Affiliation(s)
- T Tsukihara
- Faculty of Engineering, Tottori University, Japan
| | | | | | | | | | | | | | | |
Collapse
|
40
|
Armstrong S, Francis C, McIntosh M. Molecular analysis of the Escherichia coli ferric enterobactin receptor FepA. J Biol Chem 1990. [DOI: 10.1016/s0021-9258(18)77336-5] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
41
|
Freimuth PI, Taylor JW, Kaiser ET. Introduction of guest peptides into Escherichia coli alkaline phosphatase. Excision and purification of a dynorphin analogue from an active chimeric protein. J Biol Chem 1990. [DOI: 10.1016/s0021-9258(19)40134-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
42
|
Noguti T, Go N. Structural basis of hierarchical multiple substates of a protein. III: Side chain and main chain local conformations. Proteins 1989; 5:113-24. [PMID: 2748575 DOI: 10.1002/prot.340050205] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
An analysis is carried out of differences in the minimum energy conformations obtained in the previous paper by energy minimization starting from conformations sampled by a Monte Carlo simulation of conformational fluctuations in the native state of a globular protein, bovine pancreatic trypsin inhibitor. Main conformational differences in each pair of energy minima are found usually localized in several side chains and in a few local main chain segments. Such side chains and local main chain segments are found to take a few distinct local conformations in the minimum energy conformations. Energy minimum conformations can thus be described in terms of combinations of these multiple local conformations.
Collapse
Affiliation(s)
- T Noguti
- Computation Center, Faculty of Engineering, Kyushu University, Fukuoka, Japan
| | | |
Collapse
|
43
|
Frömmel C. Use of the averaged mutation rate in pieces of protein sequences to predict the location of antigenic determinants. J Theor Biol 1988; 132:171-7. [PMID: 2463436 DOI: 10.1016/s0022-5193(88)80155-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Antigenic determinants in proteins show properties, which can be used to their prediction: high degree of accessibility, mobility, polarity etc. Furthermore, there is a remarkable correlation between the location of antigenic determinants and regions with high frequency of accepted mutations during the evolution. Consequently, it is possible to predict successfully the location of antigenic epitopes in proteins using the averaged mutation rate of protein sequence pieces.
Collapse
Affiliation(s)
- C Frömmel
- Institute of Biochemistry, Humboldt-University Berlin, GDR
| |
Collapse
|
44
|
Klein P, Somorjai RL. Nonlinear methods for discrimination and their application to classification of protein structures. J Theor Biol 1988; 130:461-8. [PMID: 3184960 DOI: 10.1016/s0022-5193(88)80210-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Discriminant analysis assigns objects to one of several classes on the basis of attributes which characterize the objects. The success of classification depends on the selection of discriminatory attributes and on the choice of an assignment rule. In this paper we focus on the latter and discuss ways to obtain nonlinear classification rules through maximum likelihood, canonical components and projection pursuit. We use both linear and nonlinear methods to classify proteins into three secondary structural types: alpha, beta, and mixed alpha and beta or irregular. Using simple attributes, dependent on amino acid properties, we show that the rate of incorrect classification can be decreased by more than 15% when nonlinear methods are used.
Collapse
Affiliation(s)
- P Klein
- Divison of Biological Sciences, National Research Council of Canada, Ottawa, Ontario
| | | |
Collapse
|
45
|
Abstract
In order to analyze the evolutionary behavior of the cobra venom cytotoxins, their probable tertiary structure was predicted using computer graphics. The 41 amino acid sequences known show that the major evolutionary changes have taken place in two particularly exposed areas of the molecular surface. In each area, neighboring residue positions seem to have evolved interdependently, but there is no obvious interdependence between the two areas. Indeed, the relative evolution of these two areas prompts a subdivision of the sequence set into four groups. According to the known cytotoxin circular dichroism spectra, one of these four groups could be characterized by a difference in molecular secondary structure. Since the two variable areas have functional associations, it is suggested that their evolution may be governed by a target with several similar binding sites.
Collapse
Affiliation(s)
- R Breckenridge
- Department of Pure and Applied Chemistry, University of Strathclyde, Glasgow, United Kingdom
| | | |
Collapse
|
46
|
Andersen K, Caton J. Sequence analysis of the Alcaligenes eutrophus chromosomally encoded ribulose bisphosphate carboxylase large and small subunit genes and their gene products. J Bacteriol 1987; 169:4547-58. [PMID: 2820933 PMCID: PMC213820 DOI: 10.1128/jb.169.10.4547-4558.1987] [Citation(s) in RCA: 53] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The nucleotide sequence of the chromosomally encoded ribulose bisphosphate carboxylase/oxygenase (RuBPCase) large (rbcL) and small (rbcS) subunit genes of the hydrogen bacterium Alcaligenes eutrophus ATCC 17707 was determined. We found that the two coding regions are separated by a 47-base-pair intergenic region, and both genes are preceded by plausible ribosome-binding sites. Cotranscription of the rbcL and rbcS genes has been demonstrated previously. The rbcL and rbcS genes encode polypeptides of 487 and 135 amino acids, respectively. Both genes exhibited similar codon usage which was highly biased and different from that of other organisms. The N-terminal amino acid sequence of both subunit proteins was determined by Edman degradation. No processing of the rbcS protein was detected, while the rbcL protein underwent a posttranslational loss of formylmethionyl. The A. eutrophus rbcL and rbcS proteins exhibited 56.8 to 58.3% and 35.6 to 38.5% amino acid sequence homology, respectively, with the corresponding proteins from cyanobacteria, eucaryotic algae, and plants. The A. eutrophus and Rhodospirillum rubrum rbcL proteins were only about 32% homologous. The N- and C-terminal sequences of both the rbcL and the rbcS proteins were among the most divergent regions. Known or proposed active site residues in other rbcL proteins, including Lys, His, Arg, and Asp residues, were conserved in the A. eutrophus enzyme. The A. eutrophus rbcS protein, like those of cyanobacteria, lacks a 12-residue internal sequence that is found in plant RuBPCase. Comparison of hydropathy profiles and secondary structure predictions by the method described by Chou and Fasman (P. Y. Chou and G. D. Fasman, Adv. Enzymol. 47:45-148, 1978) revealed striking similarities between A. eutrophus RuBPCase and other hexadecameric enzymes. This suggests that folding of the polypeptide chains is similar. The observed sequence homologies were consistent with the notion that both the rbcL and rbcS genes of the chemoautotroph A. eutrophus and the thus far characterized rbc genes of photosynthetic organisms have a common origin. This suggests that both subunit genes have a very ancient origin. The role of quaternary structure as a determinant of the rate of accepted amino acid substitution was examined. It is proposed that the sequence of the dimeric R. rubrum RuBPCase may be less conserved because there are fewer structural constraints for this RuBPCase than there are for hexadecameric enzymes.
Collapse
|
47
|
Bando H, Kusuda J, Gojobori T, Maruyama T, Kawase S. Organization and nucleotide sequence of a densovirus genome imply a host-dependent evolution of the parvoviruses. J Virol 1987; 61:553-60. [PMID: 3027382 PMCID: PMC253980 DOI: 10.1128/jvi.61.2.553-560.1987] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The genome structure of a densovirus from a silkworm was determined by sequencing more than 85% of the complete genome DNA. This is the first report of the genome organization of an insect parvovirus deduced from the DNA sequence. In the viral genome, two large open reading frames designated 1 and 2 and one smaller open reading frame designated 3 were identified. The first two open reading frames shared the same strand, while the third was found in the complementary sequence. Computer analysis suggested that open reading frame 2 may encode all four structural proteins. The genome organization and a part of the nucleotide sequence were conserved among the insect densovirus, rodent parvoviruses, and a human dependovirus. These viruses may have diverged from a common ancestor.
Collapse
|
48
|
Abstract
The question addressed in this report focuses on the autoantigenicity of self antigens, principally cytochrome c and lysozyme. Of interest is whether the immune system produces autoantibodies to its host proteins reacting randomly with all potential antigen sites or is autoreactively selective for certain determinants. Based on experimental evidence from autoantibodies against cytochromes c, Jemmerson and Margoliash [Jemmerson, R. & Margoliash, E. (1979) Nature (London) 282, 468-471] have described a striking correlation between autoreactive sequence regions and evolutionary instability. While their analysis of evolutionary variation was based on simple sequence variability plots, we present here a refined approach that takes into account the distinction between evolutionary substitutions that induce a change in the protein surface from those that do not (surface-neutral substitutions). A quantitative aspect of surface variation (surface consensus) is included in the algorithm that produces a ranked order for autoantigenic determinants. The final plot, called surface variability, indicates sequence regions having a preference for autoimmune reaction. We propose the term "autogen" to designate such protein determinants.
Collapse
|
49
|
|
50
|
Abstract
A model was constructed that predicts the electric charge of a protein and its isoelectric point from its primary and quaternary structures. By using two different patterns of mutation and purifying selection, four schemes of nucleotide substitution were simulated. In the absence of selection for a specific value of pI, proteins are expected to evolve toward a mildly basic pI. Thus, the selection for maintaining extreme values of pI must be stringent, and proteins with extreme pI's will evolve very slowly. This prediction is consistent with observations on the evolution of histones and ubiquitin. The mean charge change is expected to be about 0.005 units pI per nucleotide substitution. The amount of electrophoretically hidden variation is expected to be considerable even for large degrees of divergence at the nucleotide and amino acid levels. Electrophoretic detectability depends on the size of the protein. The longer the protein the larger the amount of variation at the amino acid level that is undetectable by isoelectric focusing. This property may be partially responsible for the imperfect correlation between molecular weight and gene diversity observed for electrophoretic data. Very basic and very acidic proteins are expected to generate less electrophoretic variability than proteins with intermediate pI's. Unequal rates of mutation between nucleotides and asymmetrical patterns of purifying selection have almost no effect on the equilibrium pI of proteins, but affect the rates of change in pI, and increase the amount of electrophoretically hidden variation in comparison to the expectations derived from random patterns of mutation and constant selection. Comparison of detectability of protein differences among four electrophoretical techniques suggests that the best performance is obtained by the sequential electrophoresis method.
Collapse
|