1
|
Chemical specificity and conformational flexibility in proteinase-inhibitor interaction: scaffolds for promiscuous binding. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2014; 116:151-7. [PMID: 25151636 DOI: 10.1016/j.pbiomolbio.2014.08.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Revised: 07/26/2014] [Accepted: 08/02/2014] [Indexed: 11/24/2022]
Abstract
One of the most important roles of proteins in cellular milieu is recognition of other biomolecules including other proteins. Protein-protein complexes are involved in many essential cellular processes. Interfaces of protein-protein complexes are traditionally known to be conserved in evolution and less flexible than other solvent interacting tertiary structural surface. But many examples are emerging where these features do not hold good. An understanding of inter-play between flexibility and sequence conservation is emerging, providing a fresh dimension to the paradigm of sequence-structure-function relationship. The functional manifestation of the inter-relation between sequence conservation and flexibility of interface is exemplified in this review using proteinase-inhibitor protein complexes.
Collapse
|
2
|
Kufareva I, Budagyan L, Raush E, Totrov M, Abagyan R. PIER: protein interface recognition for structural proteomics. Proteins 2007; 67:400-17. [PMID: 17299750 DOI: 10.1002/prot.21233] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Recent advances in structural proteomics call for development of fast and reliable automatic methods for prediction of functional surfaces of proteins with known three-dimensional structure, including binding sites for known and unknown protein partners as well as oligomerization interfaces. Despite significant progress the problem is still far from being solved. Most existing methods rely, at least partially, on evolutionary information from multiple sequence alignments projected on protein surface. The common drawback of such methods is their limited applicability to the proteins with a sparse set of sequential homologs, as well as inability to detect interfaces in evolutionary variable regions. In this study, the authors developed an improved method for predicting interfaces from a single protein structure, which is based on local statistical properties of the protein surface derived at the level of atomic groups. The proposed Protein IntErface Recognition (PIER) method achieved the overall precision of 60% at the recall threshold of 50% at the residue level on a diverse benchmark of 490 homodimeric, 62 heterodimeric, and 196 transient interfaces (compared with 25% precision at 50% recall expected from random residue function assignment). For 70% of proteins in the benchmark, the binding patch residues were successfully detected with precision exceeding 50% at 50% recall. The calculation only took seconds for an average 300-residue protein. The authors demonstrated that adding the evolutionary conservation signal only marginally influenced the overall prediction performance on the benchmark; moreover, for certain classes of proteins, using this signal actually resulted in a deteriorated prediction. Thorough benchmarking using other datasets from literature showed that PIER yielded improved performance as compared with several alignment-free or alignment-dependent predictions. The accuracy, efficiency, and dependence on structure alone make PIER a suitable tool for automated high-throughput annotation of protein structures emerging from structural proteomics projects.
Collapse
Affiliation(s)
- Irina Kufareva
- Scripps Research Institute, La Jolla, California 92037, USA
| | | | | | | | | |
Collapse
|
3
|
van Gent D, Sharp P, Morgan K, Kalsheker N. Serpins: structure, function and molecular evolution. Int J Biochem Cell Biol 2003; 35:1536-47. [PMID: 12824063 DOI: 10.1016/s1357-2725(03)00134-1] [Citation(s) in RCA: 140] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The superfamily of serine proteinase inhibitors (serpins) are involved in a number of fundamental biological processes such as blood coagulation, complement activation, fibrinolysis, angiogenesis, inflammation and tumor suppression and are expressed in a cell-specific manner. The average protein size of a serpin family member is 350-400 amino acids, but gene structure varies in terms of number and size of exons and introns. Previous studies of all known serpins identified 16 clades and 10 orphan sequences. Vertebrate serpins can be conveniently classified into six sub-groups. We provide additional data that updates the phylogenetic analysis in the context of structural and functional properties of the proteins. From these, we can conclude that the functional classification of serpins relies on their protein structure and not on sequence similarity.
Collapse
Affiliation(s)
- Diana van Gent
- Division of Clinical Chemistry, Institute of Genetics, Queen's Medical Centre, University of Nottingham, NG7 2UH Nottingham, UK
| | | | | | | |
Collapse
|
4
|
Bradford JR, Westhead DR. Asymmetric mutation rates at enzyme-inhibitor interfaces: implications for the protein-protein docking problem. Protein Sci 2003; 12:2099-103. [PMID: 12931008 PMCID: PMC2324006 DOI: 10.1110/ps.0306303] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
We have carried out a thorough and systematic sequence-structure study on how the pattern of conservation at the interface differs from the noninteracting surface in seven proteases and their inhibitors. As expected, the interface of a protease could be easily distinguished from the noninteracting surface by a concentrated area of conservation. In contrast, there was less distinction to be made between the interface and the noninteracting surface of inhibitors, and in five of the seven cases, a higher proportion of the interface area was variable compared to the rest of the surface. This is likely to cause a problem for binding-site prediction methods that assume the largest cluster of highly conserved residues on the surface of a protein corresponds to the interface. We conclude that such methods would succeed when applied to our protease test cases, but complications could arise with the inhibitors. These results also impact on methods to solve the protein-protein docking problem that use conservation at the interface to provide the location of the two protein binding sites prior to application of the docking algorithm.
Collapse
Affiliation(s)
- James R Bradford
- School of Biochemistry and Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
| | | |
Collapse
|
5
|
Forsyth S, Horvath A, Coughlin P. A review and comparison of the murine alpha1-antitrypsin and alpha1-antichymotrypsin multigene clusters with the human clade A serpins. Genomics 2003; 81:336-45. [PMID: 12659817 DOI: 10.1016/s0888-7543(02)00041-1] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The major human plasma protease inhibitors, alpha(1)-antitrypsin and alpha(1)-antichymotrypsin, are each encoded by a single gene, whereas in the mouse they are represented by clusters of 5 and 14 genes, respectively. Although there is a high degree of overall sequence similarity within these groupings, the reactive-center loop (RCL) domain, which determines target protease specificity, is markedly divergent. The literature dealing with members of these mouse serine protease inhibitor (serpin) clusters has been complicated by inconsistent nomenclature. Furthermore, some investigators, unaware of the complexity of the family, have failed to distinguish between closely related genes when measuring expression levels or functional activity. We have reviewed the literature dealing with the mouse equivalents of human alpha(1)-antitrypsin and alpha(1)-antichymotrypsin and made use of the recently completed mouse genome sequence to propose a systematic nomenclature. We have also examined the extended mouse clade "a" serpin cluster at chromosome 12F1 and compared it with the syntenic region at human chromosome 14q32. In summarizing the literature and suggesting a standardized nomenclature, we aim to provide a logical structure on which future research may be based.
Collapse
Affiliation(s)
- Sharon Forsyth
- Department of Medicine, Monash University, Melbourne 3128, Australia
| | | | | |
Collapse
|
6
|
Barbour KW, Goodwin RL, Guillonneau F, Wang Y, Baumann H, Berger FG. Functional diversification during evolution of the murine alpha(1)-proteinase inhibitor family: role of the hypervariable reactive center loop. Mol Biol Evol 2002; 19:718-27. [PMID: 11961105 DOI: 10.1093/oxfordjournals.molbev.a004130] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Alpha(1)-proteinase inhibitor (alpha(1)-PI) is a member of the serpin superfamily of serine proteinase inhibitors that are involved in the regulation of a number of proteolytic processes. Alpha(1)-PI, like most serpins, functions by covalent binding to, and inhibition of, target proteinases. The interaction between alpha(1)-PI and its target is directed by the so-called reactive center loop (RCL), an approximately 20 residue domain that extends out from the body of the alpha(1)-PI polypeptide and determines the inhibitor's specificity. Mice express at least seven closely related alpha(1)-PI isoforms, encoded by a family of genes clustered at the Spi1 locus on chromosome 12. The amino acid sequence of the RCL region is hypervariable among alpha(1)-PIs, a phenomenon that has been attributed to high rates of evolution driven by positive Darwinian selection. This suggests that the various isoforms are functionally diverse. To test this notion, we have compared the proteinase specificities of individual alpha(1)-PIs from each of the two mouse species. As predicted from the positive Darwinian selection hypothesis, the various alpha(1)-PIs differ in their ability to form covalent complexes with serine proteinases, such as elastase, trypsin, chymotrypsin, and cathepsin G. In addition, they differ in their binding ability to proteinases in crude snake venoms. Importantly, the RCL region of the alpha(1)-PI polypeptide is the primary determinant of isoform-specific differences in proteinase recognition, indicating that hypervariability within this region drives the functional diversification of alpha(1)-PIs during evolution. The possible physiological benefits of alpha(1)-PI diversity are discussed.
Collapse
Affiliation(s)
- Karen W Barbour
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | | | | | | | | | | |
Collapse
|
7
|
Yoshida K, Suzuki Y, Yamamoto K, Matsuura K, Watanabe M, Sinohara H. Molecular evolution in the hypervariable regions of fetuin: comparison between human and African green monkey fetuin. Biol Chem 2000; 381:773-6. [PMID: 11030436 DOI: 10.1515/bc.2000.099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Sequences of fetuin cDNA and its deduced amino acid residues from the African green monkey cell line Vero were found to differ by 7.3% and 12.9%, respectively, from the corresponding human sequences. Most amino acid substitutions were clustered within a small segment of the third domain (D3). Calculations of nonsynonymous and synonymous nucleotide substitution rates suggest that this small segment was mutated under positive selection. cDNAs encoding alpha1-antitrypsin, beta-actin and the sequences of intron 4 of alpha1-antitrypsin gene in human liver and Vero cells were also investigated. The results substantiated the positive selection imposed on the D3 segment.
Collapse
Affiliation(s)
- K Yoshida
- Department of Biochemistry, Kinki University School of Medicine, Osaka-Sayama, Osaka, Japan
| | | | | | | | | | | |
Collapse
|
8
|
Takamatsu N, Kojima M, Taniyama M, Ohba K, Uematsu T, Segawa C, Tsutou S, Watanabe M, Kondo J, Kondo N, Shiba T. Expression of multiple alpha1-antitrypsin-like genes in hibernating species of the squirrel family. Gene X 1997; 204:127-32. [PMID: 9434174 DOI: 10.1016/s0378-1119(97)00532-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
In the chipmunk, a mammalian hibernator, a 140 kDa protein complex found in the blood, drastically decreases in concentration during hibernation. This complex contains four species of proteins, HP-20, -25, -27 and -55. In the present study, cDNA clones coding for the chipmunk HP-55 were isolated from a liver cDNA library. Sequence analysis revealed that HP-55 is produced as a precursor protein of 413 amino acids (aa), that it has a signal peptide of 24 aa, and that it contains four potential N-glycosylation sites. The deduced aa sequence shows 63% identity with that of rat alpha1-antitrypsin (alpha1-AT); however, the sequence corresponding to the reactive center P1-P1' residues was found to be Met-Leu, whereas it is Met-Ser in the rat alpha1-AT. During screening of the chipmunk liver cDNA library, four other related classes of cDNA clones were obtained, each also coding for an alpha1-AT-like protein. In spite of more than 86% overall aa sequence identity among the five chipmunk alpha1-AT-like proteins, they are highly divergent in the putative reactive center region; the putative P1-P1' sequences are Met-Leu (HP-55 or CM55-ML), Met-Met (CM55-MM), Met-Ser (CM55-MS), Ser-Ile (CM55-SI) and Ser-Thr (CM55-ST). Each of the alpha1-AT-like protein mRNAs was expressed in chipmunk liver, and the HP-55 mRNA level was greatly reduced during hibernation. Genomic Southern blot analysis and screening of a liver cDNA library from another hibernating squirrel species, the ground squirrel, also revealed expression of multiple members of the alpha1-AT gene family, whereas analysis of a cDNA library from a non-hibernating species, the tree squirrel, found only a single alpha1-AT gene.
Collapse
Affiliation(s)
- N Takamatsu
- Department of Biosciences, School of Science, Kitasato University, Kanagawa, Japan.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Rollini P, Fournier RE. A 370-kb cosmid contig of the serpin gene cluster on human chromosome 14q32.1: molecular linkage of the genes encoding alpha 1-antichymotrypsin, protein C inhibitor, kallistatin, alpha 1-antitrypsin, and corticosteroid-binding globulin. Genomics 1997; 46:409-15. [PMID: 9441744 DOI: 10.1006/geno.1997.5077] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The human genes encoding alpha 1-antitrypsin (alpha 1AT, gene symbol PI), corticosteroid-binding globulin (CBG), alpha 1-antichymotrypsin (AACT), and protein C inhibitor (PCI) are related by descent, and they all map to human chromosome 14q32.1. This serine protease inhibitor (serpin) gene cluster also contains an antitrypsin-related sequence (ATR, gene symbol PIL), but the precise molecular organization of this region has not been defined. In this report we describe the generation and characterization of an approximately 370-kb cosmid contig that includes all five serpin genes. Moreover, a newly described serpin, kallistatin (KAL, gene symbol PI4), was also mapped within the region. Gene order within this interval is cen-CBG-ATR-alpha 1 AT-KAL-PCI-AACT-tel. The genes occupy approximately 320 kb of genomic DNA, and they are organized into two discrete subclusters of three genes each that are separated by approximately 170 kb. The distal subcluster includes KAL, PCI, and AACT; it occupies approximately 63 kb of DNA, and all three genes are transcribed in a proximal-to-distal orientation. Within the subcluster, there is approximately 12 kb of intergenic DNA between KAL and PCI and approximately 19 kb between PCI and AACT. The proximal subcluster includes alpha 1AT, ATR, and CBG; it occupies approximately 90 kb of genomic DNA, with approximately 12 kb of DNA between alpha 1AT and ATR and approximately 40 kb between ATR and CBG. These genes are all transcribed in a distal-to-proximal orientation. This represents the first detailed physical map of the serpin gene cluster on 14q32.1.
Collapse
Affiliation(s)
- P Rollini
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109-1024, USA
| | | |
Collapse
|
10
|
Jiang H, Wang Y, Huang Y, Mulnix AB, Kadel J, Cole K, Kanost MR. Organization of serpin gene-1 from Manduca sexta. Evolution of a family of alternate exons encoding the reactive site loop. J Biol Chem 1996; 271:28017-23. [PMID: 8910411 DOI: 10.1074/jbc.271.45.28017] [Citation(s) in RCA: 70] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Manduca sexta serpin gene-1 encodes a family of serpins whose amino acid sequences are identical in their amino-terminal 336 residues but variable in their carboxyl-terminal 39-46 residues, which includes the reactive site loop (Jiang, H., Wang, Y., and Kanost, M. R. (1994) J. Biol. Chem. 269, 55-58). Here, we report the gene's complete nucleotide sequence and exon-intron structure. A unique characteristic of this gene is its exon 9, which is present in 12 alternate forms between exons 8 and 10. Isolation and characterization of cDNA clones containing exons 9C, 9H, and 9I, which were not found previously, indicate that all 12 alternate forms of exon 9 can be utilized to generate 12 different serpins. The splicing pathway apparently allows inclusion of only one exon 9 per molecule of mature serpin-1 mRNA. Analysis of exon-intron border sequences reveals unique features that may be involved in regulation of RNA splicing. The exon 9 region has apparently evolved through rounds of exon duplication and sequence divergence. The exons near the center of the region may have evolved recently, whereas the outermost exons are the most ancient. Exons 9G and 9H were duplicated as a pair from exons 9E and 9F, an event that may have occurred more than once in the history of this gene.
Collapse
Affiliation(s)
- H Jiang
- Department of Biochemistry, Kansas State University, Manhattan, Kansas 66506, USA.
| | | | | | | | | | | | | |
Collapse
|
11
|
Hirose S, Furukawa M, Tamechika I, Itakura M, Kato A, Suzuki Y, Kuroki J, Tachibana S. Discovery of a new type of proteinase inhibitor family whose members have an anchoring sequence. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 1996; 389:43-9. [PMID: 8860992 DOI: 10.1007/978-1-4613-0335-0_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- S Hirose
- Department of Biological Sciences, Tokyo Institute of Technology, Yokohama, Japan
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Ohta T. Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theory. J Mol Evol 1995; 40:56-63. [PMID: 7714912 DOI: 10.1007/bf00166595] [Citation(s) in RCA: 180] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The nearly neutral theory of molecular evolution predicts larger generation-time effects for synonymous than for nonsynonymous substitutions. This prediction is tested using the sequences of 49 single-copy genes by calculating the average and variance of synonymous and nonsynonymous substitutions in mammalian star phylogenies (rodentia, artiodactyla, and primates). The average pattern of the 49 genes supports the prediction of the nearly neutral theory, with some notable exceptions. The nearly neutral theory also predicts that the variance of the evolutionary rate is larger than the value predicted by the completely neutral theory. This prediction is tested by examining the dispersion index (ratio of the variance to the mean), which is positively correlated with the average substitution number. After weighting by the lineage effects, this correlation almost disappears for nonsynonymous substitutions, but not quite so for synonymous substitutions. After weighting, the dispersion indices of both synonymous and nonsynonymous substitutions still exceed values expected under the simple Poisson process. The results indicate that both the systematic bias in evolutionary rate among the lineages and the episodic type of rate variation are contributing to the large variance. The former is more significant to synonymous substitutions than to nonsynonymous substitutions. Isochore evolution may be similar to synonymous substitutions. The rate and pattern found here are consistent with the nearly neutral theory, such that the relative contributions of drift and selection differ between the two types of substitutions. The results are also consistent with Gillespie's episodic selection theory.
Collapse
Affiliation(s)
- T Ohta
- National Institute of Genetics, Mishima, Japan
| |
Collapse
|