1
|
Hernández Berthet AS, Aptekmann AA, Tejero J, Sánchez IE, Noguera ME, Roman EA. Associating protein sequence positions with the modulation of quantitative phenotypes. Arch Biochem Biophys 2024; 755:109979. [PMID: 38583654 DOI: 10.1016/j.abb.2024.109979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 03/11/2024] [Accepted: 03/27/2024] [Indexed: 04/09/2024]
Abstract
Although protein sequences encode the information for folding and function, understanding their link is not an easy task. Unluckily, the prediction of how specific amino acids contribute to these features is still considerably impaired. Here, we developed a simple algorithm that finds positions in a protein sequence with potential to modulate the studied quantitative phenotypes. From a few hundred protein sequences, we perform multiple sequence alignments, obtain the per-position pairwise differences for both the sequence and the observed phenotypes, and calculate the correlation between these last two quantities. We tested our methodology with four cases: archaeal Adenylate Kinases and the organisms optimal growth temperatures, microbial rhodopsins and their maximal absorption wavelengths, mammalian myoglobins and their muscular concentration, and inhibition of HIV protease clinical isolates by two different molecules. We found from 3 to 10 positions tightly associated with those phenotypes, depending on the studied case. We showed that these correlations appear using individual positions but an improvement is achieved when the most correlated positions are jointly analyzed. Noteworthy, we performed phenotype predictions using a simple linear model that links per-position divergences and differences in the observed phenotypes. Predictions are comparable to the state-of-art methodologies which, in most of the cases, are far more complex. All of the calculations are obtained at a very low information cost since the only input needed is a multiple sequence alignment of protein sequences with their associated quantitative phenotypes. The diversity of the explored systems makes our work a valuable tool to find sequence determinants of biological activity modulation and to predict various functional features for uncharacterized members of a protein family.
Collapse
Affiliation(s)
- Ayelén S Hernández Berthet
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina.
| | - Ariel A Aptekmann
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina; Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08873, USA; Institute of Marine and Coastal Sciences, Rutgers University, New Brunswick, NJ, 08901, USA.
| | - Jesús Tejero
- Heart, Lung, Blood and Vascular Medicine Institute, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, 15261, USA; Department of Bioengineering, Swanson School of Engineering, University of Pittsburgh, Pittsburgh, PA, 15260, USA; Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15261, USA.
| | - Ignacio E Sánchez
- Universidad de Buenos Aires, Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Buenos Aires, Argentina.
| | - Martín E Noguera
- Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina; Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina.
| | - Ernesto A Roman
- Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Intendente Güiraldes 2160 - Ciudad Universitaria, 1428EGA, C.A.B.A., Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química y Fisicoquímica Biológicas Dr. Alejandro Paladini, Junín 956, 1113AAD, C.A.B.A., Argentina.
| |
Collapse
|
2
|
Identification and structural characterization of deleterious non-synonymous single nucleotide polymorphisms in the human SKP2 gene. Comput Biol Chem 2019; 79:127-136. [PMID: 30802828 DOI: 10.1016/j.compbiolchem.2019.02.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 01/27/2019] [Accepted: 02/13/2019] [Indexed: 12/17/2022]
Abstract
In SCF (Skp, Cullin, F-box) ubiquitin-protein ligase complexes, S-phase kinase 2 (SKP2) is one of the major players of F-box family, that is responsible for the degradation of several important cell regulators and tumor suppressor proteins. Despite of having significant evidence for the role of SKP2 on tumorgenesis, there is a lack of available data regarding the effect of non-synonymous polymorphisms. In this communication, the structural and functional consequences of non-synonymous single nucleotide polymorphisms (nsSNPs) of SKP2 have been reported by employing various computational approaches and molecular dynamics simulation. Initially, several computational tools like SIFT, PolyPhen-2, PredictSNP, I-Mutant 2.0 and ConSurf have been implicated in this study to explore the damaging SNPs. In total of 172 nsSNPs, 5 nsSNPs were identified as deleterious and 3 of them were predicted to be decreased the stability of protein. Guided from ConSurf analysis, P101L (rs761253702) and Y346C (rs755010517) were categorized as the highly conserved and functional disrupting mutations. Therefore, these mutations were subjected to three dimensional model building and molecular dynamics simulation study for the detailed structural consequences upon the mutations. The study revealed that P101L and Y346C mutations increased the flexibility and changed the structural dynamics. As both these mutations are located in the most functional regions of SKP2 protein, these computational insights might be helpful to consider these nsSNPs for wet-lab confirmatory analysis as well as in rationalizing future population based studies and structure based drug design against SKP2.
Collapse
|
3
|
Mechanical variations in proteins with large-scale motions highlight the formation of structural locks. J Struct Biol 2018; 203:195-204. [DOI: 10.1016/j.jsb.2018.05.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 05/18/2018] [Accepted: 05/22/2018] [Indexed: 12/18/2022]
|
4
|
Cossins BP, Lawson ADG, Shi J. Computational Exploration of Conformational Transitions in Protein Drug Targets. Methods Mol Biol 2018; 1762:339-365. [PMID: 29594780 DOI: 10.1007/978-1-4939-7756-7_17] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2022]
Abstract
Protein drug targets vary from highly structured to completely disordered; either way dynamics governs function. Hence, understanding the dynamical aspects of how protein targets function can enable improved interventions with drug molecules. Computational approaches offer highly detailed structural models of protein dynamics which are becoming more predictive as model quality and sampling power improve. However, the most advanced and popular models still have errors owing to imperfect parameter sets and often cannot access longer timescales of many crucial biological processes. Experimental approaches offer more certainty but can struggle to detect and measure lightly populated conformations of target proteins and subtle allostery. An emerging solution is to integrate available experimental data into advanced molecular simulations. In the future, molecular simulation in combination with experimental data may be able to offer detailed models of important drug targets such that improved functional mechanisms or selectivity can be accessed.
Collapse
Affiliation(s)
- Benjamin P Cossins
- Computer-Aided Drug Design and Structural Biology, UCB Pharma, Slough, UK.
| | | | - Jiye Shi
- Computer-Aided Drug Design and Structural Biology, UCB Pharma, Slough, UK
| |
Collapse
|
5
|
Armenta-Medina D, Perez-Rueda E. Hybrid approaches for the detection of networks of critical residues involved in functional motions in protein families. BMC Bioinformatics 2015. [PMCID: PMC4340141 DOI: 10.1186/1471-2105-16-s3-a8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
6
|
Tiberti M, Invernizzi G, Papaleo E. (Dis)similarity Index To Compare Correlated Motions in Molecular Simulations. J Chem Theory Comput 2015; 11:4404-14. [PMID: 26575932 DOI: 10.1021/acs.jctc.5b00512] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Molecular dynamics (MD) simulations are widely used to complement or guide experimental studies in the characterization of protein dynamics, thanks to improvements in force-field accuracy, along with in the software and hardware to sample the conformational landscape of proteins. Among the different applications of MD simulations, the study of correlated motions is largely employed for different purposes. Several metrics have been developed to describe correlated motions in the MD ensemble, such as methods based on Pearson Correlation or Mutual Information. Cross-correlation analysis of MD trajectories is indeed appealing not only to identify residues characterized by coupled fluctuations in protein structures but also since it can be used to extrapolate motions along directions in which major conformational changes should occur, for example on longer time scales than the ones that are actually simulated. Nevertheless, most of the MD studies employ average correlation maps and mostly in a qualitative way, even when different systems or different replicates of the same system are compared. The broad application of correlation metrics in the analysis of MD simulations, especially for comparative purposes, requires a step forward toward more quantitative and accurate comparisons. We thus here employed a simple but effective index, which is based on a normalized Frobenius norm of the differences between protein correlation maps, to compare correlated motions. We applied this index for a quantitative comparison of correlated motions from MD simulations of seven proteins of different size and fold. We also employed the index to assess the robustness of correlation description when multi-replicate MD simulations of a same system are used, and we compared our index to metrics for comparison of structural ensembles such as Root Mean Square Inner Product and the Bhattacharyya Coefficient.
Collapse
Affiliation(s)
- Matteo Tiberti
- Department of Biotechnology and Biosciences, University of Milano-Bicocca , Piazza della Scienza 2, 20126 Milan, Italy
| | - Gaetano Invernizzi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca , Piazza della Scienza 2, 20126 Milan, Italy
| | - Elena Papaleo
- Department of Biotechnology and Biosciences, University of Milano-Bicocca , Piazza della Scienza 2, 20126 Milan, Italy
| |
Collapse
|
7
|
Ortegon P, Poot-Hernández AC, Perez-Rueda E, Rodriguez-Vazquez K. Comparison of Metabolic Pathways in Escherichia coli by Using Genetic Algorithms. Comput Struct Biotechnol J 2015; 13:277-85. [PMID: 25973143 PMCID: PMC4423528 DOI: 10.1016/j.csbj.2015.04.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Revised: 03/31/2015] [Accepted: 04/01/2015] [Indexed: 11/21/2022] Open
Abstract
In order to understand how cellular metabolism has taken its modern form, the conservation and variations between metabolic pathways were evaluated by using a genetic algorithm (GA). The GA approach considered information on the complete metabolism of the bacterium Escherichia coli K-12, as deposited in the KEGG database, and the enzymes belonging to a particular pathway were transformed into enzymatic step sequences by using the breadth-first search algorithm. These sequences represent contiguous enzymes linked to each other, based on their catalytic activities as they are encoded in the Enzyme Commission numbers. In a posterior step, these sequences were compared using a GA in an all-against-all (pairwise comparisons) approach. Individual reactions were chosen based on their measure of fitness to act as parents of offspring, which constitute the new generation. The sequences compared were used to construct a similarity matrix (of fitness values) that was then considered to be clustered by using a k-medoids algorithm. A total of 34 clusters of conserved reactions were obtained, and their sequences were finally aligned with a multiple-sequence alignment GA optimized to align all the reaction sequences included in each group or cluster. From these comparisons, maps associated with the metabolism of similar compounds also contained similar enzymatic step sequences, reinforcing the Patchwork Model for the evolution of metabolism in E. coli K-12, an observation that can be expanded to other organisms, for which there is metabolism information. Finally, our mapping of these reactions is discussed, with illustrations from a particular case.
Collapse
Affiliation(s)
- Patricia Ortegon
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
| | - Augusto C. Poot-Hernández
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Ernesto Perez-Rueda
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
- Unidad Multidisciplinaria de Docencia e Investigación, Sisal Facultad de Ciencias, Sisal, Yucatán, UNAM, Mexico
- Correspondence to: E. Perez-Rueda, Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico.
| | - Katya Rodriguez-Vazquez
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
- Correspondence to: K. Rodriguez-Vazquez, Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico.
| |
Collapse
|
8
|
Papaleo E, Parravicini F, Grandori R, De Gioia L, Brocca S. Structural investigation of the cold-adapted acylaminoacyl peptidase from Sporosarcina psychrophila by atomistic simulations and biophysical methods. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2014; 1844:2203-13. [DOI: 10.1016/j.bbapap.2014.09.018] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2014] [Revised: 09/19/2014] [Accepted: 09/23/2014] [Indexed: 01/07/2023]
|
9
|
|
10
|
Feher VA, Durrant JD, Van Wart AT, Amaro RE. Computational approaches to mapping allosteric pathways. Curr Opin Struct Biol 2014; 25:98-103. [PMID: 24667124 DOI: 10.1016/j.sbi.2014.02.004] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2014] [Revised: 02/20/2014] [Accepted: 02/24/2014] [Indexed: 01/17/2023]
Abstract
Allosteric signaling occurs when chemical and/or physical changes at an allosteric site alter the activity of a primary orthosteric site often many Ångströms distant. A number of recently developed computational techniques, including dynamical network analysis, novel topological and molecular dynamics methods, and hybrids of these methods, are useful for elucidating allosteric signaling pathways at the atomistic level. No single method prevails as best to identify allosteric signal propagation path(s), rather each has particular strengths in characterizing signals that occur over specific timescale ranges and magnitudes of conformational fluctuation. With continued improvement in accuracy and predictive power, these computational techniques aim to become useful drug discovery tools that will allow researchers to identify allostery critical residues for subsequent pharmacological targeting.
Collapse
Affiliation(s)
- Victoria A Feher
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
| | - Jacob D Durrant
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
| | - Adam T Van Wart
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
| | - Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
11
|
Papaleo E, Renzetti G. Coupled motions during dynamics reveal a tunnel toward the active site regulated by the N-terminal α-helix in an acylaminoacyl peptidase. J Mol Graph Model 2012; 38:226-34. [PMID: 23085164 DOI: 10.1016/j.jmgm.2012.06.014] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2012] [Revised: 06/15/2012] [Accepted: 06/26/2012] [Indexed: 10/28/2022]
Abstract
Acylaminoacyl peptidase (AAP) subfamily belongs to the prolyl oligopeptidase (POP) family of serine-proteases. There is a great interest in the definition of molecular mechanisms related to the activity and substrate recognition of these complex multi-domain enzymes. The active site relies at the interface between the C-terminal catalytic domain and the β-propeller domain, whose N-terminal region acts as a bridge to the hydrolase domain. In AAP, the N-terminal extension is characterized by a structurally conserved α1-helix, which is known to affect thermal stability and thermal dependence of the catalytic activity. In the present contribution, results from hundreds nanosecond all-atom molecular dynamics simulations, along with analyses of the networks of cross-correlated motions of a member of the AAP subfamily are discussed. The MD investigation identifies a tunnel that from the surrounding of the N-terminal α1-helix bring to the catalytic site. This cavity seems to be regulated by conformational changes of the α1-helix itself during the dynamics. The evidence here provided can be a useful guide for a better understanding of the mechanistic aspects related to AAP activity, but also for drug design purposes.
Collapse
Affiliation(s)
- Elena Papaleo
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, P.zza della Scienza 2, 20126 Milan, Italy.
| | | |
Collapse
|
12
|
Pasi M, Tiberti M, Arrigoni A, Papaleo E. xPyder: a PyMOL plugin to analyze coupled residues and their networks in protein structures. J Chem Inf Model 2012; 52:1865-74. [PMID: 22721491 DOI: 10.1021/ci300213c] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
A versatile method to directly identify and analyze short- or long-range coupled or communicating residues in a protein conformational ensemble is of extreme relevance to achieve a complete understanding of protein dynamics and structural communication routes. Here, we present xPyder, an interface between one of the most employed molecular graphics systems, PyMOL, and the analysis of dynamical cross-correlation matrices (DCCM). The approach can also be extended, in principle, to matrices including other indexes of communication propensity or intensity between protein residues, as well as the persistence of intra- or intermolecular interactions, such as those underlying protein dynamics. The xPyder plugin for PyMOL 1.4 and 1.5 is offered as Open Source software via the GPL v2 license, and it can be found, along with the installation package, the user guide, and examples, at http://linux.btbs.unimib.it/xpyder/.
Collapse
Affiliation(s)
- Marco Pasi
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, P.zza della Scienza 2, 20126 Milan, Italy
| | | | | | | |
Collapse
|
13
|
Papaleo E, Renzetti G, Tiberti M. Mechanisms of intramolecular communication in a hyperthermophilic acylaminoacyl peptidase: a molecular dynamics investigation. PLoS One 2012; 7:e35686. [PMID: 22558199 PMCID: PMC3338720 DOI: 10.1371/journal.pone.0035686] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2011] [Accepted: 03/21/2012] [Indexed: 11/25/2022] Open
Abstract
Protein dynamics and the underlying networks of intramolecular interactions and communicating residues within the three-dimensional (3D) structure are known to influence protein function and stability, as well as to modulate conformational changes and allostery. Acylaminoacyl peptidase (AAP) subfamily of enzymes belongs to a unique class of serine proteases, the prolyl oligopeptidase (POP) family, which has not been thoroughly investigated yet. POPs have a characteristic multidomain three-dimensional architecture with the active site at the interface of the C-terminal catalytic domain and a β-propeller domain, whose N-terminal region acts as a bridge to the hydrolase domain. In the present contribution, protein dynamics signatures of a hyperthermophilic acylaminoacyl peptidase (AAP) of the prolyl oligopeptidase (POP) family, as well as of a deletion variant and alanine mutants (I12A, V13A, V16A, L19A, I20A) are reported. In particular, we aimed at identifying crucial residues for long range communications to the catalytic site or promoting the conformational changes to switch from closed to open ApAAP conformations. Our investigation shows that the N-terminal α1-helix mediates structural intramolecular communication to the catalytic site, concurring to the maintenance of a proper functional architecture of the catalytic triad. Main determinants of the effects induced by α1-helix are a subset of hydrophobic residues (V16, L19 and I20). Moreover, a subset of residues characterized by relevant interaction networks or coupled motions have been identified, which are likely to modulate the conformational properties at the interdomain interface.
Collapse
Affiliation(s)
- Elena Papaleo
- Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy.
| | | | | |
Collapse
|
14
|
Delalande O, Sacquin-Mora S, Baaden M. Enzyme closure and nucleotide binding structurally lock guanylate kinase. Biophys J 2011; 101:1440-9. [PMID: 21943425 DOI: 10.1016/j.bpj.2011.07.048] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2011] [Revised: 07/15/2011] [Accepted: 07/28/2011] [Indexed: 02/02/2023] Open
Abstract
We investigate the conformational dynamics and mechanical properties of guanylate kinase (GK) using a multiscale approach combining high-resolution atomistic molecular dynamics and low-resolution Brownian dynamics simulations. The GK enzyme is subject to large conformational changes, leading from an open to a closed form, which are further influenced by the presence of nucleotides. As suggested by recent work on simple coarse-grained models of apo-GK, we primarily focus on GK's closure mechanism with the aim to establish a detailed picture of the hierarchy and chronology of structural events essential for the enzymatic reaction. We have investigated open-versus-closed, apo-versus-holo, and substrate-versus-product-loaded forms of the GK enzyme. Bound ligands significantly modulate the mechanical and dynamical properties of GK and rigidity profiles of open and closed states hint at functionally important differences. Our data emphasizes the role of magnesium, highlights a water channel permitting active site hydration, and reveals a structural lock that stabilizes the closed form of the enzyme.
Collapse
Affiliation(s)
- Olivier Delalande
- Institut de Biologie Physico-Chimique, Laboratoire de Biochimie Théorique, Centre National de la Recherche Scientifique, UPR9080, Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | | | | |
Collapse
|