1
|
Spirov AV, Myasnikova EM. Problem of Domain/Building Block Preservation in the Evolution of Biological Macromolecules and Evolutionary Computation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1345-1362. [PMID: 35594219 DOI: 10.1109/tcbb.2022.3175908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Structurally and functionally isolated domains in biological macromolecular evolution, both natural and artificial, are largely similar to "schemata", building blocks (BBs), in evolutionary computation (EC). The problem of preserving in subsequent evolutionary searches the already found domains / BBs is well known and quite relevant in biology as well as in EC. Both biology and EC are seeing parallel and independent development of several approaches to identifying and preserving previously identified domains / BBs. First, we notice the similarity of DNA shuffling methods in synthetic biology and multi-parent recombination algorithms in EC. Furthermore, approaches to computer identification of domains in proteins that are being developed in biology can be aligned with BB identification methods in EC. Finally, approaches to chimeric protein libraries optimization in biology can be compared to evolutionary search methods based on probabilistic models in EC. We propose to validate the prospects of mutual exchange of ideas and transfer of algorithms and approaches between evolutionary systems biology and EC in these three principal directions. A crucial aim of this transfer is the design of new advanced experimental techniques capable of solving more complex problems of in vitro evolution.
Collapse
|
2
|
Clouthier CM, Morin S, Gobeil SMC, Doucet N, Blanchet J, Nguyen E, Gagné SM, Pelletier JN. Chimeric β-lactamases: global conservation of parental function and fast time-scale dynamics with increased slow motions. PLoS One 2012; 7:e52283. [PMID: 23284969 PMCID: PMC3528772 DOI: 10.1371/journal.pone.0052283] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2012] [Accepted: 11/15/2012] [Indexed: 11/18/2022] Open
Abstract
Enzyme engineering has been facilitated by recombination of close homologues, followed by functional screening. In one such effort, chimeras of two class-A β-lactamases – TEM-1 and PSE-4 – were created according to structure-guided protein recombination and selected for their capacity to promote bacterial proliferation in the presence of ampicillin (Voigt et al., Nat. Struct. Biol. 2002 9:553). To provide a more detailed assessment of the effects of protein recombination on the structure and function of the resulting chimeric enzymes, we characterized a series of functional TEM-1/PSE-4 chimeras possessing between 17 and 92 substitutions relative to TEM-1 β-lactamase. Circular dichroism and thermal scanning fluorimetry revealed that the chimeras were generally well folded. Despite harbouring important sequence variation relative to either of the two ‘parental’ β-lactamases, the chimeric β-lactamases displayed substrate recognition spectra and reactivity similar to their most closely-related parent. To gain further insight into the changes induced by chimerization, the chimera with 17 substitutions was investigated by NMR spin relaxation. While high order was conserved on the ps-ns timescale, a hallmark of class A β-lactamases, evidence of additional slow motions on the µs-ms timescale was extracted from model-free calculations. This is consistent with the greater number of resonances that could not be assigned in this chimera relative to the parental β-lactamases, and is consistent with this well-folded and functional chimeric β-lactamase displaying increased slow time-scale motions.
Collapse
Affiliation(s)
- Christopher M. Clouthier
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Chimie, Université de Montréal, Montréal, Québec, Canada
| | - Sébastien Morin
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Biochimie, Microbiologie et Bioinformatique, Université Laval, Laval Québec, Canada
| | - Sophie M. C. Gobeil
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Biochimie, Université de Montréal, Montréal, Québec, Canada
| | - Nicolas Doucet
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- INRS–Institut Armand-Frappier, Université du Québec, Laval, Québec, Canada
| | - Jonathan Blanchet
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Chimie, Université de Montréal, Montréal, Québec, Canada
| | - Elisabeth Nguyen
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Chimie, Université de Montréal, Montréal, Québec, Canada
| | - Stéphane M. Gagné
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Biochimie, Microbiologie et Bioinformatique, Université Laval, Laval Québec, Canada
| | - Joelle N. Pelletier
- PROTEO, the Québec Network for Research on Protein Structure, Function and Engineering, Université Laval, Laval, Québec, Canada
- Département de Chimie, Université de Montréal, Montréal, Québec, Canada
- Département de Biochimie, Université de Montréal, Montréal, Québec, Canada
- * E-mail:
| |
Collapse
|
3
|
Romero PA, Arnold FH. Random field model reveals structure of the protein recombinational landscape. PLoS Comput Biol 2012; 8:e1002713. [PMID: 23055915 PMCID: PMC3464211 DOI: 10.1371/journal.pcbi.1002713] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2012] [Accepted: 08/03/2012] [Indexed: 11/28/2022] Open
Abstract
We are interested in how intragenic recombination contributes to the evolution of proteins and how this mechanism complements and enhances the diversity generated by random mutation. Experiments have revealed that proteins are highly tolerant to recombination with homologous sequences (mutation by recombination is conservative); more surprisingly, they have also shown that homologous sequence fragments make largely additive contributions to biophysical properties such as stability. Here, we develop a random field model to describe the statistical features of the subset of protein space accessible by recombination, which we refer to as the recombinational landscape. This model shows quantitative agreement with experimental results compiled from eight libraries of proteins that were generated by recombining gene fragments from homologous proteins. The model reveals a recombinational landscape that is highly enriched in functional sequences, with properties dominated by a large-scale additive structure. It also quantifies the relative contributions of parent sequence identity, crossover locations, and protein fold to the tolerance of proteins to recombination. Intragenic recombination explores a unique subset of sequence space that promotes rapid molecular diversification and functional adaptation. Mutation and recombination are the primary sources of genetic variation in evolving populations. The relative benefit of these two diversification mechanisms and how they complement each other has been a long-standing question in evolutionary biology. While it is clear what types of genetic diversity these two mechanisms can create, a significant challenge is relating these sequence changes to changes in fitness. The fitness landscape, which describes this mapping from genotype to phenotype, is extraordinarily complex and defined over an incomprehensibly large space of sequences. Here, we develop a model of the landscape that relies not on the details of this mapping, but rather on the statistical relationships between sequences. By studying the expected values of landscape properties, we can gain insights into the structure of the landscape that are independent of the details of how genotype dictates phenotype. We use this random field model to understand how recombination explores a functionally enriched and diverse subset of protein sequence space.
Collapse
Affiliation(s)
| | - Frances H. Arnold
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|
4
|
Morleo A, Bonomi F, Iametti S, Huang VW, Kurtz DM. Iron-nucleated folding of a metalloprotein in high urea: resolution of metal binding and protein folding events. Biochemistry 2010; 49:6627-34. [PMID: 20614892 DOI: 10.1021/bi100630t] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Addition of iron salts to chaotrope-denatured aporubredoxin (apoRd) leads to nearly quantitative recovery of its single Fe(SCys)(4) site and native protein structure without significant dilution of the chaotrope. This "high-chaotrope" approach was used to examine iron binding and protein folding events using stopped-flow UV-vis absorption and CD spectroscopies. With a 100-fold molar excess of ferrous iron over denatured apoRd maintained in 5 M urea, the folded holoFe(III)Rd structure was recovered in >90% yield with a t(1/2) of <10 ms. More modest excesses of iron also gave nearly quantitative holoRd formation in 5 M urea but with chronological resolution of iron binding and protein folding events. The results indicate structural recovery in 5 M urea consists of the minimal sequence: (1) binding of ferrous iron to the unfolded apoRd, (2) rapid formation of a near-native ferrous Fe(SCys)(4) site within a protein having no detectable secondary structure, and (3) recovery of the ferrous Fe(SCys)(4) site chiral environment nearly concomitantly with (4) recovery of the native protein secondary structure. The rate of step 2 (and, by inference, step 1) was not saturated even at a 100-fold molar excess of iron. Analogous results obtained for Cys --> Ser iron ligand variants support formation of an unfolded-Fe(SCys)(3) complex between steps 1 and 2, which we propose is the key nucleation event that pulls together distal regions of the protein chain. These results show that folding of chaotrope-denatured apoRd is iron-nucleated and driven by extraordinarily rapid formation of the Fe(SCys)(4) site from an essentially random coil apoprotein. This high-chaotrope, multispectroscopy approach could clarify folding pathways of other [M(SCys)(3)]- or [M(SCys)(4)]-containing proteins.
Collapse
Affiliation(s)
- Anna Morleo
- DISMA, University of Milan, Via G. Celoria 2, 20133 Milan, Italy
| | | | | | | | | |
Collapse
|
5
|
NMR analysis of native-state protein conformational flexibility by hydrogen exchange. Methods Mol Biol 2009; 490:285-310. [PMID: 19157088 DOI: 10.1007/978-1-59745-367-7_12] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2023]
Abstract
The rate of hydrogen exchange for the most protected amides of a protein is widely used to provide an estimate of global conformational stability by analyzing the exchange kinetics in the unfolded state in terms of model peptide exchange rates. The exchange behavior of the other amides of the protein which do not exchange via a global unfolding mechanism can provide insight into the smaller-scale conformational transitions that facilitate access to solvent as required for the exchange reaction. However, since the residual tertiary structure in the exchange-competent conformation can modulate the chemistry of the exchange reaction, equilibrium values estimated from normalization with model peptide rates are open to question. To overcome this limitation, the most robust approaches utilize differential analyses as a function of experimental variables such as denaturant concentration, temperature, pH, and mutational variation. Practical aspects of these various differential analysis techniques are considered with illustrations drawn from the literature.
Collapse
|
6
|
Anderson JS, Hernández G, Lemaster DM. A billion-fold range in acidity for the solvent-exposed amides of Pyrococcus furiosus rubredoxin. Biochemistry 2008; 47:6178-88. [PMID: 18479148 DOI: 10.1021/bi800284y] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The exchange rates of the static solvent-accessible amide hydrogens of Pyrococcus furiosus rubredoxin range from near the diffusion-limited rate to a billion-fold slower for the non-hydrogen-bonded Val 38 (eubacterial numbering). Hydrogen exchange directly monitors the kinetic acidity of the peptide nitrogen. Electrostatic solvation free energies were calculated by Poisson-Boltzmann methods for the individual peptide anions that form during the hydroxide-catalyzed exchange reaction to examine how well the predicted thermodynamic acidities match the experimentally determined kinetic acidities. With the exception of the Ile 12 amide, the differential exchange rate constant for each solvent-exposed amide proton that is not hydrogen bonded to a backbone carbonyl can be predicted within a factor of 6 (10 (0.78)) root-mean-square deviation (rmsd) using the CHARMM22 electrostatic parameter set and an internal dielectric value of 3. Under equivalent conditions, the PARSE parameter set yields a larger rmsd value of 1.28 pH units, while the AMBER parm99 parameter set resulted in a considerably poorer correlation. Either increasing the internal dielectric value to 4 or reducing it to a value of 2 significantly degrades the quality of the prediction. Assigning the excess charge of the peptide anion equally between the peptide nitrogen and the carbonyl oxygen also reduces the correlation to the experimental data. These continuum electrostatic calculations were further analyzed to characterize the specific structural elements that appear to be responsible for the wide range of peptide acidities observed for these solvent-exposed amides. The striking heterogeneity in the potential at sites along the protein-solvent interface should prove germane to the ongoing challenge of quantifying the contribution that electrostatic interactions make to the catalytic acceleration achieved by enzymes.
Collapse
Affiliation(s)
- Janet S Anderson
- Department of Chemistry, Union College, Schenectady, New York 12308, USA.
| | | | | |
Collapse
|
7
|
LeMaster DM, Anderson JS, Wang L, Guo Y, Li H, Hernández G. NMR and X-ray analysis of structural additivity in metal binding site-swapped hybrids of rubredoxin. BMC STRUCTURAL BIOLOGY 2007; 7:81. [PMID: 18053245 PMCID: PMC2249605 DOI: 10.1186/1472-6807-7-81] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Accepted: 12/05/2007] [Indexed: 12/03/2022]
Abstract
Background Chimeric hybrids derived from the rubredoxins of Pyrococcus furiosus (Pf) and Clostridium pasteurianum (Cp) provide a robust system for the characterization of protein conformational stability and dynamics in a differential mode. Interchange of the seven nonconserved residues of the metal binding site between the Pf and Cp rubredoxins yields a complementary pair of hybrids, for which the sum of the thermodynamic stabilities is equal to the sum for the parental proteins. Furthermore, the increase in amide hydrogen exchange rates for the hyperthermophile-derived metal binding site hybrid is faithfully mirrored by a corresponding decrease for the complementary hybrid that is derived from the less thermostable rubredoxin, indicating a degree of additivity in the conformational fluctuations that underlie these exchange reactions. Results Initial NMR studies indicated that the structures of the two complementary hybrids closely resemble "cut-and-paste" models derived from the parental Pf and Cp rubredoxins. This protein system offers a robust opportunity to characterize differences in solution structure, permitting the quantitative NMR chemical shift and NOE peak intensity data to be analyzed without recourse to the conventional conversion of experimental NOE peak intensities into distance restraints. The intensities for 1573 of the 1652 well-resolved NOE crosspeaks from the hybrid rubredoxins were statistically indistinguishable from the intensities of the corresponding parental crosspeaks, to within the baseplane noise level of these high sensitivity data sets. The differences in intensity for the remaining 79 NOE crosspeaks were directly ascribable to localized dynamical processes. Subsequent X-ray analysis of the metal binding site-swapped hybrids, to resolution limits of 0.79 Å and 1.04 Å, demonstrated that the backbone and sidechain heavy atoms in the NMR-derived structures lie within the range of structural variability exhibited among the individual molecules in the crystallographic asymmetric unit (~0.3 Å), indicating consistency with the "cut-and-paste" structuring of the hybrid rubredoxins in both crystal and solution. Conclusion Each of the significant energetic interactions in the metal binding site-swapped hybrids appears to exhibit a 1-to-1 correspondence with the interactions present in the corresponding parental rubredoxin structure, thus providing a structural basis for the observed additivity in conformational stability and dynamics. The congruence of these X-ray and NMR experimental data offers additional support for the interpretation that the conventional treatment of NOE distance restraints contributes substantially to the systematic differences that are commonly reported between NMR- and X-ray-derived protein structures.
Collapse
Affiliation(s)
- David M LeMaster
- Wadsworth Center, New York State Department of Health, School of Public Health, University at Albany - SUNY, Empire State Plaza, Albany, New York 12201, USA.
| | | | | | | | | | | |
Collapse
|
8
|
LeMaster DM, Hernández G. Residue cluster additivity of thermodynamic stability in the hydrophobic core of mesophile vs. hyperthermophile rubredoxins. Biophys Chem 2007; 125:483-9. [PMID: 17118523 DOI: 10.1016/j.bpc.2006.10.013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2006] [Revised: 10/27/2006] [Accepted: 10/27/2006] [Indexed: 11/23/2022]
Abstract
The branched sidechain residues 24 and 33 in the hydrophobic core of rubredoxin differ between the Clostridium pasteurianum (Cp) and Pyrococcus furiosus (Pf) sequences. Their X-ray structures indicate that these two sidechains are in van der Waals contact with each other, while neither appears to significantly interact with the other nonconserved residues. The simultaneous interchange of residues 24 and 33 between the Cp and Pf rubredoxin sequences yield a complementary pair of hybrid proteins for which the sum of their thermodynamic stabilities equals that of the parental rubredoxins. The 1.2 kcal/mol change arising from this two residues interchange accounts for 21% of the differential thermodynamic stability between the mesophile and hyperthermophile proteins. The additional interchange of the sole nonconserved aromatic residue in the hydrophobic core yields a 0.78 kcal/mol deviation from thermodynamic additivity.
Collapse
Affiliation(s)
- David M LeMaster
- Wadsworth Center, New York State Department of Health, New York 12201-0509, USA
| | | |
Collapse
|