1
|
Alsulami AF. Mut-Map: Comprehensive Computational Pipeline for Structural Mapping and Analysis of Cancer-Associated Mutations. Brief Bioinform 2024; 25:bbae514. [PMID: 39413799 PMCID: PMC11483132 DOI: 10.1093/bib/bbae514] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 09/18/2024] [Accepted: 09/30/2024] [Indexed: 10/18/2024] Open
Abstract
Understanding the functional impact of genetic mutations on protein structures is essential for advancing cancer research and developing targeted therapies. The main challenge lies in accurately mapping these mutations to protein structures and analysing their effects on protein function. To address this, Mut-Map (https://genemutation.org/) is a comprehensive computational pipeline designed to integrate mutation data from the Catalogue Of Somatic Mutations In Cancer database with protein structural data from the Protein Data Bank and AlphaFold models. The pipeline begins by taking a UniProt ID and proceeds through mapping corresponding Protein Data Bank structures, renumbering residues, and assessing disorder percentages. It then overlays mutation data, categorizes mutations based on structural context, and visualizes them using advanced tools like MolStar. This approach allows for a detailed analysis of how mutations may disrupt protein function by affecting key regions such as DNA interfaces, ligand-binding sites, and dimer interactions. To validate the pipeline, a case study on the TP53 gene, a critical tumour suppressor often mutated in cancers, was conducted. The analysis highlighted the most frequent mutations occurring at the DNA-binding interface, providing insights into their potential role in cancer progression. Mut-Map offers a powerful resource for elucidating the structural implications of cancer-associated mutations, paving the way for more targeted therapeutic strategies and advancing our understanding of protein structure-function relationships.
Collapse
Affiliation(s)
- Ali F Alsulami
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
| |
Collapse
|
2
|
Komar AA, Samatova E, Rodnina MV. Translation Rates and Protein Folding. J Mol Biol 2024; 436:168384. [PMID: 38065274 DOI: 10.1016/j.jmb.2023.168384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/01/2023] [Accepted: 12/02/2023] [Indexed: 12/19/2023]
Abstract
The mRNA coding sequence defines not only the amino acid sequence of the protein, but also the speed at which the ribosomes move along the mRNA while making the protein. The non-uniform local kinetics - denoted as translational rhythm - is similar among mRNAs coding for related protein folds. Deviations from this conserved rhythm can result in protein misfolding. In this review we summarize the experimental evidence demonstrating how local translation rates affect cotranslational protein folding, with the focus on the synonymous codons and patches of charged residues in the nascent peptide as best-studied examples. Alterations in nascent protein conformations due to disturbed translational rhythm can persist off the ribosome, as demonstrated by the effects of synonymous codon variants of several disease-related proteins. Charged amino acid patches in nascent chains also modulate translation and cotranslational protein folding, and can abrogate translation when placed at the N-terminus of the nascent peptide. During cotranslational folding, incomplete nascent chains navigate through a unique conformational landscape in which earlier intermediate states become inaccessible as the nascent peptide grows. Precisely tuned local translation rates, as well as interactions with the ribosome, guide the folding pathway towards the native structure, whereas deviations from the natural translation rhythm may favor pathways leading to trapped misfolded states. Deciphering the 'folding code' of the mRNA will contribute to understanding the diseases caused by protein misfolding and to rational protein design.
Collapse
Affiliation(s)
- Anton A Komar
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, 2121 Euclid Avenue, Cleveland, OH 44115, USA; Department of Biochemistry and Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA.
| | - Ekaterina Samatova
- Max Planck Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, 37077 Goettingen, Germany
| | - Marina V Rodnina
- Max Planck Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, 37077 Goettingen, Germany.
| |
Collapse
|
3
|
Ribeiro R, Moreira JN, Goncalves J. Development of a new affinity maturation protocol for the construction of an internalizing anti-nucleolin antibody library. Sci Rep 2024; 14:10608. [PMID: 38719911 PMCID: PMC11079059 DOI: 10.1038/s41598-024-61230-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 05/02/2024] [Indexed: 05/12/2024] Open
Abstract
Over the last decades, monoclonal antibodies have substantially improved the treatment of several conditions. The continuous search for novel therapeutic targets and improvements in antibody's structure, demands for a constant optimization of their development. In this regard, modulation of an antibody's affinity to its target has been largely explored and culminated in the discovery and optimization of a variety of molecules. It involves the creation of antibody libraries and selection against the target of interest. In this work, we aimed at developing a novel protocol to be used for the affinity maturation of an antibody previously developed by our group. An antibody library was constructed using an in vivo random mutagenesis approach that, to our knowledge, has not been used before for antibody development. Then, a cell-based phage display selection protocol was designed to allow the fast and simple screening of antibody clones capable of being internalized by target cells. Next generation sequencing coupled with computer analysis provided an extensive characterization of the created library and post-selection pool, that can be used as a guide for future antibody development. With a single selection step, an enrichment in the mutated antibody library, given by a decrease in almost 50% in sequence diversity, was achieved, and structural information useful in the study of the antibody-target interaction in the future was obtained.
Collapse
Affiliation(s)
- Rita Ribeiro
- CNC-Center for Neurosciences and Cell Biology, Center for Innovative Biomedicine and Biotechnology (CIBB), Faculty of Medicine (Polo 1), University of Coimbra, Coimbra, Portugal
- Faculty of Pharmacy, iMed.ULisboa - Research Institute for Medicines, University of Lisbon, Lisbon, Portugal
- Univ Coimbra-University of Coimbra, CIBB, Faculty of Pharmacy, Coimbra, Portugal
| | - João N Moreira
- CNC-Center for Neurosciences and Cell Biology, Center for Innovative Biomedicine and Biotechnology (CIBB), Faculty of Medicine (Polo 1), University of Coimbra, Coimbra, Portugal.
- Univ Coimbra-University of Coimbra, CIBB, Faculty of Pharmacy, Coimbra, Portugal.
| | - João Goncalves
- Faculty of Pharmacy, iMed.ULisboa - Research Institute for Medicines, University of Lisbon, Lisbon, Portugal.
| |
Collapse
|
4
|
Gudkov M, Thibaut L, Giannoulatou E. Quantifying negative selection on synonymous variants. HGG ADVANCES 2024; 5:100262. [PMID: 38192100 PMCID: PMC10835449 DOI: 10.1016/j.xhgg.2024.100262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 01/01/2024] [Accepted: 01/01/2024] [Indexed: 01/10/2024] Open
Abstract
Widespread adoption of DNA sequencing has resulted in large numbers of genetic variants, whose contribution to disease is not easily determined. Although many types of variation are known to disrupt cellular processes in predictable ways, for some categories of variants, the effects may not be directly detectable. A particular example is synonymous variants, that is, those single-nucleotide variants that create a codon substitution, such that the produced amino acid sequence is unaffected. Contrary to the original theory suggesting that synonymous variants are benign, there is a growing volume of research showing that, despite their "silent" mechanism of action, some synonymous variation may be deleterious. Here, we studied the extent of the negative selective pressure acting on different classes of synonymous variants by analyzing the relative enrichment of synonymous singleton variants in the human exomes provided by gnomAD. Using a modification of the mutability-adjusted proportion of singletons (MAPS) metric as a measure of purifying selection, we found that some classes of synonymous variants are subject to stronger negative selection than others. For instance, variants that reduce codon optimality undergo stronger selection than optimality-increasing variants. Besides, selection affects synonymous variants implicated in splice-site-loss or splice-site-gain events. To understand what drives this negative selection, we tested a number of predictors in the aim to explain the variability in the selection scores. Our findings provide insights into the effects of synonymous variants at the population level, highlighting the specifics of the role that these variants play in health and disease.
Collapse
Affiliation(s)
- Mikhail Gudkov
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW 2010, Australia; St Vincent's Clinical School, UNSW Sydney, Sydney, NSW 2052, Australia
| | - Loïc Thibaut
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW 2010, Australia; School of Mathematics and Statistics, UNSW Sydney, Sydney, NSW 2052, Australia
| | - Eleni Giannoulatou
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW 2010, Australia; St Vincent's Clinical School, UNSW Sydney, Sydney, NSW 2052, Australia.
| |
Collapse
|
5
|
Jiang Y, Deane CM, Morris GM, O’Brien EP. It is theoretically possible to avoid misfolding into non-covalent lasso entanglements using small molecule drugs. PLoS Comput Biol 2024; 20:e1011901. [PMID: 38470915 PMCID: PMC10931463 DOI: 10.1371/journal.pcbi.1011901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 02/08/2024] [Indexed: 03/14/2024] Open
Abstract
A novel class of protein misfolding characterized by either the formation of non-native noncovalent lasso entanglements in the misfolded structure or loss of native entanglements has been predicted to exist and found circumstantial support through biochemical assays and limited-proteolysis mass spectrometry data. Here, we examine whether it is possible to design small molecule compounds that can bind to specific folding intermediates and thereby avoid these misfolded states in computer simulations under idealized conditions (perfect drug-binding specificity, zero promiscuity, and a smooth energy landscape). Studying two proteins, type III chloramphenicol acetyltransferase (CAT-III) and D-alanyl-D-alanine ligase B (DDLB), that were previously suggested to form soluble misfolded states through a mechanism involving a failure-to-form of native entanglements, we explore two different drug design strategies using coarse-grained structure-based models. The first strategy, in which the native entanglement is stabilized by drug binding, failed to decrease misfolding because it formed an alternative entanglement at a nearby region. The second strategy, in which a small molecule was designed to bind to a non-native tertiary structure and thereby destabilize the native entanglement, succeeded in decreasing misfolding and increasing the native state population. This strategy worked because destabilizing the entanglement loop provided more time for the threading segment to position itself correctly to be wrapped by the loop to form the native entanglement. Further, we computationally identified several FDA-approved drugs with the potential to bind these intermediate states and rescue misfolding in these proteins. This study suggests it is possible for small molecule drugs to prevent protein misfolding of this type.
Collapse
Affiliation(s)
- Yang Jiang
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Charlotte M. Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles’ Oxford, OX1 3LB United Kingdom
| | - Garrett M. Morris
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, 24-29 St Giles’ Oxford, OX1 3LB United Kingdom
| | - Edward P. O’Brien
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Bioinformatics and Genomics Graduate Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Institute for Computational and Data Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| |
Collapse
|
6
|
Love AM, Nair NU. Specific codons control cellular resources and fitness. SCIENCE ADVANCES 2024; 10:eadk3485. [PMID: 38381824 PMCID: PMC10881034 DOI: 10.1126/sciadv.adk3485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 01/18/2024] [Indexed: 02/23/2024]
Abstract
As cellular engineering progresses from simply overexpressing proteins to imparting complex phenotypes through multigene expression, judicious appropriation of cellular resources is essential. Since codon use is degenerate and biased, codons may control cellular resources at a translational level. We investigate how partitioning transfer RNA (tRNA) resources by incorporating dissimilar codon usage can drastically alter interdependence of expression level and burden on the host. By isolating the effect of individual codons' use during translation elongation while eliminating confounding factors, we show that codon choice can trans-regulate fitness of the host and expression of other heterologous or native genes. We correlate specific codon usage patterns with host fitness and derive a coding scheme for multigene expression called the Codon Health Index (CHI, χ). This empirically derived coding scheme (χ) enables the design of multigene expression systems that avoid catastrophic cellular burden and is robust across several proteins and conditions.
Collapse
Affiliation(s)
- Aaron M. Love
- Manus Bio, Waltham, MA 02453, USA
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02155, USA
| | - Nikhil U. Nair
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA 02155, USA
| |
Collapse
|
7
|
Liang J, Tang M, Chen L, Wang W, Liang X. Oxidative stress resistance prompts pyrroloquinoline quinone biosynthesis in Hyphomicrobium denitrificans H4-45. Appl Microbiol Biotechnol 2024; 108:204. [PMID: 38349428 PMCID: PMC10864529 DOI: 10.1007/s00253-024-13053-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 01/26/2024] [Accepted: 02/03/2024] [Indexed: 02/15/2024]
Abstract
Pyrroloquinoline quinone (PQQ) is a natural antioxidant with diverse applications in food and pharmaceutical industries. A lot of effort has been devoted toward the discovery of PQQ high-producing microbial species and characterization of biosynthesis, but it is still challenging to achieve a high PQQ yield. In this study, a combined strategy of random mutagenesis and adaptive laboratory evolution (ALE) with fermentation optimization was applied to improve PQQ production in Hyphomicrobium denitrificans H4-45. A mutant strain AE-9 was obtained after nearly 400 generations of UV-LiCl mutagenesis, followed by an ALE process, which was conducted with a consecutive increase of oxidative stress generated by kanamycin, sodium sulfide, and potassium tellurite. In the flask culture condition, the PQQ production in mutant strain AE-9 had an 80.4% increase, and the cell density increased by 14.9% when compared with that of the initial strain H4-45. Moreover, batch and fed-batch fermentation processes were optimized to further improve PQQ production by pH control strategy, methanol and H2O2 feed flow, and segmented fermentation process. Finally, the highest PQQ production and productivity of the mutant strain AE-9 reached 307 mg/L and 4.26 mg/L/h in a 3.7-L bioreactor, respectively. Whole genome sequencing analysis showed that genetic mutations in the ftfL gene and thiC gene might contribute to improving PQQ production by enhancing methanol consumption and cell growth in the AE-9 strain. Our study provided a systematic strategy to obtain a PQQ high-producing mutant strain and achieve high production of PQQ in fermentation. These practical methods could be applicable to improve the production of other antioxidant compounds with uncleared regulation mechanisms. KEY POINTS: • Improvement of PQQ production by UV-LiCl mutagenesis combined with adaptive laboratory evolution (ALE) and fermentation optimization. • A consecutive increase of oxidative stress could be used as the antagonistic factor for ALE to enhance PQQ production. • Mutations in the ftfL gene and thiC gene indicated that PQQ production might be increased by enhancing methanol consumption and cell growth.
Collapse
Affiliation(s)
- Jiale Liang
- School of Food Science and Biotechnology, Zhejiang Gongshang University, Hangzhou, 310018, China
| | - Mingjie Tang
- School of Food Science and Biotechnology, Zhejiang Gongshang University, Hangzhou, 310018, China
| | - Lang Chen
- School of Food Science and Biotechnology, Zhejiang Gongshang University, Hangzhou, 310018, China
| | - Wenjie Wang
- School of Food Science and Biotechnology, Zhejiang Gongshang University, Hangzhou, 310018, China.
| | - Xinle Liang
- School of Food Science and Biotechnology, Zhejiang Gongshang University, Hangzhou, 310018, China.
| |
Collapse
|
8
|
Samatova E, Komar AA, Rodnina MV. How the ribosome shapes cotranslational protein folding. Curr Opin Struct Biol 2024; 84:102740. [PMID: 38071940 DOI: 10.1016/j.sbi.2023.102740] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 11/15/2023] [Accepted: 11/16/2023] [Indexed: 02/09/2024]
Abstract
During protein synthesis, the growing nascent peptide chain moves inside the polypeptide exit tunnel of the ribosome from the peptidyl transferase center towards the exit port where it emerges into the cytoplasm. The ribosome defines the unique energy landscape of the pioneering round of protein folding. The spatial confinement and the interactions of the nascent peptide with the tunnel walls facilitate formation of secondary structures, such as α-helices. The vectorial nature of protein folding inside the tunnel favors local intra- and inter-molecular interactions, thereby inducing cotranslational folding intermediates that do not form upon protein refolding in solution. Tertiary structures start to fold in the lower part of the tunnel, where interactions with the ribosome destabilize native protein folds. The present review summarizes the recent progress in understanding the driving forces of nascent protein folding inside the tunnel and at the surface of the ribosome.
Collapse
Affiliation(s)
- Ekaterina Samatova
- Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, Goettingen 37077, Germany
| | - Anton A Komar
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, 2121 Euclid Avenue, Cleveland, OH 44115, USA; Department of Biochemistry and Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Marina V Rodnina
- Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, Goettingen 37077, Germany.
| |
Collapse
|
9
|
Yang B, Cheng Z, Luo L, Cheng K, Gan S, Shi Y, Liu C, Wang D. Comparative analysis of codon usage patterns of Plasmodium helical interspersed subtelomeric (PHIST) proteins. Front Microbiol 2023; 14:1320060. [PMID: 38156001 PMCID: PMC10752978 DOI: 10.3389/fmicb.2023.1320060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 11/28/2023] [Indexed: 12/30/2023] Open
Abstract
Background Plasmodium falciparum is a protozoan parasite that causes the most severe form of malaria in humans worldwide, which is predominantly found in sub-Saharan Africa, where it is responsible for the majority of malaria-related deaths. Plasmodium helical interspersed subtelomeric (PHIST) proteins are a family of proteins, with a conserved PHIST domain, which are typically located at the subtelomeric regions of the Plasmodium falciparum chromosomes and play crucial roles in the interaction between the parasite and its human host, such as cytoadherence, immune evasion, and host cell remodeling. However, the specific utilization of synonymous codons by PHIST proteins in Plasmodium falciparum is still unknown. Methods Codon usage bias (CUB) refers to the unequal usage of synonymous codons during translation, resulting in over- or underrepresentation of certain nucleotide patterns. This imbalance in CUB can impact various cellular processes, including protein expression levels and genetic variation. To investigate this, the CUB of 88 PHIST protein coding sequences (CDSs) from 5 subgroups were analyzed in this study. Results The results showed that both codon base composition and relative synonymous codon usage (RSCU) analysis identified a higher occurrence of AT-ended codons (AGA and UUA) in PHIST proteins of Plasmodium falciparum. The average effective number of codons (ENC) for these PHIST proteins was 36.69, indicating a weak codon preference among them, as it was greater than 35. Additionally, the correlation analysis among codon base composition (GC1, GC2, GC3, GCs), codon adaptation index (CAI), codon bias index (CBI), frequency of optimal codons (FOP), ENC, general average hydropathicity (GRAVY), aromaticity (AROMO), length of synonymous codons (L_sym), and length of amino acids (L_aa) revealed the influence of base composition and codon usage indices on codon usage bias, with GC1 having a significant impact in this study. Furthermore, the neutrality plot analysis, PR2-bias plot analysis, and ENC-GC3 plot analysis provided additional evidence that natural selection plays a crucial role in determining codon bias in PHIST proteins. Conclusion In conclusion, this study has enhanced our understanding of the characteristics of codon usage and genetic evolution in PHIST proteins, thereby providing data foundation for further research on antimalarial drugs or vaccines.
Collapse
Affiliation(s)
- Baoling Yang
- College of Basic Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Ziwen Cheng
- College of Basic Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Like Luo
- College of Basic Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Kuo Cheng
- College of Basic Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Shengqi Gan
- College of Animal Husbandry and Veterinary Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Yuyi Shi
- College of Animal Husbandry and Veterinary Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Che Liu
- College of Animal Husbandry and Veterinary Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| | - Dawei Wang
- College of Animal Husbandry and Veterinary Medicine, Jinzhou Medical University, Jinzhou, Liaoning Province, China
| |
Collapse
|
10
|
Wang D, Yang B. Analysis of codon usage bias of thioredoxin in apicomplexan protozoa. Parasit Vectors 2023; 16:431. [PMID: 37990340 PMCID: PMC10664530 DOI: 10.1186/s13071-023-06002-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 10/06/2023] [Indexed: 11/23/2023] Open
Abstract
BACKGROUND Apicomplexan protozoa are a diverse group of obligate intracellular parasites causing many diseases that affect humans and animals, such as malaria, toxoplasmosis, and cryptosporidiosis. Apicomplexan protozoa possess unique thioredoxins (Trxs) that have been shown to regulate various cellular processes including metabolic redox regulation, parasite survival, and host immune evasion. However, it is still unknown how synonymous codons are used by apicomplexan protozoa Trxs. METHODS Codon usage bias (CUB) is the unequal usage of synonymous codons during translation which leads to the over- or underrepresentation of certain nucleotide patterns. This imbalance in CUB can impact a variety of cellular processes including protein expression levels and genetic variation. This study analyzed the CUB of 32 Trx coding sequences (CDS) from 11 apicomplexan protozoa. RESULTS The results showed that both codon base composition and relative synonymous codon usage (RSCU) analysis revealed that AT-ended codons were more frequently used in Cryptosporidium spp. and Plasmodium spp., while the Eimeria spp., Babesia spp., Hammondia hammondi, Neospora caninum, and Toxoplasma gondii tended to end in G/C. The average effective number of codon (ENC) value of these apicomplexan protozoa is 46.59, which is > 35, indicating a weak codon preference among apicomplexan protozoa Trxs. Furthermore, the correlation analysis among codon base composition (GC1, GC2, GC3, GCs), codon adaptation index (CAI), codon bias index (CBI), frequency of optimal codons (FOP), ENC, general average hydropathicity (GRAVY), aromaticity (AROMO), length of synonymous codons (L_sym), and length of amino acids (L_aa) indicated the influence of base composition and codon usage indices on CUB. Additionally, the neutrality plot analysis, PR2-bias plot analysis, and ENC-GC3 plot analysis further demonstrated that natural selection plays an important role in apicomplexan protozoa Trxs codon bias. CONCLUSIONS In conclusion, this study increased the understanding of codon usage characteristics and genetic evolution of apicomplexan protozoa Trxs, which expanded new ideas for vaccine and drug research.
Collapse
Affiliation(s)
- Dawei Wang
- Jinzhou Medical University, Jinzhou, 121000, Liaoning Province, China
| | - Baoling Yang
- Jinzhou Medical University, Jinzhou, 121000, Liaoning Province, China.
| |
Collapse
|
11
|
Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023; 13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]
Abstract
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
Collapse
Affiliation(s)
- Petr Kouba
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Faculty of
Electrical Engineering, Czech Technical
University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
| | - Pavel Kohout
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Faraneh Haddadi
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Anton Bushuiev
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Raman Samusevich
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Jiri Sedlar
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Jiri Damborsky
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Tomas Pluskal
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Josef Sivic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
12
|
Salicari L, Baiesi M, Orlandini E, Trovato A. Folding kinetics of an entangled protein. PLoS Comput Biol 2023; 19:e1011107. [PMID: 37956216 PMCID: PMC10681328 DOI: 10.1371/journal.pcbi.1011107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 11/27/2023] [Accepted: 11/02/2023] [Indexed: 11/15/2023] Open
Abstract
The possibility of the protein backbone adopting lasso-like entangled motifs has attracted increasing attention. After discovering the surprising abundance of natively entangled protein domain structures, it was shown that misfolded entangled subpopulations might become thermosensitive or escape the homeostasis network just after translation. To investigate the role of entanglement in shaping folding kinetics, we introduce a novel indicator and analyze simulations of a coarse-grained, structure-based model for two small single-domain proteins. The model recapitulates the well-known two-state folding mechanism of a non-entangled SH3 domain. However, despite its small size, a natively entangled antifreeze RD1 protein displays a rich refolding behavior, populating two distinct kinetic intermediates: a short-lived, entangled, near-unfolded state and a longer-lived, non-entangled, near-native state. The former directs refolding along a fast pathway, whereas the latter is a kinetic trap, consistently with known experimental evidence of two different characteristic times. Upon trapping, the natively entangled loop folds without being threaded by the N-terminal residues. After trapping, the native entangled structure emerges by either backtracking to the unfolded state or threading through the already formed but not yet entangled loop. Along the fast pathway, trapping does not occur because the native contacts at the closure of the lasso-like loop fold after those involved in the N-terminal thread, confirming previous predictions. Despite this, entanglement may appear already in unfolded configurations. Remarkably, a longer-lived, near-native intermediate, with non-native entanglement properties, recalls what was observed in cotranslational folding.
Collapse
Affiliation(s)
- Leonardo Salicari
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Marco Baiesi
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Enzo Orlandini
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy “G. Galilei”, University of Padova, Padova, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Padova, Italy
| |
Collapse
|
13
|
Nussinov R, Liu Y, Zhang W, Jang H. Protein conformational ensembles in function: roles and mechanisms. RSC Chem Biol 2023; 4:850-864. [PMID: 37920394 PMCID: PMC10619138 DOI: 10.1039/d3cb00114h] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/02/2023] [Indexed: 11/04/2023] Open
Abstract
The sequence-structure-function paradigm has dominated twentieth century molecular biology. The paradigm tacitly stipulated that for each sequence there exists a single, well-organized protein structure. Yet, to sustain cell life, function requires (i) that there be more than a single structure, (ii) that there be switching between the structures, and (iii) that the structures be incompletely organized. These fundamental tenets called for an updated sequence-conformational ensemble-function paradigm. The powerful energy landscape idea, which is the foundation of modernized molecular biology, imported the conformational ensemble framework from physics and chemistry. This framework embraces the recognition that proteins are dynamic and are always interconverting between conformational states with varying energies. The more stable the conformation the more populated it is. The changes in the populations of the states are required for cell life. As an example, in vivo, under physiological conditions, wild type kinases commonly populate their more stable "closed", inactive, conformations. However, there are minor populations of the "open", ligand-free states. Upon their stabilization, e.g., by high affinity interactions or mutations, their ensembles shift to occupy the active states. Here we discuss the role of conformational propensities in function. We provide multiple examples of diverse systems, including protein kinases, lipid kinases, and Ras GTPases, discuss diverse conformational mechanisms, and provide a broad outlook on protein ensembles in the cell. We propose that the number of molecules in the active state (inactive for repressors), determine protein function, and that the dynamic, relative conformational propensities, rather than the rigid structures, are the hallmark of cell life.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University Tel Aviv 69978 Israel
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Wengang Zhang
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| |
Collapse
|
14
|
Romanowski SB, Lee S, Kunakom S, Paulo BS, Recchia MJJ, Liu DY, Cavanagh H, Linington RG, Eustáquio AS. Identification of the lipodepsipeptide selethramide encoded in a giant nonribosomal peptide synthetase from a Burkholderia bacterium. Proc Natl Acad Sci U S A 2023; 120:e2304668120. [PMID: 37812712 PMCID: PMC10589681 DOI: 10.1073/pnas.2304668120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/06/2023] [Indexed: 10/11/2023] Open
Abstract
Bacterial natural products have found many important industrial applications. Yet traditional discovery pipelines often prioritize individual natural product families despite the presence of multiple natural product biosynthetic gene clusters in each bacterial genome. Systematic characterization of talented strains is a means to expand the known natural product space. Here, we report genomics, epigenomics, and metabolomics studies of Burkholderia sp. FERM BP-3421, a soil isolate and known producer of antitumor spliceostatins. Its genome is composed of two chromosomes and two plasmids encoding at least 29 natural product families. Metabolomics studies showed that FERM BP-3421 also produces antifungal aminopyrrolnitrin and approved anticancer romidepsin. From the orphan metabolome features, we connected a lipopeptide of 1,928 Da to an 18-module nonribosomal peptide synthetase encoded as a single gene in chromosome 1. Isolation and structure elucidation led to the identification of selethramide which contains a repeating pattern of serine and leucine and is cyclized at the side chain oxygen of the one threonine residue at position 13. A (R)-3-hydroxybutyric acid moiety decorates the N-terminal serine. Initial attempts to obtain deletion mutants to probe the role of selethramide failed. After acquiring epigenome (methylome) data for FERM BP-3421, we employed a mimicry by methylation strategy that improved DNA transfer efficiency. Mutants defective in selethramide biosynthesis showed reduced surfactant activity and impaired swarming motility that could be chemically complemented with selethramide. This work unveils a lipopeptide that promotes surface motility, establishes improved DNA transfer efficiency, and sets the stage for continued natural product identification from a prolific strain.
Collapse
Affiliation(s)
- Sean B. Romanowski
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL60607
| | - Sanghoon Lee
- Department of Chemistry, Simon Fraser University, Burnaby, BCV5H 1S6, Canada
| | - Sylvia Kunakom
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL60607
| | - Bruno S. Paulo
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL60607
| | | | - Dennis Y. Liu
- Department of Chemistry, Simon Fraser University, Burnaby, BCV5H 1S6, Canada
| | - Hannah Cavanagh
- Department of Chemistry, Simon Fraser University, Burnaby, BCV5H 1S6, Canada
| | - Roger G. Linington
- Department of Chemistry, Simon Fraser University, Burnaby, BCV5H 1S6, Canada
| | - Alessandra S. Eustáquio
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL60607
| |
Collapse
|
15
|
Rudolph A, Nyerges A, Chiappino-Pepe A, Landon M, Baas-Thomas M, Church G. Strategies to identify and edit improvements in synthetic genome segments episomally. Nucleic Acids Res 2023; 51:10094-10106. [PMID: 37615546 PMCID: PMC10570025 DOI: 10.1093/nar/gkad692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 06/30/2023] [Accepted: 08/16/2023] [Indexed: 08/25/2023] Open
Abstract
Genome engineering projects often utilize bacterial artificial chromosomes (BACs) to carry multi-kilobase DNA segments at low copy number. However, all stages of whole-genome engineering have the potential to impose mutations on the synthetic genome that can reduce or eliminate the fitness of the final strain. Here, we describe improvements to a multiplex automated genome engineering (MAGE) protocol to improve recombineering frequency and multiplexability. This protocol was applied to recoding an Escherichia coli strain to replace seven codons with synonymous alternatives genome wide. Ten 44 402-47 179 bp de novo synthesized DNA segments contained in a BAC from the recoded strain were unable to complement deletion of the corresponding 33-61 wild-type genes using a single antibiotic resistance marker. Next-generation sequencing (NGS) was used to identify 1-7 non-recoding mutations in essential genes per segment, and MAGE in turn proved a useful strategy to repair these mutations on the recoded segment contained in the BAC when both the recoded and wild-type copies of the mutated genes had to exist by necessity during the repair process. Finally, two web-based tools were used to predict the impact of a subset of non-recoding missense mutations on strain fitness using protein structure and function calls.
Collapse
Affiliation(s)
- Alexandra Rudolph
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Akos Nyerges
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Anush Chiappino-Pepe
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Wyss Institute for Biologically Inspired Engineering, Boston, MA 02115, USA
| | - Matthieu Landon
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | | | - George Church
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
- Wyss Institute for Biologically Inspired Engineering, Boston, MA 02115, USA
| |
Collapse
|
16
|
Cuevas-Zuviría B, Adam ZR, Goldman AD, Kaçar B. Informatic Capabilities of Translation and Its Implications for the Origins of Life. J Mol Evol 2023; 91:567-569. [PMID: 37526692 DOI: 10.1007/s00239-023-10125-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 06/22/2023] [Indexed: 08/02/2023]
Abstract
The ability to encode and convert heritable information into molecular function is a defining feature of life as we know it. The conversion of information into molecular function is performed by the translation process, in which triplets of nucleotides in a nucleic acid polymer (mRNA) encode specific amino acids in a protein polymer that folds into a three-dimensional structure. The folded protein then performs one or more molecular activities, often as one part of a complex and coordinated physiological network. Prebiotic systems, lacking the ability to explicitly translate information between genotype and phenotype, would have depended upon either chemosynthetic pathways to generate its components-constraining its complexity and evolvability- or on the ambivalence of RNA as both carrier of information and of catalytic functions-a possibility which is still supported by a very limited set of catalytic RNAs. Thus, the emergence of translation during early evolutionary history may have allowed life to unmoor from the setting of its origin. The origin of translation machinery also represents an entirely novel and distinct threshold of behavior for which there is no abiotic counterpart-it could be the only known example of computing that emerged naturally at the chemical level. Here we describe translation machinery's decoding system as the basis of cellular translation's information-processing capabilities, and the four operation types that find parallels in computer systems engineering that this biological machinery exhibits.
Collapse
Affiliation(s)
- Bruno Cuevas-Zuviría
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA.
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid, Madrid, Spain.
| | - Zachary R Adam
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Geosciences, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Betül Kaçar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
17
|
Hymer WC, Kraemer WJ. Resistance exercise stress: theoretical mechanisms for growth hormone processing and release from the anterior pituitary somatotroph. Eur J Appl Physiol 2023; 123:1867-1878. [PMID: 37421488 DOI: 10.1007/s00421-023-05263-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 06/15/2023] [Indexed: 07/10/2023]
Abstract
Heavy resistance exercise (HRE) is the most effective method for inducing muscular hypertrophy and stimulating anabolic hormones, including growth hormone, into the blood. In this review, we explore possible mechanisms within the GH secretory pathway of the pituitary somatotroph, which are likely to modulate the flow of hormone synthesis and packaging as it is processed prior to exocytosis. Special emphasis is placed on the secretory granule and its possible role as a signaling hub. We also review data that summarize how HRE affects the quality and quantity of the secreted hormone. Finally, these pathway mechanisms are considered in the context of heterogeneity of the somatotroph population in the anterior pituitary.
Collapse
Affiliation(s)
- Wesley C Hymer
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, 16802, USA
| | - William J Kraemer
- Department of Human Sciences, The Ohio State University, Columbus, OH, 43802, USA.
- Department of Kinesiology, University of Connecticut, Storrs, CT, USA.
- Department of Physiology and Neurobiology, University of Connecticut, Storrs, CT, USA.
- School of Medical and Health Sciences, Edith Cowan University, Perth, Australia.
| |
Collapse
|
18
|
Davyt M, Bharti N, Ignatova Z. Effect of mRNA/tRNA mutations on translation speed: Implications for human diseases. J Biol Chem 2023; 299:105089. [PMID: 37495112 PMCID: PMC10470029 DOI: 10.1016/j.jbc.2023.105089] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 07/18/2023] [Accepted: 07/20/2023] [Indexed: 07/28/2023] Open
Abstract
Recent discoveries establish tRNAs as central regulators of mRNA translation dynamics, and therefore cotranslational folding and function of the encoded protein. The tRNA pool, whose composition and abundance change in a cell- and tissue-dependent manner, is the main factor which determines mRNA translation velocity. In this review, we discuss a group of pathogenic mutations, in the coding sequences of either protein-coding genes or in tRNA genes, that alter mRNA translation dynamics. We also summarize advances in tRNA biology that have uncovered how variations in tRNA levels on account of genetic mutations affect protein folding and function, and thereby contribute to phenotypic diversity in clinical manifestations.
Collapse
Affiliation(s)
- Marcos Davyt
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany
| | - Nikhil Bharti
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany
| | - Zoya Ignatova
- Institute of Biochemistry and Molecular Biology, University of Hamburg, Hamburg, Germany.
| |
Collapse
|
19
|
David L, Shpigel E, Levin I, Moshe S, Zimmerman L, Dadon-Simanowitz S, Shemer B, Levkovich SA, Larush L, Magdassi S, Belkin S. Performance upgrade of a microbial explosives' sensor strain by screening a high throughput saturation library of a transcriptional regulator. Comput Struct Biotechnol J 2023; 21:4252-4260. [PMID: 37701016 PMCID: PMC10493890 DOI: 10.1016/j.csbj.2023.08.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/15/2023] [Accepted: 08/21/2023] [Indexed: 09/14/2023] Open
Abstract
We present a methodology for a high-throughput screening (HTS) of transcription factor libraries, based on bacterial cells and GFP fluorescence. The method is demonstrated on the Escherichia coli LysR-type transcriptional regulator YhaJ, a key element in 2,4-dinitrotuluene (DNT) detection by bacterial explosives' sensor strains. Enhancing the performance characteristics of the YhaJ transcription factor is essential for future standoff detection of buried landmines. However, conventional directed evolution methods for modifying YhaJ are limited in scope, due to the vast sequence space and the absence of efficient screening methods to select optimal transcription factor mutants. To overcome this limitation, we have constructed a focused saturation library of ca. 6.4 × 107 yhaJ variants, and have screened over 70 % of its sequence space using fluorescence-activated cell sorting (FACS). Through this screening process, we have identified YhaJ mutants exhibiting superior fluorescence responses to DNT, which were then effectively transformed into a bioluminescence-based DNT detection system. The best modified DNT reporter strain demonstrated a 7-fold lower DNT detection threshold, a 45-fold increased signal intensity, and a 40 % shorter response time compared to the parental bioreporter. The FACS-based HTS approach presented here may hold a potential for future molecular enhancement of other sensing and catalytic bioreactions.
Collapse
Affiliation(s)
- Lidor David
- Enzymit Ltd. 3 Pinhas Sapir St., Ness Ziona 7403626, Israel
| | - Etai Shpigel
- Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Itay Levin
- Enzymit Ltd. 3 Pinhas Sapir St., Ness Ziona 7403626, Israel
| | - Shaked Moshe
- Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Lior Zimmerman
- Enzymit Ltd. 3 Pinhas Sapir St., Ness Ziona 7403626, Israel
| | | | - Benjamin Shemer
- Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Shon A. Levkovich
- George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Liraz Larush
- Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | - Shlomo Magdassi
- Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 9190401, Israel
| | | |
Collapse
|
20
|
Halder R, Nissley DA, Sitarik I, Jiang Y, Rao Y, Vu QV, Li MS, Pritchard J, O'Brien EP. How soluble misfolded proteins bypass chaperones at the molecular level. Nat Commun 2023; 14:3689. [PMID: 37344452 DOI: 10.1038/s41467-023-38962-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 05/24/2023] [Indexed: 06/23/2023] Open
Abstract
Subpopulations of soluble, misfolded proteins can bypass chaperones within cells. The extent of this phenomenon and how it happens at the molecular level are unknown. Through a meta-analysis of the experimental literature we find that in all quantitative protein refolding studies there is always a subpopulation of soluble but misfolded protein that does not fold in the presence of one or more chaperones, and can take days or longer to do so. Thus, some misfolded subpopulations commonly bypass chaperones. Using multi-scale simulation models we observe that the misfolded structures that bypass various chaperones can do so because their structures are highly native like, leading to a situation where chaperones do not distinguish between the folded and near-native-misfolded states. More broadly, these results provide a mechanism by which long-time scale changes in protein structure and function can persist in cells because some misfolded states can bypass components of the proteostasis machinery.
Collapse
Affiliation(s)
- Ritaban Halder
- Department of Chemistry, Pennsylvania State University, University Park, PA, 16802, USA
| | - Daniel A Nissley
- Department of Chemistry, Pennsylvania State University, University Park, PA, 16802, USA
- Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
| | - Ian Sitarik
- Department of Chemistry, Pennsylvania State University, University Park, PA, 16802, USA
| | - Yang Jiang
- Department of Chemistry, Pennsylvania State University, University Park, PA, 16802, USA
| | - Yiyun Rao
- Molecular, Cellular and Integrative Biosciences Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA
| | - Quyen V Vu
- Institute of Physics, Polish Academy of Sciences; Al. Lotnikow 32/46, 02-668, Warsaw, Poland
| | - Mai Suan Li
- Institute of Physics, Polish Academy of Sciences; Al. Lotnikow 32/46, 02-668, Warsaw, Poland
- Institute for Computational Sciences and Technology; Quang Trung Software City, Tan Chanh Hiep Ward, District 12, Ho Chi Minh City, Vietnam
| | - Justin Pritchard
- Department of Biomedical Engineering, Pennsylvania State University, State College, PA, 16802, USA
- Huck Institute for the Life Sciences, Pennsylvania State University, State College, PA, 16802, USA
| | - Edward P O'Brien
- Department of Chemistry, Pennsylvania State University, University Park, PA, 16802, USA.
- Bioinformatics and Genomics Graduate Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
- Institute for Computational and Data Sciences, Pennsylvania State University, University Park, PA, 16802, USA.
| |
Collapse
|
21
|
Vu Q, Nissley DA, Jiang Y, O’Brien EP, Li MS. Is Posttranslational Folding More Efficient Than Refolding from a Denatured State: A Computational Study. J Phys Chem B 2023; 127:4761-4774. [PMID: 37200608 PMCID: PMC10240488 DOI: 10.1021/acs.jpcb.3c01694] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 05/04/2023] [Indexed: 05/20/2023]
Abstract
The folding of proteins into their native conformation is a complex process that has been extensively studied over the past half-century. The ribosome, the molecular machine responsible for protein synthesis, is known to interact with nascent proteins, adding further complexity to the protein folding landscape. Consequently, it is unclear whether the folding pathways of proteins are conserved on and off the ribosome. The main question remains: to what extent does the ribosome help proteins fold? To address this question, we used coarse-grained molecular dynamics simulations to compare the mechanisms by which the proteins dihydrofolate reductase, type III chloramphenicol acetyltransferase, and d-alanine-d-alanine ligase B fold during and after vectorial synthesis on the ribosome to folding from the full-length unfolded state in bulk solution. Our results reveal that the influence of the ribosome on protein folding mechanisms varies depending on the size and complexity of the protein. Specifically, for a small protein with a simple fold, the ribosome facilitates efficient folding by helping the nascent protein avoid misfolded conformations. However, for larger and more complex proteins, the ribosome does not promote folding and may contribute to the formation of intermediate misfolded states cotranslationally. These misfolded states persist posttranslationally and do not convert to the native state during the 6 μs runtime of our coarse-grain simulations. Overall, our study highlights the complex interplay between the ribosome and protein folding and provides insight into the mechanisms of protein folding on and off the ribosome.
Collapse
Affiliation(s)
- Quyen
V. Vu
- Institute
of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
| | - Daniel A. Nissley
- Department
of Statistics, University of Oxford, Oxford OX1 3LB, U.K.
| | - Yang Jiang
- Department
of Chemistry, Pennsylvania State University, University Park, Pennsylvania 16802, United States
| | - Edward P. O’Brien
- Department
of Chemistry, Pennsylvania State University, University Park, Pennsylvania 16802, United States
- Bioinformatics
and Genomics Graduate Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania 16802, United States
- Institute
for Computational and Data Sciences, Pennsylvania
State University, University Park, Pennsylvania 16802, United States
| | - Mai Suan Li
- Institute
of Physics, Polish Academy of Sciences, Al. Lotnikow 32/46, 02-668 Warsaw, Poland
- Institute
for Computational Sciences and Technology, Quang Trung Software City, Tan
Chanh Hiep Ward, District 12, Ho Chi Minh City 700000, Vietnam
| |
Collapse
|
22
|
Heeney M, Frank MH. The mRNA mobileome: challenges and opportunities for deciphering signals from the noise. THE PLANT CELL 2023; 35:1817-1833. [PMID: 36881847 DOI: 10.1093/plcell/koad063] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 02/03/2023] [Accepted: 02/06/2023] [Indexed: 05/30/2023]
Abstract
Organismal communication entails encoding a message that is sent over space or time to a recipient cell, where that message is decoded to activate a downstream response. Defining what qualifies as a functional signal is essential for understanding intercellular communication. In this review, we delve into what is known and unknown in the field of long-distance messenger RNA (mRNA) movement and draw inspiration from the field of information theory to provide a perspective on what defines a functional signaling molecule. Although numerous studies support the long-distance movement of hundreds to thousands of mRNAs through the plant vascular system, only a small handful of these transcripts have been associated with signaling functions. Deciphering whether mobile mRNAs generally serve a role in plant communication has been challenging, due to our current lack of understanding regarding the factors that influence mRNA mobility. Further insight into unsolved questions regarding the nature of mobile mRNAs could provide an understanding of the signaling potential of these macromolecules.
Collapse
Affiliation(s)
- Michelle Heeney
- Plant Biology Section, School of Integrative Plant Science, Cornell University, 14853 Ithaca, NY, USA
| | - Margaret H Frank
- Plant Biology Section, School of Integrative Plant Science, Cornell University, 14853 Ithaca, NY, USA
| |
Collapse
|
23
|
Salicari L, Trovato A. Entangled Motifs in Membrane Protein Structures. Int J Mol Sci 2023; 24:9193. [PMID: 37298146 PMCID: PMC10253074 DOI: 10.3390/ijms24119193] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 05/18/2023] [Accepted: 05/20/2023] [Indexed: 06/12/2023] Open
Abstract
Entangled motifs are found in one-third of protein domain structures, a reference set that contains mostly globular proteins. Their properties suggest a connection with co-translational folding. Here, we wish to investigate the presence and properties of entangled motifs in membrane protein structures. From existing databases, we build a non-redundant data set of membrane protein domains, annotated with the monotopic/transmembrane and peripheral/integral labels. We evaluate the presence of entangled motifs using the Gaussian entanglement indicator. We find that entangled motifs appear in one-fifth of transmembrane and one-fourth of monotopic proteins. Surprisingly, the main features of the distribution of the values of the entanglement indicator are similar to the reference case of general proteins. The distribution is conserved across different organisms. Differences with respect to the reference set emerge when considering the chirality of entangled motifs. Although the same chirality bias is found for single-winding motifs in both membrane and reference proteins, the bias is reversed, strikingly, for double-winding motifs only in the reference set. We speculate that these observations can be rationalized in terms of the constraints exerted on the nascent chain by the co-translational bio-genesis machinery, which is different for membrane and globular proteins.
Collapse
Affiliation(s)
- Leonardo Salicari
- Department of Physics and Astronomy ‘Galileo Galilei’, University of Padova, Via Marzolo 8, 35031 Padova, PD, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Via Marzolo 8, 35131 Padova, PD, Italy
| | - Antonio Trovato
- Department of Physics and Astronomy ‘Galileo Galilei’, University of Padova, Via Marzolo 8, 35031 Padova, PD, Italy
- National Institute of Nuclear Physics (INFN), Padova Section, Via Marzolo 8, 35131 Padova, PD, Italy
| |
Collapse
|
24
|
Lin BC, Katneni U, Jankowska KI, Meyer D, Kimchi-Sarfaty C. In silico methods for predicting functional synonymous variants. Genome Biol 2023; 24:126. [PMID: 37217943 PMCID: PMC10204308 DOI: 10.1186/s13059-023-02966-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 05/10/2023] [Indexed: 05/24/2023] Open
Abstract
Single nucleotide variants (SNVs) contribute to human genomic diversity. Synonymous SNVs are previously considered to be "silent," but mounting evidence has revealed that these variants can cause RNA and protein changes and are implicated in over 85 human diseases and cancers. Recent improvements in computational platforms have led to the development of numerous machine-learning tools, which can be used to advance synonymous SNV research. In this review, we discuss tools that should be used to investigate synonymous variants. We provide supportive examples from seminal studies that demonstrate how these tools have driven new discoveries of functional synonymous SNVs.
Collapse
Affiliation(s)
- Brian C Lin
- Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products, Center for Biologics Evaluation and Research, US FDA, Silver Spring, MD, USA
| | - Upendra Katneni
- Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products, Center for Biologics Evaluation and Research, US FDA, Silver Spring, MD, USA
| | - Katarzyna I Jankowska
- Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products, Center for Biologics Evaluation and Research, US FDA, Silver Spring, MD, USA
| | - Douglas Meyer
- Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products, Center for Biologics Evaluation and Research, US FDA, Silver Spring, MD, USA
| | - Chava Kimchi-Sarfaty
- Hemostasis Branch 1, Division of Hemostasis, Office of Plasma Protein Therapeutics CMC, Office of Therapeutic Products, Center for Biologics Evaluation and Research, US FDA, Silver Spring, MD, USA.
| |
Collapse
|
25
|
Whisperings from not so silent mutations. Nat Rev Microbiol 2023; 21:221. [PMID: 36781959 DOI: 10.1038/s41579-023-00864-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
|
26
|
Kadjo AE, Eustáquio AS. Bacterial natural product discovery by heterologous expression. J Ind Microbiol Biotechnol 2023; 50:kuad044. [PMID: 38052428 PMCID: PMC10727000 DOI: 10.1093/jimb/kuad044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 12/04/2023] [Indexed: 12/07/2023]
Abstract
Natural products have found important applications in the pharmaceutical and agricultural sectors. In bacteria, the genes that encode the biosynthesis of natural products are often colocalized in the genome, forming biosynthetic gene clusters. It has been predicted that only 3% of natural products encoded in bacterial genomes have been discovered thus far, in part because gene clusters may be poorly expressed under laboratory conditions. Heterologous expression can help convert bioinformatics predictions into products. However, challenges remain, such as gene cluster prioritization, cloning of the complete gene cluster, high level expression, product identification, and isolation of products in practical yields. Here we reviewed the literature from the past 5 years (January 2018 to June 2023) to identify studies that discovered natural products by heterologous expression. From the 50 studies identified, we present analyses of the rationale for gene cluster prioritization, cloning methods, biosynthetic class, source taxa, and host choice. Combined, the 50 studies led to the discovery of 63 new families of natural products, supporting heterologous expression as a promising way to access novel chemistry. However, the success rate of natural product detection varied from 11% to 32% based on four large-scale studies that were part of the reviewed literature. The low success rate makes it apparent that much remains to be improved. The potential reasons for failure and points to be considered to improve the chances of success are discussed. ONE-SENTENCE SUMMARY At least 63 new families of bacterial natural products were discovered using heterologous expression in the last 5 years, supporting heterologous expression as a promising way to access novel chemistry; however, the success rate is low (11-32%) making it apparent that much remains to be improved-we discuss the potential reasons for failure and points to be considered to improve the chances of success. BioRender was used to generate the graphical abstract figure.
Collapse
Affiliation(s)
- Adjo E Kadjo
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL 60607, USA
- Center for Biomolecular Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL 60607, USA
| | - Alessandra S Eustáquio
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL 60607, USA
- Center for Biomolecular Sciences, College of Pharmacy, University of Illinois at Chicago, Chicago, IL 60607, USA
| |
Collapse
|
27
|
Do Noncoding and Coding Sites in Angiosperm Chloroplast DNA Have Different Mutation Processes? Genes (Basel) 2023; 14:genes14010148. [PMID: 36672890 PMCID: PMC9858945 DOI: 10.3390/genes14010148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 12/30/2022] [Accepted: 01/03/2023] [Indexed: 01/09/2023] Open
Abstract
Fourfold degenerate sites within coding regions and intergenic sites have both been used as estimates of neutral evolution. In chloroplast DNA, the pattern of substitution at intergenic sites is strongly dependent on the composition of the surrounding hexanucleotide composed of the three base pairs on each side, which suggests that the mutation process is highly context-dependent in this genome. This study examines the context-dependency of substitutions at fourfold degenerate sites in protein-coding regions and compares the pattern to what has been observed at intergenic sites. Overall, there is strong similarity between the two types of sites, but there are some intriguing differences. One of these is that substitutions of G and C are significantly higher at fourfold degenerate sites across a range of contexts. In fact, A → T and T → A substitutions are the only substitution types that occur at a lower rate at fourfold degenerate sites. The data are not consistent with selective constraints being responsible for the difference in substitution patterns between intergenic and fourfold degenerate sites. Rather, it is suggested that the difference may be a result of different epigenetic modifications that result in slightly different mutation patterns in coding and intergenic DNA.
Collapse
|
28
|
Engineering Ag43 Signal Peptides with Bacterial Display and Selection. Methods Protoc 2022; 6:mps6010001. [PMID: 36648950 PMCID: PMC9844295 DOI: 10.3390/mps6010001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/13/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022] Open
Abstract
Protein display, secretion, and export in prokaryotes are essential for utilizing microbial systems as engineered living materials, medicines, biocatalysts, and protein factories. To select for improved signal peptides for Escherichia coli protein display, we utilized error-prone polymerase chain reaction (epPCR) coupled with single-cell sorting and microplate titer to generate, select, and detect improved Ag43 signal peptides. Through just three rounds of mutagenesis and selection using green fluorescence from the 56 kDa sfGFP-beta-lactamase, we isolated clones that modestly increased surface display from 1.4- to 3-fold as detected by the microplate plate-reader and native SDS-PAGE assays. To establish that the functional protein was displayed extracellularly, we trypsinized the bacterial cells to release the surface displayed proteins for analysis. This workflow demonstrated a fast and high-throughput method leveraging epPCR and single-cell sorting to augment bacterial surface display rapidly that could be applied to other bacterial proteins.
Collapse
|