1
|
Su Z, Griffin B, Emmons S, Wu Y. Prediction of interactions between cell surface proteins by machine learning. Proteins 2024; 92:567-580. [PMID: 38050713 DOI: 10.1002/prot.26648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 11/15/2023] [Accepted: 11/20/2023] [Indexed: 12/06/2023]
Abstract
Cells detect changes in their external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and, thus, challenging to detect using traditional experimental techniques. Here, we tackle this challenge using a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in the immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells or between proteins on the same cell surface. In practice, we collected all structural data on Ig domain interactions and transformed them into an interface fragment pair library. A high-dimensional profile can then be constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile so that the probability of interaction between the query proteins could be predicted. We tested our models on an experimentally derived dataset that contains 564 cell surface proteins in humans. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in Caenorhabditis elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literature. In conclusion, our computational platform serves as a useful tool to help identify potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study the interactions of proteins in other domain superfamilies.
Collapse
Affiliation(s)
- Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Brian Griffin
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Scott Emmons
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| |
Collapse
|
2
|
Tawfeeq C, Song J, Khaniya U, Madej T, Wang J, Youkharibache P, Abrol R. Towards a structural and functional analysis of the immunoglobulin-fold proteome. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2024; 138:135-178. [PMID: 38220423 DOI: 10.1016/bs.apcsb.2023.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
The immunoglobulin fold (Ig fold) domain is a super-secondary structural motif consisting of a sandwich with two layers of β-sheets that is present in many proteins with very diverse biological functions covering a wide range of physiological processes. This domain presents a modular architecture built with β strands connected by variable length loops that has a highly conserved structural core of four β-strands and quite variable β-sheet extensions in the two sandwich layers that enable both divergent and convergent evolutionary mechanisms in the known Ig fold proteome. The central role of this Ig fold's structural plasticity in the evolutionary success of antibodies in our immune system is well established. Nature has also utilized this Ig fold in all domains of life in many different physiological contexts that go way beyond the immune system. Here we will present a structural and functional overview of the utilization of the Ig fold in different biological processes and in different cellular contexts to highlight some of the innumerable ways that this structural motif can interact in multidomain proteins to enable their diversity of functions. This includes shareable specific protein structure visualizations behind those functions that serve as starting points for further explorations of the biomolecular interactions spanning the Ig fold proteome. This overview also highlights how this Ig fold is being utilized through natural adaptation, engineering, and even building from scratch for a range of biotechnological applications.
Collapse
Affiliation(s)
- Caesar Tawfeeq
- Department of Chemistry and Biochemistry, California State University Northridge, Northridge, United States
| | - James Song
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, United States
| | - Umesh Khaniya
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, United States
| | - Thomas Madej
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, United States
| | - Jiyao Wang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, United States
| | - Philippe Youkharibache
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, United States.
| | - Ravinder Abrol
- Department of Chemistry and Biochemistry, California State University Northridge, Northridge, United States.
| |
Collapse
|
3
|
Young PG, Paynter JM, Wardega JK, Middleditch MJ, Payne LS, Baker EN, Squire CJ. Domain structure and cross-linking in a giant adhesin from the Mobiluncus mulieris bacterium. Acta Crystallogr D Struct Biol 2023; 79:971-979. [PMID: 37860959 PMCID: PMC10619420 DOI: 10.1107/s2059798323007507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 08/27/2023] [Indexed: 10/21/2023] Open
Abstract
Cell-surface proteins known as adhesins enable bacteria to colonize particular environments, and in Gram-positive bacteria often contain autocatalytically formed covalent intramolecular cross-links. While investigating the prevalence of such cross-links, a remarkable example was discovered in Mobiluncus mulieris, a pathogen associated with bacterial vaginosis. This organism encodes a putative adhesin of 7651 residues. Crystallography and mass spectrometry of two selected domains, and AlphaFold structure prediction of the remainder of the protein, were used to show that this adhesin belongs to the family of thioester, isopeptide and ester-bond-containing proteins (TIE proteins). It has an N-terminal domain homologous to thioester adhesion domains, followed by 51 immunoglobulin (Ig)-like domains containing ester- or isopeptide-bond cross-links. The energetic cost to the M. mulieris bacterium in retaining such a large adhesin as a single gene or protein construct suggests a critical role in pathogenicity and/or persistence.
Collapse
Affiliation(s)
- Paul G. Young
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, c/o The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Jacob M. Paynter
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Julia K. Wardega
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Martin J. Middleditch
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Leo S. Payne
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Edward N. Baker
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, c/o The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| | - Christopher J. Squire
- School of Biological Sciences, The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, c/o The University of Auckland, Private Bag 92019, Auckland 1010, New Zealand
| |
Collapse
|
4
|
Campillo-Balderas JA, Lazcano A, Cottom-Salas W, Jácome R, Becerra A. Pangenomic Analysis of Nucleo-Cytoplasmic Large DNA Viruses. I: The Phylogenetic Distribution of Conserved Oxygen-Dependent Enzymes Reveals a Capture-Gene Process. J Mol Evol 2023; 91:647-668. [PMID: 37526693 PMCID: PMC10598087 DOI: 10.1007/s00239-023-10126-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 06/21/2023] [Indexed: 08/02/2023]
Abstract
The Nucleo-Cytoplasmic Large DNA Viruses (NCLDVs) infect a wide range of eukaryotic species, including amoeba, algae, fish, amphibia, arthropods, birds, and mammals. This group of viruses has linear or circular double-stranded DNA genomes whose size spans approximately one order of magnitude, from 100 to 2500 kbp. The ultimate origin of this peculiar group of viruses remains an open issue. Some have argued that NCLDVs' origin may lie in a bacteriophage ancestor that increased its genome size by subsequent recruitment of eukaryotic and bacterial genes. Others have suggested that NCLDVs families originated from cells that underwent an irreversible process of genome reduction. However, the hypothesis that a number of NCLDVs sequences have been recruited from the host genomes has been largely ignored. In the present work, we have performed pangenomic analyses of each of the seven known NCLDVs families. We show that these families' core- and shell genes have cellular homologs, supporting possible escaping-gene events as part of its evolution. Furthermore, the detection of sequences that belong to two protein families (small chain ribonucleotide reductase and Erv1/Air) and to one superfamily [2OG-Fe(II) oxygenases] that are for distribution in all NCLDVs core and shell clusters encoding for oxygen-dependent enzymes suggests that the highly conserved core these viruses originated after the Proterozoic Great Oxidation Event that transformed the terrestrial atmosphere 2.4-2.3 Ga ago.
Collapse
Affiliation(s)
- J A Campillo-Balderas
- Facultad de Ciencias, UNAM, Cd. Universitaria, Apdo. Postal 70-407, 04510, Mexico City, DF, Mexico
| | - A Lazcano
- Facultad de Ciencias, UNAM, Cd. Universitaria, Apdo. Postal 70-407, 04510, Mexico City, DF, Mexico
- El Colegio Nacional, Donceles 104, Centro Histórico, 06020, Mexico City, CP, Mexico
| | - W Cottom-Salas
- Facultad de Ciencias, UNAM, Cd. Universitaria, Apdo. Postal 70-407, 04510, Mexico City, DF, Mexico
- Escuela Nacional Preparatoria, Plantel 8 Miguel E. Schulz, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - R Jácome
- Facultad de Ciencias, UNAM, Cd. Universitaria, Apdo. Postal 70-407, 04510, Mexico City, DF, Mexico
| | - A Becerra
- Facultad de Ciencias, UNAM, Cd. Universitaria, Apdo. Postal 70-407, 04510, Mexico City, DF, Mexico.
| |
Collapse
|
5
|
Yoshida K, Suzuki S, Yuan H, Sato A, Hirata-Tsuchiya S, Saito M, Yamada S, Shiba H. Public RNA-seq data-based identification and functional analyses reveal that MXRA5 retains proliferative and migratory abilities of dental pulp stem cells. Sci Rep 2023; 13:15574. [PMID: 37730838 PMCID: PMC10511426 DOI: 10.1038/s41598-023-42684-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 09/13/2023] [Indexed: 09/22/2023] Open
Abstract
Dental pulp stem cells (DPSC) usually remain quiescent in the dental pulp tissue; however, once the dental pulp tissue is injured, DPSCs potently proliferate and migrate into the injury microenvironment and contribute to immuno-modulation and tissue repair. However, the key molecules that physiologically support the potent proliferation and migration of DPSCs have not been revealed. In this study, we searched publicly available transcriptome raw data sets, which contain comparable (i.e., equivalently cultured) DPSC and mesenchymal stem cell data. Three data sets were extracted from the Gene Expression Omnibus database and then processed and analyzed. MXRA5 was identified as the predominant DPSC-enriched gene associated with the extracellular matrix. MXRA5 is detected in human dental pulp tissues. Loss of MXRA5 drastically decreases the proliferation and migration of DSPCs, concomitantly with reduced expression of the genes associated with the cell cycle and microtubules. In addition to the known full-length isoform of MXRA5, a novel splice variant of MXRA5 was cloned in DPSCs. Recombinant MXRA5 coded by the novel splice variant potently induced the haptotaxis migration of DPSCs, which was inhibited by microtubule inhibitors. Collectively, MXRA5 is a key extracellular matrix protein in dental pulp tissue for maintaining the proliferation and migration of DPSCs.
Collapse
Affiliation(s)
- Kazuma Yoshida
- Department of Biological Endodontics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, 734-8553, Japan
| | - Shigeki Suzuki
- Department of Biological Endodontics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, 734-8553, Japan.
- Department of Periodontology and Endodontology, Tohoku University Graduate School of Dentistry, Sendai, 980-8575, Japan.
| | - Hang Yuan
- Department of Periodontology and Endodontology, Tohoku University Graduate School of Dentistry, Sendai, 980-8575, Japan
| | - Akiko Sato
- Department of Periodontology and Endodontology, Tohoku University Graduate School of Dentistry, Sendai, 980-8575, Japan
| | - Shizu Hirata-Tsuchiya
- Department of Biological Endodontics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, 734-8553, Japan
| | - Masahiro Saito
- Department of Restorative Dentistry, Tohoku University Graduate School of Dentistry, Sendai, 980-8575, Japan
| | - Satoru Yamada
- Department of Periodontology and Endodontology, Tohoku University Graduate School of Dentistry, Sendai, 980-8575, Japan
| | - Hideki Shiba
- Department of Biological Endodontics, Graduate School of Biomedical and Health Sciences, Hiroshima University, Hiroshima, 734-8553, Japan
| |
Collapse
|
6
|
Su Z, Griffin B, Emmons S, Wu Y. Prediction of Interactions between Cell Surface Proteins by Machine Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557337. [PMID: 37745607 PMCID: PMC10515853 DOI: 10.1101/2023.09.12.557337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Cells detect changes of external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and thus challenging to detect using traditional experimental techniques. Here we tackle this challenge by a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells, or between proteins on the same cell surface. In practice, we collected all structural data of Ig domain interactions and transformed them into an interface fragment pair library. A high dimensional profile can be then constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile, so that the probability of interaction between the query proteins can be predicted. We tested our models to an experimentally derived dataset which contains 564 cell surface proteins in human. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in C elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literatures. In conclusion, our computational platform serves a useful tool to help identifying potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study interactions of proteins in other domain superfamilies.
Collapse
Affiliation(s)
- Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Brian Griffin
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Scott Emmons
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| |
Collapse
|
7
|
Wang B, Razavi S, Gamazon ER. Towards mechanistic models of mutational effects: Deep Learning on Alzheimer’s Aβ peptide. Comput Struct Biotechnol J 2023; 21:2434-2445. [PMID: 37090430 PMCID: PMC10114515 DOI: 10.1016/j.csbj.2023.03.051] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 03/24/2023] [Accepted: 03/30/2023] [Indexed: 04/03/2023] Open
Abstract
Deep Mutational Scanning (DMS) has enabled multiplexed measurement of mutational effects on protein properties, including kinematics and self-organization, with unprecedented resolution. However, potential bottlenecks of DMS characterization include experimental design, data quality, and depth of mutational coverage. Here, we apply deep learning to comprehensively model the mutational effect of the Alzheimer's Disease associated peptide Aβ42 on aggregation-related biochemical traits from DMS measurements. Among tested neural network architectures, Convolutional Neural Networks and Recurrent Neural Networks are found to be the most cost-effective models with high performance even under insufficiently-sampled DMS studies. While sequence features are essential for satisfactory prediction from neural networks, geometric-structural features further enhance the prediction performance. Notably, we demonstrate how mechanistic insights into phenotype may be extracted from the neural networks themselves suitably designed. This methodological benefit is particularly relevant for biochemical systems displaying a strong coupling between structure and phenotype such as the conformation of Aβ42 aggregate and nucleation, as shown here using a Graph Convolutional Neural Network (GCN) developed from the protein atomic structure input. In addition to accurate imputation of missing values (which here ranged up to 55% of all phenotype values at key residues), the mutationally-defined nucleation phenotype generated from a GCN shows improved resolution for identifying known disease-causing mutations relative to the original DMS phenotype. Our study suggests that neural network derived sequence-phenotype mapping can be exploited not only to provide direct support for protein engineering or genome editing but also to facilitate therapeutic design with the gained perspectives from biological modeling.
Collapse
|
8
|
In Silico Approach Gives Insights into Ig-like Fold Containing Proteins in Vibrio parahaemolyticus: A Focus on the Fibrillar Adhesins. Toxins (Basel) 2022; 14:toxins14020133. [PMID: 35202160 PMCID: PMC8877628 DOI: 10.3390/toxins14020133] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 02/06/2022] [Accepted: 02/08/2022] [Indexed: 12/10/2022] Open
Abstract
Immunoglobulin-like (Ig-like) fold domains are abundant on the surface of bacteria, where they are required for cell-to-cell recognition, adhesion, biofilm formation, and conjugative transfer. Fibrillar adhesins are proteins with Ig-like fold(s) that have filamentous structures at the cell surface, being thinner and more flexible than pili. While the roles of fibrillar adhesins have been proposed in bacteria overall, their characterization in Vibrio parahaemolyticus has not been established and, therefore, understanding about fibrillar adhesins remain limited in V. parahaemolyticus. This in silico analysis can aid in the systematic identification of Ig-like-folded and fibrillar adhesin-like proteins in V. parahaemolyticus, opening new avenues for disease prevention by interfering in microbial interaction between V. parahaemolyticus and the host.
Collapse
|
9
|
Dhusia K, Wu Y. Classification of protein-protein association rates based on biophysical informatics. BMC Bioinformatics 2021; 22:408. [PMID: 34404340 PMCID: PMC8371850 DOI: 10.1186/s12859-021-04323-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 08/10/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Proteins form various complexes to carry out their versatile functions in cells. The dynamic properties of protein complex formation are mainly characterized by the association rates which measures how fast these complexes can be formed. It was experimentally observed that the association rates span an extremely wide range with over ten orders of magnitudes. Identification of association rates within this spectrum for specific protein complexes is therefore essential for us to understand their functional roles. RESULTS To tackle this problem, we integrate physics-based coarse-grained simulations into a neural-network-based classification model to estimate the range of association rates for protein complexes in a large-scale benchmark set. The cross-validation results show that, when an optimal threshold was selected, we can reach the best performance with specificity, precision, sensitivity and overall accuracy all higher than 70%. The quality of our cross-validation data has also been testified by further statistical analysis. Additionally, given an independent testing set, we can successfully predict the group of association rates for eight protein complexes out of ten. Finally, the analysis of failed cases suggests the future implementation of conformational dynamics into simulation can further improve model. CONCLUSIONS In summary, this study demonstrated that a new modeling framework that combines biophysical simulations with bioinformatics approaches is able to identify protein-protein interactions with low association rates from those with higher association rates. This method thereby can serve as a useful addition to a collection of existing experimental approaches that measure biomolecular recognition.
Collapse
Affiliation(s)
- Kalyani Dhusia
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA.
| |
Collapse
|
10
|
Carroll-Portillo A, Lin HC. Exploring Mucin as Adjunct to Phage Therapy. Microorganisms 2021; 9:microorganisms9030509. [PMID: 33670927 PMCID: PMC7997181 DOI: 10.3390/microorganisms9030509] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Revised: 02/23/2021] [Accepted: 02/25/2021] [Indexed: 12/21/2022] Open
Abstract
Conventional phage therapy using bacteriophages (phages) for specific targeting of pathogenic bacteria is not always useful as a therapeutic for gastrointestinal (GI) dysfunction. Complex dysbiotic GI disorders such as small intestinal bowel overgrowth (SIBO), ulcerative colitis (UC), or Crohn’s disease (CD) are even more difficult to treat as these conditions have shifts in multiple populations of bacteria within the microbiome. Such community-level structural changes in the gut microbiota may require an alternative to conventional phage therapy such as fecal virome transfer or a phage cocktail capable of targeting multiple bacterial species. Additionally, manipulation of the GI microenvironment may enhance beneficial bacteria–phage interactions during treatment. Mucin, produced along the entire length of the GI tract to protect the underlying mucosa, is a prominent contributor to the GI microenvironment and may facilitate bacteria–phage interactions in multiple ways, potentially serving as an adjunct during phage therapy. In this review, we will describe what is known about the role of mucin within the GI tract and how its facilitation of bacteria–phage interactions should be considered in any effort directed at optimizing effectiveness of a phage therapy for gastrointestinal dysbiosis.
Collapse
Affiliation(s)
- Amanda Carroll-Portillo
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, University of New Mexico, Albuquerque, NM 87131, USA;
| | - Henry C. Lin
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, University of New Mexico, Albuquerque, NM 87131, USA;
- Medicine Service, New Mexico VA Health Care System, Albuquerque, NM 87108, USA
- Correspondence: ; Tel.: +1-505-265-1711 (ext. 4552)
| |
Collapse
|
11
|
Wang B, Su Z, Wu Y. Characterizing the function of domain linkers in regulating the dynamics of multi-domain fusion proteins by microsecond molecular dynamics simulations and artificial intelligence. Proteins 2021; 89:884-895. [PMID: 33620752 DOI: 10.1002/prot.26066] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Revised: 01/20/2021] [Accepted: 02/20/2021] [Indexed: 11/12/2022]
Abstract
Multi-domain proteins are not only formed through natural evolution but can also be generated by recombinant DNA technology. Because many fusion proteins can enhance the selectivity of cell targeting, these artificially produced molecules, called multi-specific biologics, are promising drug candidates, especially for immunotherapy. Moreover, the rational design of domain linkers in fusion proteins is becoming an essential step toward a quantitative understanding of the dynamics in these biopharmaceutics. We developed a computational framework to characterize the impacts of peptide linkers on the dynamics of multi-specific biologics. Specifically, we first constructed a benchmark containing six types of linkers that represent various lengths and degrees of flexibility and used them to connect two natural proteins as a test system. We then projected the microsecond dynamics of these proteins generated from Anton onto a coarse-grained conformational space. We further analyzed the similarity of dynamics among different proteins in this low-dimensional space by a neural-network-based classification model. Finally, we applied hierarchical clustering to place linkers into different subgroups based on the classification results. The clustering results suggest that the length of linkers, which is used to spatially separate different functional modules, plays the most important role in regulating the dynamics of this fusion protein. Given the same number of amino acids, linker flexibility functions as a regulator of protein dynamics. In summary, we illustrated that a new computational strategy can be used to study the dynamics of multi-domain fusion proteins by a combination of long timescale molecular dynamics simulation, coarse-grained feature extraction, and artificial intelligence.
Collapse
Affiliation(s)
- Bo Wang
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| |
Collapse
|
12
|
Cell Communications among Microorganisms, Plants, and Animals: Origin, Evolution, and Interplays. Int J Mol Sci 2020; 21:ijms21218052. [PMID: 33126770 PMCID: PMC7663094 DOI: 10.3390/ijms21218052] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 10/17/2020] [Accepted: 10/27/2020] [Indexed: 02/06/2023] Open
Abstract
Cellular communications play pivotal roles in multi-cellular species, but they do so also in uni-cellular species. Moreover, cells communicate with each other not only within the same individual, but also with cells in other individuals belonging to the same or other species. These communications occur between two unicellular species, two multicellular species, or between unicellular and multicellular species. The molecular mechanisms involved exhibit diversity and specificity, but they share common basic features, which allow common pathways of communication between different species, often phylogenetically very distant. These interactions are possible by the high degree of conservation of the basic molecular mechanisms of interaction of many ligand-receptor pairs in evolutionary remote species. These inter-species cellular communications played crucial roles during Evolution and must have been positively selected, particularly when collectively beneficial in hostile environments. It is likely that communications between cells did not arise after their emergence, but were part of the very nature of the first cells. Synchronization of populations of non-living protocells through chemical communications may have been a mandatory step towards their emergence as populations of living cells and explain the large commonality of cell communication mechanisms among microorganisms, plants, and animals.
Collapse
|