Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bouatta N, Sorger P, AlQuraishi M. Protein structure prediction by AlphaFold2: are attention and symmetries all you need? Acta Crystallogr D Struct Biol 2021;77:982-991. [PMID: 34342271 PMCID: PMC8329862 DOI: 10.1107/s2059798321007531] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 07/21/2021] [Indexed: 11/11/2022] Open

For:	Bouatta N, Sorger P, AlQuraishi M. Protein structure prediction by AlphaFold2: are attention and symmetries all you need? Acta Crystallogr D Struct Biol 2021;77:982-991. [PMID: 34342271 PMCID: PMC8329862 DOI: 10.1107/s2059798321007531] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Accepted: 07/21/2021] [Indexed: 11/11/2022] Open

Number

Cited by Other Article(s)

Alhumaid NK, Tawfik EA. Reliability of AlphaFold2 Models in Virtual Drug Screening: A Focus on Selected Class A GPCRs. Int J Mol Sci 2024;25:10139. [PMID: 39337622 PMCID: PMC11432040 DOI: 10.3390/ijms251810139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2024] [Revised: 09/19/2024] [Accepted: 09/19/2024] [Indexed: 09/30/2024] Open

Abstract

Protein three-dimensional (3D) structure prediction is one of the most challenging issues in the field of computational biochemistry, which has overwhelmed scientists for almost half a century. A significant breakthrough in structural biology has been established by developing the artificial intelligence (AI) system AlphaFold2 (AF2). The AF2 system provides a state-of-the-art prediction of protein structures from nearly all known protein sequences with high accuracy. This study examined the reliability of AF2 models compared to the experimental structures in drug discovery, focusing on one of the most common protein drug-targeted classes known as G protein-coupled receptors (GPCRs) class A. A total of 32 representative protein targets were selected, including experimental structures of X-ray crystallographic and Cryo-EM structures and their corresponding AF2 models. The quality of AF2 models was assessed using different structure validation tools, including the pLDDT score, RMSD value, MolProbity score, percentage of Ramachandran favored, QMEAN Z-score, and QMEANDisCo Global. The molecular docking was performed using the Genetic Optimization for Ligand Docking (GOLD) software. The AF2 models' reliability in virtual drug screening was determined by their ability to predict the ligand binding poses closest to the native binding pose by assessing the Root Mean Square Deviation (RMSD) metric and docking scoring function. The quality of the docking and scoring function was evaluated using the enrichment factor (EF). Furthermore, the capability of using AF2 models in molecular docking to identify hits with key protein-ligand interactions was analyzed. The posing power results showed that the AF2 models successfully predicted ligand binding poses (RMSD < 2 Å). However, they exhibited lower screening power, with average EF values of 2.24, 2.42, and 1.82 for X-ray, Cryo-EM, and AF2 structures, respectively. Moreover, our study revealed that molecular docking using AF2 models can identify competitive inhibitors. In conclusion, this study found that AF2 models provided docking results comparable to experimental structures, particularly for certain GPCR targets, and could potentially significantly impact drug discovery.

Collapse

Correa Marrero M, Jänes J, Baptista D, Beltrao P. Integrating Large-Scale Protein Structure Prediction into Human Genetics Research. Annu Rev Genomics Hum Genet 2024;25:123-140. [PMID: 38621234 DOI: 10.1146/annurev-genom-120622-020615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]

Agarwal V, McShan AC. The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins. Nat Chem Biol 2024;20:950-959. [PMID: 38907110 DOI: 10.1038/s41589-024-01638-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 04/29/2024] [Indexed: 06/23/2024]

Urvas L, Chiesa L, Bret G, Jacquemard C, Kellenberger E. Benchmarking AlphaFold-Generated Structures of Chemokine-Chemokine Receptor Complexes. J Chem Inf Model 2024;64:4587-4600. [PMID: 38809680 DOI: 10.1021/acs.jcim.3c01835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]

Duignan TT. The Potential of Neural Network Potentials. ACS PHYSICAL CHEMISTRY AU 2024;4:232-241. [PMID: 38800721 PMCID: PMC11117678 DOI: 10.1021/acsphyschemau.4c00004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/04/2024] [Accepted: 03/05/2024] [Indexed: 05/29/2024]

Pun MN, Ivanov A, Bellamy Q, Montague Z, LaMont C, Bradley P, Otwinowski J, Nourmohammad A. Learning the shape of protein microenvironments with a holographic convolutional neural network. Proc Natl Acad Sci U S A 2024;121:e2300838121. [PMID: 38300863 PMCID: PMC10861886 DOI: 10.1073/pnas.2300838121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 11/29/2023] [Indexed: 02/03/2024] Open

Versini R, Sritharan S, Aykac Fas B, Tubiana T, Aimeur SZ, Henri J, Erard M, Nüsse O, Andreani J, Baaden M, Fuchs P, Galochkina T, Chatzigoulas A, Cournia Z, Santuz H, Sacquin-Mora S, Taly A. A Perspective on the Prospective Use of AI in Protein Structure Prediction. J Chem Inf Model 2024;64:26-41. [PMID: 38124369 DOI: 10.1021/acs.jcim.3c01361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]

Abstract

AlphaFold2 (AF2) and RoseTTaFold (RF) have revolutionized structural biology, serving as highly reliable and effective methods for predicting protein structures. This article explores their impact and limitations, focusing on their integration into experimental pipelines and their application in diverse protein classes, including membrane proteins, intrinsically disordered proteins (IDPs), and oligomers. In experimental pipelines, AF2 models help X-ray crystallography in resolving the phase problem, while complementarity with mass spectrometry and NMR data enhances structure determination and protein flexibility prediction. Predicting the structure of membrane proteins remains challenging for both AF2 and RF due to difficulties in capturing conformational ensembles and interactions with the membrane. Improvements in incorporating membrane-specific features and predicting the structural effect of mutations are crucial. For intrinsically disordered proteins, AF2's confidence score (pLDDT) serves as a competitive disorder predictor, but integrative approaches including molecular dynamics (MD) simulations or hydrophobic cluster analyses are advocated for accurate dynamics representation. AF2 and RF show promising results for oligomeric models, outperforming traditional docking methods, with AlphaFold-Multimer showing improved performance. However, some caveats remain in particular for membrane proteins. Real-life examples demonstrate AF2's predictive capabilities in unknown protein structures, but models should be evaluated for their agreement with experimental data. Furthermore, AF2 models can be used complementarily with MD simulations. In this Perspective, we propose a "wish list" for improving deep-learning-based protein folding prediction models, including using experimental data as constraints and modifying models with binding partners or post-translational modifications. Additionally, a meta-tool for ranking and suggesting composite models is suggested, driving future advancements in this rapidly evolving field.

Collapse

Affiliation(s)

Raphaelle Versini Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Sujith Sritharan Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Burcu Aykac Fas Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Thibault Tubiana Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
Sana Zineb Aimeur Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
Julien Henri Sorbonne Université, CNRS, Laboratoire de Biologie, Computationnelle et Quantitative UMR 7238, Institut de Biologie Paris-Seine, 4 Place Jussieu, F-75005 Paris, France
Marie Erard Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
Oliver Nüsse Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
Jessica Andreani Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
Marc Baaden Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Patrick Fuchs Sorbonne Université, École Normale Supérieure, PSL University, CNRS, Laboratoire des Biomolécules, LBM, 75005 Paris, France Université de Paris, UFR Sciences du Vivant, 75013 Paris, France
Tatiana Galochkina Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75014 Paris, France
Alexios Chatzigoulas Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
Zoe Cournia Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
Hubert Santuz Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Sophie Sacquin-Mora Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
Antoine Taly Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France

Collapse

Suskiewicz MJ, Munnur D, Strømland Ø, Yang JC, Easton L, Chatrin C, Zhu K, Baretić D, Goffinont S, Schuller M, Wu WF, Elkins J, Ahel D, Sanyal S, Neuhaus D, Ahel I. Updated protein domain annotation of the PARP protein family sheds new light on biological function. Nucleic Acids Res 2023;51:8217-8236. [PMID: 37326024 PMCID: PMC10450202 DOI: 10.1093/nar/gkad514] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 05/09/2023] [Accepted: 06/03/2023] [Indexed: 06/17/2023] Open

Adhav V, Saikrishnan K. The Realm of Unconventional Noncovalent Interactions in Proteins: Their Significance in Structure and Function. ACS OMEGA 2023;8:22268-22284. [PMID: 37396257 PMCID: PMC10308531 DOI: 10.1021/acsomega.3c00205] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Accepted: 05/22/2023] [Indexed: 07/04/2023]

Bhatia H, Aydin F, Carpenter TS, Lightstone FC, Bremer PT, Ingólfsson HI, Nissley DV, Streitz FH. The confluence of machine learning and multiscale simulations. Curr Opin Struct Biol 2023;80:102569. [PMID: 36966691 DOI: 10.1016/j.sbi.2023.102569] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 01/31/2023] [Accepted: 02/08/2023] [Indexed: 06/04/2023]

Veličković P. Everything is connected: Graph neural networks. Curr Opin Struct Biol 2023;79:102538. [PMID: 36764042 DOI: 10.1016/j.sbi.2023.102538] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/28/2022] [Accepted: 01/03/2023] [Indexed: 02/11/2023]

Bertoline LMF, Lima AN, Krieger JE, Teixeira SK. Before and after AlphaFold2: An overview of protein structure prediction. FRONTIERS IN BIOINFORMATICS 2023;3:1120370. [PMID: 36926275 PMCID: PMC10011655 DOI: 10.3389/fbinf.2023.1120370] [Citation(s) in RCA: 41] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open

Soleymani F, Paquet E, Viktor HL, Michalowski W, Spinello D. ProtInteract: A deep learning framework for predicting protein-protein interactions. Comput Struct Biotechnol J 2023;21:1324-1348. [PMID: 36817951 PMCID: PMC9929211 DOI: 10.1016/j.csbj.2023.01.028] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 01/20/2023] [Accepted: 01/20/2023] [Indexed: 01/26/2023] Open

Abstract

Proteins mainly perform their functions by interacting with other proteins. Protein-protein interactions underpin various biological activities such as metabolic cycles, signal transduction, and immune response. However, due to the sheer number of proteins, experimental methods for finding interacting and non-interacting protein pairs are time-consuming and costly. We therefore developed the ProtInteract framework to predict protein-protein interaction. ProtInteract comprises two components: first, a novel autoencoder architecture that encodes each protein's primary structure to a lower-dimensional vector while preserving its underlying sequence attributes. This leads to faster training of the second network, a deep convolutional neural network (CNN) that receives encoded proteins and predicts their interaction under three different scenarios. In each scenario, the deep CNN predicts the class of a given encoded protein pair. Each class indicates different ranges of confidence scores corresponding to the probability of whether a predicted interaction occurs or not. The proposed framework features significantly low computational complexity and relatively fast response. The contributions of this work are twofold. First, ProtInteract assimilates the protein's primary structure into a pseudo-time series. Therefore, we leverage the nature of the time series of proteins and their physicochemical properties to encode a protein's amino acid sequence into a lower-dimensional vector space. This approach enables extracting highly informative sequence attributes while reducing computational complexity. Second, the ProtInteract framework utilises this information to identify protein interactions with other proteins based on its amino acid configuration. Our results suggest that the proposed framework performs with high accuracy and efficiency in predicting protein-protein interactions.

Collapse

Boonyakida J, Khoris IM, Nasrin F, Park EY. Improvement of Modular Protein Display Efficiency in SpyTag-Implemented Norovirus-like Particles. Biomacromolecules 2023;24:308-318. [PMID: 36475654 DOI: 10.1021/acs.biomac.2c01150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Genetic fusion and chemical conjugation are the most common approaches for displaying a foreign protein on the surface of virus-like particles (VLPs); however, these methods may negatively affect the formation and stability of VLPs. Here, we aimed to develop a modular display platform for protein decoration on norovirus-like particles (NoV-LPs) by combining the NoV-LP scaffold with the SpyTag/SpyCatcher bioconjugation system, as the NoV-LP is an attractive protein nanoparticle to carry foreign proteins for various applications. The SpyTagged-NoV-LPs were prepared by introducing SpyTag peptide into the C-terminus of the norovirus VP1 protein. To increase surface exposure of the SpyTag peptide on the NoV-LPs, two or three repeated extension linkers (EAAAK) were inserted between the SpyTag peptide and VP1 protein. Fluorescence proteins, EGFP and mCherry, were fused to SpyCatcher and employed as SpyTag conjugation partners. These VP1-SpyTag variants and SpyCatcher-fused EGFP and mCherry were separately expressed in silkworm fat bodies and purified. This study reveals that adding an extension linker did not disrupt the VLP formation; instead, it increased the particle size by 4-6 nm. The conjugation efficiency of the VP1-SpyTag variants with the extended linker improved from ∼15-35 to ∼50-63% based on the densitometric analysis, while it was up to 77% based on an optical quantification of EGFP and mCherry. Results indicate that the linker causes the SpyTag peptides to be positioned further away from the C-termini of VP1 and potentially increases the exposure of the SpyTag to the outer surface of the NoV-LPs, allowing more SpyTag/SpyCatcher complex formation on the VLP surface. Our study provides a strategy for enhancing the conjugation efficiency of NoV-LP and demonstrates the platform's utility for developing vaccines or functional nanoparticles.

Collapse

Li J, Wang H, Zhu J, Yang Q, Luan Y, Shi L, Molina-Mora JA, Zheng Y. De novo assembly of a chromosome-level reference genome of the ornamental butterfly Sericinus montelus based on nanopore sequencing and Hi-C analysis. Front Genet 2023;14:1107353. [PMID: 36968580 PMCID: PMC10030965 DOI: 10.3389/fgene.2023.1107353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 02/27/2023] [Indexed: 03/29/2023] Open

Nallasamy V, Seshiah M. Energy Profile Bayes and Thompson Optimized Convolutional Neural Network protein structure prediction. Neural Comput Appl 2023;35:1983-2006. [PMID: 36245797 PMCID: PMC9542649 DOI: 10.1007/s00521-022-07868-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 09/21/2022] [Indexed: 01/12/2023]

Abstract

In living organisms, proteins are considered as the executants of biological functions. Owing to its pivotal role played in protein folding patterns, comprehension of protein structure is a challenging issue. Moreover, owing to numerous protein sequence exploration in protein data banks and complication of protein structures, experimental methods are found to be inadequate for protein structural class prediction. Hence, it is very much advantageous to design a reliable computational method to predict protein structural classes from protein sequences. In the recent few years there has been an elevated interest in using deep learning to assist protein structure prediction as protein structure prediction models can be utilized to screen a large number of novel sequences. In this regard, we propose a model employing Energy Profile for atom pairs in conjunction with the Legion-Class Bayes function called Energy Profile Legion-Class Bayes Protein Structure Identification model. Followed by this, we use a Thompson Optimized convolutional neural network to extract features between amino acids and then the Thompson Optimized SoftMax function is employed to extract associations between protein sequences for predicting secondary protein structure. The proposed Energy Profile Bayes and Thompson Optimized Convolutional Neural Network (EPB-OCNN) method tested distinct unique protein data and was compared to the state-of-the-art methods, the Template-Based Modeling, Protein Design using Deep Graph Neural Networks, a deep learning-based S-glutathionylation sites prediction tool called a Computational Framework, the Deep Learning and a distance-based protein structure prediction using deep learning. The results obtained when applied with the Biopython tool with respect to protein structure prediction time, protein structure prediction accuracy, specificity, recall, F-measure, and precision, respectively, are measured. The proposed EPB-OCNN method outperformed the state-of-the-art methods, thereby corroborating the objective.

Collapse

Dephospho-Coenzyme A Kinase Is an Exploitable Drug Target against Plasmodium falciparum: Identification of Selective Inhibitors by High-Throughput Screening of a Large Chemical Compound Library. Antimicrob Agents Chemother 2022;66:e0042022. [DOI: 10.1128/aac.00420-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, Artificial Intelligence (AI), and Allostery. J Phys Chem B 2022;126:6372-6383. [PMID: 35976160 PMCID: PMC9442638 DOI: 10.1021/acs.jpcb.2c04346] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/03/2022] [Indexed: 02/08/2023]

Computation-Aided Design of Albumin Affibody-Inserted Antibody Fragment for the Prolonged Serum Half-Life. Pharmaceutics 2022;14:pharmaceutics14091769. [PMID: 36145517 PMCID: PMC9500697 DOI: 10.3390/pharmaceutics14091769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 08/17/2022] [Accepted: 08/23/2022] [Indexed: 11/16/2022] Open

Ma Q, Lei H, Cao Y. Intramolecular covalent bonds in Gram-positive bacterial surface proteins. Chembiochem 2022;23:e202200316. [PMID: 35801833 DOI: 10.1002/cbic.202200316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 07/07/2022] [Indexed: 11/09/2022]

Pelosi B. Developing a bioinformatics pipeline for comparative protein classification analysis. BMC Genom Data 2022;23:43. [PMID: 35668373 PMCID: PMC9172112 DOI: 10.1186/s12863-022-01045-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 03/11/2022] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

Protein classification is a task of paramount importance in various fields of biology. Despite the great momentum of modern implementation of protein classification, machine learning techniques such as Random Forest and Neural Network could not always be used for several reasons: data collection, unbalanced classification or labelling of the data.As an alternative, I propose the use of a bioinformatics pipeline to search for and classify information from protein databases. Hence, to evaluate the efficiency and accuracy of the pipeline, I focused on the carotenoid biosynthetic genes and developed a filtering approach to retrieve orthologs clusters in two well-studied plants that belong to the Brassicaceae family: Arabidopsis thaliana and Brassica rapa Pekinensis group. The result obtained has been compared with previous studies on carotenoid biosynthetic genes in B. rapa where phylogenetic analysis was conducted.

RESULTS

The developed bioinformatics pipeline relies on commercial software and multiple databeses including the use of phylogeny, Gene Ontology terms (GOs) and Protein Families (Pfams) at a protein level. Furthermore, the phylogeny is coupled with "population analysis" to evaluate the potential orthologs. All the steps taken together give a final table of potential orthologs. The phylogenetic tree gives a result of 43 putative orthologs conserved in B. rapa Pekinensis group. Different A. thaliana proteins have more than one syntenic ortholog as also shown in a previous finding (Li et al., BMC Genomics 16(1):1-11, 2015).

CONCLUSIONS

This study demonstrates that, when the biological features of proteins of interest are not specific, I can rely on a computational approach in filtering steps for classification purposes. The comparison of the results obtained here for the carotenoid biosynthetic genes with previous research confirmed the accuracy of the developed pipeline which can therefore be applied for filtering different types of datasets.

Collapse

Nishihara A, Morimoto N, Sumiyoshi T, Yasumoto S, Kondo M, Kono T, Sakai M, Hikima JI. Inhibition of lysozyme lytic activity by Ivy derived from Photobacterium damselae subsp. piscicida. FISH & SHELLFISH IMMUNOLOGY 2022;124:280-288. [PMID: 35421575 DOI: 10.1016/j.fsi.2022.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 04/08/2022] [Accepted: 04/09/2022] [Indexed: 06/14/2023]

Vacuolar Protein-Sorting Receptor MoVps13 Regulates Conidiation and Pathogenicity in Rice Blast Fungus Magnaporthe oryzae. J Fungi (Basel) 2021;7:jof7121084. [PMID: 34947066 PMCID: PMC8708568 DOI: 10.3390/jof7121084] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 12/04/2021] [Accepted: 12/16/2021] [Indexed: 01/18/2023] Open

Perrakis A, Sixma TK. AI revolutions in biology: The joys and perils of AlphaFold. EMBO Rep 2021;22:e54046. [PMID: 34668287 PMCID: PMC8567224 DOI: 10.15252/embr.202154046] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Accepted: 10/05/2021] [Indexed: 11/30/2022] Open

David A, Islam S, Tankhilevich E, Sternberg MJE. The AlphaFold Database of Protein Structures: A Biologist's Guide. J Mol Biol 2021;434:167336. [PMID: 34757056 PMCID: PMC8783046 DOI: 10.1016/j.jmb.2021.167336] [Citation(s) in RCA: 118] [Impact Index Per Article: 39.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 10/25/2021] [Accepted: 10/26/2021] [Indexed: 01/06/2023]

AlQuraishi M, Sorger PK. Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms. Nat Methods 2021;18:1169-1180. [PMID: 34608321 PMCID: PMC8793939 DOI: 10.1038/s41592-021-01283-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 08/27/2021] [Indexed: 02/08/2023]

Kell DB. The Transporter-Mediated Cellular Uptake and Efflux of Pharmaceutical Drugs and Biotechnology Products: How and Why Phospholipid Bilayer Transport Is Negligible in Real Biomembranes. Molecules 2021;26:5629. [PMID: 34577099 PMCID: PMC8470029 DOI: 10.3390/molecules26185629] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 09/03/2021] [Accepted: 09/14/2021] [Indexed: 12/12/2022] Open

Abstract

Over the years, my colleagues and I have come to realise that the likelihood of pharmaceutical drugs being able to diffuse through whatever unhindered phospholipid bilayer may exist in intact biological membranes in vivo is vanishingly low. This is because (i) most real biomembranes are mostly protein, not lipid, (ii) unlike purely lipid bilayers that can form transient aqueous channels, the high concentrations of proteins serve to stop such activity, (iii) natural evolution long ago selected against transport methods that just let any undesirable products enter a cell, (iv) transporters have now been identified for all kinds of molecules (even water) that were once thought not to require them, (v) many experiments show a massive variation in the uptake of drugs between different cells, tissues, and organisms, that cannot be explained if lipid bilayer transport is significant or if efflux were the only differentiator, and (vi) many experiments that manipulate the expression level of individual transporters as an independent variable demonstrate their role in drug and nutrient uptake (including in cytotoxicity or adverse drug reactions). This makes such transporters valuable both as a means of targeting drugs (not least anti-infectives) to selected cells or tissues and also as drug targets. The same considerations apply to the exploitation of substrate uptake and product efflux transporters in biotechnology. We are also beginning to recognise that transporters are more promiscuous, and antiporter activity is much more widespread, than had been realised, and that such processes are adaptive (i.e., were selected by natural evolution). The purpose of the present review is to summarise the above, and to rehearse and update readers on recent developments. These developments lead us to retain and indeed to strengthen our contention that for transmembrane pharmaceutical drug transport "phospholipid bilayer transport is negligible".

Collapse