1
|
Opuni KFM, Ruß M, Geens R, Vocht LD, Wielendaele PV, Debuy C, Sterckx YGJ, Glocker MO. Mass spectrometry-complemented molecular modeling predicts the interaction interface for a camelid single-domain antibody targeting the Plasmodium falciparum circumsporozoite protein's C-terminal domain. Comput Struct Biotechnol J 2024; 23:3300-3314. [PMID: 39296809 PMCID: PMC11409006 DOI: 10.1016/j.csbj.2024.08.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 08/26/2024] [Accepted: 08/26/2024] [Indexed: 09/21/2024] Open
Abstract
Background Bioanalytical methods that enable rapid and high-detail characterization of binding specificities and strengths of protein complexes with low sample consumption are highly desired. The interaction between a camelid single domain antibody (sdAbCSP1) and its target antigen (PfCSP-Cext) was selected as a model system to provide proof-of-principle for the here described methodology. Research design and methods The structure of the sdAbCSP1 - PfCSP-Cext complex was modeled using AlphaFold2. The recombinantly expressed proteins, sdAbCSP1, PfCSP-Cext, and the sdAbCSP1 - PfCSP-Cext complex, were subjected to limited proteolysis and mass spectrometric peptide analysis. ITEM MS (Intact Transition Epitope Mapping Mass Spectrometry) and ITC (Isothermal Titration Calorimetry) were applied to determine stoichiometry and binding strength. Results The paratope of sdAbCSP1 mainly consists of its CDR3 (aa100-118). PfCSP-Cext's epitope is assembled from its α-helix (aa40-52) and opposing loop (aa83-90). PfCSP-Cext's GluC cleavage sites E46 and E58 were shielded by complex formation, confirming the predicted epitope. Likewise, sdAbCSP1's tryptic cleavage sites R105 and R108 were shielded by complex formation, confirming the predicted paratope. ITEM MS determined the 1:1 stoichiometry and the high complex binding strength, exemplified by the gas phase dissociation reaction enthalpy of 50.2 kJ/mol. The in-solution complex dissociation constant is 5 × 10-10 M. Conclusions Combining AlphaFold2 modeling with mass spectrometry/limited proteolysis generated a trustworthy model for the sdAbCSP1 - PfCSP-Cext complex interaction interface.
Collapse
Affiliation(s)
- Kwabena F M Opuni
- Department of Pharmaceutical Chemistry, School of Pharmacy, College of Health Science, University of Ghana, P.O. Box LG43, Legon, Ghana
| | - Manuela Ruß
- Proteome Center Rostock, University Medicine Rostock and University of Rostock, Schillingallee 69, 18057 Rostock, Germany
| | - Rob Geens
- Laboratory of Medical Biochemistry, Faculty of Pharmaceutical, Biomedical, and Veterinary Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, 2610 Antwerp, Belgium
| | - Line De Vocht
- Laboratory of Medical Biochemistry, Faculty of Pharmaceutical, Biomedical, and Veterinary Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, 2610 Antwerp, Belgium
| | - Pieter Van Wielendaele
- Laboratory of Medical Biochemistry, Faculty of Pharmaceutical, Biomedical, and Veterinary Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, 2610 Antwerp, Belgium
| | - Christophe Debuy
- Laboratory of Medical Biochemistry, Faculty of Pharmaceutical, Biomedical, and Veterinary Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, 2610 Antwerp, Belgium
| | - Yann G-J Sterckx
- Laboratory of Medical Biochemistry, Faculty of Pharmaceutical, Biomedical, and Veterinary Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, 2610 Antwerp, Belgium
| | - Michael O Glocker
- Proteome Center Rostock, University Medicine Rostock and University of Rostock, Schillingallee 69, 18057 Rostock, Germany
| |
Collapse
|
2
|
Csikász-Nagy A, Fichó E, Noto S, Reguly I. Computational tools to predict context-specific protein complexes. Curr Opin Struct Biol 2024; 88:102883. [PMID: 38986166 DOI: 10.1016/j.sbi.2024.102883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 05/21/2024] [Accepted: 06/19/2024] [Indexed: 07/12/2024]
Abstract
Interactions between thousands of proteins define cells' protein-protein interaction (PPI) network. Some of these interactions lead to the formation of protein complexes. It is challenging to identify a protein complex in a haystack of protein-protein interactions, and it is even more difficult to predict all protein complexes of the complexome. Simulations and machine learning approaches try to crack these problems by looking at the PPI network or predicted protein structures. Clustering of PPI networks led to the first protein complex predictions, while most recently, atomistic models of protein complexes and deep-learning-based structure prediction methods have also emerged. The simulation of PPI level interactions even enables the quantitative prediction of protein complexes. These methods, the required data sources, and their potential future developments are discussed in this review.
Collapse
Affiliation(s)
- Attila Csikász-Nagy
- Cytocast Hungary Kft, Budapest, Hungary; Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary.
| | | | - Santiago Noto
- Cytocast Hungary Kft, Budapest, Hungary; Escola de Matemática Aplicada, Fundação Getúlio Vargas, Rio de Janeiro, Brazil
| | - István Reguly
- Cytocast Hungary Kft, Budapest, Hungary; Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| |
Collapse
|
3
|
Son A, Park J, Kim W, Yoon Y, Lee S, Park Y, Kim H. Revolutionizing Molecular Design for Innovative Therapeutic Applications through Artificial Intelligence. Molecules 2024; 29:4626. [PMID: 39407556 PMCID: PMC11477718 DOI: 10.3390/molecules29194626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 09/19/2024] [Accepted: 09/27/2024] [Indexed: 10/20/2024] Open
Abstract
The field of computational protein engineering has been transformed by recent advancements in machine learning, artificial intelligence, and molecular modeling, enabling the design of proteins with unprecedented precision and functionality. Computational methods now play a crucial role in enhancing the stability, activity, and specificity of proteins for diverse applications in biotechnology and medicine. Techniques such as deep learning, reinforcement learning, and transfer learning have dramatically improved protein structure prediction, optimization of binding affinities, and enzyme design. These innovations have streamlined the process of protein engineering by allowing the rapid generation of targeted libraries, reducing experimental sampling, and enabling the rational design of proteins with tailored properties. Furthermore, the integration of computational approaches with high-throughput experimental techniques has facilitated the development of multifunctional proteins and novel therapeutics. However, challenges remain in bridging the gap between computational predictions and experimental validation and in addressing ethical concerns related to AI-driven protein design. This review provides a comprehensive overview of the current state and future directions of computational methods in protein engineering, emphasizing their transformative potential in creating next-generation biologics and advancing synthetic biology.
Collapse
Affiliation(s)
- Ahrum Son
- Department of Molecular Medicine, Scripps Research, La Jolla, CA 92037, USA;
| | - Jongham Park
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Woojin Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Yoonki Yoon
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Sangwoon Lee
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Yongho Park
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Hyunsoo Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
- Department of Convergent Bioscience and Informatics, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- Protein AI Design Institute, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- SCICS, Prove beyond AI, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| |
Collapse
|
4
|
Rosignoli S, Pacelli M, Manganiello F, Paiardini A. An outlook on structural biology after AlphaFold: tools, limits and perspectives. FEBS Open Bio 2024. [PMID: 39313455 DOI: 10.1002/2211-5463.13902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 08/19/2024] [Accepted: 09/13/2024] [Indexed: 09/25/2024] Open
Abstract
AlphaFold and similar groundbreaking, AI-based tools, have revolutionized the field of structural bioinformatics, with their remarkable accuracy in ab-initio protein structure prediction. This success has catalyzed the development of new software and pipelines aimed at incorporating AlphaFold's predictions, often focusing on addressing the algorithm's remaining challenges. Here, we present the current landscape of structural bioinformatics shaped by AlphaFold, and discuss how the field is dynamically responding to this revolution, with new software, methods, and pipelines. While the excitement around AI-based tools led to their widespread application, it is essential to acknowledge that their practical success hinges on their integration into established protocols within structural bioinformatics, often neglected in the context of AI-driven advancements. Indeed, user-driven intervention is still as pivotal in the structure prediction process as in complementing state-of-the-art algorithms with functional and biological knowledge.
Collapse
Affiliation(s)
- Serena Rosignoli
- Department of Biochemical sciences "A. Rossi Fanelli", Sapienza Università di Roma, Italy
| | - Maddalena Pacelli
- Department of Biochemical sciences "A. Rossi Fanelli", Sapienza Università di Roma, Italy
| | - Francesca Manganiello
- Department of Biochemical sciences "A. Rossi Fanelli", Sapienza Università di Roma, Italy
| | - Alessandro Paiardini
- Department of Biochemical sciences "A. Rossi Fanelli", Sapienza Università di Roma, Italy
| |
Collapse
|
5
|
Rahimzadeh F, Mohammad Khanli L, Salehpoor P, Golabi F, PourBahrami S. Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis. Comput Biol Med 2024; 179:108815. [PMID: 38986287 DOI: 10.1016/j.compbiomed.2024.108815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 06/09/2024] [Accepted: 06/24/2024] [Indexed: 07/12/2024]
Abstract
Predicting protein structure is both fascinating and formidable, playing a crucial role in structure-based drug discovery and unraveling diseases with elusive origins. The Critical Assessment of Protein Structure Prediction (CASP) serves as a biannual battleground where global scientists converge to untangle the intricate relationships within amino acid chains. Two primary methods, Template-Based Modeling (TBM) and Template-Free (TF) strategies, dominate protein structure prediction. The trend has shifted towards Template-Free predictions due to their broader sequence coverage with fewer templates. The predictive process can be broadly classified into contact map, binned-distance, and real-valued distance predictions, each with distinctive strengths and limitations manifested through tailored loss functions. We have also introduced revolutionary end-to-end, and all-atom diffusion-based techniques that have transformed protein structure predictions. Recent advancements in deep learning techniques have significantly improved prediction accuracy, although the effectiveness is contingent upon the quality of input features derived from natural bio-physiochemical attributes and Multiple Sequence Alignments (MSA). Hence, the generation of high-quality MSA data holds paramount importance in harnessing informative input features for enhanced prediction outcomes. Remarkable successes have been achieved in protein structure prediction accuracy, however not enough for what structural knowledge was intended to, which implies need for development in some other aspects of the predictions. In this regard, scientists have opened other frontiers for protein structural prediction. The utilization of subsampling in multiple sequence alignment (MSA) and protein language modeling appears to be particularly promising in enhancing the accuracy and efficiency of predictions, ultimately aiding in drug discovery efforts. The exploration of predicting protein complex structure also opens up exciting opportunities to deepen our knowledge of molecular interactions and design therapeutics that are more effective. In this article, we have discussed the vicissitudes that the scientists have gone through to improve prediction accuracy, and examined the effective policies in predicting from different aspects, including the construction of high quality MSA, providing informative input features, and progresses in deep learning approaches. We have also briefly touched upon transitioning from predicting single-chain protein structures to predicting protein complex structures. Our findings point towards promoting open research environments to support the objectives of protein structure prediction.
Collapse
Affiliation(s)
- Faezeh Rahimzadeh
- Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz, Iran
| | | | - Pedram Salehpoor
- Faculty of Electrical and Computer Engineering, University of Tabriz, Tabriz, Iran
| | - Faegheh Golabi
- Department of Biomedical Engineering, Faculty of Advanced Medical Sciences, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Shahin PourBahrami
- Department of Computer Engineering, Technical and Vocational University (TVU), Tehran, Iran
| |
Collapse
|
6
|
Chowdhury MAR, Haq MM, Lee JH, Jeong S. Multi-faceted regulation of CREB family transcription factors. Front Mol Neurosci 2024; 17:1408949. [PMID: 39165717 PMCID: PMC11333461 DOI: 10.3389/fnmol.2024.1408949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 07/12/2024] [Indexed: 08/22/2024] Open
Abstract
cAMP response element-binding protein (CREB) is a ubiquitously expressed nuclear transcription factor, which can be constitutively activated regardless of external stimuli or be inducibly activated by external factors such as stressors, hormones, neurotransmitters, and growth factors. However, CREB controls diverse biological processes including cell growth, differentiation, proliferation, survival, apoptosis in a cell-type-specific manner. The diverse functions of CREB appear to be due to CREB-mediated differential gene expression that depends on cAMP response elements and multi-faceted regulation of CREB activity. Indeed, the transcriptional activity of CREB is controlled at several levels including alternative splicing, post-translational modification, dimerization, specific transcriptional co-activators, non-coding small RNAs, and epigenetic regulation. In this review, we present versatile regulatory modes of CREB family transcription factors and discuss their functional consequences.
Collapse
Affiliation(s)
- Md Arifur Rahman Chowdhury
- Department of Bioactive Material Sciences, Jeonbuk National University, Jeonju, Republic of Korea
- Department of Molecular Biology, and Research Center of Bioactive Materials, Jeonbuk National University, Jeonju, Republic of Korea
| | - Md Mazedul Haq
- Department of Bioactive Material Sciences, Jeonbuk National University, Jeonju, Republic of Korea
- Department of Molecular Biology, and Research Center of Bioactive Materials, Jeonbuk National University, Jeonju, Republic of Korea
| | - Jeong Hwan Lee
- Division of Life Sciences, Jeonbuk National University, Jeonju, Republic of Korea
| | - Sangyun Jeong
- Department of Bioactive Material Sciences, Jeonbuk National University, Jeonju, Republic of Korea
- Department of Molecular Biology, and Research Center of Bioactive Materials, Jeonbuk National University, Jeonju, Republic of Korea
| |
Collapse
|
7
|
Kim MJ, Ibrahim MM, Jablonski MM. Deepening insights into cholinergic agents for intraocular pressure reduction: systems genetics, molecular modeling, and in vivo perspectives. Front Mol Biosci 2024; 11:1423351. [PMID: 39130374 PMCID: PMC11310038 DOI: 10.3389/fmolb.2024.1423351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Accepted: 07/08/2024] [Indexed: 08/13/2024] Open
Abstract
Parasympathetic activation in the anterior eye segment regulates various physiological functions. This process, mediated by muscarinic acetylcholine receptors, also impacts intraocular pressure (IOP) through the trabecular meshwork. While FDA-approved M3 muscarinic receptor (M3R) agonists exist for IOP reduction, their systemic cholinergic adverse effects pose limitations in clinical use. Therefore, advancing our understanding of the cholinergic system in the anterior segment of the eye is crucial for developing additional IOP-reducing agents with improved safety profiles. Systems genetics analyses were utilized to explore correlations between IOP and the five major muscarinic receptor subtypes. Molecular docking and dynamics simulations were applied to human M3R homology model using a comprehensive set of human M3R ligands and 1,667 FDA-approved or investigational drugs. Lead compounds from the modeling studies were then tested for their IOP-lowering abilities in mice. Systems genetics analyses unveiled positive correlations in mRNA expressions among the five major muscarinic receptor subtypes, with a negative correlation observed only in M3R with IOP. Through modeling studies, rivastigmine and edrophonium emerged as the most optimally suited cholinergic drugs for reducing IOP via a potentially distinct mechanism from pilocarpine or physostigmine. Subsequent animal studies confirmed comparable IOP reductions among rivastigmine, edrophonium, and pilocarpine, with longer durations of action for rivastigmine and edrophonium. Mild cholinergic adverse effects were observed with pilocarpine and rivastigmine but absent with edrophonium. These findings advance ocular therapeutics, suggesting a more nuanced role of the parasympathetic system in the anterior eye segment for reducing IOP than previously thought.
Collapse
Affiliation(s)
- Minjae J. Kim
- Department of Ophthalmology, The Hamilton Eye Institute, The University of Tennessee Health Science Center, Memphis, TN, United States
| | - Mohamed M. Ibrahim
- Department of Ophthalmology, The Hamilton Eye Institute, The University of Tennessee Health Science Center, Memphis, TN, United States
- Department of Pharmaceutics, Faculty of Pharmacy, Mansoura University, Mansoura, Egypt
| | - Monica M. Jablonski
- Department of Ophthalmology, The Hamilton Eye Institute, The University of Tennessee Health Science Center, Memphis, TN, United States
- Department of Pharmaceutical Sciences, University of Tennessee Health Science Center, Memphis, TN, United States
| |
Collapse
|
8
|
Dahlström KM, Salminen TA. Apprehensions and emerging solutions in ML-based protein structure prediction. Curr Opin Struct Biol 2024; 86:102819. [PMID: 38631107 DOI: 10.1016/j.sbi.2024.102819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/05/2024] [Accepted: 03/31/2024] [Indexed: 04/19/2024]
Abstract
The three-dimensional structure of proteins determines their function in vital biological processes. Thus, when the structure is known, the molecular mechanism of protein function can be understood in more detail and obtained information utilized in biotechnological, diagnostics, and therapeutic applications. Over the past five years, machine learning (ML)-based modeling has pushed protein structure prediction to the next level with AlphaFold in the front line, predicting the structure for hundreds of millions of proteins. Further advances recently report promising ML-based approaches for solving remaining challenges by incorporating functionally important metals, co-factors, post-translational modifications, structural dynamics, and interdomain and multimer interactions in the structure prediction process.
Collapse
Affiliation(s)
- Käthe M Dahlström
- Structural Bioinformatics Laboratory, Biochemistry, Faculty of Science and Engineering, Åbo Akademi University, Tykistökatu 6A, 20520 Turku, Finland; InFLAMES Research Flagship Center, Åbo Akademi University, 20520 Turku, Finland
| | - Tiina A Salminen
- Structural Bioinformatics Laboratory, Biochemistry, Faculty of Science and Engineering, Åbo Akademi University, Tykistökatu 6A, 20520 Turku, Finland; InFLAMES Research Flagship Center, Åbo Akademi University, 20520 Turku, Finland.
| |
Collapse
|
9
|
Werren EA, Peirent ER, Jantti H, Guxholli A, Srivastava KR, Orenstein N, Narayanan V, Wiszniewski W, Dawidziuk M, Gawlinski P, Umair M, Khan A, Khan SN, Geneviève D, Lehalle D, van Gassen KLI, Giltay JC, Oegema R, van Jaarsveld RH, Rafiullah R, Rappold GA, Rabin R, Pappas JG, Wheeler MM, Bamshad MJ, Tsan YC, Johnson MB, Keegan CE, Srivastava A, Bielas SL. Biallelic variants in CSMD1 are implicated in a neurodevelopmental disorder with intellectual disability and variable cortical malformations. Cell Death Dis 2024; 15:379. [PMID: 38816421 PMCID: PMC11140003 DOI: 10.1038/s41419-024-06768-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 05/03/2024] [Accepted: 05/22/2024] [Indexed: 06/01/2024]
Abstract
CSMD1 (Cub and Sushi Multiple Domains 1) is a well-recognized regulator of the complement cascade, an important component of the innate immune response. CSMD1 is highly expressed in the central nervous system (CNS) where emergent functions of the complement pathway modulate neural development and synaptic activity. While a genetic risk factor for neuropsychiatric disorders, the role of CSMD1 in neurodevelopmental disorders is unclear. Through international variant sharing, we identified inherited biallelic CSMD1 variants in eight individuals from six families of diverse ancestry who present with global developmental delay, intellectual disability, microcephaly, and polymicrogyria. We modeled CSMD1 loss-of-function (LOF) pathogenesis in early-stage forebrain organoids differentiated from CSMD1 knockout human embryonic stem cells (hESCs). We show that CSMD1 is necessary for neuroepithelial cytoarchitecture and synchronous differentiation. In summary, we identified a critical role for CSMD1 in brain development and biallelic CSMD1 variants as the molecular basis of a previously undefined neurodevelopmental disorder.
Collapse
Affiliation(s)
- Elizabeth A Werren
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- Advanced Precision Medicine Laboratory, The Jackson Laboratory for Genomic Medicine, Farmington, CTt, 06032, USA
| | - Emily R Peirent
- Neuroscience Graduate Program, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Henna Jantti
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Alba Guxholli
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- Department of Pediatrics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Kinshuk Raj Srivastava
- Medicinal and Process Chemistry Division, CSIR-Central Drug Research Institute, Lucknow, 226031, India
| | - Naama Orenstein
- Schneider Children's Medical Center of Israel, Petah Tikva, 4920235, Israel
| | - Vinodh Narayanan
- Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ, 85004, USA
| | - Wojciech Wiszniewski
- Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, 97239, USA
| | - Mateusz Dawidziuk
- Department of Medical Genetics, Institute of Mother and Child, Warsaw, 01-211, Poland
| | - Pawel Gawlinski
- Department of Medical Genetics, Institute of Mother and Child, Warsaw, 01-211, Poland
| | - Muhammad Umair
- Medical Genomics Research Department, King Abdullah International Medical Research Center, King Saud Bin Abdulaziz University for Health Sciences, Ministry of National Guard Health Affairs, Riyadh, 11481, Saudi Arabia
- Department of Life Sciences, School of Science, University of Management and Technology, Lahore, Punjab, 54770, Pakistan
| | - Amjad Khan
- Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, 97239, USA
- Department of Zoology, University of Lakki Marwat, Lakki Marwat, Khyber Pakhtunkhwa, 28420, Pakistan
| | - Shahid Niaz Khan
- Department of Zoology, Kohat University of Science and Technology, Kohat, Pakistan
| | - David Geneviève
- Montpellier University, Inserm Unit U1183, Reference Center for Rare Diseases and Developmental Anomalies, CHU, 34000, Montpellier, France
| | - Daphné Lehalle
- Sorbonne University, Department of Medical Genetics, Hospital Armand Trousseau, 75012, Paris, France
| | - K L I van Gassen
- Department of Genetics, University Medical Centre Utrecht, Utrecht University, Utrecht, 3584 EA, The Netherlands
| | - Jacques C Giltay
- Department of Genetics, University Medical Centre Utrecht, Utrecht University, Utrecht, 3584 EA, The Netherlands
| | - Renske Oegema
- Department of Genetics, University Medical Centre Utrecht, Utrecht University, Utrecht, 3584 EA, The Netherlands
| | - Richard H van Jaarsveld
- Department of Genetics, University Medical Centre Utrecht, Utrecht University, Utrecht, 3584 EA, The Netherlands
| | - Rafiullah Rafiullah
- Department of Biotechnology, Faculty of Life Sciences, BUITEMS, Quetta, 87300, Pakistan
| | - Gudrun A Rappold
- Department of Human Molecular Genetics, Institute of Human Genetics, Ruprecht-Karls-University, Heidelberg, 69120, Germany
| | - Rachel Rabin
- Department of Pediatrics, NYU Grossman School of Medicine, New York, NY, 10016, USA
| | - John G Pappas
- Department of Pediatrics, NYU Grossman School of Medicine, New York, NY, 10016, USA
| | - Marsha M Wheeler
- Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA
| | - Michael J Bamshad
- Department of Pediatrics, University of Washington, Seattle, WA, 98195, USA
- Brotman Baty Institute, Washington, 98195, USA
| | - Yao-Chang Tsan
- Division of Cardiovascular Medicine, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Matthew B Johnson
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Catherine E Keegan
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
- Department of Pediatrics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA
| | - Anshika Srivastava
- Department of Medical Genetics, Sanjay Gandhi Postgraduate Institute of Medical Sciences, Lucknow, Uttar Pradesh, 226014, India.
| | - Stephanie L Bielas
- Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
- Department of Pediatrics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
10
|
Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W, Gao J, Zheng W. Recent Progress of Protein Tertiary Structure Prediction. Molecules 2024; 29:832. [PMID: 38398585 PMCID: PMC10893003 DOI: 10.3390/molecules29040832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 02/06/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024] Open
Abstract
The prediction of three-dimensional (3D) protein structure from amino acid sequences has stood as a significant challenge in computational and structural bioinformatics for decades. Recently, the widespread integration of artificial intelligence (AI) algorithms has substantially expedited advancements in protein structure prediction, yielding numerous significant milestones. In particular, the end-to-end deep learning method AlphaFold2 has facilitated the rise of structure prediction performance to new heights, regularly competitive with experimental structures in the 14th Critical Assessment of Protein Structure Prediction (CASP14). To provide a comprehensive understanding and guide future research in the field of protein structure prediction for researchers, this review describes various methodologies, assessments, and databases in protein structure prediction, including traditionally used protein structure prediction methods, such as template-based modeling (TBM) and template-free modeling (FM) approaches; recently developed deep learning-based methods, such as contact/distance-guided methods, end-to-end folding methods, and protein language model (PLM)-based methods; multi-domain protein structure prediction methods; the CASP experiments and related assessments; and the recently released AlphaFold Protein Structure Database (AlphaFold DB). We discuss their advantages, disadvantages, and application scopes, aiming to provide researchers with insights through which to understand the limitations, contexts, and effective selections of protein structure prediction methods in protein-related fields.
Collapse
Affiliation(s)
- Qiqige Wuyun
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yihan Chen
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Yifeng Shen
- Faculty of Environment and Information Studies, Keio University, Fujisawa 252-0882, Kanagawa, Japan;
| | - Yang Cao
- College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Gang Hu
- NITFID, School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin 300071, China
| | - Wei Cui
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Jianzhao Gao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
11
|
Peng CX, Liang F, Xia YH, Zhao KL, Hou MH, Zhang GJ. Recent Advances and Challenges in Protein Structure Prediction. J Chem Inf Model 2024; 64:76-95. [PMID: 38109487 DOI: 10.1021/acs.jcim.3c01324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Artificial intelligence has made significant advances in the field of protein structure prediction in recent years. In particular, DeepMind's end-to-end model, AlphaFold2, has demonstrated the capability to predict three-dimensional structures of numerous unknown proteins with accuracy levels comparable to those of experimental methods. This breakthrough has opened up new possibilities for understanding protein structure and function as well as accelerating drug discovery and other applications in the field of biology and medicine. Despite the remarkable achievements of artificial intelligence in the field, there are still some challenges and limitations. In this Review, we discuss the recent progress and some of the challenges in protein structure prediction. These challenges include predicting multidomain protein structures, protein complex structures, multiple conformational states of proteins, and protein folding pathways. Furthermore, we highlight directions in which further improvements can be conducted.
Collapse
Affiliation(s)
- Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Fang Liang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Kai-Long Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ming-Hua Hou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|