1
|
Jänes J, Beltrao P. Deep learning for protein structure prediction and design-progress and applications. Mol Syst Biol 2024; 20:162-169. [PMID: 38291232 PMCID: PMC10912668 DOI: 10.1038/s44320-024-00016-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open
Abstract
Proteins are the key molecular machines that orchestrate all biological processes of the cell. Most proteins fold into three-dimensional shapes that are critical for their function. Studying the 3D shape of proteins can inform us of the mechanisms that underlie biological processes in living cells and can have practical applications in the study of disease mutations or the discovery of novel drug treatments. Here, we review the progress made in sequence-based prediction of protein structures with a focus on applications that go beyond the prediction of single monomer structures. This includes the application of deep learning methods for the prediction of structures of protein complexes, different conformations, the evolution of protein structures and the application of these methods to protein design. These developments create new opportunities for research that will have impact across many areas of biomedical research.
Collapse
Affiliation(s)
- Jürgen Jänes
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pedro Beltrao
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
2
|
Nouchikian L, Fernandez-Martinez D, Renard PY, Sabot C, Duménil G, Rey M, Chamot-Rooke J. Do Not Waste Time─Ensure Success in Your Cross-Linking Mass Spectrometry Experiments before You Begin. Anal Chem 2024; 96:2506-2513. [PMID: 38294351 PMCID: PMC10867798 DOI: 10.1021/acs.analchem.3c04682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 01/09/2024] [Accepted: 01/19/2024] [Indexed: 02/01/2024]
Abstract
Cross-linking mass spectrometry (XL-MS) has become a very useful tool for studying protein complexes and interactions in living systems. It enables the investigation of many large and dynamic assemblies in their native state, providing an unbiased view of their protein interactions and restraints for integrative modeling. More researchers are turning toward trying XL-MS to probe their complexes of interest, especially in their native environments. However, due to the presence of other potentially higher abundant proteins, sufficient cross-links on a system of interest may not be reached to achieve satisfactory structural and interaction information. There are currently no rules for predicting whether XL-MS experiments are likely to work or not; in other words, if a protein complex of interest will lead to useful XL-MS data. Here, we show that a simple iBAQ (intensity-based absolute quantification) analysis performed from trypsin digest data can provide a good understanding of whether proteins of interest are abundant enough to achieve successful cross-linking data. Comparing our findings to large-scale data on diverse systems from several other groups, we show that proteins of interest should be at least in the top 20% abundance range to expect more than one cross-link found per protein. We foresee that this guideline is a good starting point for researchers who would like to use XL-MS to study their protein of interest and help ensure a successful cross-linking experiment from the beginning. Data are available via ProteomeXchange with identifier PXD045792.
Collapse
Affiliation(s)
- Lucienne Nouchikian
- Institut
Pasteur, Université Paris Cité, CNRS UAR 2024, Mass
Spectrometry for Biology Unit, Paris 75015, France
| | - David Fernandez-Martinez
- Institut
Pasteur, Université Paris Cité, INSERM UMR1225, Pathogenesis
of Vascular Infections Unit, Paris 75015, France
| | - Pierre-Yves Renard
- Univ
Rouen Normandie, INSA Rouen Normandie, CNRS, Normandie Univ, COBRA
UMR 6014, INC3M FR 3038, Rouen F-76000, France
| | - Cyrille Sabot
- Univ
Rouen Normandie, INSA Rouen Normandie, CNRS, Normandie Univ, COBRA
UMR 6014, INC3M FR 3038, Rouen F-76000, France
| | - Guillaume Duménil
- Institut
Pasteur, Université Paris Cité, INSERM UMR1225, Pathogenesis
of Vascular Infections Unit, Paris 75015, France
| | - Martial Rey
- Institut
Pasteur, Université Paris Cité, CNRS UAR 2024, Mass
Spectrometry for Biology Unit, Paris 75015, France
| | - Julia Chamot-Rooke
- Institut
Pasteur, Université Paris Cité, CNRS UAR 2024, Mass
Spectrometry for Biology Unit, Paris 75015, France
| |
Collapse
|
3
|
McCafferty CL, Papoulas O, Lee C, Bui KH, Taylor DW, Marcotte EM, Wallingford JB. An amino acid-resolution interactome for motile cilia illuminates the structure and function of ciliopathy protein complexes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.09.548259. [PMID: 37781579 PMCID: PMC10541116 DOI: 10.1101/2023.07.09.548259] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
Motile cilia are ancient, evolutionarily conserved organelles whose dysfunction underlies motile ciliopathies, a broad class of human diseases. Motile cilia contain myriad different proteins that assemble into an array of distinct machines, so understanding the interactions and functional hierarchies among them presents an important challenge. Here, we defined the protein interactome of motile axonemes using cross-linking mass spectrometry (XL/MS) in Tetrahymena thermophila. From over 19,000 XLs, we identified 4,757 unique amino acid interactions among 1,143 distinct proteins, providing both macromolecular and atomic-scale insights into diverse ciliary machines, including the Intraflagellar Transport system, axonemal dynein arms, radial spokes, the 96 nm ruler, and microtubule inner proteins, among others. Guided by this dataset, we used vertebrate multiciliated cells to reveal novel functional interactions among several poorly-defined human ciliopathy proteins. The dataset therefore provides a powerful resource for studying the basic biology of an ancient organelle and the molecular etiology of human genetic disease.
Collapse
Affiliation(s)
- Caitlyn L. McCafferty
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
- Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Ophelia Papoulas
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
| | - Chanjae Lee
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
| | - Khanh Huy Bui
- Department of Anatomy and Cell Biology, Faculty of Medicine and Health Sciences McGill University, Québec, Canada
| | - David W. Taylor
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
| | - Edward M. Marcotte
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
| | - John B. Wallingford
- Department of Molecular Biosciences, University of Texas, Austin, TX 78712, USA
| |
Collapse
|
4
|
DiIorio MC, Kulczyk AW. Novel Artificial Intelligence-Based Approaches for Ab Initio Structure Determination and Atomic Model Building for Cryo-Electron Microscopy. MICROMACHINES 2023; 14:1674. [PMID: 37763837 PMCID: PMC10534518 DOI: 10.3390/mi14091674] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/21/2023] [Accepted: 08/25/2023] [Indexed: 09/29/2023]
Abstract
Single particle cryo-electron microscopy (cryo-EM) has emerged as the prevailing method for near-atomic structure determination, shedding light on the important molecular mechanisms of biological macromolecules. However, the inherent dynamics and structural variability of biological complexes coupled with the large number of experimental images generated by a cryo-EM experiment make data processing nontrivial. In particular, ab initio reconstruction and atomic model building remain major bottlenecks that demand substantial computational resources and manual intervention. Approaches utilizing recent innovations in artificial intelligence (AI) technology, particularly deep learning, have the potential to overcome the limitations that cannot be adequately addressed by traditional image processing approaches. Here, we review newly proposed AI-based methods for ab initio volume generation, heterogeneous 3D reconstruction, and atomic model building. We highlight the advancements made by the implementation of AI methods, as well as discuss remaining limitations and areas for future development.
Collapse
Affiliation(s)
- Megan C. DiIorio
- Institute for Quantitative Biomedicine, Rutgers University, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
| | - Arkadiusz W. Kulczyk
- Institute for Quantitative Biomedicine, Rutgers University, 174 Frelinghuysen Road, Piscataway, NJ 08854, USA
- Department of Biochemistry & Microbiology, Rutgers University, 76 Lipman Drive, New Brunswick, NJ 08901, USA
| |
Collapse
|
5
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
6
|
Habeck M. Bayesian methods in integrative structure modeling. Biol Chem 2023; 404:741-754. [PMID: 37505205 DOI: 10.1515/hsz-2023-0145] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 07/07/2023] [Indexed: 07/29/2023]
Abstract
There is a growing interest in characterizing the structure and dynamics of large biomolecular assemblies and their interactions within the cellular environment. A diverse array of experimental techniques allows us to study biomolecular systems on a variety of length and time scales. These techniques range from imaging with light, X-rays or electrons, to spectroscopic methods, cross-linking mass spectrometry and functional genomics approaches, and are complemented by AI-assisted protein structure prediction methods. A challenge is to integrate all of these data into a model of the system and its functional dynamics. This review focuses on Bayesian approaches to integrative structure modeling. We sketch the principles of Bayesian inference, highlight recent applications to integrative modeling and conclude with a discussion of current challenges and future perspectives.
Collapse
Affiliation(s)
- Michael Habeck
- Microscopic Image Analysis Group, Jena University Hospital, D-07743 Jena, Germany
- Max Planck Institute for Multidisciplinary Sciences, d-37077 Göttingen, Germany
| |
Collapse
|