1
|
Laakko T, Korkealaakso A, Yildirir BF, Batys P, Liljeström V, Hokkanen A, Nonappa, Penttilä M, Laukkanen A, Miserez A, Södergård C, Mohammadi P. Accelerated Engineering of ELP-Based Materials through Hybrid Biomimetic-De Novo Predictive Molecular Design. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2312299. [PMID: 38710202 DOI: 10.1002/adma.202312299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 03/28/2024] [Indexed: 05/08/2024]
Abstract
Efforts to engineer high-performance protein-based materials inspired by nature have mostly focused on altering naturally occurring sequences to confer the desired functionalities, whereas de novo design lags significantly behind and calls for unconventional innovative approaches. Here, using partially disordered elastin-like polypeptides (ELPs) as initial building blocks this work shows that de novo engineering of protein materials can be accelerated through hybrid biomimetic design, which this work achieves by integrating computational modeling, deep neural network, and recombinant DNA technology. This generalizable approach involves incorporating a series of de novo-designed sequences with α-helical conformation and genetically encoding them into biologically inspired intrinsically disordered repeating motifs. The new ELP variants maintain structural conformation and showed tunable supramolecular self-assembly out of thermal equilibrium with phase behavior in vitro. This work illustrates the effective translation of the predicted molecular designs in structural and functional materials. The proposed methodology can be applied to a broad range of partially disordered biomacromolecules and potentially pave the way toward the discovery of novel structural proteins.
Collapse
Affiliation(s)
- Timo Laakko
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| | | | - Burcu Firatligil Yildirir
- Faculty of Engineering and Natural Sciences, Tampere University, Korkeakoulunkatu 6, Tampere, FI-33720, Finland
| | - Piotr Batys
- Jerzy Haber Institute of Catalysis and Surface Chemistry, Polish Academy of Sciences, Niezapominajek 8, Krakow, PL-30239, Poland
| | - Ville Liljeström
- Department of Applied Physics, School of Science, Aalto University, Aalto, FI-00076, Finland
| | - Ari Hokkanen
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| | - Nonappa
- Faculty of Engineering and Natural Sciences, Tampere University, Korkeakoulunkatu 6, Tampere, FI-33720, Finland
| | - Merja Penttilä
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| | - Anssi Laukkanen
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| | - Ali Miserez
- Center for Sustainable Materials (SusMat), School of Materials Science and Engineering, Nanyang Technological University (NTU), Singapore, 637553, Singapore
- School of Biological Sciences, NTU, Singapore, 637551, Singapore
| | - Caj Södergård
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| | - Pezhman Mohammadi
- VTT Technical Research Centre of Finland Ltd., VTT, FI-02044, Finland
| |
Collapse
|
2
|
Ferruz N, Schmidt S, Höcker B. ProtGPT2 is a deep unsupervised language model for protein design. Nat Commun 2022; 13:4348. [PMID: 35896542 PMCID: PMC9329459 DOI: 10.1038/s41467-022-32007-7] [Citation(s) in RCA: 147] [Impact Index Per Article: 73.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 07/13/2022] [Indexed: 11/29/2022] Open
Abstract
Protein design aims to build novel proteins customized for specific purposes, thereby holding the potential to tackle many environmental and biomedical problems. Recent progress in Transformer-based architectures has enabled the implementation of language models capable of generating text with human-like capabilities. Here, motivated by this success, we describe ProtGPT2, a language model trained on the protein space that generates de novo protein sequences following the principles of natural ones. The generated proteins display natural amino acid propensities, while disorder predictions indicate that 88% of ProtGPT2-generated proteins are globular, in line with natural sequences. Sensitive sequence searches in protein databases show that ProtGPT2 sequences are distantly related to natural ones, and similarity networks further demonstrate that ProtGPT2 is sampling unexplored regions of protein space. AlphaFold prediction of ProtGPT2-sequences yields well-folded non-idealized structures with embodiments and large loops and reveals topologies not captured in current structure databases. ProtGPT2 generates sequences in a matter of seconds and is freely available.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany.
- Institute of Informatics and Applications, University of Girona, Girona, Spain.
| | - Steffen Schmidt
- Computational Biochemistry, University of Bayreuth, 95447, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|