1
|
Shanker VR, Bruun TUJ, Hie BL, Kim PS. Unsupervised evolution of protein and antibody complexes with a structure-informed language model. Science 2024; 385:46-53. [PMID: 38963838 DOI: 10.1126/science.adk8946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 05/29/2024] [Indexed: 07/06/2024]
Abstract
Large language models trained on sequence information alone can learn high-level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here, we show that a general protein language model augmented with protein structure backbone coordinates can guide evolution for diverse proteins without the need to model individual functional tasks. We also demonstrate that ESM-IF1, which was only trained on single-chain structures, can be extended to engineer protein complexes. Using this approach, we screened about 30 variants of two therapeutic clinical antibodies used to treat severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. We achieved up to 25-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants of concern BQ.1.1 and XBB.1.5, respectively. These findings highlight the advantage of integrating structural information to identify efficient protein evolution trajectories without requiring any task-specific training data.
Collapse
Affiliation(s)
- Varun R Shanker
- Stanford Biophysics Program, Stanford University School of Medicine, Stanford, CA 94305, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Theodora U J Bruun
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford, CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Brian L Hie
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Peter S Kim
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| |
Collapse
|
2
|
Shanker VR, Bruun TU, Hie BL, Kim PS. Inverse folding of protein complexes with a structure-informed language model enables unsupervised antibody evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.19.572475. [PMID: 38187780 PMCID: PMC10769282 DOI: 10.1101/2023.12.19.572475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Large language models trained on sequence information alone are capable of learning high level principles of protein design. However, beyond sequence, the three-dimensional structures of proteins determine their specific function, activity, and evolvability. Here we show that a general protein language model augmented with protein structure backbone coordinates and trained on the inverse folding problem can guide evolution for diverse proteins without needing to explicitly model individual functional tasks. We demonstrate inverse folding to be an effective unsupervised, structure-based sequence optimization strategy that also generalizes to multimeric complexes by implicitly learning features of binding and amino acid epistasis. Using this approach, we screened ~30 variants of two therapeutic clinical antibodies used to treat SARS-CoV-2 infection and achieved up to 26-fold improvement in neutralization and 37-fold improvement in affinity against antibody-escaped viral variants-of-concern BQ.1.1 and XBB.1.5, respectively. In addition to substantial overall improvements in protein function, we find inverse folding performs with leading experimental success rates among other reported machine learning-guided directed evolution methods, without requiring any task-specific training data.
Collapse
Affiliation(s)
- Varun R. Shanker
- Stanford Biophysics Program, Stanford University School of Medicine, Stanford, CA 94305, USA
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Theodora U.J. Bruun
- Stanford Medical Scientist Training Program, Stanford University School of Medicine, Stanford CA 94305, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Brian L. Hie
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Peter S. Kim
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| |
Collapse
|
3
|
Gao Q, Lu S, Wang Y, He L, Wang M, Jia R, Chen S, Zhu D, Liu M, Zhao X, Yang Q, Wu Y, Zhang S, Huang J, Mao S, Ou X, Sun D, Tian B, Cheng A. Bacterial DNA methyltransferase: A key to the epigenetic world with lessons learned from proteobacteria. Front Microbiol 2023; 14:1129437. [PMID: 37032876 PMCID: PMC10073500 DOI: 10.3389/fmicb.2023.1129437] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 02/27/2023] [Indexed: 04/11/2023] Open
Abstract
Epigenetics modulates expression levels of various important genes in both prokaryotes and eukaryotes. These epigenetic traits are heritable without any change in genetic DNA sequences. DNA methylation is a universal mechanism of epigenetic regulation in all kingdoms of life. In bacteria, DNA methylation is the main form of epigenetic regulation and plays important roles in affecting clinically relevant phenotypes, such as virulence, host colonization, sporulation, biofilm formation et al. In this review, we survey bacterial epigenomic studies and focus on the recent developments in the structure, function, and mechanism of several highly conserved bacterial DNA methylases. These methyltransferases are relatively common in bacteria and participate in the regulation of gene expression and chromosomal DNA replication and repair control. Recent advances in sequencing techniques capable of detecting methylation signals have enabled the characterization of genome-wide epigenetic regulation. With their involvement in critical cellular processes, these highly conserved DNA methyltransferases may emerge as promising targets for developing novel epigenetic inhibitors for biomedical applications.
Collapse
Affiliation(s)
- Qun Gao
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
| | - Shuwei Lu
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Yuwei Wang
- Key Laboratory of Livestock and Poultry Provenance Disease Research in Mianyang, Sichuan, China
| | - Longgui He
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Mingshu Wang
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Renyong Jia
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Shun Chen
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Dekang Zhu
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Mafeng Liu
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Xinxin Zhao
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Qiao Yang
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Ying Wu
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Shaqiu Zhang
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Juan Huang
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Sai Mao
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Xumin Ou
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Di Sun
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Bin Tian
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| | - Anchun Cheng
- Research Center of Avian Diseases, College of Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
- Key Laboratory of Animal Disease and Human Health of Sichuan Province, Chengdu, Sichuan, China
- Institute of Preventive Veterinary Medicine, Sichuan Agricultural University, Chengdu, Sichuan, China
| |
Collapse
|