1
|
Song B, Wang K, Na S, Yao J, Fattah FJ, von Itzstein MS, Yang DM, Liu J, Xue Y, Liang C, Guo Y, Raman I, Zhu C, Dowell JE, Homsi J, Rashdan S, Yang S, Gwin ME, Hsiehchen D, Gloria-McCutchen Y, Raj P, Bai X, Wang J, Conejo-Garcia J, Xie Y, Gerber DE, Huang J, Wang T. Cmai: Predicting Antigen-Antibody Interactions from Massive Sequencing Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601035. [PMID: 39005456 PMCID: PMC11244862 DOI: 10.1101/2024.06.27.601035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
The interaction between antigens and antibodies (B cell receptors, BCRs) is the key step underlying the function of the humoral immune system in various biological contexts. The capability to profile the landscape of antigen-binding affinity of a vast number of BCRs will provide a powerful tool to reveal novel insights at unprecedented levels and will yield powerful tools for translational development. However, current experimental approaches for profiling antibody-antigen interactions are costly and time-consuming, and can only achieve low-to-mid throughput. On the other hand, bioinformatics tools in the field of antibody informatics mostly focus on optimization of antibodies given known binding antigens, which is a very different research question and of limited scope. In this work, we developed an innovative Artificial Intelligence tool, Cmai, to address the prediction of the binding between antibodies and antigens that can be scaled to high-throughput sequencing data. Cmai achieved an AUROC of 0.91 in our validation cohort. We devised a biomarker metric based on the output from Cmai applied to high-throughput BCR sequencing data. We found that, during immune-related adverse events (irAEs) caused by immune-checkpoint inhibitor (ICI) treatment, the humoral immunity is preferentially responsive to intracellular antigens from the organs affected by the irAEs. In contrast, extracellular antigens on malignant tumor cells are inducing B cell infiltrations, and the infiltrating B cells have a greater tendency to co-localize with tumor cells expressing these antigens. We further found that the abundance of tumor antigen-targeting antibodies is predictive of ICI treatment response. Overall, Cmai and our biomarker approach filled in a gap that is not addressed by current antibody optimization works nor works such as AlphaFold3 that predict the structures of complexes of proteins that are known to bind.
Collapse
|
2
|
Gordon GL, Raybould MIJ, Wong A, Deane CM. Prospects for the computational humanization of antibodies and nanobodies. Front Immunol 2024; 15:1399438. [PMID: 38812514 PMCID: PMC11133524 DOI: 10.3389/fimmu.2024.1399438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 05/02/2024] [Indexed: 05/31/2024] Open
Abstract
To be viable therapeutics, antibodies must be tolerated by the human immune system. Rational approaches to reduce the risk of unwanted immunogenicity involve maximizing the 'humanness' of the candidate drug. However, despite the emergence of new discovery technologies, many of which start from entirely human gene fragments, most antibody therapeutics continue to be derived from non-human sources with concomitant humanization to increase their human compatibility. Early experimental humanization strategies that focus on CDR loop grafting onto human frameworks have been critical to the dominance of this discovery route but do not consider the context of each antibody sequence, impacting their success rate. Other challenges include the simultaneous optimization of other drug-like properties alongside humanness and the humanization of fundamentally non-human modalities such as nanobodies. Significant efforts have been made to develop in silico methodologies able to address these issues, most recently incorporating machine learning techniques. Here, we outline these recent advancements in antibody and nanobody humanization, focusing on computational strategies that make use of the increasing volume of sequence and structural data available and the validation of these tools. We highlight that structural distinctions between antibodies and nanobodies make the application of antibody-focused in silico tools to nanobody humanization non-trivial. Furthermore, we discuss the effects of humanizing mutations on other essential drug-like properties such as binding affinity and developability, and methods that aim to tackle this multi-parameter optimization problem.
Collapse
Affiliation(s)
| | | | | | - Charlotte M. Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
3
|
Gallo E. The rise of big data: deep sequencing-driven computational methods are transforming the landscape of synthetic antibody design. J Biomed Sci 2024; 31:29. [PMID: 38491519 PMCID: PMC10943851 DOI: 10.1186/s12929-024-01018-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 03/05/2024] [Indexed: 03/18/2024] Open
Abstract
Synthetic antibodies (Abs) represent a category of artificial proteins capable of closely emulating the functions of natural Abs. Their in vitro production eliminates the need for an immunological response, streamlining the process of Ab discovery, engineering, and development. These artificially engineered Abs offer novel approaches to antigen recognition, paratope site manipulation, and biochemical/biophysical enhancements. As a result, synthetic Abs are fundamentally reshaping conventional methods of Ab production. This mirrors the revolution observed in molecular biology and genomics as a result of deep sequencing, which allows for the swift and cost-effective sequencing of DNA and RNA molecules at scale. Within this framework, deep sequencing has enabled the exploration of whole genomes and transcriptomes, including particular gene segments of interest. Notably, the fusion of synthetic Ab discovery with advanced deep sequencing technologies is redefining the current approaches to Ab design and development. Such combination offers opportunity to exhaustively explore Ab repertoires, fast-tracking the Ab discovery process, and enhancing synthetic Ab engineering. Moreover, advanced computational algorithms have the capacity to effectively mine big data, helping to identify Ab sequence patterns/features hidden within deep sequencing Ab datasets. In this context, these methods can be utilized to predict novel sequence features thereby enabling the successful generation of de novo Ab molecules. Hence, the merging of synthetic Ab design, deep sequencing technologies, and advanced computational models heralds a new chapter in Ab discovery, broadening our comprehension of immunology and streamlining the advancement of biological therapeutics.
Collapse
Affiliation(s)
- Eugenio Gallo
- Department of Medicinal Chemistry, Avance Biologicals, 950 Dupont Street, Toronto, ON, M6H 1Z2, Canada.
- Department of Protein Engineering, RevivAb, Av. Ipiranga, 6681, Partenon, Porto Alegre, RS, 90619-900, Brazil.
| |
Collapse
|
4
|
Mock M, Langmead CJ, Grandsard P, Edavettal S, Russell A. Recent advances in generative biology for biotherapeutic discovery. Trends Pharmacol Sci 2024; 45:255-267. [PMID: 38378385 DOI: 10.1016/j.tips.2024.01.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 12/22/2023] [Accepted: 01/05/2024] [Indexed: 02/22/2024]
Abstract
Generative biology combines artificial intelligence (AI), advanced life sciences technologies, and automation to revolutionize the process of designing novel biomolecules with prescribed properties, giving drug discoverers the ability to escape the limitations of biology during the design of next-generation protein therapeutics. Significant hurdles remain, namely: (i) the inherently complex nature of drug discovery, (ii) the bewildering number of promising computational and experimental techniques that have emerged in the past several years, and (iii) the limited availability of relevant protein sequence-function data for drug-like molecules. There is a need to focus on computational methods that will be most practically effective for protein drug discovery and on building experimental platforms to generate the data most appropriate for these methods. Here, we discuss recent advances in computational and experimental life sciences that are most crucial for impacting the pace and success of protein drug discovery.
Collapse
Affiliation(s)
- Marissa Mock
- Amgen Research, Amgen Inc., One Amgen Center Drive, Thousand Oaks, CA 91320, USA
| | | | - Peter Grandsard
- Amgen Research, Amgen Inc., One Amgen Center Drive, Thousand Oaks, CA 91320, USA
| | - Suzanne Edavettal
- Amgen Research, Amgen Inc., One Amgen Center Drive, Thousand Oaks, CA 91320, USA
| | - Alan Russell
- Amgen Research, Amgen Inc., One Amgen Center Drive, Thousand Oaks, CA 91320, USA.
| |
Collapse
|
5
|
Kim DN, McNaughton AD, Kumar N. Leveraging Artificial Intelligence to Expedite Antibody Design and Enhance Antibody-Antigen Interactions. Bioengineering (Basel) 2024; 11:185. [PMID: 38391671 PMCID: PMC10886287 DOI: 10.3390/bioengineering11020185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 01/30/2024] [Accepted: 02/06/2024] [Indexed: 02/24/2024] Open
Abstract
This perspective sheds light on the transformative impact of recent computational advancements in the field of protein therapeutics, with a particular focus on the design and development of antibodies. Cutting-edge computational methods have revolutionized our understanding of protein-protein interactions (PPIs), enhancing the efficacy of protein therapeutics in preclinical and clinical settings. Central to these advancements is the application of machine learning and deep learning, which offers unprecedented insights into the intricate mechanisms of PPIs and facilitates precise control over protein functions. Despite these advancements, the complex structural nuances of antibodies pose ongoing challenges in their design and optimization. Our review provides a comprehensive exploration of the latest deep learning approaches, including language models and diffusion techniques, and their role in surmounting these challenges. We also present a critical analysis of these methods, offering insights to drive further progress in this rapidly evolving field. The paper includes practical recommendations for the application of these computational techniques, supplemented with independent benchmark studies. These studies focus on key performance metrics such as accuracy and the ease of program execution, providing a valuable resource for researchers engaged in antibody design and development. Through this detailed perspective, we aim to contribute to the advancement of antibody design, equipping researchers with the tools and knowledge to navigate the complexities of this field.
Collapse
Affiliation(s)
- Doo Nam Kim
- Pacific Northwest National Laboratory, 902 Battelle Blvd., Richland, WA 99352, USA
| | - Andrew D McNaughton
- Pacific Northwest National Laboratory, 902 Battelle Blvd., Richland, WA 99352, USA
| | - Neeraj Kumar
- Pacific Northwest National Laboratory, 902 Battelle Blvd., Richland, WA 99352, USA
| |
Collapse
|
6
|
Notin P, Rollins N, Gal Y, Sander C, Marks D. Machine learning for functional protein design. Nat Biotechnol 2024; 42:216-228. [PMID: 38361074 DOI: 10.1038/s41587-024-02127-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 01/05/2024] [Indexed: 02/17/2024]
Abstract
Recent breakthroughs in AI coupled with the rapid accumulation of protein sequence and structure data have radically transformed computational protein design. New methods promise to escape the constraints of natural and laboratory evolution, accelerating the generation of proteins for applications in biotechnology and medicine. To make sense of the exploding diversity of machine learning approaches, we introduce a unifying framework that classifies models on the basis of their use of three core data modalities: sequences, structures and functional labels. We discuss the new capabilities and outstanding challenges for the practical design of enzymes, antibodies, vaccines, nanomachines and more. We then highlight trends shaping the future of this field, from large-scale assays to more robust benchmarks, multimodal foundation models, enhanced sampling strategies and laboratory automation.
Collapse
Affiliation(s)
- Pascal Notin
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
- Department of Computer Science, University of Oxford, Oxford, UK.
| | | | - Yarin Gal
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Chris Sander
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Debora Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA.
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
| |
Collapse
|
7
|
Irvine EB, Reddy ST. Advancing Antibody Engineering through Synthetic Evolution and Machine Learning. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2024; 212:235-243. [PMID: 38166249 DOI: 10.4049/jimmunol.2300492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 10/20/2023] [Indexed: 01/04/2024]
Abstract
Abs are versatile molecules with the potential to achieve exceptional binding to target Ags, while also possessing biophysical properties suitable for therapeutic drug development. Protein display and directed evolution systems have transformed synthetic Ab discovery, engineering, and optimization, vastly expanding the number of Ab clones able to be experimentally screened for binding. Moreover, the burgeoning integration of high-throughput screening, deep sequencing, and machine learning has further augmented in vitro Ab optimization, promising to accelerate the design process and massively expand the Ab sequence space interrogated. In this Brief Review, we discuss the experimental and computational tools employed in synthetic Ab engineering and optimization. We also explore the therapeutic challenges posed by developing Abs for infectious diseases, and the prospects for leveraging machine learning-guided protein engineering to prospectively design Abs resistant to viral escape.
Collapse
Affiliation(s)
- Edward B Irvine
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| | - Sai T Reddy
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland
| |
Collapse
|
8
|
Li B, Chen H, Huang J, He B. CD47Binder: Identify CD47 Binding Peptides by Combining Next-Generation Phage Display Data and Multiple Peptide Descriptors. Interdiscip Sci 2023; 15:578-589. [PMID: 37389722 DOI: 10.1007/s12539-023-00575-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 06/08/2023] [Accepted: 06/09/2023] [Indexed: 07/01/2023]
Abstract
CD47/SIRPα pathway is a new breakthrough in the field of tumor immunity after PD-1/PD-L1. While current monoclonal antibody therapies targeting CD47/SIRPα have demonstrated some anti-tumor effectiveness, there are several inherent limitations associated with these formulations. In the paper, we developed a predictive model that combines next-generation phage display (NGPD) and traditional machine learning methods to distinguish CD47 binding peptides. First, we utilized NGPD biopanning technology to screen CD47 binding peptides. Second, ten traditional machine learning methods based on multiple peptide descriptors and three deep learning methods were used to build computational models for identifying CD47 binding peptides. Finally, we proposed an integrated model based on support vector machine. During the five-fold cross-validation, the integrated predictor demonstrated specificity, accuracy, and sensitivity of 0.755, 0.764, and 0.772, respectively. Furthermore, an online bioinformatics tool called CD47Binder has been developed for the integrated predictor. This tool is readily accessible on http://i.uestc.edu.cn/CD47Binder/cgi-bin/CD47Binder.pl .
Collapse
Affiliation(s)
- Bowen Li
- Medical College, Guizhou University, Huaxi District, Guiyang, 550025, Guizhou, China
| | - Heng Chen
- Medical College, Guizhou University, Huaxi District, Guiyang, 550025, Guizhou, China.
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, No.2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu, 6173001, Sichuan, China.
| | - Bifang He
- Medical College, Guizhou University, Huaxi District, Guiyang, 550025, Guizhou, China.
- State Key Laboratory of Public Big Data, Guizhou University, Huaxi District, Guiyang, 550025, Guizhou, China.
| |
Collapse
|
9
|
Matsunaga R, Ujiie K, Inagaki M, Fernández Pérez J, Yasuda Y, Mimasu S, Soga S, Tsumoto K. High-throughput analysis system of interaction kinetics for data-driven antibody design. Sci Rep 2023; 13:19417. [PMID: 37990030 PMCID: PMC10663500 DOI: 10.1038/s41598-023-46756-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/04/2023] [Indexed: 11/23/2023] Open
Abstract
Surface plasmon resonance (SPR) is widely used for antigen-antibody interaction kinetics analysis. However, it has not been used in the screening phase because of the low throughput of measurement and analysis. Herein, we proposed a high-throughput SPR analysis system named "BreviA" using the Brevibacillus expression system. Brevibacillus was transformed using a plasmid library containing various antibody sequences, and single colonies were cultured in 96-well plates. Sequence analysis was performed using bacterial cells, and recombinant antibodies secreted in the supernatant were immobilized on a sensor chip to analyze their interactions with antigens using high-throughput SPR. Using this system, the process from the transformation to 384 interaction analyses can be performed within a week. This system utility was tested using an interspecies specificity design of an anti-human programmed cell death protein 1 (PD-1) antibody. A plasmid library containing alanine and tyrosine mutants of all complementarity-determining region residues was generated. A high-throughput SPR analysis was performed against human and mouse PD-1, showing that the mutation in the specific region enhanced the affinity for mouse PD-1. Furthermore, deep mutational scanning of the region revealed two mutants with > 100-fold increased affinity for mouse PD-1, demonstrating the potential efficacy of antibody design using data-driven approach.
Collapse
Affiliation(s)
- Ryo Matsunaga
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
- Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
| | - Kan Ujiie
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
| | - Mayuko Inagaki
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
| | - Jorge Fernández Pérez
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
| | - Yoshiki Yasuda
- Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan
| | - Shinya Mimasu
- Biologics Engineering, Discovery Intelligence, Astellas Pharma Inc., 21, Miyukigaoka, Tsukuba-shi, Ibaraki, 305-8585, Japan
| | - Shinji Soga
- Biologics Engineering, Discovery Intelligence, Astellas Pharma Inc., 21, Miyukigaoka, Tsukuba-shi, Ibaraki, 305-8585, Japan
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan.
- Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, 113-8656, Japan.
- The Institute of Medical Science, The University of Tokyo, Tokyo, 108-8639, Japan.
| |
Collapse
|
10
|
Zhou Y, Huang Z, Li W, Wei J, Jiang Q, Yang W, Huang J. Deep learning in preclinical antibody drug discovery and development. Methods 2023; 218:57-71. [PMID: 37454742 DOI: 10.1016/j.ymeth.2023.07.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 03/20/2023] [Accepted: 07/10/2023] [Indexed: 07/18/2023] Open
Abstract
Antibody drugs have become a key part of biotherapeutics. Patients suffering from various diseases have benefited from antibody therapies. However, its development process is rather long, expensive and risky. To speed up the process, reduce cost and improve success rate, artificial intelligence, especially deep learning methods, have been widely used in all aspects of preclinical antibody drug development, from library generation to hit identification, developability screening, lead selection and optimization. In this review, we systematically summarize antibody encodings, deep learning architectures and models used in preclinical antibody drug discovery and development. We also critically discuss challenges and opportunities, problems and possible solutions, current applications and future directions of deep learning in antibody drug development.
Collapse
Affiliation(s)
- Yuwei Zhou
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Ziru Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Wenzhen Li
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Jinyi Wei
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Qianhu Jiang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Wei Yang
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China.
| |
Collapse
|
11
|
Bai G, Sun C, Guo Z, Wang Y, Zeng X, Su Y, Zhao Q, Ma B. Accelerating antibody discovery and design with artificial intelligence: Recent advances and prospects. Semin Cancer Biol 2023; 95:13-24. [PMID: 37355214 DOI: 10.1016/j.semcancer.2023.06.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 06/09/2023] [Accepted: 06/18/2023] [Indexed: 06/26/2023]
Abstract
Therapeutic antibodies are the largest class of biotherapeutics and have been successful in treating human diseases. However, the design and discovery of antibody drugs remains challenging and time-consuming. Recently, artificial intelligence technology has had an incredible impact on antibody design and discovery, resulting in significant advances in antibody discovery, optimization, and developability. This review summarizes major machine learning (ML) methods and their applications for computational predictors of antibody structure and antigen interface/interaction, as well as the evaluation of antibody developability. Additionally, this review addresses the current status of ML-based therapeutic antibodies under preclinical and clinical phases. While many challenges remain, ML may offer a new therapeutic option for the future direction of fully computational antibody design.
Collapse
Affiliation(s)
- Ganggang Bai
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Chuance Sun
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Ziang Guo
- Cancer Center, Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Taipa, Macao Special Administrative Region of China
| | - Yangjing Wang
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xincheng Zeng
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yuhong Su
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Qi Zhao
- Cancer Center, Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Taipa, Macao Special Administrative Region of China; MoE Frontiers Science Center for Precision Oncology, University of Macau, Taipa, Macao Special Administrative Region of China.
| | - Buyong Ma
- Engineering Research Center of Cell & Therapeutic Antibody (MOE), School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China; Shanghai Digiwiser BioTechnolgy, Limited, Shanghai 201203, China.
| |
Collapse
|
12
|
Arras P, Yoo HB, Pekar L, Clarke T, Friedrich L, Schröter C, Schanz J, Tonillo J, Siegmund V, Doerner A, Krah S, Guarnera E, Zielonka S, Evers A. AI/ML combined with next-generation sequencing of VHH immune repertoires enables the rapid identification of de novo humanized and sequence-optimized single domain antibodies: a prospective case study. Front Mol Biosci 2023; 10:1249247. [PMID: 37842638 PMCID: PMC10575757 DOI: 10.3389/fmolb.2023.1249247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 08/31/2023] [Indexed: 10/17/2023] Open
Abstract
Introduction: In this study, we demonstrate the feasibility of yeast surface display (YSD) and nextgeneration sequencing (NGS) in combination with artificial intelligence and machine learning methods (AI/ML) for the identification of de novo humanized single domain antibodies (sdAbs) with favorable early developability profiles. Methods: The display library was derived from a novel approach, in which VHH-based CDR3 regions obtained from a llama (Lama glama), immunized against NKp46, were grafted onto a humanized VHH backbone library that was diversified in CDR1 and CDR2. Following NGS analysis of sequence pools from two rounds of fluorescence-activated cell sorting we focused on four sequence clusters based on NGS frequency and enrichment analysis as well as in silico developability assessment. For each cluster, long short-term memory (LSTM) based deep generative models were trained and used for the in silico sampling of new sequences. Sequences were subjected to sequence- and structure-based in silico developability assessment to select a set of less than 10 sequences per cluster for production. Results: As demonstrated by binding kinetics and early developability assessment, this procedure represents a general strategy for the rapid and efficient design of potent and automatically humanized sdAb hits from screening selections with favorable early developability profiles.
Collapse
Affiliation(s)
- Paul Arras
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
- Institute for Organic Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Germany
| | - Han Byul Yoo
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Lukas Pekar
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Thomas Clarke
- Bioinformatics, EMD Serono, Billerica, MA, United States
| | - Lukas Friedrich
- Computational Chemistry and Biologics, Merck Healthcare KGaA, Darmstadt, Germany
| | | | - Jennifer Schanz
- ADCs & Targeted NBE Therapeutics, Merck KGaA, Darmstadt, Germany
| | - Jason Tonillo
- ADCs & Targeted NBE Therapeutics, Merck KGaA, Darmstadt, Germany
| | - Vanessa Siegmund
- Early Protein Supply and Characterization, Merck Healthcare KGaA, Darmstadt, Germany
| | - Achim Doerner
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Simon Krah
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Enrico Guarnera
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Stefan Zielonka
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
- Institute for Organic Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Germany
| | - Andreas Evers
- Antibody Discovery and Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| |
Collapse
|
13
|
Williams AH, Zhan CG. Staying Ahead of the Game: How SARS-CoV-2 has Accelerated the Application of Machine Learning in Pandemic Management. BioDrugs 2023; 37:649-674. [PMID: 37464099 DOI: 10.1007/s40259-023-00611-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/28/2023] [Indexed: 07/20/2023]
Abstract
In recent years, machine learning (ML) techniques have garnered considerable interest for their potential use in accelerating the rate of drug discovery. With the emergence of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic, the utilization of ML has become even more crucial in the search for effective antiviral medications. The pandemic has presented the scientific community with a unique challenge, and the rapid identification of potential treatments has become an urgent priority. Researchers have been able to accelerate the process of identifying drug candidates, repurposing existing drugs, and designing new compounds with desirable properties using machine learning in drug discovery. To train predictive models, ML techniques in drug discovery rely on the analysis of large datasets, including both experimental and clinical data. These models can be used to predict the biological activities, potential side effects, and interactions with specific target proteins of drug candidates. This strategy has proven to be an effective method for identifying potential coronavirus disease 2019 (COVID-19) and other disease treatments. This paper offers a thorough analysis of the various ML techniques implemented to combat COVID-19, including supervised and unsupervised learning, deep learning, and natural language processing. The paper discusses the impact of these techniques on pandemic drug development, including the identification of potential treatments, the understanding of the disease mechanism, and the creation of effective and safe therapeutics. The lessons learned can be applied to future outbreaks and drug discovery initiatives.
Collapse
Affiliation(s)
- Alexander H Williams
- Molecular Modeling and Biopharmaceutical Center, University of Kentucky, 789 South Limestone Street, Lexington, KY, 40536, USA
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky, 789 South Limestone Street, Lexington, KY, 40536, USA
- GSK Upper Providence, 1250 S. Collegeville Road, Collegeville, PA, 19426, USA
| | - Chang-Guo Zhan
- Molecular Modeling and Biopharmaceutical Center, University of Kentucky, 789 South Limestone Street, Lexington, KY, 40536, USA.
- Department of Pharmaceutical Sciences, College of Pharmacy, University of Kentucky, 789 South Limestone Street, Lexington, KY, 40536, USA.
| |
Collapse
|
14
|
Li J, Kang G, Wang J, Yuan H, Wu Y, Meng S, Wang P, Zhang M, Wang Y, Feng Y, Huang H, de Marco A. Affinity maturation of antibody fragments: A review encompassing the development from random approaches to computational rational optimization. Int J Biol Macromol 2023; 247:125733. [PMID: 37423452 DOI: 10.1016/j.ijbiomac.2023.125733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 07/04/2023] [Accepted: 07/06/2023] [Indexed: 07/11/2023]
Abstract
Routinely screened antibody fragments usually require further in vitro maturation to achieve the desired biophysical properties. Blind in vitro strategies can produce improved ligands by introducing random mutations into the original sequences and selecting the resulting clones under more and more stringent conditions. Rational approaches exploit an alternative perspective that aims first at identifying the specific residues potentially involved in the control of biophysical mechanisms, such as affinity or stability, and then to evaluate what mutations could improve those characteristics. The understanding of the antigen-antibody interactions is instrumental to develop this process the reliability of which, consequently, strongly depends on the quality and completeness of the structural information. Recently, methods based on deep learning approaches critically improved the speed and accuracy of model building and are promising tools for accelerating the docking step. Here, we review the features of the available bioinformatic instruments and analyze the reports illustrating the result obtained with their application to optimize antibody fragments, and nanobodies in particular. Finally, the emerging trends and open questions are summarized.
Collapse
Affiliation(s)
- Jiaqi Li
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Guangbo Kang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Jiewen Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Haibin Yuan
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
| | - Yili Wu
- Zhejiang Provincial Clinical Research Center for Mental Disorders, School of Mental Health and the Affiliated Kangning Hospital, Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, Wenzhou Medical University, Oujiang Laboratory, Wenzhou, Zhejiang 325035, China
| | - Shuxian Meng
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
| | - Ping Wang
- New Technology R&D Department, Tianjin Modern Innovative TCM Technology Company Limited, Tianjin 300392, China
| | - Miao Zhang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; China Resources Biopharmaceutical Company Limited, Beijing 100029, China
| | - Yuli Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Tianjin Pharmaceutical Da Ren Tang Group Corporation Limited, Traditional Chinese Pharmacy Research Institute, Tianjin Key Laboratory of Quality Control in Chinese Medicine, Tianjin 300457, China; State Key Laboratory of Drug Delivery Technology and Pharmacokinetics, Tianjin Institute of Pharmaceutical Research, Tianjin 300193, China
| | - Yuanhang Feng
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
| | - He Huang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China.
| | - Ario de Marco
- Laboratory for Environmental and Life Sciences, University of Nova Gorica, Nova Gorica, Slovenia.
| |
Collapse
|
15
|
Parkinson J, Wang W. Linear-Scaling Kernels for Protein Sequences and Small Molecules Outperform Deep Learning While Providing Uncertainty Quantitation and Improved Interpretability. J Chem Inf Model 2023; 63:4589-4601. [PMID: 37498239 DOI: 10.1021/acs.jcim.3c00601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]
Abstract
Gaussian process (GP) is a Bayesian model which provides several advantages for regression tasks in machine learning such as reliable quantitation of uncertainty and improved interpretability. Their adoption has been precluded by their excessive computational cost and by the difficulty in adapting them for analyzing sequences (e.g., amino acid sequences) and graphs (e.g., small molecules). In this study, we introduce a group of random feature-approximated kernels for sequences and graphs that exhibit linear scaling with both the size of the training set and the size of the sequences or graphs. We incorporate these new kernels into our new Python library for GP regression, xGPR, and develop an efficient and scalable algorithm for fitting GPs equipped with these kernels to large datasets. We compare the performance of xGPR on 17 different benchmarks with both standard and state-of-the-art deep learning models and find that GP regression achieves highly competitive accuracy for these tasks while providing with well-calibrated uncertainty quantitation and improved interpretability. Finally, in a simple experiment, we illustrate how xGPR may be used as part of an active learning strategy to engineer a protein with a desired property in an automated way without human intervention.
Collapse
|
16
|
Bauer J, Rajagopal N, Gupta P, Gupta P, Nixon AE, Kumar S. How can we discover developable antibody-based biotherapeutics? Front Mol Biosci 2023; 10:1221626. [PMID: 37609373 PMCID: PMC10441133 DOI: 10.3389/fmolb.2023.1221626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 07/10/2023] [Indexed: 08/24/2023] Open
Abstract
Antibody-based biotherapeutics have emerged as a successful class of pharmaceuticals despite significant challenges and risks to their discovery and development. This review discusses the most frequently encountered hurdles in the research and development (R&D) of antibody-based biotherapeutics and proposes a conceptual framework called biopharmaceutical informatics. Our vision advocates for the syncretic use of computation and experimentation at every stage of biologic drug discovery, considering developability (manufacturability, safety, efficacy, and pharmacology) of potential drug candidates from the earliest stages of the drug discovery phase. The computational advances in recent years allow for more precise formulation of disease concepts, rapid identification, and validation of targets suitable for therapeutic intervention and discovery of potential biotherapeutics that can agonize or antagonize them. Furthermore, computational methods for de novo and epitope-specific antibody design are increasingly being developed, opening novel computationally driven opportunities for biologic drug discovery. Here, we review the opportunities and limitations of emerging computational approaches for optimizing antigens to generate robust immune responses, in silico generation of antibody sequences, discovery of potential antibody binders through virtual screening, assessment of hits, identification of lead drug candidates and their affinity maturation, and optimization for developability. The adoption of biopharmaceutical informatics across all aspects of drug discovery and development cycles should help bring affordable and effective biotherapeutics to patients more quickly.
Collapse
Affiliation(s)
- Joschka Bauer
- Early Stage Pharmaceutical Development Biologicals, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach/Riss, Germany
- In Silico Team, Boehringer Ingelheim, Hannover, Germany
| | - Nandhini Rajagopal
- In Silico Team, Boehringer Ingelheim, Hannover, Germany
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, United States
| | - Priyanka Gupta
- In Silico Team, Boehringer Ingelheim, Hannover, Germany
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, United States
| | - Pankaj Gupta
- In Silico Team, Boehringer Ingelheim, Hannover, Germany
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, United States
| | - Andrew E. Nixon
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, United States
| | - Sandeep Kumar
- In Silico Team, Boehringer Ingelheim, Hannover, Germany
- Biotherapeutics Discovery, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, CT, United States
| |
Collapse
|
17
|
Valeri JA, Soenksen LR, Collins KM, Ramesh P, Cai G, Powers R, Angenent-Mari NM, Camacho DM, Wong F, Lu TK, Collins JJ. BioAutoMATED: An end-to-end automated machine learning tool for explanation and design of biological sequences. Cell Syst 2023; 14:525-542.e9. [PMID: 37348466 PMCID: PMC10700034 DOI: 10.1016/j.cels.2023.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 02/17/2023] [Accepted: 05/22/2023] [Indexed: 06/24/2023]
Abstract
The design choices underlying machine-learning (ML) models present important barriers to entry for many biologists who aim to incorporate ML in their research. Automated machine-learning (AutoML) algorithms can address many challenges that come with applying ML to the life sciences. However, these algorithms are rarely used in systems and synthetic biology studies because they typically do not explicitly handle biological sequences (e.g., nucleotide, amino acid, or glycan sequences) and cannot be easily compared with other AutoML algorithms. Here, we present BioAutoMATED, an AutoML platform for biological sequence analysis that integrates multiple AutoML methods into a unified framework. Users are automatically provided with relevant techniques for analyzing, interpreting, and designing biological sequences. BioAutoMATED predicts gene regulation, peptide-drug interactions, and glycan annotation, and designs optimized synthetic biology components, revealing salient sequence characteristics. By automating sequence modeling, BioAutoMATED allows life scientists to incorporate ML more readily into their work.
Collapse
Affiliation(s)
- Jacqueline A Valeri
- Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Luis R Soenksen
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA
| | - Katherine M Collins
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Department of Engineering, University of Cambridge, Trumpington St, Cambridge CB2 1PZ, UK
| | - Pradeep Ramesh
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - George Cai
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - Rani Powers
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Pluto Biosciences, Golden, CO 80402, USA
| | - Nicolaas M Angenent-Mari
- Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - Diogo M Camacho
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - Felix Wong
- Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Timothy K Lu
- Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Synthetic Biology Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - James J Collins
- Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA 02139, USA; Abdul Latif Jameel Clinic for Machine Learning in Health, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|
18
|
Li L, Gupta E, Spaeth J, Shing L, Jaimes R, Engelhart E, Lopez R, Caceres RS, Bepler T, Walsh ME. Machine learning optimization of candidate antibody yields highly diverse sub-nanomolar affinity antibody libraries. Nat Commun 2023; 14:3454. [PMID: 37308471 DOI: 10.1038/s41467-023-39022-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 05/23/2023] [Indexed: 06/14/2023] Open
Abstract
Therapeutic antibodies are an important and rapidly growing drug modality. However, the design and discovery of early-stage antibody therapeutics remain a time and cost-intensive endeavor. Here we present an end-to-end Bayesian, language model-based method for designing large and diverse libraries of high-affinity single-chain variable fragments (scFvs) that are then empirically measured. In a head-to-head comparison with a directed evolution approach, we show that the best scFv generated from our method represents a 28.7-fold improvement in binding over the best scFv from the directed evolution. Additionally, 99% of designed scFvs in our most successful library are improvements over the initial candidate scFv. By comparing a library's predicted success to actual measurements, we demonstrate our method's ability to explore tradeoffs between library success and diversity. Results of our work highlight the significant impact machine learning models can have on scFv development. We expect our method to be broadly applicable and provide value to other protein engineering tasks.
Collapse
Affiliation(s)
- Lin Li
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA.
| | - Esther Gupta
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
| | - John Spaeth
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
| | - Leslie Shing
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
| | - Rafael Jaimes
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
| | | | | | - Rajmonda S Caceres
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
| | - Tristan Bepler
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA, USA
- Simons Electron Microscopy Center, New York Structural Biology Center, New York, NY, USA
| | - Matthew E Walsh
- Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA, USA
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| |
Collapse
|
19
|
Tamaru Y, Nakanishi S, Tanaka K, Umetsu M, Nakazawa H, Sugiyama A, Ito T, Shimokawa N, Takagi M. Recent research advances on non-linear phenomena in various biosystems. J Biosci Bioeng 2023:S1389-1723(23)00107-X. [PMID: 37246137 DOI: 10.1016/j.jbiosc.2023.03.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 03/03/2023] [Accepted: 03/22/2023] [Indexed: 05/30/2023]
Abstract
All biological phenomena can be classified as open, dissipative and non-linear. Moreover, the most typical phenomena are associated with non-linearity, dissipation and openness in biological systems. In this review article, four research topics on non-linear biosystems are described to show the examples from various biological systems. First, membrane dynamics of a lipid bilayer for the cell membrane is described. Since the cell membrane separates the inside of the cell from the outside, self-organizing systems that form spatial patterns on membranes often depend on non-linear dynamics. Second, various data banks based on recent genomics analysis supply the data including vast functional proteins from many organisms and their variable species. Since the proteins existing in nature are only a very small part of the space represented by amino acid sequence, success of mutagenesis-based molecular evolution approach crucially depends on preparing a library with high enrichment of functional proteins. Third, photosynthetic organisms depend on ambient light, the regular and irregular changes of which have a significant impact on photosynthetic processes. The light-driven process proceeds through many redox couples in the cyanobacteria constituting chain of redox reactions. Forth topics focuses on a vertebrate model, the zebrafish, which can help to understand, predict and control the chaos of complex biological systems. In particular, during early developmental stages, developmental differentiation occurs dynamically from a fertilized egg to divided and mature cells. These exciting fields of complexity, chaos, and non-linear science have experienced impressive growth in recent decades. Finally, future directions for non-liner biosystems are presented.
Collapse
Affiliation(s)
- Yutaka Tamaru
- Department of Life Sciences, Graduate School of Bioresources, Mie University, 1577 Kurimamachiya, Tsu, Mie 514-8507, Japan.
| | - Shuji Nakanishi
- Research Center for Solar Energy Chemistry, Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan
| | - Kenya Tanaka
- Research Center for Solar Energy Chemistry, Graduate School of Engineering Science, Osaka University, 1-3 Machikaneyama, Toyonaka, Osaka 560-8531, Japan
| | - Mitsuo Umetsu
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aramakiazaaoba, Aoba, Sendai, Miyagi 980-8579, Japan
| | - Hikaru Nakazawa
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aramakiazaaoba, Aoba, Sendai, Miyagi 980-8579, Japan
| | - Aruto Sugiyama
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aramakiazaaoba, Aoba, Sendai, Miyagi 980-8579, Japan
| | - Tomoyuki Ito
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aramakiazaaoba, Aoba, Sendai, Miyagi 980-8579, Japan
| | - Naofumi Shimokawa
- School of Materials Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa 923-1292, Japan
| | - Masahiro Takagi
- School of Materials Science, Japan Advanced Institute of Science and Technology, 1-1 Asahidai, Nomi, Ishikawa 923-1292, Japan
| |
Collapse
|
20
|
Hederman AP, Ackerman ME. Leveraging deep learning to improve vaccine design. Trends Immunol 2023; 44:333-344. [PMID: 37003949 PMCID: PMC10485910 DOI: 10.1016/j.it.2023.03.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 03/05/2023] [Accepted: 03/05/2023] [Indexed: 04/03/2023]
Abstract
Deep learning has led to incredible breakthroughs in areas of research, from self-driving vehicles to solutions, to formal mathematical proofs. In the biomedical sciences, however, the revolutionary results seen in other fields are only now beginning to be realized. Vaccine research and development efforts represent an application with high public health significance. Protein structure prediction, immune repertoire analysis, and phylogenetics are three principal areas in which deep learning is poised to provide key advances. Here, we opine on some of the current challenges with deep learning and how they are being addressed. Despite the nascent stage of deep learning applications in immunological studies, there is ample opportunity to utilize this new technology to address the most challenging and burdensome infectious diseases confronting global populations.
Collapse
Affiliation(s)
| | - Margaret E Ackerman
- Thayer School of Engineering, Dartmouth College, Hanover, NH, USA; Department of Microbiology and Immunology, Geisel School of Medicine, Hanover, NH, USA.
| |
Collapse
|
21
|
Chen Z, Wang X, Chen X, Huang J, Wang C, Wang J, Wang Z. Accelerating therapeutic protein design with computational approaches toward the clinical stage. Comput Struct Biotechnol J 2023; 21:2909-2926. [PMID: 38213894 PMCID: PMC10781723 DOI: 10.1016/j.csbj.2023.04.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/11/2023] [Accepted: 04/27/2023] [Indexed: 01/13/2024] Open
Abstract
Therapeutic protein, represented by antibodies, is of increasing interest in human medicine. However, clinical translation of therapeutic protein is still largely hindered by different aspects of developability, including affinity and selectivity, stability and aggregation prevention, solubility and viscosity reduction, and deimmunization. Conventional optimization of the developability with widely used methods, like display technologies and library screening approaches, is a time and cost-intensive endeavor, and the efficiency in finding suitable solutions is still not enough to meet clinical needs. In recent years, the accelerated advancement of computational methodologies has ushered in a transformative era in the field of therapeutic protein design. Owing to their remarkable capabilities in feature extraction and modeling, the integration of cutting-edge computational strategies with conventional techniques presents a promising avenue to accelerate the progression of therapeutic protein design and optimization toward clinical implementation. Here, we compared the differences between therapeutic protein and small molecules in developability and provided an overview of the computational approaches applicable to the design or optimization of therapeutic protein in several developability issues.
Collapse
Affiliation(s)
- Zhidong Chen
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xinpei Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xu Chen
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Juyang Huang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Chenglin Wang
- Shenzhen Qiyu Biotechnology Co., Ltd, Shenzhen 518107, China
| | - Junqing Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Zhe Wang
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
| |
Collapse
|
22
|
Li T, Li Y, Zhu X, He Y, Wu Y, Ying T, Xie Z. Artificial intelligence in cancer immunotherapy: Applications in neoantigen recognition, antibody design and immunotherapy response prediction. Semin Cancer Biol 2023; 91:50-69. [PMID: 36870459 DOI: 10.1016/j.semcancer.2023.02.007] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 02/13/2023] [Accepted: 02/28/2023] [Indexed: 03/06/2023]
Abstract
Cancer immunotherapy is a method of controlling and eliminating tumors by reactivating the body's cancer-immunity cycle and restoring its antitumor immune response. The increased availability of data, combined with advancements in high-performance computing and innovative artificial intelligence (AI) technology, has resulted in a rise in the use of AI in oncology research. State-of-the-art AI models for functional classification and prediction in immunotherapy research are increasingly used to support laboratory-based experiments. This review offers a glimpse of the current AI applications in immunotherapy, including neoantigen recognition, antibody design, and prediction of immunotherapy response. Advancing in this direction will result in more robust predictive models for developing better targets, drugs, and treatments, and these advancements will eventually make their way into the clinical setting, pushing AI forward in the field of precision oncology.
Collapse
Affiliation(s)
- Tong Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yupeng Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Xiaoyi Zhu
- MOE/NHC Key Laboratory of Medical Molecular Virology, Shanghai Institute of Infectious Disease and Biosecurity, School of Basic Medical Sciences, Shanghai Medical College, Fudan University, Shanghai, China; Shanghai Engineering Research Center for Synthetic Immunology, Shanghai, China
| | - Yao He
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yanling Wu
- MOE/NHC Key Laboratory of Medical Molecular Virology, Shanghai Institute of Infectious Disease and Biosecurity, School of Basic Medical Sciences, Shanghai Medical College, Fudan University, Shanghai, China; Shanghai Engineering Research Center for Synthetic Immunology, Shanghai, China
| | - Tianlei Ying
- MOE/NHC Key Laboratory of Medical Molecular Virology, Shanghai Institute of Infectious Disease and Biosecurity, School of Basic Medical Sciences, Shanghai Medical College, Fudan University, Shanghai, China; Shanghai Engineering Research Center for Synthetic Immunology, Shanghai, China.
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China; Center for Precision Medicine, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
23
|
Computational and artificial intelligence-based methods for antibody development. Trends Pharmacol Sci 2023; 44:175-189. [PMID: 36669976 DOI: 10.1016/j.tips.2022.12.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 12/21/2022] [Accepted: 12/22/2022] [Indexed: 01/19/2023]
Abstract
Due to their high target specificity and binding affinity, therapeutic antibodies are currently the largest class of biotherapeutics. The traditional largely empirical antibody development process is, while mature and robust, cumbersome and has significant limitations. Substantial recent advances in computational and artificial intelligence (AI) technologies are now starting to overcome many of these limitations and are increasingly integrated into development pipelines. Here, we provide an overview of AI methods relevant for antibody development, including databases, computational predictors of antibody properties and structure, and computational antibody design methods with an emphasis on machine learning (ML) models, and the design of complementarity-determining region (CDR) loops, antibody structural components critical for binding.
Collapse
|
24
|
Parkinson J, Hard R, Wang W. The RESP AI model accelerates the identification of tight-binding antibodies. Nat Commun 2023; 14:454. [PMID: 36709319 PMCID: PMC9884274 DOI: 10.1038/s41467-023-36028-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Accepted: 01/13/2023] [Indexed: 01/30/2023] Open
Abstract
High-affinity antibodies are often identified through directed evolution, which may require many iterations of mutagenesis and selection to find an optimal candidate. Deep learning techniques hold the potential to accelerate this process but the existing methods cannot provide the confidence interval or uncertainty needed to assess the reliability of the predictions. Here we present a pipeline called RESP for efficient identification of high affinity antibodies. We develop a learned representation trained on over 3 million human B-cell receptor sequences to encode antibody sequences. We then develop a variational Bayesian neural network to perform ordinal regression on a set of the directed evolution sequences binned by off-rate and quantify their likelihood to be tight binders against an antigen. Importantly, this model can assess sequences not present in the directed evolution library and thus greatly expand the search space to uncover the best sequences for experimental evaluation. We demonstrate the power of this pipeline by achieving a 17-fold improvement in the KD of the PD-L1 antibody Atezolizumab and this success illustrates the potential of RESP in facilitating general antibody development.
Collapse
Affiliation(s)
- Jonathan Parkinson
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, 92093-0359, USA
| | - Ryan Hard
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, 92093-0359, USA
| | - Wei Wang
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA, 92093-0359, USA. .,Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA, 92093-0359, USA.
| |
Collapse
|
25
|
Svilenov HL, Arosio P, Menzen T, Tessier P, Sormanni P. Approaches to expand the conventional toolbox for discovery and selection of antibodies with drug-like physicochemical properties. MAbs 2023; 15:2164459. [PMID: 36629855 PMCID: PMC9839375 DOI: 10.1080/19420862.2022.2164459] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 12/22/2022] [Accepted: 12/29/2022] [Indexed: 01/12/2023] Open
Abstract
Antibody drugs should exhibit not only high-binding affinity for their target antigens but also favorable physicochemical drug-like properties. Such drug-like biophysical properties are essential for the successful development of antibody drug products. The traditional approaches used in antibody drug development require significant experimentation to produce, optimize, and characterize many candidates. Therefore, it is attractive to integrate new methods that can optimize the process of selecting antibodies with both desired target-binding and drug-like biophysical properties. Here, we summarize a selection of techniques that can complement the conventional toolbox used to de-risk antibody drug development. These techniques can be integrated at different stages of the antibody development process to reduce the frequency of physicochemical liabilities in antibody libraries during initial discovery and to co-optimize multiple antibody features during early-stage antibody engineering and affinity maturation. Moreover, we highlight biophysical and computational approaches that can be used to predict physical degradation pathways relevant for long-term storage and in-use stability to reduce the need for extensive experimentation.
Collapse
Affiliation(s)
- Hristo L. Svilenov
- Laboratory of General Biochemistry and Physical Pharmacy, Faculty of Pharmaceutical Sciences, Ghent University, Gent, Belgium
| | - Paolo Arosio
- Department of Chemistry and Applied Biosciences, ETH Zürich, Zürich, Switzerland
| | - Tim Menzen
- Coriolis Pharma Research GmbH, Martinsried, 82152, Germany
| | - Peter Tessier
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, USA
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, USA
| | - Pietro Sormanni
- Centre for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
26
|
Arras P, Yoo HB, Pekar L, Schröter C, Clarke T, Krah S, Klewinghaus D, Siegmund V, Evers A, Zielonka S. A library approach for the de novo high-throughput isolation of humanized VHH domains with favorable developability properties following camelid immunization. MAbs 2023; 15:2261149. [PMID: 37766540 PMCID: PMC10540653 DOI: 10.1080/19420862.2023.2261149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Accepted: 09/15/2023] [Indexed: 09/29/2023] Open
Abstract
In this study, we generated a novel library approach for high throughput de novo identification of humanized single-domain antibodies following camelid immunization. To achieve this, VHH-derived complementarity-determining regions-3 (CDR3s) obtained from an immunized llama (Lama glama) were grafted onto humanized VHH backbones comprising moderately sequence-diversified CDR1 and CDR2 regions similar to natural immunized and naïve antibody repertoires. Importantly, these CDRs were tailored toward favorable in silico developability properties, by considering human-likeness as well as excluding potential sequence liabilities and predicted immunogenic motifs. Target-specific humanized single-domain antibodies (sdAbs) were readily obtained by yeast surface display. We demonstrate that, by exploiting this approach, high affinity sdAbs with an optimized in silico developability profile can be generated. These sdAbs display favorable biophysical, biochemical, and functional attributes and do not require any further sequence optimization. This approach is generally applicable to any antigen upon camelid immunization and has the potential to significantly accelerate candidate selection and reduce risks and attrition rates in sdAb development.
Collapse
Affiliation(s)
- Paul Arras
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
- Institute for Organic Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Germany
| | - Han Byul Yoo
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
- Early Protein Supply & Characterization, Merck Healthcare KGaA, Darmstadt, Germany
| | - Lukas Pekar
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | | | | | - Simon Krah
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Daniel Klewinghaus
- Early Protein Supply & Characterization, Merck Healthcare KGaA, Darmstadt, Germany
| | - Vanessa Siegmund
- Early Protein Supply & Characterization, Merck Healthcare KGaA, Darmstadt, Germany
| | - Andreas Evers
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
| | - Stefan Zielonka
- Antibody Discovery & Protein Engineering, Merck Healthcare KGaA, Darmstadt, Germany
- Institute for Organic Chemistry and Biochemistry, Technical University of Darmstadt, Darmstadt, Germany
| |
Collapse
|
27
|
Ito T, Nguyen TD, Saito Y, Kurumida Y, Nakazawa H, Kawada S, Nishi H, Tsuda K, Kameda T, Umetsu M. Selection of target-binding proteins from the information of weakly enriched phage display libraries by deep sequencing and machine learning. MAbs 2023; 15:2168470. [PMID: 36683172 PMCID: PMC9872955 DOI: 10.1080/19420862.2023.2168470] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Despite the advances in surface-display systems for directed evolution, variants with high affinity are not always enriched due to undesirable biases that increase target-unrelated variants during biopanning. Here, our goal was to design a library containing improved variants from the information of the "weakly enriched" library where functional variants were weakly enriched. Deep sequencing for the previous biopanning result, where no functional antibody mimetics were experimentally identified, revealed that weak enrichment was partly due to undesirable biases during phage infection and amplification steps. The clustering analysis of the deep sequencing data from appropriate steps revealed no distinct sequence patterns, but a Bayesian machine learning model trained with the selected deep sequencing data supplied nine clusters with distinct sequence patterns. Phage libraries were designed on the basis of the sequence patterns identified, and four improved variants with target-specific affinity (EC50 = 80-277 nM) were identified by biopanning. The selection and use of deep sequencing data without undesirable bias enabled us to extract the information on prospective variants. In summary, the use of appropriate deep sequencing data and machine learning with the sequence data has the possibility of finding sequence space where functional variants are enriched.
Collapse
Affiliation(s)
- Tomoyuki Ito
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Thuy Duong Nguyen
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Yutaka Saito
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan,AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan,Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan,Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan
| | - Yoichi Kurumida
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Hikaru Nakazawa
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Sakiya Kawada
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| | - Hafumi Nishi
- Department of Applied Information Sciences, Graduate School of Information Sciences, Tohoku University, Sendai, Japan,Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan,Faculty of Core Research, Ochanomizu University, Tokyo, Japan
| | - Koji Tsuda
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Tokyo, Japan,Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan,Research and Services Division of Materials Data and Integrated Systems, National Institute for Materials Science, Tsukuba, Japan
| | - Tomoshi Kameda
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan,Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan,CONTACT Tomoshi Kameda Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Mitsuo Umetsu
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan,Center for Advanced Intelligence Project, RIKEN, Tokyo, Japan,Mitsuo Umetsu Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, Japan
| |
Collapse
|
28
|
Gaa R, Mayer HM, Noack D, Kumari K, Guenther R, Tsai SP, Ji Q, Doerner A. Mammalian display to secretion switchable libraries for antibody preselection and high throughput functional screening. MAbs 2023; 15:2251190. [PMID: 37646089 PMCID: PMC10469430 DOI: 10.1080/19420862.2023.2251190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 08/10/2023] [Accepted: 08/20/2023] [Indexed: 09/01/2023] Open
Abstract
Recently, there has been a co-evolution of mammalian libraries and diverse microfluidic approaches for therapeutic antibody hit discovery. Mammalian libraries enable the preservation of full immune repertoires, produce hit candidates in final format and facilitate broad combinatorial bispecific antibody screening, while several available microfluidic methodologies offer opportunities for rapid high-content screens. Here, we report proof-of-concept studies exploring the potential of combining microfluidic technologies with mammalian libraries for antibody discovery. First, antibody secretion, target co-expression and integration of appropriate reporter cell lines enabled the selection of in-trans acting agonistic bispecific antibodies. Second, a functional screen for internalization was established and comparison of autocrine versus co-encapsulation setups highlighted the advantages of an autocrine one cell approach. Third, synchronization of antibody-secreting cells prior to microfluidic screens reduced assay variability. Furthermore, a display to secretion switchable system was developed and applied for pre-enrichment of antibody clones with high manufacturability in conjunction with subsequent screening for functional properties. These case studies demonstrate the system's feasibility and may serve as basis for further development of integrated workflows combining manufacturability sorting and functional screens for the identification of optimal therapeutic antibody candidates.
Collapse
Affiliation(s)
- Ramona Gaa
- NBE Technologies, Merck KGaA, Darmstadt, Germany
| | | | | | - Kavita Kumari
- Discovery Biology, Syngene International, Bangalore, India
| | | | | | - Qingyong Ji
- NBE Technologies, EMD Serono, Billerica, MA, USA
| | | |
Collapse
|
29
|
Robert PA, Akbar R, Frank R, Pavlović M, Widrich M, Snapkov I, Slabodkin A, Chernigovskaya M, Scheffer L, Smorodina E, Rawat P, Mehta BB, Vu MH, Mathisen IF, Prósz A, Abram K, Olar A, Miho E, Haug DTT, Lund-Johansen F, Hochreiter S, Haff IH, Klambauer G, Sandve GK, Greiff V. Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction. NATURE COMPUTATIONAL SCIENCE 2022; 2:845-865. [PMID: 38177393 DOI: 10.1038/s43588-022-00372-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 11/09/2022] [Indexed: 01/06/2024]
Abstract
Machine learning (ML) is a key technology for accurate prediction of antibody-antigen binding. Two orthogonal problems hinder the application of ML to antibody-specificity prediction and the benchmarking thereof: the lack of a unified ML formalization of immunological antibody-specificity prediction problems and the unavailability of large-scale synthetic datasets to benchmark real-world relevant ML methods and dataset design. Here we developed the Absolut! software suite that enables parameter-based unconstrained generation of synthetic lattice-based three-dimensional antibody-antigen-binding structures with ground-truth access to conformational paratope, epitope and affinity. We formalized common immunological antibody-specificity prediction problems as ML tasks and confirmed that for both sequence- and structure-based tasks, accuracy-based rankings of ML methods trained on experimental data hold for ML methods trained on Absolut!-generated data. The Absolut! framework has the potential to enable real-world relevant development and benchmarking of ML strategies for biotherapeutics design.
Collapse
Affiliation(s)
- Philippe A Robert
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
| | - Rahmad Akbar
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Robert Frank
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | | | - Michael Widrich
- ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | - Igor Snapkov
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Andrei Slabodkin
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Maria Chernigovskaya
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | | | - Eva Smorodina
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Puneet Rawat
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Mai Ha Vu
- Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway
| | | | - Aurél Prósz
- Danish Cancer Society Research Center, Translational Cancer Genomics, Copenhagen, Denmark
| | - Krzysztof Abram
- The Novo Nordisk Foundation Center for Biosustainability, Autoflow, DTU Biosustain and IT University of Copenhagen, Copenhagen, Denmark
| | - Alex Olar
- Department of Complex Systems in Physics, Eötvös Loránd University, Budapest, Hungary
| | - Enkelejda Miho
- Institute of Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
- aiNET GmbH, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | | | | | - Sepp Hochreiter
- ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
- Institute of Advanced Research in Artificial Intelligence (IARAI), Vienna, Austria
| | | | - Günter Klambauer
- ELLIS Unit Linz and LIT AI Lab, Institute for Machine Learning, Johannes Kepler University Linz, Linz, Austria
| | | | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
30
|
Engelhart E, Emerson R, Shing L, Lennartz C, Guion D, Kelley M, Lin C, Lopez R, Younger D, Walsh ME. A dataset comprised of binding interactions for 104,972 antibodies against a SARS-CoV-2 peptide. Sci Data 2022; 9:653. [PMID: 36289234 PMCID: PMC9606274 DOI: 10.1038/s41597-022-01779-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 10/14/2022] [Indexed: 11/21/2022] Open
Abstract
The dataset presented here contains quantitative binding scores of scFv-format antibodies against a SARS-CoV-2 target peptide collected via an AlphaSeq assay that can be used in the development and benchmarking of machine learning models. Starting from three seed sequences identified from a phage display campaign using a human naïve library, four sets of 29,900 antibodies were designed in silico by creating all k = 1 mutations and random k = 2 and k = 3 mutations throughout the complementary-determining regions (CDRs). Of the 119,600 designs, 104,972 were successfully built in to the AlphaSeq library and target binding was subsequently measured with 71,384 designs resulting in a predicted affinity value for at least one of the triplicate measurements. Data include antibodies with predicted affinity measurements ranging from 37 pM to 22 mM. To our knowledge, this dataset is the largest, publicly available dataset that contains antibody sequences, antigen sequence and quantitative measurements of binding scores and provides an opportunity to serve as a benchmark to evaluate antibody-specific representation models for machine learning. Measurement(s) | Antibody Binding | Technology Type(s) | AlphaSeq | Factor Type(s) | Antibody sequence | Sample Characteristic - Organism | Homo sapiens |
Collapse
Affiliation(s)
| | | | - Leslie Shing
- grid.504876.80000 0001 0684 1626Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA USA
| | - Chelsea Lennartz
- grid.504876.80000 0001 0684 1626Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA USA
| | | | | | | | | | | | - Matthew E. Walsh
- grid.504876.80000 0001 0684 1626Massachusetts Institute of Technology Lincoln Laboratory, Lexington, MA USA ,grid.21107.350000 0001 2171 9311Present Address: Department of Environmental Health and Engineering, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| |
Collapse
|
31
|
Abstract
An increasingly high-impact application of machine learning in scientific discovery is its use in the design of novel objects with desired properties, such as the design of proteins, small molecules, and materials. Although a variety of algorithms have been developed for this purpose, it remains unclear when practitioners can trust the predictions made by learned models for designed objects, since design algorithms induce a distinctive shift between the training and test data distributions. We propose a method that provides confidence sets for designed objects, which we show satisfy finite-sample guarantees of statistical validity, for any design algorithm involving any learned regression model. Our work enables more trustworthy use of machine learning for design. Many applications of machine-learning methods involve an iterative protocol in which data are collected, a model is trained, and then outputs of that model are used to choose what data to consider next. For example, a data-driven approach for designing proteins is to train a regression model to predict the fitness of protein sequences and then use it to propose new sequences believed to exhibit greater fitness than observed in the training data. Since validating designed sequences in the wet laboratory is typically costly, it is important to quantify the uncertainty in the model’s predictions. This is challenging because of a characteristic type of distribution shift between the training and test data that arises in the design setting—one in which the training and test data are statistically dependent, as the latter is chosen based on the former. Consequently, the model’s error on the test data—that is, the designed sequences—has an unknown and possibly complex relationship with its error on the training data. We introduce a method to construct confidence sets for predictions in such settings, which account for the dependence between the training and test data. The confidence sets we construct have finite-sample guarantees that hold for any regression model, even when it is used to choose the test-time input distribution. As a motivating use case, we use real datasets to demonstrate how our method quantifies uncertainty for the predicted fitness of designed proteins and can therefore be used to select design algorithms that achieve acceptable tradeoffs between high predicted fitness and low predictive uncertainty.
Collapse
|
32
|
Rube HT, Rastogi C, Feng S, Kribelbauer JF, Li A, Becerra B, Melo LAN, Do BV, Li X, Adam HH, Shah NH, Mann RS, Bussemaker HJ. Prediction of protein-ligand binding affinity from sequencing data with interpretable machine learning. Nat Biotechnol 2022; 40:1520-1527. [PMID: 35606422 PMCID: PMC9546773 DOI: 10.1038/s41587-022-01307-0] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Accepted: 04/04/2022] [Indexed: 01/02/2023]
Abstract
Protein-ligand interactions are increasingly profiled at high throughput using affinity selection and massively parallel sequencing. However, these assays do not provide the biophysical parameters that most rigorously quantify molecular interactions. Here we describe a flexible machine learning method, called ProBound, that accurately defines sequence recognition in terms of equilibrium binding constants or kinetic rates. This is achieved using a multi-layered maximum-likelihood framework that models both the molecular interactions and the data generation process. We show that ProBound quantifies transcription factor (TF) behavior with models that predict binding affinity over a range exceeding that of previous resources; captures the impact of DNA modifications and conformational flexibility of multi-TF complexes; and infers specificity directly from in vivo data such as ChIP-seq without peak calling. When coupled with an assay called KD-seq, it determines the absolute affinity of protein-ligand interactions. We also apply ProBound to profile the kinetics of kinase-substrate interactions. ProBound opens new avenues for decoding biological networks and rationally engineering protein-ligand interactions.
Collapse
Affiliation(s)
- H Tomas Rube
- Department of Bioengineering, University of California, Merced, Merced, CA, USA
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Chaitanya Rastogi
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Siqian Feng
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
| | | | - Allyson Li
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Basheer Becerra
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Lucas A N Melo
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Bach Viet Do
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Xiaoting Li
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Hammaad H Adam
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Neel H Shah
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Richard S Mann
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Harmen J Bussemaker
- Department of Biological Sciences, Columbia University, New York, NY, USA.
- Department of Systems Biology, Columbia University, New York, NY, USA.
| |
Collapse
|
33
|
Castro E, Godavarthi A, Rubinfien J, Givechian K, Bhaskar D, Krishnaswamy S. Transformer-based protein generation with regularized latent space optimization. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00532-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
34
|
Designing antibodies as therapeutics. Cell 2022; 185:2789-2805. [PMID: 35868279 DOI: 10.1016/j.cell.2022.05.029] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 05/18/2022] [Accepted: 05/31/2022] [Indexed: 12/25/2022]
Abstract
Antibody therapeutics are a large and rapidly expanding drug class providing major health benefits. We provide a snapshot of current antibody therapeutics including their formats, common targets, therapeutic areas, and routes of administration. Our focus is on selected emerging directions in antibody design where progress may provide a broad benefit. These topics include enhancing antibodies for cancer, antibody delivery to organs such as the brain, gastrointestinal tract, and lungs, plus antibody developability challenges including immunogenicity risk assessment and mitigation and subcutaneous delivery. Machine learning has the potential, albeit as yet largely unrealized, for a transformative future impact on antibody discovery and engineering.
Collapse
|
35
|
Gao B, Han J, Reddy ST. Learning what not to select for in antibody drug discovery. CELL REPORTS METHODS 2022; 2:100258. [PMID: 35880020 PMCID: PMC9308151 DOI: 10.1016/j.crmeth.2022.100258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Identifying antibodies with high affinity and target specificity is crucial for drug discovery and development; however, filtering out antibody candidates with nonspecific or polyspecific binding profiles is also important. In this issue of Cell Reports Methods, Saksena et al. report a computational counterselection method combining deep sequencing and machine learning for identifying nonspecific antibody candidates and demonstrate that it has advantages over more established molecular counterselection methods.
Collapse
Affiliation(s)
- Beichen Gao
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland
| | - Jiami Han
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland
| | - Sai T. Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland
| |
Collapse
|
36
|
Saksena SD, Liu G, Banholzer C, Horny G, Ewert S, Gifford DK. Computational counterselection identifies nonspecific therapeutic biologic candidates. CELL REPORTS METHODS 2022; 2:100254. [PMID: 35880012 PMCID: PMC9308162 DOI: 10.1016/j.crmeth.2022.100254] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 04/22/2022] [Accepted: 06/17/2022] [Indexed: 01/13/2023]
Abstract
Effective biologics require high specificity and limited off-target binding, but these properties are not guaranteed by current affinity-selection-based discovery methods. Molecular counterselection against off targets is a technique for identifying nonspecific sequences but is experimentally costly and can fail to eliminate a large fraction of nonspecific sequences. Here, we introduce computational counterselection, a framework for removing nonspecific sequences from pools of candidate biologics using machine learning models. We demonstrate the method using sequencing data from single-target affinity selection of antibodies, bypassing combinatorial experiments. We show that computational counterselection outperforms molecular counterselection by performing cross-target selection and individual binding assays to determine the performance of each method at retaining on-target, specific antibodies and identifying and eliminating off-target, nonspecific antibodies. Further, we show that one can identify generally polyspecific antibody sequences using a general model trained on affinity data from unrelated targets with potential affinity for a broad range of sequences.
Collapse
Affiliation(s)
- Sachit Dinesh Saksena
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Computational and Systems Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Ge Liu
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | | | - Geraldine Horny
- Novartis Institute of BioMedical Research (NIBR), Basel, Switzerland
| | - Stefan Ewert
- Novartis Institute of BioMedical Research (NIBR), Basel, Switzerland
| | - David K Gifford
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
37
|
He B, Li B, Chen X, Zhang Q, Lu C, Yang S, Long J, Ning L, Chen H, Huang J. PDL1Binder: Identifying programmed cell death ligand 1 binding peptides by incorporating next-generation phage display data and different peptide descriptors. Front Microbiol 2022; 13:928774. [PMID: 35910615 PMCID: PMC9335124 DOI: 10.3389/fmicb.2022.928774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
Monoclonal antibody drugs targeting the PD-1/PD-L1 pathway have showed efficacy in the treatment of cancer patients, however, they have many intrinsic limitations and inevitable drawbacks. Peptide inhibitors as alternatives might compensate for the drawbacks of current PD-1/PD-L1 interaction blockers. Identifying PD-L1 binding peptides by random peptide library screening is a time-consuming and labor-intensive process. Machine learning-based computational models enable rapid discovery of peptide candidates targeting the PD-1/PD-L1 pathway. In this study, we first employed next-generation phage display (NGPD) biopanning to isolate PD-L1 binding peptides. Different peptide descriptors and feature selection methods as well as diverse machine learning methods were then incorporated to implement predictive models of PD-L1 binding. Finally, we proposed PDL1Binder, an ensemble computational model for efficiently obtaining PD-L1 binding peptides. Our results suggest that predictive models of PD-L1 binding can be learned from deep sequencing data and provide a new path to discover PD-L1 binding peptides. A web server was implemented for PDL1Binder, which is freely available at http://i.uestc.edu.cn/pdl1binder/cgi-bin/PDL1Binder.pl.
Collapse
Affiliation(s)
- Bifang He
- Medical College, Guizhou University, Guiyang, China
| | - Bowen Li
- Medical College, Guizhou University, Guiyang, China
| | - Xue Chen
- Medical College, Guizhou University, Guiyang, China
| | | | - Chunying Lu
- Medical College, Guizhou University, Guiyang, China
| | | | - Jinjin Long
- Medical College, Guizhou University, Guiyang, China
| | - Lin Ning
- School of Healthcare Technology, Chengdu Neusoft University, Chengdu, China
| | - Heng Chen
- Medical College, Guizhou University, Guiyang, China
- *Correspondence: Heng Chen,
| | - Jian Huang
- School of Life Sciences and Technology, University of Electronic Science and Technology of China, Chengdu, China
- Jian Huang,
| |
Collapse
|
38
|
Wilman W, Wróbel S, Bielska W, Deszynski P, Dudzic P, Jaszczyszyn I, Kaniewski J, Młokosiewicz J, Rouyan A, Satława T, Kumar S, Greiff V, Krawczyk K. Machine-designed biotherapeutics: opportunities, feasibility and advantages of deep learning in computational antibody discovery. Brief Bioinform 2022; 23:6643456. [PMID: 35830864 PMCID: PMC9294429 DOI: 10.1093/bib/bbac267] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/09/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Antibodies are versatile molecular binders with an established and growing role as therapeutics. Computational approaches to developing and designing these molecules are being increasingly used to complement traditional lab-based processes. Nowadays, in silico methods fill multiple elements of the discovery stage, such as characterizing antibody–antigen interactions and identifying developability liabilities. Recently, computational methods tackling such problems have begun to follow machine learning paradigms, in many cases deep learning specifically. This paradigm shift offers improvements in established areas such as structure or binding prediction and opens up new possibilities such as language-based modeling of antibody repertoires or machine-learning-based generation of novel sequences. In this review, we critically examine the recent developments in (deep) machine learning approaches to therapeutic antibody design with implications for fully computational antibody design.
Collapse
|
39
|
Makowski EK, Kinnunen PC, Huang J, Wu L, Smith MD, Wang T, Desai AA, Streu CN, Zhang Y, Zupancic JM, Schardt JS, Linderman JJ, Tessier PM. Co-optimization of therapeutic antibody affinity and specificity using machine learning models that generalize to novel mutational space. Nat Commun 2022; 13:3788. [PMID: 35778381 PMCID: PMC9249733 DOI: 10.1038/s41467-022-31457-3] [Citation(s) in RCA: 43] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 06/20/2022] [Indexed: 11/08/2022] Open
Abstract
Therapeutic antibody development requires selection and engineering of molecules with high affinity and other drug-like biophysical properties. Co-optimization of multiple antibody properties remains a difficult and time-consuming process that impedes drug development. Here we evaluate the use of machine learning to simplify antibody co-optimization for a clinical-stage antibody (emibetuzumab) that displays high levels of both on-target (antigen) and off-target (non-specific) binding. We mutate sites in the antibody complementarity-determining regions, sort the antibody libraries for high and low levels of affinity and non-specific binding, and deep sequence the enriched libraries. Interestingly, machine learning models trained on datasets with binary labels enable predictions of continuous metrics that are strongly correlated with antibody affinity and non-specific binding. These models illustrate strong tradeoffs between these two properties, as increases in affinity along the co-optimal (Pareto) frontier require progressive reductions in specificity. Notably, models trained with deep learning features enable prediction of novel antibody mutations that co-optimize affinity and specificity beyond what is possible for the original antibody library. These findings demonstrate the power of machine learning models to greatly expand the exploration of novel antibody sequence space and accelerate the development of highly potent, drug-like antibodies.
Collapse
Affiliation(s)
- Emily K Makowski
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Patrick C Kinnunen
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jie Huang
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Lina Wu
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Matthew D Smith
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Tiexin Wang
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Alec A Desai
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Craig N Streu
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemistry, Albion College, Albion, MI, 49224, USA
| | - Yulei Zhang
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jennifer M Zupancic
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - John S Schardt
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Jennifer J Linderman
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Peter M Tessier
- Department of Pharmaceutical Sciences, University of Michigan, Ann Arbor, MI, 48109, USA.
- Biointerfaces Institute, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Chemical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA.
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, MI, 48109, USA.
| |
Collapse
|
40
|
Dai Z, Saksena SD, Horny G, Banholzer C, Ewert S, Gifford DK. Ultra-high-diversity factorizable libraries for efficient therapeutic discovery. Genome Res 2022; 32:gr.276593.122. [PMID: 35738900 PMCID: PMC9528983 DOI: 10.1101/gr.276593.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Accepted: 06/22/2022] [Indexed: 01/13/2023]
Abstract
The successful discovery of novel biological therapeutics by selection requires highly diverse libraries of candidate sequences that contain a high proportion of desirable candidates. Here we propose the use of computationally designed factorizable libraries made of concatenated segment libraries as a method of creating large libraries that meet an objective function at low cost. We show that factorizable libraries can be designed efficiently by representing objective functions that describe sequence optimality as an inner product of feature vectors, which we use to design an optimization method we call stochastically annealed product spaces (SAPS). We then use this approach to design diverse and efficient libraries of antibody CDR-H3 sequences with various optimized characteristics.
Collapse
Affiliation(s)
- Zheng Dai
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Sachit D Saksena
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Geraldine Horny
- Novartis Institutes for BioMedical Research (NIBR), CH-4056 Basel, Switzerland
| | - Christine Banholzer
- Novartis Institutes for BioMedical Research (NIBR), CH-4056 Basel, Switzerland
| | - Stefan Ewert
- Novartis Institutes for BioMedical Research (NIBR), CH-4056 Basel, Switzerland
| | - David K Gifford
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
41
|
A Novel Synthetic Antibody Library with Complementarity-Determining Region Diversities Designed for an Improved Amplification Profile. Int J Mol Sci 2022; 23:ijms23116255. [PMID: 35682935 PMCID: PMC9181208 DOI: 10.3390/ijms23116255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Revised: 05/26/2022] [Accepted: 05/31/2022] [Indexed: 12/04/2022] Open
Abstract
Antibody discovery by phage display consists of two phases, i.e., the binding phase and the amplification phase. Ideally, the selection process is dominated by the former, and all the retrieved clones are amplified equally during the latter. In reality, the amplification efficiency of antibody fragments varies widely among different sequences and, after a few rounds of phage display panning, the output repertoire often includes rapidly amplified sequences with low or no binding activity, significantly diminishing the efficiency of antibody isolation. In this work, a novel synthetic single-chain variable fragment (scFv) library with complementarity-determining region (CDR) diversities aimed at improved amplification efficiency was designed and constructed. A previously reported synthetic scFv library with low, non-combinatorial CDR diversities was panned against protein A superantigen, and the library repertoires before and after the panning were analyzed by next generation sequencing. The enrichment or depletion patterns of CDR sequences after panning served as the basis for the design of the new library. Especially for CDR-H3 with a higher and more random diversity, a machine learning method was applied to predict potential fast-amplified sequences among a simulated sequence repertoire. In a direct comparison with the previous generation library, the new library performed better against a panel of antigens in terms of the number of binders isolated, the number of unique sequences, and/or the speed of binder enrichment. Our results suggest that the amplification-centric design of sequence diversity is a valid strategy for the construction of highly functional phage display antibody libraries.
Collapse
|
42
|
Zambrano N, Froechlich G, Lazarevic D, Passariello M, Nicosia A, De Lorenzo C, Morelli MJ, Sasso E. High-Throughput Monoclonal Antibody Discovery from Phage Libraries: Challenging the Current Preclinical Pipeline to Keep the Pace with the Increasing mAb Demand. Cancers (Basel) 2022; 14:cancers14051325. [PMID: 35267633 PMCID: PMC8909429 DOI: 10.3390/cancers14051325] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 02/25/2022] [Accepted: 03/02/2022] [Indexed: 12/13/2022] Open
Abstract
Simple Summary Monoclonal antibodies are increasingly used for a broad range of diseases. Rising demand must face with time time-consuming and laborious processes to isolate novel monoclonal antibodies. Next-generation sequencing coupled to phage display provides timely and sustainable high throughput selection strategy to rapidly access novel target. Here, we describe the current NGS-guided strategies to identify potential binders from enriched sub-libraires by applying a user-friendly informatic pipeline to identify and discard false positive clones. Rescue step and strategies to boost mAb yield are also discussed to improve the limiting selection and screening steps. Abstract Monoclonal antibodies are among the most powerful therapeutics in modern medicine. Since the approval of the first therapeutic antibody in 1986, monoclonal antibodies keep holding great expectations for application in a range of clinical indications, highlighting the need to provide timely and sustainable access to powerful screening options. However, their application in the past has been limited by time-consuming and expensive steps of discovery and production. The screening of antibody repertoires is a laborious step; however, the implementation of next-generation sequencing-guided screening of single-chain antibody fragments has now largely overcome this issue. This review provides a detailed overview of the current strategies for the identification of monoclonal antibodies from phage display-based libraries. We also discuss the challenges and the possible solutions to improve the limiting selection and screening steps, in order to keep pace with the increasing demand for monoclonal antibodies.
Collapse
Affiliation(s)
- Nicola Zambrano
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
- Correspondence: (N.Z.); (E.S.)
| | - Guendalina Froechlich
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
| | - Dejan Lazarevic
- Center for Omics Sciences Ospedale San Raffaele, Via Olgettina 58, 20132 Milano, Italy; (D.L.); (M.J.M.)
| | - Margherita Passariello
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
| | - Alfredo Nicosia
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
| | - Claudia De Lorenzo
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
| | - Marco J. Morelli
- Center for Omics Sciences Ospedale San Raffaele, Via Olgettina 58, 20132 Milano, Italy; (D.L.); (M.J.M.)
| | - Emanuele Sasso
- Dipartimento di Medicina Molecolare e Biotecnologie Mediche, Università Degli Studi di Napoli Federico II, Via Pansini 5, 80131 Napoli, Italy; (G.F.); (M.P.); (A.N.); (C.D.L.)
- CEINGE—Biotecnologie Avanzate s.c. a.r.l., Via Gaetano Salvatore 486, 80145 Naples, Italy
- Correspondence: (N.Z.); (E.S.)
| |
Collapse
|
43
|
Khetan R, Curtis R, Deane CM, Hadsund JT, Kar U, Krawczyk K, Kuroda D, Robinson SA, Sormanni P, Tsumoto K, Warwicker J, Martin ACR. Current advances in biopharmaceutical informatics: guidelines, impact and challenges in the computational developability assessment of antibody therapeutics. MAbs 2022; 14:2020082. [PMID: 35104168 PMCID: PMC8812776 DOI: 10.1080/19420862.2021.2020082] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Therapeutic monoclonal antibodies and their derivatives are key components of clinical pipelines in the global biopharmaceutical industry. The availability of large datasets of antibody sequences, structures, and biophysical properties is increasingly enabling the development of predictive models and computational tools for the "developability assessment" of antibody drug candidates. Here, we provide an overview of the antibody informatics tools applicable to the prediction of developability issues such as stability, aggregation, immunogenicity, and chemical degradation. We further evaluate the opportunities and challenges of using biopharmaceutical informatics for drug discovery and optimization. Finally, we discuss the potential of developability guidelines based on in silico metrics that can be used for the assessment of antibody stability and manufacturability.
Collapse
Affiliation(s)
- Rahul Khetan
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Robin Curtis
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | | | | | - Uddipan Kar
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
| | | | - Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan
| | | | - Pietro Sormanni
- Chemistry of Health, Yusuf Hamied Department of Chemistry, University of Cambridge
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan.,The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Jim Warwicker
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Andrew C R Martin
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK
| |
Collapse
|
44
|
Philpott D, Gomis S, Wang H, Atwal R, Kelil A, Sack T, Morningstar B, Burnie C, Sargent EH, Angers S, Sidhu S, Kelley SO. Rapid On-Cell Selection of High-Performance Human Antibodies. ACS CENTRAL SCIENCE 2022; 8:102-109. [PMID: 35106377 PMCID: PMC8796304 DOI: 10.1021/acscentsci.1c01205] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Indexed: 06/14/2023]
Abstract
Phage display is a critical tool for developing antibodies. However, existing approaches require many time-consuming rounds of biopanning and screening of potential candidates due to a high rate of failure during validation. Herein, we present a rapid on-cell phage display platform which recapitulates the complex in vivo binding environment to produce high-performance human antibodies in a short amount of time. Selection is performed in a highly stringent heterogeneous mixture of cells to quickly remove nonspecific binders. A microfluidic platform then separates antigen-presenting cells with high throughput and specificity. An unsupervised machine learning algorithm analyzes sequences of phage from all pools to identify the structural trends that contribute to affinity and proposes ideal candidates for validation. In a proof-of-concept screen against human Frizzled-7, a key ligand in the Wnt signaling pathway, antibodies with picomolar affinity were discovered in two rounds of selection that outperformed current gold-standard reagents. This approach, termed μCellect, is low cost, high throughput, and compatible with a wide variety of cell types, enabling widespread adoption for antibody development.
Collapse
Affiliation(s)
- David
N. Philpott
- Edward
S. Rogers Sr. Department of Electrical & Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G8, Canada
| | - Surath Gomis
- Edward
S. Rogers Sr. Department of Electrical & Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G8, Canada
| | - Hansen Wang
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Randy Atwal
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Abdellali Kelil
- Donnelly
Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Tanja Sack
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Brandon Morningstar
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Chris Burnie
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Edward H. Sargent
- Edward
S. Rogers Sr. Department of Electrical & Computer Engineering, University of Toronto, Toronto, Ontario M5S 3G8, Canada
| | - Stephane Angers
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| | - Sachdev Sidhu
- Donnelly
Centre, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Shana O. Kelley
- Department
of Pharmaceutical Sciences, University of
Toronto, Toronto, Ontario M5S 3M2, Canada
| |
Collapse
|
45
|
Akbar R, Bashour H, Rawat P, Robert PA, Smorodina E, Cotet TS, Flem-Karlsen K, Frank R, Mehta BB, Vu MH, Zengin T, Gutierrez-Marcos J, Lund-Johansen F, Andersen JT, Greiff V. Progress and challenges for the machine learning-based design of fit-for-purpose monoclonal antibodies. MAbs 2022; 14:2008790. [PMID: 35293269 PMCID: PMC8928824 DOI: 10.1080/19420862.2021.2008790] [Citation(s) in RCA: 29] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2021] [Revised: 11/04/2021] [Accepted: 11/17/2021] [Indexed: 12/15/2022] Open
Abstract
Although the therapeutic efficacy and commercial success of monoclonal antibodies (mAbs) are tremendous, the design and discovery of new candidates remain a time and cost-intensive endeavor. In this regard, progress in the generation of data describing antigen binding and developability, computational methodology, and artificial intelligence may pave the way for a new era of in silico on-demand immunotherapeutics design and discovery. Here, we argue that the main necessary machine learning (ML) components for an in silico mAb sequence generator are: understanding of the rules of mAb-antigen binding, capacity to modularly combine mAb design parameters, and algorithms for unconstrained parameter-driven in silico mAb sequence synthesis. We review the current progress toward the realization of these necessary components and discuss the challenges that must be overcome to allow the on-demand ML-based discovery and design of fit-for-purpose mAb therapeutic candidates.
Collapse
Affiliation(s)
- Rahmad Akbar
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Habib Bashour
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Puneet Rawat
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - Philippe A. Robert
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Eva Smorodina
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Russia
| | | | - Karine Flem-Karlsen
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, Department of Pharmacology, University of Oslo and Oslo University Hospital, Norway
| | - Robert Frank
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Brij Bhushan Mehta
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| | - Mai Ha Vu
- Department of Linguistics and Scandinavian Studies, University of Oslo, Norway
| | - Talip Zengin
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Department of Bioinformatics, Mugla Sitki Kocman University, Turkey
| | | | | | - Jan Terje Andersen
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
- Institute of Clinical Medicine, Department of Pharmacology, University of Oslo and Oslo University Hospital, Norway
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway
| |
Collapse
|
46
|
Abstract
The antibody drug field has continually sought improvements to methods for candidate discovery and engineering. Historically, most such methods have been laboratory-based, but informatics methods have recently started to make an impact. Deep learning, a subfield of machine learning, is rapidly gaining prominence in the biomedical research. Recent advances in microfluidics technologies and next-generation sequencing have not only revolutionized therapeutic antibody discovery, but also contributed to a vast amount of antibody repertoire sequencing data, providing opportunities for deep learning-based applications. Previously, we used microfluidics, yeast display, and deep sequencing to generate a panel of binder and non-binder antibody sequences to the cancer immunotherapy targets PD-1 and CTLA-4. Here we encoded the antibody light and heavy chain complementarity-determining regions (CDR3s) into antibody images, then built and trained convolutional neural network models to classify binders and non-binders. To improve model interpretability, we performed in silico mutagenesis to identify CDR3 residues that were important for binder classification. We further built generative deep learning models using generative adversarial network models to produce synthetic antibodies against PD-1 and CTLA-4. Our models generated variable length CDR3 sequences that resemble real sequences. Overall, our study demonstrates that deep learning methods can be leveraged to mine and learn patterns in antibody sequences, offering insights into antibody engineering, optimization, and discovery.
Collapse
Affiliation(s)
- Yoong Wearn Lim
- GigaGen Inc. (A Grifols Company), South San Francisco, CA, USA
| | - Adam S Adler
- GigaGen Inc. (A Grifols Company), South San Francisco, CA, USA
| | - David S Johnson
- GigaGen Inc. (A Grifols Company), South San Francisco, CA, USA
| |
Collapse
|
47
|
Moura-Sampaio J, Faustino AF, Boeuf R, Antunes MA, Ewert S, Batista AP. Reconstruction of full antibody sequences in NGS datasets and accurate VL:VH coupling by cluster coordinate matching of non-overlapping reads. Comput Struct Biotechnol J 2022; 20:2723-2727. [PMID: 35832623 PMCID: PMC9168528 DOI: 10.1016/j.csbj.2022.05.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/27/2022] [Accepted: 05/27/2022] [Indexed: 11/27/2022] Open
Abstract
Next-generation sequencing (NGS) is an indispensable tool in antibody discovery projects. However, the limits on NGS read length make it difficult to reconstruct full antibody sequences from the sequencing runs, especially if the six CDRs are randomized. To overcome that, we took advantage of Illumina’s cluster mapping capabilities to pair non-overlapping reads and reconstruct full Fab sequences with accurate VL:VH pairings. The method relies on in silico cluster coordinate information, and not on extensive in vitro manipulation, making the protocol easily deployable and less prone to PCR-derived errors. This work maintains the throughput necessary for antibody discovery campaigns, and a high degree of fidelity, which potentiates not only phage-display and synthetic library-based discovery methods, but also the NGS-driven analysis of naïve and immune libraries.
Collapse
|
48
|
Adaptive machine learning for protein engineering. Curr Opin Struct Biol 2021; 72:145-152. [PMID: 34896756 DOI: 10.1016/j.sbi.2021.11.002] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/14/2021] [Accepted: 11/08/2021] [Indexed: 11/22/2022]
Abstract
Machine-learning models that learn from data to predict how protein sequence encodes function are emerging as a useful protein engineering tool. However, when using these models to suggest new protein designs, one must deal with the vast combinatorial complexity of protein sequences. Here, we review how to use a sequence-to-function machine-learning surrogate model to select sequences for experimental measurement. First, we discuss how to select sequences through a single round of machine-learning optimization. Then, we discuss sequential optimization, where the goal is to discover optimized sequences and improve the model across multiple rounds of training, optimization, and experimental measurement.
Collapse
|
49
|
Hanning KR, Minot M, Warrender AK, Kelton W, Reddy ST. Deep mutational scanning for therapeutic antibody engineering. Trends Pharmacol Sci 2021; 43:123-135. [PMID: 34895944 DOI: 10.1016/j.tips.2021.11.010] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 11/02/2021] [Accepted: 11/10/2021] [Indexed: 12/24/2022]
Abstract
The biophysical and functional properties of monoclonal antibody (mAb) drug candidates are often improved by protein engineering methods to increase the probability of clinical efficacy. One emerging method is deep mutational scanning (DMS) which combines the power of exhaustive protein mutagenesis and functional screening with deep sequencing and bioinformatics. The application of DMS has yielded significant improvements to the affinity, specificity, and stability of several preclinical antibodies alongside novel applications such as introducing multi-specific binding properties. DMS has also been applied directly on target antigens to precisely map antibody-binding epitopes and notably to profile the mutational escape potential of viral targets (e.g., SARS-CoV-2 variants). Finally, DMS combined with machine learning is enabling advances in the computational screening and engineering of therapeutic antibodies.
Collapse
Affiliation(s)
- Kyrin R Hanning
- Te Huataki Waiora School of Health, University of Waikato, Hamilton 3240, New Zealand
| | - Mason Minot
- Department of Biosystems Science and Engineering, Eidgenössische Technische Hochschule (ETH) Zurich, Basel 4058, Switzerland
| | - Annmaree K Warrender
- Te Huataki Waiora School of Health, University of Waikato, Hamilton 3240, New Zealand
| | - William Kelton
- Te Huataki Waiora School of Health, University of Waikato, Hamilton 3240, New Zealand.
| | - Sai T Reddy
- Department of Biosystems Science and Engineering, Eidgenössische Technische Hochschule (ETH) Zurich, Basel 4058, Switzerland.
| |
Collapse
|
50
|
Saito Y, Oikawa M, Sato T, Nakazawa H, Ito T, Kameda T, Tsuda K, Umetsu M. Machine-Learning-Guided Library Design Cycle for Directed Evolution of Enzymes: The Effects of Training Data Composition on Sequence Space Exploration. ACS Catal 2021. [DOI: 10.1021/acscatal.1c03753] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Affiliation(s)
- Yutaka Saito
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), 3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
- Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
| | - Misaki Oikawa
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
| | - Takumi Sato
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
| | - Hikaru Nakazawa
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
| | - Tomoyuki Ito
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
| | - Tomoshi Kameda
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan
- Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
| | - Koji Tsuda
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
- Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
- Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, 1-2-1 Sengen, Tsukuba, Ibaraki 305-0047, Japan
| | - Mitsuo Umetsu
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, 6-6-11 Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan
- Center for Advanced Intelligence Project, RIKEN, 1-4-1 Nihombashi, Chuo-ku, Tokyo 103-0027, Japan
| |
Collapse
|