1
|
Barton J, Gaspariunas A, Galson JD, Leem J. Building Representation Learning Models for Antibody Comprehension. Cold Spring Harb Perspect Biol 2024; 16:a041462. [PMID: 38012013 PMCID: PMC10910360 DOI: 10.1101/cshperspect.a041462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Antibodies are versatile proteins with both the capacity to bind a broad range of targets and a proven track record as some of the most successful therapeutics. However, the development of novel antibody therapeutics is a lengthy and costly process. It is challenging to predict the functional and biophysical properties of antibodies from their amino acid sequence alone, requiring numerous experiments for full characterization. Machine learning, specifically deep representation learning, has emerged as a family of methods that can complement wet lab approaches and accelerate the overall discovery and engineering process. Here, we review advances in antibody sequence representation learning, and how this has improved antibody structure prediction and facilitated antibody optimization. We discuss challenges in the development and implementation of such models, such as the lack of publicly available, well-curated antibody function data and highlight opportunities for improvement. These and future advances in machine learning for antibody sequences have the potential to increase the success rate in developing new therapeutics, resulting in broader access to transformative medicines and improved patient outcomes.
Collapse
Affiliation(s)
- Justin Barton
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| | | | - Jacob D Galson
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| | - Jinwoo Leem
- Alchemab Therapeutics Ltd, London N1C 4AX, United Kingdom
| |
Collapse
|
2
|
Dudzic P, Chomicz D, Kończak J, Satława T, Janusz B, Wrobel S, Gawłowski T, Jaszczyszyn I, Bielska W, Demharter S, Spreafico R, Schulte L, Martin K, Comeau SR, Krawczyk K. Large-scale data mining of four billion human antibody variable regions reveals convergence between therapeutic and natural antibodies that constrains search space for biologics drug discovery. MAbs 2024; 16:2361928. [PMID: 38844871 PMCID: PMC11164219 DOI: 10.1080/19420862.2024.2361928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 05/27/2024] [Indexed: 06/12/2024] Open
Abstract
The naïve human antibody repertoire has theoretical access to an estimated > 1015 antibodies. Identifying subsets of this prohibitively large space where therapeutically relevant antibodies may be found is useful for development of these agents. It was previously demonstrated that, despite the immense sequence space, different individuals can produce the same antibodies. It was also shown that therapeutic antibodies, which typically follow seemingly unnatural development processes, can arise independently naturally. To check for biases in how the sequence space is explored, we data mined public repositories to identify 220 bioprojects with a combined seven billion reads. Of these, we created a subset of human bioprojects that we make available as the AbNGS database (https://naturalantibody.com/ngs/). AbNGS contains 135 bioprojects with four billion productive human heavy variable region sequences and 385 million unique complementarity-determining region (CDR)-H3s. We find that 270,000 (0.07% of 385 million) unique CDR-H3s are highly public in that they occur in at least five of 135 bioprojects. Of 700 unique therapeutic CDR-H3, a total of 6% has direct matches in the small set of 270,000. This observation extends to a match between CDR-H3 and V-gene call as well. Thus, the subspace of shared ('public') CDR-H3s shows utility for serving as a starting point for therapeutic antibody design.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | - Lukas Schulte
- Global Computational Biology & Digital Sciences, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riß, Germany
| | - Kyle Martin
- Biotherapeutics Discovery, Boehringer Ingelheim, Ridgefield, CT, USA
| | - Stephen R. Comeau
- Biotherapeutics Discovery, Boehringer Ingelheim, Ridgefield, CT, USA
| | | |
Collapse
|
3
|
Briney B. AntiRef: reference clusters of human antibody sequences. BIOINFORMATICS ADVANCES 2023; 3:vbad109. [PMID: 37886711 PMCID: PMC10598580 DOI: 10.1093/bioadv/vbad109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 08/02/2023] [Accepted: 08/21/2023] [Indexed: 10/28/2023]
Abstract
Motivation Genetic biases in the human antibody repertoire result in publicly available antibody sequence datasets that contain many duplicate or highly similar sequences. Available datasets are further skewed by the predominance of studies focused on specific disease states, primarily cancer, autoimmunity, and a small number of infectious diseases that includes HIV, influenza, and SARS-CoV-2. These biases and redundancies are a barrier to rapid similarity searches and reduce the efficiency with which these datasets can be used to train statistical or machine-learning models. Identity-based clustering provides a solution; however, the extremely large size of available antibody sequence datasets makes such clustering operations computationally intensive and potentially out of reach for many scientists and researchers who would benefit from such data. Results Antibody Reference Clusters (AntiRef), which is modeled after UniRef, provides clustered datasets of filtered human antibody sequences. Due to the modular nature of recombined antibody genes, the clustering thresholds used by UniRef for general protein sequences are suboptimal for antibody clustering. Starting with an input dataset of ∼451M full-length, productive human antibody sequences, AntiRef provides reference datasets clustered at a range of antibody-optimized identity thresholds. AntiRef90 is one-third the size of the input dataset and less than half the size of the non-redundant AntiRef100. Availability and implementation AntiRef datasets are available on Zenodo (zenodo.org/record/7474336). All code used to generate AntiRef is available on GitHub (github.com/briney/antiref). The AntiRef versioning scheme (current version: v2022.12.14) refers to the date on which sequences were retrieved from OAS.
Collapse
Affiliation(s)
- Bryan Briney
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, United States
- Center for Viral Systems Biology, The Scripps Research Institute, La Jolla, CA 92037, United States
- The Multiomics Vaccine Evaluation Consortium, The Scripps Research Institute, La Jolla, CA 92037, United States
- Scripps Center for HIV/AIDS Vaccine Development, The Scripps Research Institute, La Jolla, CA 92037, United States
- San Diego Center for AIDS Research, The Scripps Research Institute, La Jolla, CA 92037, United States
| |
Collapse
|
4
|
Olsen TH, Abanades B, Moal IH, Deane CM. KA-Search, a method for rapid and exhaustive sequence identity search of known antibodies. Sci Rep 2023; 13:11612. [PMID: 37463925 DOI: 10.1038/s41598-023-38108-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 07/03/2023] [Indexed: 07/20/2023] Open
Abstract
Antibodies with similar amino acid sequences, especially across their complementarity-determining regions, often share properties. Finding that an antibody of interest has a similar sequence to naturally expressed antibodies in healthy or diseased repertoires is a powerful approach for the prediction of antibody properties, such as immunogenicity or antigen specificity. However, as the number of available antibody sequences is now in the billions and continuing to grow, repertoire mining for similar sequences has become increasingly computationally expensive. Existing approaches are limited by either being low-throughput, non-exhaustive, not antibody specific, or only searching against entire chain sequences. Therefore, there is a need for a specialized tool, optimized for a rapid and exhaustive search of any antibody region against all known antibodies, to better utilize the full breadth of available repertoire sequences. We introduce Known Antibody Search (KA-Search), a tool that allows for the rapid search of billions of antibody variable domains by amino acid sequence identity across either the variable domain, the complementarity-determining regions, or a user defined antibody region. We show KA-Search in operation on the [Formula: see text]2.4 billion antibody sequences available in the OAS database. KA-Search can be used to find the most similar sequences from OAS within 30 minutes and a representative subset of 10 million sequences in less than 9 seconds. We give examples of how KA-Search can be used to obtain new insights about an antibody of interest. KA-Search is freely available at https://github.com/oxpig/kasearch .
Collapse
Affiliation(s)
- Tobias H Olsen
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
| | - Brennan Abanades
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
| | - Iain H Moal
- GSK Medicines Research Centre, GlaxoSmithKline plc, Stevenage, SG1 2NY, UK
| | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.
- Exscientia plc, Oxford, OX4 4GE, UK.
| |
Collapse
|
5
|
Wilman W, Wróbel S, Bielska W, Deszynski P, Dudzic P, Jaszczyszyn I, Kaniewski J, Młokosiewicz J, Rouyan A, Satława T, Kumar S, Greiff V, Krawczyk K. Machine-designed biotherapeutics: opportunities, feasibility and advantages of deep learning in computational antibody discovery. Brief Bioinform 2022; 23:6643456. [PMID: 35830864 PMCID: PMC9294429 DOI: 10.1093/bib/bbac267] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 05/09/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open
Abstract
Antibodies are versatile molecular binders with an established and growing role as therapeutics. Computational approaches to developing and designing these molecules are being increasingly used to complement traditional lab-based processes. Nowadays, in silico methods fill multiple elements of the discovery stage, such as characterizing antibody–antigen interactions and identifying developability liabilities. Recently, computational methods tackling such problems have begun to follow machine learning paradigms, in many cases deep learning specifically. This paradigm shift offers improvements in established areas such as structure or binding prediction and opens up new possibilities such as language-based modeling of antibody repertoires or machine-learning-based generation of novel sequences. In this review, we critically examine the recent developments in (deep) machine learning approaches to therapeutic antibody design with implications for fully computational antibody design.
Collapse
|
6
|
Hummer AM, Abanades B, Deane CM. Advances in computational structure-based antibody design. Curr Opin Struct Biol 2022; 74:102379. [PMID: 35490649 DOI: 10.1016/j.sbi.2022.102379] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 02/28/2022] [Accepted: 03/17/2022] [Indexed: 12/12/2022]
Abstract
Antibodies are currently the most important class of biotherapeutics and are used to treat numerous diseases. Recent advances in computational methods are ushering in a new era of antibody design, driven in part by accurate structure prediction. Previously, structure-based antibody design has been limited to a relatively small number of cases where accurate structures or models of both the target antigen and antibody were available. As we move towards a time where it is possible to accurately model most antibodies and antigens, and to reliably predict their binding site, there is vast potential for true computational antibody design. In this review, we describe the latest methods that promise to launch a paradigm shift towards entirely in silico structure-based antibody design.
Collapse
Affiliation(s)
- Alissa M Hummer
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK. https://twitter.com/@AlissaHummer
| | - Brennan Abanades
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK. https://twitter.com/@brennanaba
| | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
| |
Collapse
|
7
|
Brooks BD, Beland A, Aguero G, Taylor N, Towne FD. Moving beyond Titers. Vaccines (Basel) 2022; 10:vaccines10050683. [PMID: 35632439 PMCID: PMC9144832 DOI: 10.3390/vaccines10050683] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/18/2022] [Accepted: 04/20/2022] [Indexed: 01/27/2023] Open
Abstract
Vaccination to prevent and even eliminate disease is amongst the greatest achievements of modern medicine. Opportunities remain in vaccine development to improve protection across the whole population. A next step in vaccine development is the detailed molecular characterization of individual humoral immune responses against a pathogen, especially the rapidly evolving pathogens. New technologies such as sequencing the immune repertoire in response to disease, immunogenomics/vaccinomics, particularly the individual HLA variants, and high-throughput epitope characterization offer new insights into disease protection. Here, we highlight the emerging technologies that could be used to identify variation within the human population, facilitate vaccine discovery, improve vaccine safety and efficacy, and identify mechanisms of generating immunological memory. In today’s vaccine-hesitant climate, these techniques used individually or especially together have the potential to improve vaccine effectiveness and safety and thus vaccine uptake rates. We highlight the importance of using these techniques in combination to understand the humoral immune response as a whole after vaccination to move beyond neutralizing titers as the standard for immunogenicity and vaccine efficacy, especially in clinical trials.
Collapse
Affiliation(s)
- Benjamin D. Brooks
- Department of Biomedical Sciences, Rocky Vista University, Ivins, UT 84738, USA
- Inovan Inc., Fargo, ND 58103, USA
- Correspondence: ; Tel.: +1-(435)-222-1304
| | - Alexander Beland
- College of Osteopathic Medicine, Rocky Vista University, Parker, CO 80112, USA; (A.B.); (G.A.); (N.T.); (F.D.T.)
| | - Gabriel Aguero
- College of Osteopathic Medicine, Rocky Vista University, Parker, CO 80112, USA; (A.B.); (G.A.); (N.T.); (F.D.T.)
| | - Nicholas Taylor
- College of Osteopathic Medicine, Rocky Vista University, Parker, CO 80112, USA; (A.B.); (G.A.); (N.T.); (F.D.T.)
| | - Francina D. Towne
- College of Osteopathic Medicine, Rocky Vista University, Parker, CO 80112, USA; (A.B.); (G.A.); (N.T.); (F.D.T.)
| |
Collapse
|
8
|
Młokosiewicz J, Deszyński P, Wilman W, Jaszczyszyn I, Ganesan R, Kovaltsuk A, Leem J, Galson J, Krawczyk K. AbDiver-A tool to explore the natural antibody landscape to aid therapeutic design. Bioinformatics 2022; 38:2628-2630. [PMID: 35274671 PMCID: PMC9048670 DOI: 10.1093/bioinformatics/btac151] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 02/05/2022] [Accepted: 03/10/2022] [Indexed: 11/13/2022] Open
Abstract
Motivation Rational design of therapeutic antibodies can be improved by harnessing the natural sequence diversity of these molecules. Our understanding of the diversity of antibodies has recently been greatly facilitated through the deposition of hundreds of millions of human antibody sequences in next-generation sequencing (NGS) repositories. Contrasting a query therapeutic antibody sequence to naturally observed diversity in similar antibody sequences from NGS can provide a mutational roadmap for antibody engineers designing biotherapeutics. Because of the sheer scale of the antibody NGS datasets, performing queries across them is computationally challenging. Results To facilitate harnessing antibody NGS data, we developed AbDiver (http://naturalantibody.com/abdiver), a free portal allowing users to compare their query sequences to those observed in the natural repertoires. AbDiver offers three antibody-specific use-cases: (i) compare a query antibody to positional variability statistics precomputed from multiple independent studies, (ii) retrieve close full variable sequence matches to a query antibody and (iii) retrieve CDR3 or clonotype matches to a query antibody. We applied our system to a set of 742 therapeutic antibodies, demonstrating that for each use-case our system can retrieve relevant results for most sequences. AbDiver facilitates the navigation of vast antibody mutation space for the purpose of rational therapeutic antibody design. Availability and implementation AbDiver is freely accessible at http://naturalantibody.com/abdiver. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
9
|
Gupta P, Makowski EK, Kumar S, Zhang Y, Scheer JM, Tessier PM. Antibodies with Weakly Basic Isoelectric Points Minimize Trade-offs between Formulation and Physiological Colloidal Properties. Mol Pharm 2022; 19:775-787. [PMID: 35108018 PMCID: PMC9350878 DOI: 10.1021/acs.molpharmaceut.1c00373] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The widespread interest in antibody therapeutics has led to much focus on identifying antibody candidates with favorable developability properties. In particular, there is broad interest in identifying antibody candidates with highly repulsive self-interactions in standard formulations (e.g., low ionic strength buffers at pH 5-6) for high solubility and low viscosity. Likewise, there is also broad interest in identifying antibody candidates with low levels of non-specific interactions in physiological solution conditions (PBS, pH 7.4) to promote favorable pharmacokinetic properties. To what extent antibodies that possess both highly repulsive self-interactions in standard formulations and weak non-specific interactions in physiological solution conditions can be systematically identified remains unclear and is a potential impediment to successful therapeutic drug development. Here, we evaluate these two properties for 42 IgG1 variants based on the variable fragments (Fvs) from four clinical-stage antibodies and complementarity-determining regions from 10 clinical-stage antibodies. Interestingly, we find that antibodies with the strongest repulsive self-interactions in a standard formulation (pH 6 and 10 mM histidine) display the strongest non-specific interactions in physiological solution conditions. Conversely, antibodies with the weakest non-specific interactions under physiological conditions display the least repulsive self-interactions in standard formulations. This behavior can be largely explained by the antibody isoelectric point, as highly basic antibodies that are highly positively charged under standard formulation conditions (pH 5-6) promote repulsive self-interactions that mediate high colloidal stability but also mediate strong non-specific interactions with negatively charged biomolecules at physiological pH and vice versa for antibodies with negatively charged Fv regions. Therefore, IgG1s with weakly basic isoelectric points between 8 and 8.5 and Fv isoelectric points between 7.5 and 9 typically display the best combinations of strong repulsive self-interactions and weak non-specific interactions. We expect that these findings will improve the identification and engineering of antibody candidates with drug-like biophysical properties.
Collapse
Affiliation(s)
- Priyanka Gupta
- Biochemistry and Biophysics Department, Rensselaer Polytechnic Institute, Troy, New York 12180, United States.,Biotherapeutics Molecule Discovery Department, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, Connecticut 06877, United States
| | - Emily K Makowski
- Department of Pharmaceutical Sciences, Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Sandeep Kumar
- Biotherapeutics Molecule Discovery Department, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, Connecticut 06877, United States
| | - Yulei Zhang
- Department of Chemical Engineering, Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Justin M Scheer
- Biotherapeutics Molecule Discovery Department, Boehringer Ingelheim Pharmaceuticals Inc., Ridgefield, Connecticut 06877, United States.,Janssen R&D, South San Francisco, California 94080, United States
| | - Peter M Tessier
- Biochemistry and Biophysics Department, Rensselaer Polytechnic Institute, Troy, New York 12180, United States.,Department of Pharmaceutical Sciences, Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan 48109, United States.,Department of Chemical Engineering, Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan 48109, United States.,Department of Biomedical Engineering, Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
10
|
Prihoda D, Maamary J, Waight A, Juan V, Fayadat-Dilman L, Svozil D, Bitton DA. BioPhi: A platform for antibody design, humanization, and humanness evaluation based on natural antibody repertoires and deep learning. MAbs 2022; 14:2020203. [PMID: 35133949 PMCID: PMC8837241 DOI: 10.1080/19420862.2021.2020203] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Despite recent advances in transgenic animal models and display technologies, humanization of mouse sequences remains one of the main routes for therapeutic antibody development. Traditionally, humanization is manual, laborious, and requires expert knowledge. Although automation efforts are advancing, existing methods are either demonstrated on a small scale or are entirely proprietary. To predict the immunogenicity risk, the human-likeness of sequences can be evaluated using existing humanness scores, but these lack diversity, granularity or interpretability. Meanwhile, immune repertoire sequencing has generated rich antibody libraries such as the Observed Antibody Space (OAS) that offer augmented diversity not yet exploited for antibody engineering. Here we present BioPhi, an open-source platform featuring novel methods for humanization (Sapiens) and humanness evaluation (OASis). Sapiens is a deep learning humanization method trained on the OAS using language modeling. Based on an in silico humanization benchmark of 177 antibodies, Sapiens produced sequences at scale while achieving results comparable to that of human experts. OASis is a granular, interpretable and diverse humanness score based on 9-mer peptide search in the OAS. OASis separated human and non-human sequences with high accuracy, and correlated with clinical immunogenicity. BioPhi thus offers an antibody design interface with automated methods that capture the richness of natural antibody repertoires to produce therapeutics with desired properties and accelerate antibody discovery campaigns. The BioPhi platform is accessible at https://biophi.dichlab.org and https://github.com/Merck/BioPhi.
Collapse
Affiliation(s)
- David Prihoda
- Department of Informatics and Chemistry, University of Chemistry and Technology, Prague, Czech Republic.,R&D Informatics Solutions, MSD Czech Republic S.r.o, Prague, Czech Republic
| | - Jad Maamary
- Predictive and Clinical Immunogenicity, Merck & Co., Inc, Kenilworth, New Jersey, USA
| | - Andrew Waight
- Discovery Biologics, Protein Sciences, MRL, Merck & Co., Inc, South San Francisco, CA, USA
| | - Veronica Juan
- Discovery Biologics, Protein Sciences, MRL, Merck & Co., Inc, South San Francisco, CA, USA
| | | | - Daniel Svozil
- Department of Informatics and Chemistry, University of Chemistry and Technology, Prague, Czech Republic.,CZ-OPENSCREEN: National Infrastructure for Chemical Biology, Institute of Molecular Genetics of the ASCR, Prague, Czech Republic
| | - Danny A Bitton
- R&D Informatics Solutions, MSD Czech Republic S.r.o, Prague, Czech Republic
| |
Collapse
|
11
|
Khetan R, Curtis R, Deane CM, Hadsund JT, Kar U, Krawczyk K, Kuroda D, Robinson SA, Sormanni P, Tsumoto K, Warwicker J, Martin ACR. Current advances in biopharmaceutical informatics: guidelines, impact and challenges in the computational developability assessment of antibody therapeutics. MAbs 2022; 14:2020082. [PMID: 35104168 PMCID: PMC8812776 DOI: 10.1080/19420862.2021.2020082] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Therapeutic monoclonal antibodies and their derivatives are key components of clinical pipelines in the global biopharmaceutical industry. The availability of large datasets of antibody sequences, structures, and biophysical properties is increasingly enabling the development of predictive models and computational tools for the "developability assessment" of antibody drug candidates. Here, we provide an overview of the antibody informatics tools applicable to the prediction of developability issues such as stability, aggregation, immunogenicity, and chemical degradation. We further evaluate the opportunities and challenges of using biopharmaceutical informatics for drug discovery and optimization. Finally, we discuss the potential of developability guidelines based on in silico metrics that can be used for the assessment of antibody stability and manufacturability.
Collapse
Affiliation(s)
- Rahul Khetan
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Robin Curtis
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | | | | | - Uddipan Kar
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
| | | | - Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan
| | | | - Pietro Sormanni
- Chemistry of Health, Yusuf Hamied Department of Chemistry, University of Cambridge
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan.,The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Jim Warwicker
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Andrew C R Martin
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK
| |
Collapse
|
12
|
Negron C, Fang J, McPherson MJ, Stine WB, McCluskey AJ. Separating clinical antibodies from repertoire antibodies, a path to in silico developability assessment. MAbs 2022; 14:2080628. [PMID: 35771588 PMCID: PMC9255221 DOI: 10.1080/19420862.2022.2080628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Approaches for antibody discovery have seen substantial improvement and success in recent years. Yet, advancing antibodies into the clinic remains difficult because therapeutic developability concerns are challenging to predict. We developed a computational model to simplify antibody developability assessment and enable accelerated early-stage screening. To this end, we quantified the ability of hundreds of sequence- and structure-based descriptors to differentiate clinical antibodies that have undergone rigorous screening and characterization for drug-like properties from antibodies in the human repertoire that are not natively paired. This analysis identified 144 descriptors capable of distinguishing clinical from repertoire antibodies. Five descriptors were selected and combined based on performance and orthogonality into a single model referred to as the Therapeutic Antibody Developability Analysis (TA-DA). On a hold-out test set, this tool separated clinical antibodies from repertoire antibodies with an AUC = 0.8, demonstrating the ability to identify developability attributes unique to clinical antibodies. Based on our results, the TA-DA score may serve as an approach for selecting lead antibodies for further development. Abbreviations: Affinity-Capture Self-Interaction Nanoparticle Spectroscopy (AC-SINS), Area Under the Curve (AUC), Complementary-Determining Region (CDR), Clinical-Stage Therapeutics (CST), Framework (FR), Monoclonal Antibodies (mAbs), Observed Antibody Space (OAS), Receiver Operating Characteristic (ROC), Size-Exclusion Chromatography (SEC), Structural Aggregation Propensity (SAP), Therapeutic Antibody Developability Analysis (TA-DA), Therapeutic Antibody Profiler (TAP), Therapeutic Structural Antibody Database (Thera-SAbDab), Variable Heavy (VH), Variable Light (VL).
Collapse
Affiliation(s)
| | - Joyce Fang
- AbbVie Bioresearch Center, Worcester, MA, USA
| | | | | | | |
Collapse
|
13
|
Wang Q, Zeng H, Zhu Y, Wang M, Zhang Y, Yang X, Tang H, Li H, Chen Y, Ma C, Lan C, Liu B, Yang W, Yu X, Zhang Z. Dual UMIs and Dual Barcodes With Minimal PCR Amplification Removes Artifacts and Acquires Accurate Antibody Repertoire. Front Immunol 2021; 12:778298. [PMID: 35003093 PMCID: PMC8727365 DOI: 10.3389/fimmu.2021.778298] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/25/2021] [Indexed: 12/03/2022] Open
Abstract
Antibody repertoire sequencing (Rep-seq) has been widely used to reveal repertoire dynamics and to interrogate antibodies of interest at single nucleotide-level resolution. However, polymerase chain reaction (PCR) amplification introduces extensive artifacts including chimeras and nucleotide errors, leading to false discovery of antibodies and incorrect assessment of somatic hypermutations (SHMs) which subsequently mislead downstream investigations. Here, a novel approach named DUMPArts, which improves the accuracy of antibody repertoires by labeling each sample with dual barcodes and each molecule with dual unique molecular identifiers (UMIs) via minimal PCR amplification to remove artifacts, is developed. Tested by ultra-deep Rep-seq data, DUMPArts removed inter-sample chimeras, which cause artifactual shared clones and constitute approximately 15% of reads in the library, as well as intra-sample chimeras with erroneous SHMs and constituting approximately 20% of the reads, and corrected base errors and amplification biases by consensus building. The removal of these artifacts will provide an accurate assessment of antibody repertoires and benefit related studies, especially mAb discovery and antibody-guided vaccine design.
Collapse
Affiliation(s)
- Qilong Wang
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Huikun Zeng
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Yan Zhu
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Minhui Wang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Yanfang Zhang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Xiujia Yang
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Haipei Tang
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Hongliang Li
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Yuan Chen
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Cuiyu Ma
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Chunhong Lan
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
| | - Wei Yang
- Department of Pathology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
- *Correspondence: Wei Yang, ; Xueqing Yu, ; Zhenhai Zhang, ;
| | - Xueqing Yu
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Division of Nephrology, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- *Correspondence: Wei Yang, ; Xueqing Yu, ; Zhenhai Zhang, ;
| | - Zhenhai Zhang
- Center for Precision Medicine, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- Guangdong-Hong Kong Joint Laboratory on Immunological and Genetic Kidney Diseases, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
- State Key Laboratory of Organ Failure Research, National Clinical Research Center for Kidney Disease, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou, China
- Department of Bioinformatics, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China
- Key Laboratory of Mental Health of the Ministry of Education, Guangdong-Hong Kong-Macao Greater Bay Area Center for Brain Science and Brain-Inspired Intelligence, Southern Medical University, Guangzhou, China
- *Correspondence: Wei Yang, ; Xueqing Yu, ; Zhenhai Zhang, ;
| |
Collapse
|
14
|
Feng J, Fan S, Sun Y, Ren H, Guan H, Wang J. Comprehensive B-Cell Immune Repertoire Analysis of Anti-NMDAR Encephalitis and Anti-LGI1 Encephalitis. Front Immunol 2021; 12:717598. [PMID: 34691026 PMCID: PMC8529218 DOI: 10.3389/fimmu.2021.717598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 09/09/2021] [Indexed: 11/13/2022] Open
Abstract
Anti-N-methyl-D-aspartate receptor encephalitis (anti-NMDARE) and anti-leucine-rich glioma-inactivated 1 encephalitis (anti-LGI1E) are the two most common types of antibody-mediated autoimmune encephalitis. We performed a comprehensive analysis of the B-cell immune repertoire in patients with anti-NMDARE (n = 7) and anti-LGI1E (n = 10) and healthy controls (n = 4). The results revealed the presence of many common clones between patients with these two types of autoimmune encephalitis, which were mostly class-switched. Additionally, many differences were found among the anti-NMDARE, anti-LGI1E, and healthy control groups, including the diversity of the B-cell immune repertoire and gene usage preference. These findings suggest that the same adaptive immune responses occur in patients with anti-NMDARE and anti-LGI1E, which deserves further exploration.
Collapse
Affiliation(s)
- Jingjing Feng
- CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Siyuan Fan
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yinwei Sun
- CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Haitao Ren
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hongzhi Guan
- Department of Neurology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jing Wang
- CAS Key Laboratory of Mental Health, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
15
|
Intrinsic physicochemical profile of marketed antibody-based biotherapeutics. Proc Natl Acad Sci U S A 2021; 118:2020577118. [PMID: 34504010 PMCID: PMC8449350 DOI: 10.1073/pnas.2020577118] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2021] [Indexed: 01/28/2023] Open
Abstract
Successful biologic drug discovery and development involves finding functional as well as developable candidates. Once a candidate has been demonstrated to be functional, the next step is to determine whether it can be translated into a drug product. This requires that the candidate can withstand stresses encountered during manufacturing, shipping, and storage. Additionally, it must be safe, efficacious, and possess good pharmacology. In silico analyses of the variable regions of 77 marketed antibody-based biotherapeutics have revealed five nonredundant physicochemical descriptors. Distributions of these descriptors, observed for marketed biotherapeutics, can help prioritize a drug candidate for experimental testing at early discovery stages, guide engineering efforts to further optimize it, and help increase the productivity of biologic drug discovery and development. Feeding biopharma pipelines with biotherapeutic candidates that possess desirable developability profiles can help improve the productivity of biologic drug discovery and development. Here, we have derived an in silico profile by analyzing computed physicochemical descriptors for the variable regions (Fv) found in 77 marketed antibody-based biotherapeutics. Fv regions of these biotherapeutics demonstrate significant diversities in their germlines, complementarity determining region loop lengths, hydrophobicity, and charge distributions. Furthermore, an analysis of 24 physicochemical descriptors, calculated using homology-based molecular models, has yielded five nonredundant descriptors whose distributions represent stability, isoelectric point, and molecular surface characteristics of their Fv regions. Fv regions of candidates from our internal discovery campaigns, human next-generation sequencing repertoires, and those in clinical-stages (CST) were assessed for similarity with the physicochemical profile derived here. The Fv regions in 33% of CST antibodies show physicochemical properties that are dissimilar to currently marketed biotherapeutics. In comparison, physicochemical characteristics of ∼29% of the Fv regions in human antibodies and ∼27% of our internal hits deviated significantly from those of marketed biotherapeutics. The early availability of this information can help guide hit selection, lead identification, and optimization of biotherapeutic candidates. Insights from this work can also help support portfolio risk assessment, in-licensing, and biopharma collaborations.
Collapse
|
16
|
Schmitz S, Soto C, Crowe JE, Meiler J. Human-likeness of antibody biologics determined by back-translation and comparison with large antibody variable gene repertoires. MAbs 2021; 12:1758291. [PMID: 32397786 PMCID: PMC8648325 DOI: 10.1080/19420862.2020.1758291] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
The antibody (Ab) germline gene rearrangement of variable (V), diversity (D), and joining (J) gene segments, as well as somatic hypermutation, give rise to the human Ab variable gene sequence repertoire. It is common to characterize single nucleotide frequencies of the variable region by alignment to species-specific wildtype germline genes. The increasing application of next-generation sequencing to immune repertoire studies has led to the compilation of increasing large adaptive immunome receptor repertoire datasets. We have developed a method that maps the sequence of a target Ab onto an immunome dataset of 326 million human Ab sequences. For this purpose, we created a position- and gene-specific scoring matrix (PGSSM) and its corresponding antibody similarity score. We characterized our PGSSM score and found that it strongly correlated with the phylogenetic distance of 181,355 Ab sequences from GenBank across 20 species. The most likely human nucleotide back-translation was obtained given only PGSSMs and the amino acid sequence of an Ab achieving a nucleotide sequence recovery of 95.9% and 97.2% for human heavy and light chains, respectively. In conclusion, the scoring of our back-translation is a valuable estimate for the similarity of an Ab sequence to the natural human repertoire. As expected, Ab therapeutic molecules developed from a human source showed a higher similarity to the repertoire than engineered Abs. Thus, the PGSSM metric introduced here can be used to engineer human-like Ab therapeutics.
Collapse
Affiliation(s)
- Samuel Schmitz
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Cinque Soto
- Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN, USA.,The Vaccine Center, Vanderbilt University Medical Center, Nashville, TN, USA
| | - James E Crowe
- Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN, USA.,The Vaccine Center, Vanderbilt University Medical Center, Nashville, TN, USA.,Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA.,Institute for Drug Development, Leipzig University Medical School, Leipzig, SAC, Germany
| |
Collapse
|
17
|
Soto C, Bombardi RG, Kozhevnikov M, Sinkovits RS, Chen EC, Branchizio A, Kose N, Day SB, Pilkinton M, Gujral M, Mallal S, Crowe JE. High Frequency of Shared Clonotypes in Human T Cell Receptor Repertoires. Cell Rep 2021; 32:107882. [PMID: 32668251 PMCID: PMC7433715 DOI: 10.1016/j.celrep.2020.107882] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 04/18/2020] [Accepted: 06/16/2020] [Indexed: 01/30/2023] Open
Abstract
The collection of T cell receptors (TCRs) generated by somatic recombination is large but unknown. We generate large TCR repertoire datasets as a resource to facilitate detailed studies of the role of TCR clonotypes and repertoires in health and disease. We estimate the size of individual human recombined and expressed TCRs by sequence analysis and determine the extent of sharing between individual repertoires. Our experiments reveal that each blood sample contains between 5 million and 21 million TCR clonotypes. Three individuals share 8% of TCRβ- or 11% of TCRα-chain clonotypes. Sorting by T cell phenotypes in four individuals shows that 5% of naive CD4+ and 3.5% of naive CD8+ subsets share their TCRβ clonotypes, whereas memory CD4+ and CD8+ subsets share 2.3% and 0.4% of their clonotypes, respectively. We identify the sequences of these shared TCR clonotypes that are of interest for studies of human T cell biology. Soto et al. examine the extent to which five healthy adults share their T cell receptor (TCR) repertoire. Using sequencing and bioinformatics, they show a high prevalence of shared clonotypes even considering different T cell phenotypes. Possible functions for some clonotypes are inferred based on homology with TCRs in GenBank.
Collapse
Affiliation(s)
- Cinque Soto
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Robin G Bombardi
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Morgan Kozhevnikov
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Robert S Sinkovits
- San Diego Supercomputer Center, University of California, San Diego, San Diego, CA 92093, USA
| | - Elaine C Chen
- Department of Pathology, Microbiology and Immunology, Vanderbilt University Medical Center, Nashville, TN 37212, USA
| | - Andre Branchizio
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Nurgun Kose
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Samuel B Day
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Mark Pilkinton
- Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Madhusudan Gujral
- San Diego Supercomputer Center, University of California, San Diego, San Diego, CA 92093, USA
| | - Simon Mallal
- Department of Pathology, Microbiology and Immunology, Vanderbilt University Medical Center, Nashville, TN 37212, USA; Department of Medicine, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - James E Crowe
- The Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Department of Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA; Department of Pathology, Microbiology and Immunology, Vanderbilt University Medical Center, Nashville, TN 37212, USA.
| |
Collapse
|
18
|
Abstract
The patent literature should reflect the past 30 years of engineering efforts directed toward developing monoclonal antibody therapeutics. Such information is potentially valuable for rational antibody design. Patents, however, are designed not to convey scientific knowledge, but to provide legal protection. It is not obvious whether antibody information from patent documents, such as antibody sequences, is useful in conveying engineering know-how, rather than as a legal reference only. To assess the utility of patent data for therapeutic antibody engineering, we quantified the amount of antibody sequences in patents destined for medicinal purposes and how well they reflect the primary sequences of therapeutic antibodies in clinical use. We identified 16,526 patent families covering major jurisdictions (e.g., US Patent and Trademark Office (USPTO) and World Intellectual Property Organization) that contained antibody sequences. These families held 245,109 unique antibody chains (135,397 heavy chains and 109,712 light chains) that we compiled in our Patented Antibody Database (PAD, http://naturalantibody.com/pad). We find that antibodies make up a non-trivial proportion of all patent amino acid sequence depositions (e.g., 11% of USPTO Full Text database). Our analysis of the 16,526 families demonstrates that the volume of patent documents with antibody sequences is growing, with the majority of documents classified as containing antibodies for medicinal purposes. We further studied the 245,109 antibody chains from patent literature to reveal that they very well reflect the primary sequences of antibody therapeutics in clinical use. This suggests that the patent literature could serve as a reference for previous engineering efforts to improve rational antibody design.
Collapse
Affiliation(s)
- Konrad Krawczyk
- Research and Development, Natural Antibody, Hamburg, Germany
| | - Andrew Buchanan
- Department of Bio and Health Informatics, R&D, AstraZeneca, Cambridge, UK
| | | |
Collapse
|
19
|
Norman RA, Ambrosetti F, Bonvin AMJJ, Colwell LJ, Kelm S, Kumar S, Krawczyk K. Computational approaches to therapeutic antibody design: established methods and emerging trends. Brief Bioinform 2020; 21:1549-1567. [PMID: 31626279 PMCID: PMC7947987 DOI: 10.1093/bib/bbz095] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2019] [Revised: 06/07/2019] [Accepted: 07/05/2019] [Indexed: 12/31/2022] Open
Abstract
Antibodies are proteins that recognize the molecular surfaces of potentially noxious molecules to mount an adaptive immune response or, in the case of autoimmune diseases, molecules that are part of healthy cells and tissues. Due to their binding versatility, antibodies are currently the largest class of biotherapeutics, with five monoclonal antibodies ranked in the top 10 blockbuster drugs. Computational advances in protein modelling and design can have a tangible impact on antibody-based therapeutic development. Antibody-specific computational protocols currently benefit from an increasing volume of data provided by next generation sequencing and application to related drug modalities based on traditional antibodies, such as nanobodies. Here we present a structured overview of available databases, methods and emerging trends in computational antibody analysis and contextualize them towards the engineering of candidate antibody therapeutics.
Collapse
|
20
|
Prabakaran P, Glanville J, Ippolito GC. Editorial: Next-Generation Sequencing of Human Antibody Repertoires for Exploring B-cell Landscape, Antibody Discovery and Vaccine Development. Front Immunol 2020; 11:1344. [PMID: 32714328 PMCID: PMC7344256 DOI: 10.3389/fimmu.2020.01344] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 05/27/2020] [Indexed: 12/27/2022] Open
Affiliation(s)
| | | | - Gregory C Ippolito
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX, United States
| |
Collapse
|
21
|
Marks C, Deane CM. How repertoire data are changing antibody science. J Biol Chem 2020; 295:9823-9837. [PMID: 32409582 DOI: 10.1074/jbc.rev120.010181] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/28/2020] [Indexed: 12/13/2022] Open
Abstract
Antibodies are vital proteins of the immune system that recognize potentially harmful molecules and initiate their removal. Mammals can efficiently create vast numbers of antibodies with different sequences capable of binding to any antigen with high affinity and specificity. Because they can be developed to bind to many disease agents, antibodies can be used as therapeutics. In an organism, after antigen exposure, antibodies specific to that antigen are enriched through clonal selection, expansion, and somatic hypermutation. The antibodies present in an organism therefore report on its immune status, describe its innate ability to deal with harmful substances, and reveal how it has previously responded. Next-generation sequencing technologies are being increasingly used to query the antibody, or B-cell receptor (BCR), sequence repertoire, and the amount of BCR data in public repositories is growing. The Observed Antibody Space database, for example, currently contains over a billion sequences from 68 different studies. Repertoires are available that represent both the naive state (i.e. antigen-inexperienced) and that after immunization. This wealth of data has created opportunities to learn more about our immune system. In this review, we discuss the many ways in which BCR repertoire data have been or could be exploited. We highlight its utility for providing insights into how the naive immune repertoire is generated and how it responds to antigens. We also consider how structural information can be used to enhance these data and may lead to more accurate depictions of the sequence space and to applications in the discovery of new therapeutics.
Collapse
Affiliation(s)
- Claire Marks
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| | - Charlotte M Deane
- Department of Statistics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
22
|
Adaptive immune receptor repertoires, an overview of this exciting field. Immunol Lett 2020; 221:49-55. [PMID: 32113899 DOI: 10.1016/j.imlet.2020.02.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 02/19/2020] [Accepted: 02/26/2020] [Indexed: 12/30/2022]
Abstract
The adaptive immune response in jawed vertebrates relies on the huge diversity and specificity of the B cell and T cell antigen receptors, the immunoglobulins (IG) or antibodies and the T cell receptors (TR), respectively. The high level of diversity has represented a barrier to a comprehensive analysis of the adaptive immune response before the emergence of high-throughput sequencing (HTS) technologies. The size and complexity of HTS data requires the generation of novel computational and analytical approaches, which are transforming how the adaptive immune responses are deciphered to understand the clonal dynamics and properties of antigen-specific B and T cells in response to different kind of antigens. This exciting and rapidly evolving field is not only impacting human and clinical immunology but also comparative immunology. We are now closer to understanding the evolution of adaptive immune response in jawed vertebrates. This review provides an overview about classical and current strategies developed to assess the IG/TR diversity and their applications in basic and clinical immunology.
Collapse
|