1
|
Spisak N, Athènes G, Dupic T, Mora T, Walczak AM. Combining mutation and recombination statistics to infer clonal families in antibody repertoires. eLife 2024; 13:e86181. [PMID: 39120133 PMCID: PMC11441979 DOI: 10.7554/elife.86181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 07/22/2024] [Indexed: 08/10/2024] Open
Abstract
B-cell repertoires are characterized by a diverse set of receptors of distinct specificities generated through two processes of somatic diversification: V(D)J recombination and somatic hypermutations. B-cell clonal families stem from the same V(D)J recombination event, but differ in their hypermutations. Clonal families identification is key to understanding B-cell repertoire function, evolution, and dynamics. We present HILARy (high-precision inference of lineages in antibody repertoires), an efficient, fast, and precise method to identify clonal families from single- or paired-chain repertoire sequencing datasets. HILARy combines probabilistic models that capture the receptor generation and selection statistics with adapted clustering methods to achieve consistently high inference accuracy. It automatically leverages the phylogenetic signal of shared mutations in difficult repertoire subsets. Exploiting the high sensitivity of the method, we find the statistics of evolutionary properties such as the site frequency spectrum and dN/dS ratio do not depend on the junction length. We also identify a broad range of selection pressures spanning two orders of magnitude.
Collapse
Affiliation(s)
- Natanael Spisak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| | - Gabriel Athènes
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
- Saber Bio SAS, Institut du Cerveau, iPEPS The Healthtech HubParisFrance
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard UniversityCambridgeUnited States
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| | - Aleksandra M Walczak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université and Université de ParisParisFrance
| |
Collapse
|
2
|
Ferretti F, Kardar M. Universal characterization of epitope immunodominance from a multiscale model of clonal competition in germinal centers. Phys Rev E 2024; 109:064409. [PMID: 39020898 DOI: 10.1103/physreve.109.064409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 04/02/2024] [Indexed: 07/20/2024]
Abstract
We introduce a multiscale model for affinity maturation, which aims to capture the intraclonal, interclonal, and epitope-specific organization of the B-cell population in a germinal center. We describe the evolution of the B-cell population via a quasispecies dynamics, with species corresponding to unique B-cell receptors (BCRs), where the desired multiscale structure is reflected on the mutational connectivity of the accessible BCR space, and on the statistical properties of its fitness landscape. Within this mathematical framework, we study the competition among classes of BCRs targeting different antigen epitopes, and we construct an effective immunogenic space where epitope immunodominance relations can be universally characterized. We finally study how varying the relative composition of a mixture of antigens with variable and conserved domains allows for a parametric exploration of this space, and we identify general principles for the rational design of two-antigen cocktails.
Collapse
Affiliation(s)
- Federica Ferretti
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Mehran Kardar
- Department of Physics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
3
|
Tambe A, MacCarthy T, Pavri R. Interpretable deep learning reveals the role of an E-box motif in suppressing somatic hypermutation of AGCT motifs within human immunoglobulin variable regions. Front Immunol 2024; 15:1407470. [PMID: 38863710 PMCID: PMC11165027 DOI: 10.3389/fimmu.2024.1407470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Accepted: 05/08/2024] [Indexed: 06/13/2024] Open
Abstract
Introduction Somatic hypermutation (SHM) of immunoglobulin variable (V) regions by activation induced deaminase (AID) is essential for robust, long-term humoral immunity against pathogen and vaccine antigens. AID mutates cytosines preferentially within WRCH motifs (where W=A or T, R=A or G and H=A, C or T). However, it has been consistently observed that the mutability of WRCH motifs varies substantially, with large variations in mutation frequency even between multiple occurrences of the same motif within a single V region. This has led to the notion that the immediate sequence context of WRCH motifs contributes to mutability. Recent studies have highlighted the potential role of local DNA sequence features in promoting mutagenesis of AGCT, a commonly mutated WRCH motif. Intriguingly, AGCT motifs closer to 5' ends of V regions, within the framework 1 (FW1) sub-region1, mutate less frequently, suggesting an SHM-suppressing sequence context. Methods Here, we systematically examined the basis of AGCT positional biases in human SHM datasets with DeepSHM, a machine-learning model designed to predict SHM patterns. This was combined with integrated gradients, an interpretability method, to interrogate the basis of DeepSHM predictions. Results DeepSHM predicted the observed positional differences in mutation frequencies at AGCT motifs with high accuracy. For the conserved, lowly mutating AGCT motifs in FW1, integrated gradients predicted a large negative contribution of 5'C and 3'G flanking residues, suggesting that a CAGCTG context in this location was suppressive for SHM. CAGCTG is the recognition motif for E-box transcription factors, including E2A, which has been implicated in SHM. Indeed, we found a strong, inverse relationship between E-box motif fidelity and mutation frequency. Moreover, E2A was found to associate with the V region locale in two human B cell lines. Finally, analysis of human SHM datasets revealed that naturally occurring mutations in the 3'G flanking residues, which effectively ablate the E-box motif, were associated with a significantly increased rate of AGCT mutation. Discussion Our results suggest an antagonistic relationship between mutation frequency and the binding of E-box factors like E2A at specific AGCT motif contexts and, therefore, highlight a new, suppressive mechanism regulating local SHM patterns in human V regions.
Collapse
Affiliation(s)
- Abhik Tambe
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY, United States
| | - Thomas MacCarthy
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY, United States
| | - Rushad Pavri
- Research Institute of Molecular Pathology (IMP), Vienna, Austria
- Peter Gorer Department of Immunobiology, School of Immunology & Microbial Sciences, King’s College London, London, United Kingdom
| |
Collapse
|
4
|
Xu Z, Peng Q, Liu W, Demongeot J, Wei D. Antibody Dynamics Simulation-A Mathematical Exploration of Clonal Deletion and Somatic Hypermutation. Biomedicines 2023; 11:2048. [PMID: 37509687 PMCID: PMC10377040 DOI: 10.3390/biomedicines11072048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/14/2023] [Accepted: 07/19/2023] [Indexed: 07/30/2023] Open
Abstract
We have employed mathematical modeling techniques to construct a comprehensive framework for elucidating the intricate response mechanisms of the immune system, facilitating a deeper understanding of B-cell clonal deletion and somatic hypermutation. Our improved model introduces innovative mechanisms that shed light on positive and negative selection processes during T-cell and B-cell development. Notably, clonal deletion is attributed to the attenuated immune stimulation exerted by self-antigens with high binding affinities, rendering them less effective in eliciting subsequent B-cell maturation and differentiation. Secondly, our refined model places particular emphasis on the crucial role played by somatic hypermutation in modulating the immune system's functionality. Through extensive investigation, we have determined that somatic hypermutation not only expedites the production of highly specific antibodies pivotal in combating microbial infections but also serves as a regulatory mechanism to dampen autoimmunity and enhance self-tolerance within the organism. Lastly, our model advances the understanding of the implications of antibody in vivo evolution in the overall process of organismal aging. With the progression of time, the age-associated amplification of autoimmune activity becomes apparent. While somatic hypermutation effectively delays this process, mitigating the levels of autoimmune response, it falls short of reversing this trajectory entirely. In conclusion, our advanced mathematical model offers a comprehensive and scholarly approach to comprehend the intricacies of the immune system. By encompassing novel mechanisms for selection, emphasizing the functional role of somatic hypermutation, and illuminating the consequences of in vivo antibody evolution, our model expands the current understanding of immune responses and their implications in aging.
Collapse
Affiliation(s)
- Zhaobin Xu
- Department of Life Science, Dezhou University, Dezhou 253023, China
| | - Qingzhi Peng
- Department of Life Science, Dezhou University, Dezhou 253023, China
| | - Weidong Liu
- Department of Physical Education, Dezhou University, Dezhou 253023, China
| | - Jacques Demongeot
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, Faculty of Medicine, University Grenoble Alpes (UGA), 38700 La Tronche, France
| | - Dongqing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200030, China
| |
Collapse
|
5
|
Russell ML, Simon N, Bradley P, Matsen FA. Statistical inference reveals the role of length, GC content, and local sequence in V(D)J nucleotide trimming. eLife 2023; 12:e85145. [PMID: 37227256 PMCID: PMC10212571 DOI: 10.7554/elife.85145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 04/11/2023] [Indexed: 05/26/2023] Open
Abstract
To appropriately defend against a wide array of pathogens, humans somatically generate highly diverse repertoires of B cell and T cell receptors (BCRs and TCRs) through a random process called V(D)J recombination. Receptor diversity is achieved during this process through both the combinatorial assembly of V(D)J-genes and the junctional deletion and insertion of nucleotides. While the Artemis protein is often regarded as the main nuclease involved in V(D)J recombination, the exact mechanism of nucleotide trimming is not understood. Using a previously published TCRβ repertoire sequencing data set, we have designed a flexible probabilistic model of nucleotide trimming that allows us to explore various mechanistically interpretable sequence-level features. We show that local sequence context, length, and GC nucleotide content in both directions of the wider sequence, together, can most accurately predict the trimming probabilities of a given V-gene sequence. Because GC nucleotide content is predictive of sequence-breathing, this model provides quantitative statistical evidence regarding the extent to which double-stranded DNA may need to be able to breathe for trimming to occur. We also see evidence of a sequence motif that appears to get preferentially trimmed, independent of GC-content-related effects. Further, we find that the inferred coefficients from this model provide accurate prediction for V- and J-gene sequences from other adaptive immune receptor loci. These results refine our understanding of how the Artemis nuclease may function to trim nucleotides during V(D)J recombination and provide another step toward understanding how V(D)J recombination generates diverse receptors and supports a powerful, unique immune response in healthy humans.
Collapse
Affiliation(s)
- Magdalena L Russell
- Computational Biology Program, Fred Hutchinson Cancer CenterSeattleUnited States
- Molecular and Cellular Biology Program, University of WashingtonSeattleUnited States
| | - Noah Simon
- Department of Biostatistics, University of WashingtonSeattleUnited States
| | - Philip Bradley
- Computational Biology Program, Fred Hutchinson Cancer CenterSeattleUnited States
- Institute for Protein Design, Department of Biochemistry, University of WashingtonSeattleUnited States
| | - Frederick A Matsen
- Computational Biology Program, Fred Hutchinson Cancer CenterSeattleUnited States
- Department of Genome Sciences, University of WashingtonSeattleUnited States
- Department of Statistics, University of WashingtonSeattleUnited States
- Howard Hughes Medical InstituteSeattleUnited States
| |
Collapse
|
6
|
Moulana A, Dupic T, Phillips AM, Desai MM. Genotype-phenotype landscapes for immune-pathogen coevolution. Trends Immunol 2023; 44:384-396. [PMID: 37024340 PMCID: PMC10147585 DOI: 10.1016/j.it.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 04/07/2023]
Abstract
Our immune systems constantly coevolve with the pathogens that challenge them, as pathogens adapt to evade our defense responses, with our immune repertoires shifting in turn. These coevolutionary dynamics take place across a vast and high-dimensional landscape of potential pathogen and immune receptor sequence variants. Mapping the relationship between these genotypes and the phenotypes that determine immune-pathogen interactions is crucial for understanding, predicting, and controlling disease. Here, we review recent developments applying high-throughput methods to create large libraries of immune receptor and pathogen protein sequence variants and measure relevant phenotypes. We describe several approaches that probe different regions of the high-dimensional sequence space and comment on how combinations of these methods may offer novel insight into immune-pathogen coevolution.
Collapse
Affiliation(s)
- Alief Moulana
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Angela M Phillips
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; Department of Physics, Harvard University, Cambridge, MA 02138, USA; NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, MA 02138, USA; Quantitative Biology Initiative, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
7
|
Safra M, Tamari Z, Polak P, Shiber S, Matan M, Karameh H, Helviz Y, Levy-Barda A, Yahalom V, Peretz A, Ben-Chetrit E, Brenner B, Tuller T, Gal-Tanamy M, Yaari G. Altered somatic hypermutation patterns in COVID-19 patients classifies disease severity. Front Immunol 2023; 14:1031914. [PMID: 37153628 PMCID: PMC10154551 DOI: 10.3389/fimmu.2023.1031914] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 03/22/2023] [Indexed: 05/10/2023] Open
Abstract
Introduction The success of the human body in fighting SARS-CoV2 infection relies on lymphocytes and their antigen receptors. Identifying and characterizing clinically relevant receptors is of utmost importance. Methods We report here the application of a machine learning approach, utilizing B cell receptor repertoire sequencing data from severely and mildly infected individuals with SARS-CoV2 compared with uninfected controls. Results In contrast to previous studies, our approach successfully stratifies non-infected from infected individuals, as well as disease level of severity. The features that drive this classification are based on somatic hypermutation patterns, and point to alterations in the somatic hypermutation process in COVID-19 patients. Discussion These features may be used to build and adapt therapeutic strategies to COVID-19, in particular to quantitatively assess potential diagnostic and therapeutic antibodies. These results constitute a proof of concept for future epidemiological challenges.
Collapse
Affiliation(s)
- Modi Safra
- Bio-engineering, Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan, Israel
| | - Zvi Tamari
- Bio-engineering, Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan, Israel
| | - Pazit Polak
- Bio-engineering, Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan, Israel
| | - Shachaf Shiber
- Emergency Department, Rabin Medical Center-Belinson Campus, Petah Tikva, Israel
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Moshe Matan
- Clinical Microbiology Laboratory, Baruch Padeh Medical Center, Poriya, Israel
| | - Hani Karameh
- Jesselson Integrated Heart Center, Shaare Zedek Medical Center, Hebrew University School of Medicine, Jerusalem, Israel
| | - Yigal Helviz
- Intensive Care Unit, Shaare Zedek Medical Center, Hebrew University School of Medicine, Jerusalem, Israel
| | - Adva Levy-Barda
- Biobank, Department of Pathology, Rabin Medical Center-Belinson Campus, Petah Tikva, Israel
| | - Vered Yahalom
- Blood Services and Apheresis Institute, Rabin Medical Center, Petah Tikva, Israel
| | - Avi Peretz
- Clinical Microbiology Laboratory, Baruch Padeh Medical Center, Poriya, Israel
- The Azrieli Faculty of Medicine, Bar-Ilan University, Safed, Israel
| | - Eli Ben-Chetrit
- Infectious Diseases Unit, Shaare Zedek Medical Center, Hebrew University School of Medicine, Jerusalem, Israel
| | - Baruch Brenner
- Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
- Institute of Oncology, Rabin Medical Center-Belinson Campus, Petah Tikva, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering and The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
| | | | - Gur Yaari
- Bio-engineering, Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel
- Bar Ilan Institute of Nanotechnologies and Advanced Materials, Bar Ilan University, Ramat Gan, Israel
| |
Collapse
|
8
|
Pennell M, Rodriguez OL, Watson CT, Greiff V. The evolutionary and functional significance of germline immunoglobulin gene variation. Trends Immunol 2023; 44:7-21. [PMID: 36470826 DOI: 10.1016/j.it.2022.11.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 11/07/2022] [Indexed: 12/04/2022]
Abstract
The recombination between immunoglobulin (IG) gene segments determines an individual's naïve antibody repertoire and, consequently, (auto)antigen recognition. Emerging evidence suggests that mammalian IG germline variation impacts humoral immune responses associated with vaccination, infection, and autoimmunity - from the molecular level of epitope specificity, up to profound changes in the architecture of antibody repertoires. These links between IG germline variants and immunophenotype raise the question on the evolutionary causes and consequences of diversity within IG loci. We discuss why the extreme diversity in IG loci remains a mystery, why resolving this is important for the design of more effective vaccines and therapeutics, and how recent evidence from multiple lines of inquiry may help us do so.
Collapse
Affiliation(s)
- Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Department of Biological Sciences, University of Southern California, Los Angeles, CA, USA.
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
9
|
Lupo C, Spisak N, Walczak AM, Mora T. Learning the statistics and landscape of somatic mutation-induced insertions and deletions in antibodies. PLoS Comput Biol 2022; 18:e1010167. [PMID: 35653375 PMCID: PMC9197026 DOI: 10.1371/journal.pcbi.1010167] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 06/14/2022] [Accepted: 05/05/2022] [Indexed: 11/25/2022] Open
Abstract
Affinity maturation is crucial for improving the binding affinity of antibodies to antigens. This process is mainly driven by point substitutions caused by somatic hypermutations of the immunoglobulin gene. It also includes deletions and insertions of genomic material known as indels. While the landscape of point substitutions has been extensively studied, a detailed statistical description of indels is still lacking. Here we present a probabilistic inference tool to learn the statistics of indels from repertoire sequencing data, which overcomes the pitfalls and biases of standard annotation methods. The model includes antibody-specific maturation ages to account for variable mutational loads in the repertoire. After validation on synthetic data, we applied our tool to a large dataset of human immunoglobulin heavy chains. The inferred model allows us to identify universal statistical features of indels in heavy chains. We report distinct insertion and deletion hotspots, and show that the distribution of lengths of indels follows a geometric distribution, which puts constraints on future mechanistic models of the hypermutation process. Affinity maturation of B cell receptors is an important mechanism by which our body designs neutralizing antibodies to defend us against pathogens, including broadly neutralizing antibodies, which target a wide range of variants of the same pathogen. Such antibodies often contain key insertions and deletions to the germline gene, or “indels”, which are caused by somatic hypermutations. However, the mechanism, frequency and role of these indels are still elusive. We designed a computational method based on a probabilistic framework to infer the characteristics of this mutational process from high-throughput antibody sequencing experiments. Applied to human data, our approach provides a comprehensive quantitative description of insertions and deletions, opening avenues for better understanding the process of affinity maturation and the design of vaccines for eliciting a broad antibody response.
Collapse
Affiliation(s)
- Cosimo Lupo
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
| | - Natanael Spisak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
| | - Aleksandra M. Walczak
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
- * E-mail: (AMW); (TM)
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
- * E-mail: (AMW); (TM)
| |
Collapse
|
10
|
Affinity maturation for an optimal balance between long-term immune coverage and short-term resource constraints. Proc Natl Acad Sci U S A 2022; 119:2113512119. [PMID: 35177475 PMCID: PMC8872716 DOI: 10.1073/pnas.2113512119] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2022] [Indexed: 12/15/2022] Open
Abstract
Humoral immunity relies on the mutation and selection of B cells to better recognize pathogens. This affinity maturation process produces cells with diverse recognition capabilities. Examining optimal immune strategies that maximize the long-term immune coverage at a minimal metabolic cost, we show when the immune system should mount a de novo response rather than rely on existing memory cells. Our theory recapitulates known modes of the B cell response, predicts the empirical form of the distribution of clone sizes, and rationalizes as a trade-off between metabolic and immune costs the antigenic imprinting effects that limit the efficacy of vaccines (original antigenic sin). Our predictions provide a framework to interpret experimental results that could be used to inform vaccination strategies. In order to target threatening pathogens, the adaptive immune system performs a continuous reorganization of its lymphocyte repertoire. Following an immune challenge, the B cell repertoire can evolve cells of increased specificity for the encountered strain. This process of affinity maturation generates a memory pool whose diversity and size remain difficult to predict. We assume that the immune system follows a strategy that maximizes the long-term immune coverage and minimizes the short-term metabolic costs associated with affinity maturation. This strategy is defined as an optimal decision process on a finite dimensional phenotypic space, where a preexisting population of cells is sequentially challenged with a neutrally evolving strain. We show that the low specificity and high diversity of memory B cells—a key experimental result—can be explained as a strategy to protect against pathogens that evolve fast enough to escape highly potent but narrow memory. This plasticity of the repertoire drives the emergence of distinct regimes for the size and diversity of the memory pool, depending on the density of de novo responding cells and on the mutation rate of the strain. The model predicts power-law distributions of clonotype sizes observed in data and rationalizes antigenic imprinting as a strategy to minimize metabolic costs while keeping good immune protection against future strains.
Collapse
|
11
|
Tang C, Krantsevich A, MacCarthy T. Deep learning model of somatic hypermutation reveals importance of sequence context beyond hotspot targeting. iScience 2022; 25:103668. [PMID: 35036866 PMCID: PMC8749460 DOI: 10.1016/j.isci.2021.103668] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 11/08/2021] [Accepted: 12/16/2021] [Indexed: 11/23/2022] Open
Abstract
B cells undergo somatic hypermutation (SHM) of the Immunoglobulin (Ig) variable region to generate high-affinity antibodies. SHM relies on the activity of activation-induced deaminase (AID), which mutates C>U preferentially targeting WRC (W=A/T, R=A/G) hotspots. Downstream mutations at WA Polymerase η hotspots contribute further mutations. Computational models of SHM can describe the probability of mutations essential for vaccine responses. Previous studies using short subsequences (k-mers) failed to explain divergent mutability for the same k-mer. We developed the DeepSHM (Deep learning on SHM) model using k-mers of size 5-21, improving accuracy over previous models. Interpretation of DeepSHM identified an extended WWRCT motif with particularly high mutability. Increased mutability was further associated with lower surrounding G content. Our model also discovered a conserved AGYCTGGGGG (Y=C/T) motif within FW1 of IGHV3 family genes with unusually high T>G substitution rates. Thus, a wider sequence context increases predictive power and identifies features that drive mutational targeting.
Collapse
Affiliation(s)
- Catherine Tang
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Artem Krantsevich
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Thomas MacCarthy
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, USA
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
12
|
Omer A, Peres A, Rodriguez OL, Watson CT, Lees W, Polak P, Collins AM, Yaari G. T cell receptor beta germline variability is revealed by inference from repertoire data. Genome Med 2022; 14:2. [PMID: 34991709 PMCID: PMC8740489 DOI: 10.1186/s13073-021-01008-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 12/08/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND T and B cell receptor (TCR, BCR) repertoires constitute the foundation of adaptive immunity. Adaptive immune receptor repertoire sequencing (AIRR-seq) is a common approach to study immune system dynamics. Understanding the genetic factors influencing the composition and dynamics of these repertoires is of major scientific and clinical importance. The chromosomal loci encoding for the variable regions of TCRs and BCRs are challenging to decipher due to repetitive elements and undocumented structural variants. METHODS To confront this challenge, AIRR-seq-based methods have recently been developed for B cells, enabling genotype and haplotype inference and discovery of undocumented alleles. However, this approach relies on complete coverage of the receptors' variable regions, whereas most T cell studies sequence a small fraction of that region. Here, we adapted a B cell pipeline for undocumented alleles, genotype, and haplotype inference for full and partial AIRR-seq TCR data sets. The pipeline also deals with gene assignment ambiguities, which is especially important in the analysis of data sets of partial sequences. RESULTS From the full and partial AIRR-seq TCR data sets, we identified 39 undocumented polymorphisms in T cell receptor Beta V (TRBV) and 31 undocumented 5 ' UTR sequences. A subset of these inferences was also observed using independent genomic approaches. We found that a single nucleotide polymorphism differentiating between the two documented T cell receptor Beta D2 (TRBD2) alleles is strongly associated with dramatic changes in the expressed repertoire. CONCLUSIONS We reveal a rich picture of germline variability and demonstrate how a single nucleotide polymorphism dramatically affects the composition of the whole repertoire. Our findings provide a basis for annotation of TCR repertoires for future basic and clinical studies.
Collapse
Affiliation(s)
- Aviv Omer
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Ayelet Peres
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Oscar L Rodriguez
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - Corey T Watson
- Department of Biochemistry and Molecular Genetics, University of Louisville School of Medicine, Louisville, KY, USA
| | - William Lees
- Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, UK
| | - Pazit Polak
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel
| | - Andrew M Collins
- School of Biotechnology and Biomedical Sciences, University of New South Wales, Sydney, Australia
| | - Gur Yaari
- Faculty of Engineering, Bar Ilan University, Ramat Gan, 5290002, Israel.
- Bar Ilan institute of Nanotechnology and Advanced Materials, Bar Ilan University, Ramat Gan, 5290002, Israel.
| |
Collapse
|
13
|
Phillips AM, Lawrence KR, Moulana A, Dupic T, Chang J, Johnson MS, Cvijovic I, Mora T, Walczak AM, Desai MM. Binding affinity landscapes constrain the evolution of broadly neutralizing anti-influenza antibodies. eLife 2021; 10:71393. [PMID: 34491198 PMCID: PMC8476123 DOI: 10.7554/elife.71393] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 09/05/2021] [Indexed: 12/12/2022] Open
Abstract
Over the past two decades, several broadly neutralizing antibodies (bnAbs) that confer protection against diverse influenza strains have been isolated. Structural and biochemical characterization of these bnAbs has provided molecular insight into how they bind distinct antigens. However, our understanding of the evolutionary pathways leading to bnAbs, and thus how best to elicit them, remains limited. Here, we measure equilibrium dissociation constants of combinatorially complete mutational libraries for two naturally isolated influenza bnAbs (CR9114, 16 heavy-chain mutations; CR6261, 11 heavy-chain mutations), reconstructing all possible evolutionary intermediates back to the unmutated germline sequences. We find that these two libraries exhibit strikingly different patterns of breadth: while many variants of CR6261 display moderate affinity to diverse antigens, those of CR9114 display appreciable affinity only in specific, nested combinations. By examining the extensive pairwise and higher order epistasis between mutations, we find key sites with strong synergistic interactions that are highly similar across antigens for CR6261 and different for CR9114. Together, these features of the binding affinity landscapes strongly favor sequential acquisition of affinity to diverse antigens for CR9114, while the acquisition of breadth to more similar antigens for CR6261 is less constrained. These results, if generalizable to other bnAbs, may explain the molecular basis for the widespread observation that sequential exposure favors greater breadth, and such mechanistic insight will be essential for predicting and eliciting broadly protective immune responses.
Collapse
Affiliation(s)
- Angela M Phillips
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
| | - Katherine R Lawrence
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States.,NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, United States.,Quantitative Biology Initiative, Harvard University, Cambridge, United States.,Department of Physics, Massachusetts Institute of Technology, Cambridge, United States
| | - Alief Moulana
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
| | - Jeffrey Chang
- Department of Physics, Harvard University, Cambridge, United States
| | - Milo S Johnson
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States
| | - Ivana Cvijovic
- Department of Applied Physics, Stanford University, Stanford, United States
| | - Thierry Mora
- Laboratoire de physique de ÍÉcole Normale Supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
| | - Aleksandra M Walczak
- Laboratoire de physique de ÍÉcole Normale Supérieure, CNRS, PSL University, Sorbonne Université, and Université de Paris, Paris, France
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, United States.,NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, United States.,Quantitative Biology Initiative, Harvard University, Cambridge, United States.,Department of Physics, Harvard University, Cambridge, United States
| |
Collapse
|
14
|
Yu G, Wu Y, Duan Z, Tang C, Xing H, Scharff MD, MacCarthy T. A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. PLoS Comput Biol 2021; 17:e1009323. [PMID: 34491985 PMCID: PMC8462741 DOI: 10.1371/journal.pcbi.1009323] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 09/24/2021] [Accepted: 08/04/2021] [Indexed: 11/19/2022] Open
Abstract
The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future.
Collapse
Affiliation(s)
- Guojun Yu
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Yingru Wu
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Zhi Duan
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Catherine Tang
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Haipeng Xing
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| | - Matthew D. Scharff
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, New York, United States of America
| | - Thomas MacCarthy
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, United States of America
| |
Collapse
|