1
|
Ali MA, Caetano-Anollés G. AlphaFold2 Reveals Structural Patterns of Seasonal Haplotype Diversification in SARS-CoV-2 Spike Protein Variants. BIOLOGY 2024; 13:134. [PMID: 38534404 DOI: 10.3390/biology13030134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 02/07/2024] [Accepted: 02/16/2024] [Indexed: 03/28/2024]
Abstract
The slow experimental acquisition of high-quality atomic structures of the rapidly changing proteins of the COVID-19 virus challenges vaccine and therapeutic drug development efforts. Fortunately, deep learning tools such as AlphaFold2 can quickly generate reliable models of atomic structure at experimental resolution. Current modeling studies have focused solely on definitions of mutant constellations of Variants of Concern (VOCs), leaving out the impact of haplotypes on protein structure. Here, we conduct a thorough comparative structural analysis of S-proteins belonging to major VOCs and corresponding latitude-delimited haplotypes that affect viral seasonal behavior. Our approach identified molecular regions of importance as well as patterns of structural recruitment. The S1 subunit hosted the majority of structural changes, especially those involving the N-terminal domain (NTD) and the receptor-binding domain (RBD). In particular, structural changes in the NTD were much greater than just translations in three-dimensional space, altering the sub-structures to greater extents. We also revealed a notable pattern of structural recruitment with the early VOCs Alpha and Delta behaving antagonistically by suppressing regions of structural change introduced by their corresponding haplotypes, and the current VOC Omicron behaving synergistically by amplifying or collecting structural change. Remarkably, haplotypes altering the galectin-like structure of the NTD were major contributors to seasonal behavior, supporting its putative environmental-sensing role. Our results provide an extensive view of the evolutionary landscape of the S-protein across the COVID-19 pandemic. This view will help predict important regions of structural change in future variants and haplotypes for more efficient vaccine and drug development.
Collapse
Affiliation(s)
- Muhammad Asif Ali
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
2
|
Tomaszewski T, Ali MA, Caetano-Anollés K, Caetano-Anollés G. Seasonal effects decouple SARS-CoV-2 haplotypes worldwide. F1000Res 2023; 12:267. [PMID: 37069849 PMCID: PMC10105261 DOI: 10.12688/f1000research.131522.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/01/2023] [Indexed: 03/18/2023] Open
Abstract
Background: Variants of concern (VOCs) have been replacing each other during the still rampant COVID-19 pandemic. As a result, SARS-CoV-2 populations have evolved increasingly intricate constellations of mutations that often enhance transmissibility, disease severity, and other epidemiological characteristics. The origin and evolution of these constellations remain puzzling. Methods: Here we study the evolution of VOCs at the proteome level by analyzing about 12 million genomic sequences retrieved from GISAID on July 23, 2022. A total 183,276 mutations were identified and filtered with a relevancy heuristic. The prevalence of haplotypes and free-standing mutations was then tracked monthly in various latitude corridors of the world. Results: A chronology of 22 haplotypes defined three phases driven by protein flexibility-rigidity, environmental sensing, and immune escape. A network of haplotypes illustrated the recruitment and coalescence of mutations into major VOC constellations and seasonal effects of decoupling and loss. Protein interaction networks mediated by haplotypes predicted communications impacting the structure and function of proteins, showing the increasingly central role of molecular interactions involving the spike (S), nucleocapsid (N), and membrane (M) proteins. Haplotype markers either affected fusogenic regions while spreading along the sequence of the S-protein or clustered around binding domains. Modeling of protein structure with AlphaFold2 showed that VOC Omicron and one of its haplotypes were major contributors to the distortion of the M-protein endodomain, which behaves as a receptor of other structural proteins during virion assembly. Remarkably, VOC constellations acted cooperatively to balance the more extreme effects of individual haplotypes. Conclusions: Our study uncovers seasonal patterns of emergence and diversification occurring amid a highly dynamic evolutionary landscape of bursts and waves. The mapping of genetically-linked mutations to structures that sense environmental change with powerful ab initio modeling tools demonstrates the potential of deep-learning for COVID-19 predictive intelligence and therapeutic intervention.
Collapse
Affiliation(s)
- Tre Tomaszewski
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | - Muhammad Asif Ali
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| | | | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
- C. R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois, 61801, USA
| |
Collapse
|
3
|
Tomaszewski T, Gurtler V, Caetano-Anollés K, Caetano-Anollés G. The emergence of SARS-CoV-2 variants of concern in Australia by haplotype coalescence reveals a continental link to COVID-19 seasonality. METHODS IN MICROBIOLOGY 2022; 50:233-268. [PMID: 38013929 PMCID: PMC9110064 DOI: 10.1016/bs.mim.2022.03.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
SARS-CoV-2 continues to evolve, even after implementation of public-wide vaccination, as can be observed by an increasing number of mutations over time. Compared to responses by the United States and European countries, the disease mitigation strategies employed by the Australian government have been swift and effective. This provides a unique opportunity to study the emergence of variants of concern (VOCs) at many latitude levels in a country that has been able to control infection for the majority of the pandemic. In the present study, we explored the occurrence and accumulation of major mutations typical of VOCs in different regions of Australia and the effects that latitude has on the establishment of VOC-induced disease. We also studied the constellation of mutations characteristic of VOCs to determine if the mutation sets acted as haplotypes. Our goal was to explore processes behind the emergence of VOCs as the viral disease progresses towards becoming endemic. Most reported COVID-19 cases were in largest cities located within a -30°S to - 50°S latitude corridor previously identified to be associated with seasonal behavior. Accumulation plots of individual amino acid variants of major VOCs showed that the first major haplotypes reported worldwide were also present in Australia. A classification of accumulation plots revealed the existence of 18 additional haplotypes associated with VOCs alpha, delta and omicron. Core mutant constellations for these VOCs and curve overlaps for variants in each set of haplotypes demonstrated significant decoupling patterns, suggesting processes of emergence. Finally, construction of a "haplotype network" that describes the viral population landscape of Australia throughout the COVID-19 pandemic revealed significant and unanticipated seasonal patterns of emergence and diversification. These results provide a unique window into our evolutionary understanding of a human pathogen of great significance. They may guide future research into mitigation and prediction strategies for future VOCs.
Collapse
Affiliation(s)
- Tre Tomaszewski
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| | | | | | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| |
Collapse
|
4
|
Tang J, Cai W, Yan Z, Zhang K, Zhou Z, Zhao J, Lin S. Interactive effects of acidification and copper exposure on the reproduction and metabolism of coral endosymbiont Cladocopium goreaui. MARINE POLLUTION BULLETIN 2022; 177:113508. [PMID: 35259567 DOI: 10.1016/j.marpolbul.2022.113508] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 02/13/2022] [Accepted: 02/25/2022] [Indexed: 06/14/2023]
Abstract
Ocean acidification resulting from increased CO2 and pollution from land-sourced toxicants such as copper have been linked to coral cover declines in coastal reef ecosystems. The impacts of ocean acidification and copper pollution on corals have been intensively investigated, whereas research on their effects on coral endosymbiont Symbiodiniaceae is limited. In this study, reproduction, photosynthetic parameters, nutrient accumulation and metabolome of Symbiodiniaceae Cladocopium goreaui were investigated after a weeklong treatment with acute CO2-induced acidification and copper ion. Acidification promoted algal reproduction through increased nutrients assimilation, upregulated citrate cycle and biomolecular biosynthesis pathway, while copper exposure repressed algal reproduction through toxic effects. The combined acidification and copper exposure caused the same decline in algal reproduction as copper exposure alone, but the upregulation of pentose phosphate pathway and the downregulation of aromatic amino acid biosynthesis. These results suggest that copper pollution could override the positive effects of acidification on the symbiodiniacean reproduction.
Collapse
Affiliation(s)
- Jia Tang
- Muping Coastal Environment Research Station, Yantai Institute of Coastal Zone Research, Chinese Academy of Sciences, Yantai, Shandong, China
| | - Wenqi Cai
- State Key Laboratory of Marine Resource Utilization in South China Sea, Hainan Aquaculture Breeding Engineering Research Center, Hainan University, Haikou, Hainan, China
| | - Zhicong Yan
- State Key Laboratory of Marine Resource Utilization in South China Sea, Hainan Aquaculture Breeding Engineering Research Center, Hainan University, Haikou, Hainan, China
| | - Kaidian Zhang
- State Key Laboratory of Marine Resource Utilization in South China Sea, Hainan Aquaculture Breeding Engineering Research Center, Hainan University, Haikou, Hainan, China
| | - Zhi Zhou
- State Key Laboratory of Marine Resource Utilization in South China Sea, Hainan Aquaculture Breeding Engineering Research Center, Hainan University, Haikou, Hainan, China.
| | - Jianmin Zhao
- Muping Coastal Environment Research Station, Yantai Institute of Coastal Zone Research, Chinese Academy of Sciences, Yantai, Shandong, China.
| | - Senjie Lin
- Department of Marine Sciences, University of Connecticut, Groton, CT, USA
| |
Collapse
|
5
|
Dimos B, Emery M, Beavers K, MacKnight N, Brandt M, Demuth J, Mydlarz L. Adaptive Variation in Homolog Number Within Transcript Families Promotes Expression Divergence in Reef-Building Coral. Mol Ecol 2022; 31:2594-2610. [PMID: 35229964 DOI: 10.1111/mec.16414] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 02/10/2022] [Accepted: 02/22/2022] [Indexed: 11/30/2022]
Abstract
Gene expression, especially in multi-species experiments, is used to gain insight into the genetic basis of how organisms adapt and respond to changing environments. However, evolutionary processes which can influence gene expression patterns between species such as the presence of paralogs which arise from gene duplication events are rarely accounted for. Paralogous transcripts can alter the transcriptional output of a gene and thus exclusion of these transcripts can obscure important biological differences between species. To address this issue, we investigated how differences in transcript family size is associated with divergent gene expression patterns in five species of Caribbean reef-building corals. We demonstrate that transcript families that are rapidly evolving in terms of size have increased levels of expression divergence. Additionally, these rapidly evolving transcript families are enriched for multiple biological processes, with genes involved in the coral innate immune system demonstrating pronounced variation in homolog number between species. Overall, this investigation demonstrates the importance of incorporating paralogous transcripts when comparing gene expression across species by influencing both transcriptional output and the number of transcripts within biological processes. As this investigation was based on transcriptome assemblies, additional insights into the relationship between gene duplications and expression patterns will likely emergence once more genome assemblies are available for study.
Collapse
Affiliation(s)
- Bradford Dimos
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Madison Emery
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Kelsey Beavers
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Nicholas MacKnight
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Marilyn Brandt
- Center for Marine and Environmental Studies, University of the Virgin Islands, St. Thomas, US Virgin Islands, 00802, USA
| | - Jeffery Demuth
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| | - Laura Mydlarz
- Department of Biology, University of Texas at Arlington, Arlington, TX, 76019, USA
| |
Collapse
|
6
|
Caetano-Anollés K, Hernandez N, Mughal F, Tomaszewski T, Caetano-Anollés G. The seasonal behaviour of COVID-19 and its galectin-like culprit of the viral spike. METHODS IN MICROBIOLOGY 2021; 50:27-81. [PMID: 38620818 PMCID: PMC8590929 DOI: 10.1016/bs.mim.2021.10.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Seasonal behaviour is an attribute of many viral diseases. Like other 'winter' RNA viruses, infections caused by the causative agent of COVID-19, SARS-CoV-2, appear to exhibit significant seasonal changes. Here we discuss the seasonal behaviour of COVID-19, emerging viral phenotypes, viral evolution, and how the mutational landscape of the virus affects the seasonal attributes of the disease. We propose that the multiple seasonal drivers behind infectious disease spread (and the spread of COVID-19 specifically) are in 'trade-off' relationships and can be better described within a framework of a 'triangle of viral persistence' modulated by the environment, physiology, and behaviour. This 'trade-off' exists as one trait cannot increase without a decrease in another. We also propose that molecular components of the virus can act as sensors of environment and physiology, and could represent molecular culprits of seasonality. We searched for flexible protein structures capable of being modulated by the environment and identified a galectin-like fold within the N-terminal domain of the spike protein of SARS-CoV-2 as a potential candidate. Tracking the prevalence of mutations in this structure resulted in the identification of a hemisphere-dependent seasonal pattern driven by mutational bursts. We propose that the galectin-like structure is a frequent target of mutations because it helps the virus evade or modulate the physiological responses of the host to further its spread and survival. The flexible regions of the N-terminal domain should now become a focus for mitigation through vaccines and therapeutics and for prediction and informed public health decision making.
Collapse
Affiliation(s)
| | - Nicolas Hernandez
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| | - Tre Tomaszewski
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, United States
| |
Collapse
|