1
|
Aplakidou E, Vergoulidis N, Chasapi M, Venetsianou NK, Kokoli M, Panagiotopoulou E, Iliopoulos I, Karatzas E, Pafilis E, Georgakopoulos-Soares I, Kyrpides NC, Pavlopoulos GA, Baltoumas FA. Visualizing metagenomic and metatranscriptomic data: A comprehensive review. Comput Struct Biotechnol J 2024; 23:2011-2033. [PMID: 38765606 PMCID: PMC11101950 DOI: 10.1016/j.csbj.2024.04.060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024] Open
Abstract
The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.
Collapse
Affiliation(s)
- Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nikolaos Vergoulidis
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Chasapi
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Kokoli
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Eleni Panagiotopoulou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Ioannis Iliopoulos
- Department of Basic Sciences, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikos C. Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Center of New Biotechnologies & Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Greece
- Hellenic Army Academy, 16673 Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| |
Collapse
|
2
|
Butterfield ER, Obado SO, Scutts SR, Zhang W, Chait BT, Rout MP, Field MC. A lineage-specific protein network at the trypanosome nuclear envelope. Nucleus 2024; 15:2310452. [PMID: 38605598 PMCID: PMC11018031 DOI: 10.1080/19491034.2024.2310452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 01/18/2024] [Indexed: 04/13/2024] Open
Abstract
The nuclear envelope (NE) separates translation and transcription and is the location of multiple functions, including chromatin organization and nucleocytoplasmic transport. The molecular basis for many of these functions have diverged between eukaryotic lineages. Trypanosoma brucei, a member of the early branching eukaryotic lineage Discoba, highlights many of these, including a distinct lamina and kinetochore composition. Here, we describe a cohort of proteins interacting with both the lamina and NPC, which we term lamina-associated proteins (LAPs). LAPs represent a diverse group of proteins, including two candidate NPC-anchoring pore membrane proteins (POMs) with architecture conserved with S. cerevisiae and H. sapiens, and additional peripheral components of the NPC. While many of the LAPs are Kinetoplastid specific, we also identified broadly conserved proteins, indicating an amalgam of divergence and conservation within the trypanosome NE proteome, highlighting the diversity of nuclear biology across the eukaryotes, increasing our understanding of eukaryotic and NPC evolution.
Collapse
Affiliation(s)
| | - Samson O. Obado
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY, USA
| | - Simon R. Scutts
- Department of Pathology, University of Cambridge, Cambridge, UK
| | - Wenzhu Zhang
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Brian T. Chait
- Laboratory of Mass Spectrometry and Gaseous Ion Chemistry, The Rockefeller University, New York, NY, USA
| | - Michael P. Rout
- Laboratory of Cellular and Structural Biology, The Rockefeller University, New York, NY, USA
| | - Mark C. Field
- School of Life Sciences, University of Dundee, Dundee, UK
- Biology Centre, Czech Academy of Sciences, Institute of Parasitology, České Budějovice, Czech Republic
| |
Collapse
|
3
|
Concha-Eloko R, Stock M, De Baets B, Briers Y, Sanjuan R, Domingo-Calap P, Boeckaerts D. DepoScope: Accurate phage depolymerase annotation and domain delineation using large language models. PLoS Comput Biol 2024; 20:e1011831. [PMID: 39102416 DOI: 10.1371/journal.pcbi.1011831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 07/20/2024] [Indexed: 08/07/2024] Open
Abstract
Bacteriophages (phages) are viruses that infect bacteria. Many of them produce specific enzymes called depolymerases to break down external polysaccharide structures. Accurate annotation and domain identification of these depolymerases are challenging due to their inherent sequence diversity. Hence, we present DepoScope, a machine learning tool that combines a fine-tuned ESM-2 model with a convolutional neural network to identify depolymerase sequences and their enzymatic domains precisely. To accomplish this, we curated a dataset from the INPHARED phage genome database, created a polysaccharide-degrading domain database, and applied sequential filters to construct a high-quality dataset, which is subsequently used to train DepoScope. Our work is the first approach that combines sequence-level predictions with amino-acid-level predictions for accurate depolymerase detection and functional domain identification. In that way, we believe that DepoScope can greatly enhance our understanding of phage-host interactions at the level of depolymerases.
Collapse
Affiliation(s)
- Robby Concha-Eloko
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Michiel Stock
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Bernard De Baets
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Yves Briers
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
| | - Rafael Sanjuan
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Pilar Domingo-Calap
- Institute for Integrative Systems Biology (I2SysBio), Universitat de Valencia-CSIC, Paterna, Spain
| | - Dimitri Boeckaerts
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, Ghent, Belgium
| |
Collapse
|
4
|
Hamm JN, Liao Y, von Kügelgen A, Dombrowski N, Landers E, Brownlee C, Johansson EMV, Whan RM, Baker MAB, Baum B, Bharat TAM, Duggin IG, Spang A, Cavicchioli R. The parasitic lifestyle of an archaeal symbiont. Nat Commun 2024; 15:6449. [PMID: 39085207 PMCID: PMC11291902 DOI: 10.1038/s41467-024-49962-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 06/25/2024] [Indexed: 08/02/2024] Open
Abstract
DPANN archaea are a diverse group of microorganisms characterised by small cells and reduced genomes. To date, all cultivated DPANN archaea are ectosymbionts that require direct cell contact with an archaeal host species for growth and survival. However, these interactions and their impact on the host species are poorly understood. Here, we show that a DPANN archaeon (Candidatus Nanohaloarchaeum antarcticus) engages in parasitic interactions with its host (Halorubrum lacusprofundi) that result in host cell lysis. During these interactions, the nanohaloarchaeon appears to enter, or be engulfed by, the host cell. Our results provide experimental evidence for a predatory-like lifestyle of an archaeon, suggesting that at least some DPANN archaea may have roles in controlling host populations and their ecology.
Collapse
Affiliation(s)
- Joshua N Hamm
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, NSW, 2052, Australia.
- Department of Marine Microbiology and Biogeochemistry, Royal Netherlands Institute for Sea Research, Den Hoorn, The Netherlands, 1797 SZ.
| | - Yan Liao
- Australian Institute for Microbiology and Infection, University of Technology Sydney, Ultimo, NSW, 2007, Australia
| | - Andriko von Kügelgen
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK
| | - Nina Dombrowski
- Department of Marine Microbiology and Biogeochemistry, Royal Netherlands Institute for Sea Research, Den Hoorn, The Netherlands, 1797 SZ
| | - Evan Landers
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Christopher Brownlee
- Biological Resources Imaging Laboratory, Mark Wainwright Analytical Centre, University of New South Wales, Sydney, NSW, 2052, Australia
- Fluorescence Analysis Facility, Molecular Horizons, University of Wollongong, Keiraville, NSW, 2522, Australia
| | - Emma M V Johansson
- Biological Resources Imaging Laboratory, Mark Wainwright Analytical Centre, University of New South Wales, Sydney, NSW, 2052, Australia
| | - Renee M Whan
- Katharina Gaus Light Microscopy Facility, Mark Wainwright Analytical Centre, The University of New South Wales, Sydney, NSW, 2052, Australia
| | - Matthew A B Baker
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, NSW, 2052, Australia
| | - Buzz Baum
- Cell Biology Division, MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
| | - Tanmay A M Bharat
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, CB2 0QH, UK
- Sir William Dunn School of Pathology, University of Oxford, Oxford, OX1 3RE, UK
| | - Iain G Duggin
- Australian Institute for Microbiology and Infection, University of Technology Sydney, Ultimo, NSW, 2007, Australia
| | - Anja Spang
- Department of Marine Microbiology and Biogeochemistry, Royal Netherlands Institute for Sea Research, Den Hoorn, The Netherlands, 1797 SZ
- Department of Evolutionary & Population Biology, Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Amsterdam, The Netherlands
| | - Ricardo Cavicchioli
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, NSW, 2052, Australia
| |
Collapse
|
5
|
Sürmeli Y, Durmuş N, Şanlı-Mohamed G. Exploring the Structural Insights of Thermostable Geobacillus esterases by Computational Characterization. ACS OMEGA 2024; 9:32931-32941. [PMID: 39100300 PMCID: PMC11292637 DOI: 10.1021/acsomega.4c03818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 07/10/2024] [Accepted: 07/15/2024] [Indexed: 08/06/2024]
Abstract
This study conducted an in silico analysis of two biochemically characterized thermostable esterases, Est2 and Est3, from Geobacillus strains. To achieve this, the amino acid sequences of Est2 and Est3 were examined to assess their biophysicochemical properties, evolutionary connections, and sequence similarities. Three-dimensional models were constructed and validated through diverse bioinformatics tools. Molecular dynamics (MD) simulation was employed on a pNP-C2 ligand to explore interactions between enzymes and ligand. Biophysicochemical property analysis indicated that aliphatic indices and theoretical T m values of enzymes were between 82-83 and 55-65 °C, respectively. Molecular phylogeny placed Est2 and Est3 within Family XIII, alongside other Geobacillus esterases. DeepMSA2 revealed that Est2, Est3, and homologous sequences shared 12 conserved residues in their core domain (L39, D50, G53, G55, S57, G92, S94, G96, P108, P184, D193, and H223). BANΔIT analysis indicated that Est2 and Est3 had a significantly more rigid cap domain compared to Est30. Salt bridge analysis revealed that E150-R136, E124-K165, E137-R141, and E154-K157 salt bridges made Est2 and Est3 more stable compared to Est30. MD simulation indicated that Est3 exhibited greater fluctuations in the N-terminal region including conserved F25, cap domain, and C-terminal region, notably including H223, suggesting that these regions might influence esterase catalysis. The common residues in the ligand-binding sites of Est2-Est3 were determined as F25 and L167. The analysis of root mean square fluctuation (RMSF) revealed that region 1, encompassing F25 within the β2-α1 loop of Est3, exhibited higher fluctuations compared to those of Est2. Overall, this study might provide valuable insights for future investigations aimed at improving esterase thermostability and catalytic efficiency, critical industrial traits, through targeted amino acid modifications within the N-terminal region, cap domain, and C-terminal region using rational protein engineering techniques.
Collapse
Affiliation(s)
- Yusuf Sürmeli
- Department
of Agricultural Biotechnology, Tekirdağ
Namık Kemal University, 59030 Tekirdağ, Turkey
| | - Naciye Durmuş
- Department
of Molecular Biology and Genetics, İstanbul
Technical University, 34485 İstanbul, Turkey
| | | |
Collapse
|
6
|
Anderson T, Wheeler TJ. An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models. BMC Bioinformatics 2024; 25:247. [PMID: 39075359 PMCID: PMC11285124 DOI: 10.1186/s12859-024-05879-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 07/23/2024] [Indexed: 07/31/2024] Open
Abstract
BACKGROUND Sequence alignment lies at the heart of genome sequence annotation. While the BLAST suite of alignment tools has long held an important role in alignment-based sequence database search, greater sensitivity is achieved through the use of profile hidden Markov models (pHMMs). Here, we describe an FPGA hardware accelerator, called HAVAC, that targets a key bottleneck step (SSV) in the analysis pipeline of the popular pHMM alignment tool, HMMER. RESULTS The HAVAC kernel calculates the SSV matrix at 1739 GCUPS on a ∼ $3000 Xilinx Alveo U50 FPGA accelerator card, ∼ 227× faster than the optimized SSV implementation in nhmmer. Accounting for PCI-e data transfer data processing, HAVAC is 65× faster than nhmmer's SSV with one thread and 35× faster than nhmmer with four threads, and uses ∼ 31% the energy of a traditional high end Intel CPU. CONCLUSIONS HAVAC demonstrates the potential offered by FPGA hardware accelerators to produce dramatic speed gains in sequence annotation and related bioinformatics applications. Because these computations are performed on a co-processor, the host CPU remains free to simultaneously compute other aspects of the analysis pipeline.
Collapse
Affiliation(s)
- Tim Anderson
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Travis J Wheeler
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, AZ, USA.
| |
Collapse
|
7
|
Liu H, Lee G, Sang H. Exploring SDHI fungicide resistance in Botrytis cinerea through genetic transformation system and AlphaFold model-based molecular docking. PEST MANAGEMENT SCIENCE 2024. [PMID: 39054739 DOI: 10.1002/ps.8328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/28/2024] [Accepted: 07/09/2024] [Indexed: 07/27/2024]
Abstract
BACKGROUND Gray mold caused by Botrytis cinerea is one of the most serious diseases affecting strawberry. Succinate dehydrogenase inhibitor (SDHI) fungicides have been used for more than a decade to control the disease. Monitoring resistance and improving in-depth understanding of resistance mechanisms are essential for the control of B. cinerea. RESULTS In this study, resistance monitoring of a SDHI fungicide boscalid was conducted on B. cinerea isolated from strawberries in Korea during 2020 and 2021, with resistance rates of 76.92% and 72.25%, respectively. In resistant strains, mutations P225F/H and H272R were found in SdhB, with P225F representing the dominant mutation type. Simultaneous mutations G85A, I93V, M158V, and V168I in SdhC were detected in 54.84% of sensitive strains. Sensitivity profiles of different Sdh genotypes of B. cinerea strains to six SDHIs were determined in vitro and in vivo. In addition, the mutation(s) were genetically validated through in situ SdhB (SdhC) expression. Docking assays between SDHIs and AlphaFold model-based Sdh complexes revealed generally consistent patterns with their in vitro phenotypes. CONCLUSION Resistance of B. cinerea to SDHI fungicide on strawberry was systematically investigated in this study. Deciphering of SDHI resistance through the genetic transformation system and AlphaFold model-based molecular docking will provide valuable insights into other target site-based fungicide resistance in fungal pathogens. © 2024 The Author(s). Pest Management Science published by John Wiley & Sons Ltd on behalf of Society of Chemical Industry.
Collapse
Affiliation(s)
- Haifeng Liu
- Department of Integrative Food, Bioscience and Biotechnology, Chonnam National University, Gwangju, Republic of Korea
| | - Gahee Lee
- Department of Integrative Food, Bioscience and Biotechnology, Chonnam National University, Gwangju, Republic of Korea
| | - Hyunkyu Sang
- Department of Integrative Food, Bioscience and Biotechnology, Chonnam National University, Gwangju, Republic of Korea
- Kumho Life Science Laboratory, Chonnam National University, Gwangju, Republic of Korea
- Institute of Synthetic Biology for Carbon Neutralization, Chonnam National University, Gwangju, Republic of Korea
| |
Collapse
|
8
|
Bałdysz S, Nawrot R, Barylski J. "Tear down that wall"-a critical evaluation of bioinformatic resources available for lysin researchers. Appl Environ Microbiol 2024; 90:e0236123. [PMID: 38842338 PMCID: PMC11267937 DOI: 10.1128/aem.02361-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024] Open
Abstract
Lytic enzymes, or lysins for short, break down peptidoglycan and interrupt the continuity of the cell wall, which, in turn, causes osmotic lysis of the bacterium. Their ability to destroy bacteria from within makes them promising antimicrobial agents that can be used as alternatives or supplements to antibiotics. In this paper, we briefly summarize basic terms and concepts used to describe lysin sequences and delineate major lysin groups. More importantly, we describe the domain repertoire found in lysins and critically review bioinformatic tools or databases which are used in studies of these enzymes (with particular emphasis on the repositories of Hidden Markov models). Finally, we present a novel comprehensive, meticulously curated set of lysin-related family and domain models, sort them into clusters that reflect major families, and demonstrate that the selected models can be used to efficiently search for new lysins.
Collapse
Affiliation(s)
- Sophia Bałdysz
- Department of Molecular Virology, Institute of Experimental Biology, Adam Mickiewicz University, Poznań, Poland
| | - Robert Nawrot
- Department of Molecular Virology, Institute of Experimental Biology, Adam Mickiewicz University, Poznań, Poland
| | - Jakub Barylski
- Department of Molecular Virology, Institute of Experimental Biology, Adam Mickiewicz University, Poznań, Poland
| |
Collapse
|
9
|
Fenster JA, Azzinaro PA, Dinhobl M, Borca MV, Spinard E, Gladue DP. African Swine Fever Virus Protein-Protein Interaction Prediction. Viruses 2024; 16:1170. [PMID: 39066332 PMCID: PMC11281715 DOI: 10.3390/v16071170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 07/05/2024] [Accepted: 07/12/2024] [Indexed: 07/28/2024] Open
Abstract
The African swine fever virus (ASFV) is an often deadly disease in swine and poses a threat to swine livestock and swine producers. With its complex genome containing more than 150 coding regions, developing effective vaccines for this virus remains a challenge due to a lack of basic knowledge about viral protein function and protein-protein interactions between viral proteins and between viral and host proteins. In this work, we identified ASFV-ASFV protein-protein interactions (PPIs) using artificial intelligence-powered protein structure prediction tools. We benchmarked our PPI identification workflow on the Vaccinia virus, a widely studied nucleocytoplasmic large DNA virus, and found that it could identify gold-standard PPIs that have been validated in vitro in a genome-wide computational screening. We applied this workflow to more than 18,000 pairwise combinations of ASFV proteins and were able to identify seventeen novel PPIs, many of which have corroborating experimental or bioinformatic evidence for their protein-protein interactions, further validating their relevance. Two protein-protein interactions, I267L and I8L, I267L__I8L, and B175L and DP79L, B175L__DP79L, are novel PPIs involving viral proteins known to modulate host immune response.
Collapse
Affiliation(s)
- Jacob A. Fenster
- Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN 37830, USA;
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| | - Paul A. Azzinaro
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| | - Mark Dinhobl
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| | - Manuel V. Borca
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| | - Edward Spinard
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| | - Douglas P. Gladue
- Plum Island Animal Disease Center, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Orient, NY 11957, USA; (P.A.A.); (M.D.); (E.S.)
- National Bio and Agro-Defense Facility, Foreign Animal Disease Research Unit, Agricultural Research Service, U.S. Department of Agriculture, Manhattan, KS 66502, USA
| |
Collapse
|
10
|
Cuturello F, Celoria M, Ansuini A, Cazzaniga A. Enhancing predictions of protein stability changes induced by single mutations using MSA-based Language Models. Bioinformatics 2024; 40:btae447. [PMID: 39012369 PMCID: PMC11269464 DOI: 10.1093/bioinformatics/btae447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Revised: 06/19/2024] [Accepted: 07/10/2024] [Indexed: 07/17/2024] Open
Abstract
MOTIVATION Protein Language Models offer a new perspective for addressing challenges in structural biology, while relying solely on sequence information. Recent studies have investigated their effectiveness in forecasting shifts in thermodynamic stability caused by single amino acid mutations, a task known for its complexity due to the sparse availability of data, constrained by experimental limitations. To tackle this problem, we introduce two key novelties: leveraging a Protein Language Model that incorporates Multiple Sequence Alignments to capture evolutionary information, and using a recently released mega-scale dataset with rigorous data pre-processing to mitigate overfitting. RESULTS We ensure comprehensive comparisons by fine-tuning various pre-trained models, taking advantage of analyses such as ablation studies and baselines evaluation. Our methodology introduces a stringent policy to reduce the widespread issue of data leakage, rigorously removing sequences from the training set when they exhibit significant similarity with the test set. The MSA Transformer emerges as the most accurate among the models under investigation, given its capability to leverage co-evolution signals encoded in aligned homologous sequences. Moreover, the optimized MSA Transformer outperforms existing methods and exhibits enhanced generalization power, leading to a notable improvement in predicting changes in protein stability resulting from point mutations. AVAILABILITY AND IMPLEMENTATION Code and data at https://github.com/RitAreaSciencePark/PLM4Muts. SUPPLEMENTARY INFORMATION Supplementary Information is available at Bioinformatics online.
Collapse
Affiliation(s)
- Francesca Cuturello
- Research and Technology Institute, , AREA Science Park, Trieste 34149, Italy
| | - Marco Celoria
- Research and Technology Institute, , AREA Science Park, Trieste 34149, Italy
- HPC Department, , CINECA National Supercomputing Center, Bologna 40033, Italy
| | - Alessio Ansuini
- Research and Technology Institute, , AREA Science Park, Trieste 34149, Italy
| | - Alberto Cazzaniga
- Research and Technology Institute, , AREA Science Park, Trieste 34149, Italy
| |
Collapse
|
11
|
Pretzler M, Rompel A. Mushroom Tyrosinase: Six Isoenzymes Catalyzing Distinct Reactions. Chembiochem 2024; 25:e202400050. [PMID: 38386893 DOI: 10.1002/cbic.202400050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 02/20/2024] [Accepted: 02/21/2024] [Indexed: 02/24/2024]
Abstract
"Mushroom tyrosinase" from the common button mushroom is the most frequently used source of tyrosinase activity, both for basic and applied research. Here, the complete tyrosinase family from Agaricus bisporus var. bisporus (abPPO1-6) was cloned from mRNA and expressed heterologously using a single protocol. All six isoenzymes accept a wide range of phenolic and catecholic substrates, but display pronounced differences in their specificity and enzymatic reaction rate. AbPPO3 ignores γ-l-glutaminyl-4-hydroxybenzene (GHB), a natural phenol present in mM concentrations in A. bisporus, while AbPPO4 processes 100 μM GHB at 4-times the rate of the catechol l-DOPA. All six AbPPOs are biochemically distinct enzymes fit for different roles in the fungal life cycle, which challenges the traditional concept of isoenzymes as catalyzing the same physiological reaction and varying only in secondary properties. Transferring this approach to other enzymes and organisms will greatly stimulate both the study of the in vivo function(s) of enzymes and the application of these highly efficient catalysts.
Collapse
Affiliation(s)
- Matthias Pretzler
- Universität Wien, Fakultät für Chemie, Institut für Biophysikalische Chemie, Josef-Holaubek-Platz 2, 1090, Wien, Austria
| | - Annette Rompel
- Universität Wien, Fakultät für Chemie, Institut für Biophysikalische Chemie, Josef-Holaubek-Platz 2, 1090, Wien, Austria
| |
Collapse
|
12
|
Barth ZK, Hicklin I, Thézé J, Takatsuka J, Nakai M, Herniou EA, Brown AM, Aylward FO. Genomic analysis of hyperparasitic viruses associated with entomopoxviruses. Virus Evol 2024; 10:veae051. [PMID: 39100687 PMCID: PMC11296320 DOI: 10.1093/ve/veae051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 05/17/2024] [Accepted: 07/11/2024] [Indexed: 08/06/2024] Open
Abstract
Polinton-like viruses (PLVs) are a diverse group of small integrative dsDNA viruses that infect diverse eukaryotic hosts. Many PLVs are hypothesized to parasitize viruses in the phylum Nucleocytoviricota for their own propagation and spread. Here, we analyze the genomes of novel PLVs associated with the occlusion bodies of entomopoxvirus (EPV) infections of two separate lepidopteran hosts. The presence of these elements within EPV occlusion bodies suggests that they are the first known hyperparasites of poxviruses. We find that these PLVs belong to two distinct lineages that are highly diverged from known PLVs. These PLVs possess mosaic genomes, and some essential genes share homology with mobile genes within EPVs. Based on this homology and observed PLV mosaicism, we propose a mechanism to explain the turnover of PLV replication and integration genes.
Collapse
Affiliation(s)
- Zachary K Barth
- Department of Biological Sciences, Virginia Tech, 926 West Campus Drive, Blacksburg, VA 24061, USA
| | - Ian Hicklin
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061, USA
| | - Julien Thézé
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR EPIA, Saint-Genès-Champanelle, France
| | - Jun Takatsuka
- Forestry and Forest Products Research Institute, Matsunosato, Tsukuba, Ibaraki 305-8687, Japan
| | - Madoka Nakai
- Institute of Agriculture, Tokyo University of Agriculture and Technology, Saiwai, Fuchu, Tokyo 183-8509, Japan
| | - Elisabeth A Herniou
- Institut de Recherche sur la Biologie de l’Insecte, UMR7261 CNRS-Université de Tours, 20 Avenue Monge, Parc de Grandmont, Tours 37200, France
| | - Anne M Brown
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061, USA
- Center for Emerging, Zoonotic, and Arthropod-Borne Pathogens, Virginia Tech, 1981 Kraft Dr, Blacksburg, VA 24061, USA
- Research and Informatics, University Libraries, Virginia Tech, Blacksburg, VA 24061, USA
| | - Frank O Aylward
- Department of Biological Sciences, Virginia Tech, 926 West Campus Drive, Blacksburg, VA 24061, USA
- Center for Emerging, Zoonotic, and Arthropod-Borne Pathogens, Virginia Tech, 1981 Kraft Dr, Blacksburg, VA 24061, USA
| |
Collapse
|
13
|
Kollmar M, Welz T, Ravi A, Kaufmann T, Alzahofi N, Hatje K, Alghamdi A, Kim J, Briggs DA, Samol-Wolf A, Pylypenko O, Hume AN, Burkhardt P, Faix J, Kerkhoff E. Actomyosin organelle functions of SPIRE actin nucleators precede animal evolution. Commun Biol 2024; 7:832. [PMID: 38977899 PMCID: PMC11231147 DOI: 10.1038/s42003-024-06458-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 06/14/2024] [Indexed: 07/10/2024] Open
Abstract
An important question in cell biology is how cytoskeletal proteins evolved and drove the development of novel structures and functions. Here we address the origin of SPIRE actin nucleators. Mammalian SPIREs work with RAB GTPases, formin (FMN)-subgroup actin assembly proteins and class-5 myosin (MYO5) motors to transport organelles along actin filaments towards the cell membrane. However, the origin and extent of functional conservation of SPIRE among species is unknown. Our sequence searches show that SPIRE exist throughout holozoans (animals and their closest single-celled relatives), but not other eukaryotes. SPIRE from unicellular holozoans (choanoflagellate), interacts with RAB, FMN and MYO5 proteins, nucleates actin filaments and complements mammalian SPIRE function in organelle transport. Meanwhile SPIRE and MYO5 proteins colocalise to organelles in Salpingoeca rosetta choanoflagellates. Based on these observations we propose that SPIRE originated in unicellular ancestors of animals providing an actin-myosin driven exocytic transport mechanism that may have contributed to the evolution of complex multicellular animals.
Collapse
Affiliation(s)
- Martin Kollmar
- Group Systems Biology of Motor Proteins, Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Göttingen, Germany.
| | - Tobias Welz
- Molecular Cell Biology Laboratory, Department of Neurology, University Hospital Regensburg, Regensburg, Germany
| | - Aishwarya Ravi
- Michael Sars Centre, University of Bergen, Bergen, Norway
| | - Thomas Kaufmann
- Institute for Biophysical Chemistry, Hannover Medical School, Hannover, Germany
| | - Noura Alzahofi
- School of Life Sciences, University of Nottingham, Nottingham, UK
- Biology Department, College of Science, Taibah University, Medina, Kingdom of Saudi Arabia
| | - Klas Hatje
- Group Systems Biology of Motor Proteins, Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Göttingen, Germany
- Roche Pharmaceutical Research and Early Development, Pharmaceutical Sciences, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Basel, Switzerland
| | - Asmahan Alghamdi
- School of Life Sciences, University of Nottingham, Nottingham, UK
- Department of Biology, College of Sciences, Princess Nourah bint Abdulrahman University, Riyadh, Kingdom of Saudi Arabia
| | - Jiyu Kim
- Molecular Cell Biology Laboratory, Department of Neurology, University Hospital Regensburg, Regensburg, Germany
- Department of Anatomy, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Deborah A Briggs
- School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Annette Samol-Wolf
- Molecular Cell Biology Laboratory, Department of Neurology, University Hospital Regensburg, Regensburg, Germany
| | - Olena Pylypenko
- Dynamics of Intra-Cellular Organization, Institute Curie, PSL Research University, CNRS UMR144, Paris, France
| | - Alistair N Hume
- School of Life Sciences, University of Nottingham, Nottingham, UK
| | | | - Jan Faix
- Institute for Biophysical Chemistry, Hannover Medical School, Hannover, Germany
| | - Eugen Kerkhoff
- Molecular Cell Biology Laboratory, Department of Neurology, University Hospital Regensburg, Regensburg, Germany.
| |
Collapse
|
14
|
Heydarian Z, Harrington M, Hegedus DD. Defects in Glabrous 3 (GL3) functionality underlie the absence of trichomes in Brassica napus. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024. [PMID: 38967095 DOI: 10.1111/tpj.16878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 05/29/2024] [Accepted: 05/30/2024] [Indexed: 07/06/2024]
Abstract
Previously, expression of the Arabidopsis thaliana GLABRA3 (GL3) induced trichome formation in Brassica napus. GL3 orthologues were examined from glabrous (B. oleracea), semi-glabrous (B. napus), moderately hirsute (B. rapa), and very hirsute (B. villosa) Brassica species. Ectopic expression of BnGL3, BrGL3 alleles, or BvGL3 induced trichome formation in glabrous B. napus with the effect on trichome number commensurate with density in the original accessions. Chimeric GL3 proteins in which the B. napus amino terminal region, which interacts with MYB proteins, or the middle region, which interacts with the WD40 protein TTG1, was exchanged with corresponding regions from A. thaliana were as stimulatory to trichome production as AtGL3. Exchange of the carboxy-terminal region containing a bHLH domain and an ACT domain did not alter the trichome stimulatory activity, although modeling of the ACT domain identified differences that could affect GL3 dimerization. B. napus A- and C-genomes orthologues differed in their abilities to form homo- and heterodimers. Modeling of the amino-terminal region revealed a conserved domain that may represent the MYB factor binding pocket. This region interacted with the MYB factors GL1, CPC, and TRY, as well as with JAZ8, which is involved in jasmonic acid-mediated regulation of MYC-like transcription factors. Protein interaction studies indicated that GL1 interaction with GL3 from B. napus and A. thaliana may underlie the difference in their respective abilities to induce trichome formation.
Collapse
Affiliation(s)
- Zohreh Heydarian
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, Saskatchewan, S7N 0X2, Canada
- Department of Biotechnology, School of Agriculture, University of Shiraz, Bajgah, Shiraz, Fars, Iran
| | - Myrtle Harrington
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, Saskatchewan, S7N 0X2, Canada
| | - Dwayne D Hegedus
- Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, Saskatchewan, S7N 0X2, Canada
- Department of Food and Bioproduct Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| |
Collapse
|
15
|
Shomar H, Georjon H, Feng Y, Olympio B, Guillaume M, Tesson F, Cury J, Wu F, Bernheim A. Viperin immunity evolved across the tree of life through serial innovations on a conserved scaffold. Nat Ecol Evol 2024:10.1038/s41559-024-02463-z. [PMID: 38965412 DOI: 10.1038/s41559-024-02463-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 06/05/2024] [Indexed: 07/06/2024]
Abstract
Evolutionary arms races between cells and viruses drive the rapid diversification of antiviral genes in diverse life forms. Recent discoveries have revealed the existence of immune genes that are shared between prokaryotes and eukaryotes and show molecular and mechanistic similarities in their response to viruses. However, the evolutionary dynamics underlying the conservation and adaptation of these antiviral genes remain mostly unexplored. Here, we show that viperins constitute a highly conserved family of immune genes across diverse prokaryotes and eukaryotes and identify mechanisms by which they diversified in eukaryotes. Our findings indicate that viperins are enriched in Asgard archaea and widely distributed in all major eukaryotic clades, suggesting their presence in the last eukaryotic common ancestor and their acquisition in eukaryotes from an archaeal lineage. We show that viperins maintain their immune function by producing antiviral nucleotide analogues and demonstrate that eukaryotic viperins diversified through serial innovations on the viperin gene, such as the emergence and selection of substrate specificity towards pyrimidine nucleotides, and through partnerships with genes maintained through genetic linkage, notably with nucleotide kinases. These findings unveil biochemical and genomic transitions underlying the adaptation of immune genes shared by prokaryotes and eukaryotes. Our study paves the way for further understanding of the conservation of immunity across domains of life.
Collapse
Affiliation(s)
- Helena Shomar
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
| | - Héloïse Georjon
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
- Generare Bioscience, Paris, France
| | - Yanlei Feng
- School of Life Sciences, College of Science, Eastern Institute of Technology, Ningbo, China
- Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| | - Bismarck Olympio
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
| | - Marie Guillaume
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
| | - Florian Tesson
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
| | - Jean Cury
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France
| | - Fabai Wu
- School of Life Sciences, College of Science, Eastern Institute of Technology, Ningbo, China.
| | - Aude Bernheim
- Institut Pasteur, Université Paris Cité, INSERM U1284, Molecular Diversity of Microbes Lab, Paris, France.
| |
Collapse
|
16
|
Huang YH, Sun YF, Li H, Li HS, Pang H. PhyloAln: A Convenient Reference-Based Tool to Align Sequences and High-Throughput Reads for Phylogeny and Evolution in the Omic Era. Mol Biol Evol 2024; 41:msae150. [PMID: 39041199 PMCID: PMC11287380 DOI: 10.1093/molbev/msae150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/15/2024] [Accepted: 07/16/2024] [Indexed: 07/24/2024] Open
Abstract
The current trend in phylogenetic and evolutionary analyses predominantly relies on omic data. However, prior to core analyses, traditional methods typically involve intricate and time-consuming procedures, including assembly from high-throughput reads, decontamination, gene prediction, homology search, orthology assignment, multiple sequence alignment, and matrix trimming. Such processes significantly impede the efficiency of research when dealing with extensive data sets. In this study, we develop PhyloAln, a convenient reference-based tool capable of directly aligning high-throughput reads or complete sequences with existing alignments as a reference for phylogenetic and evolutionary analyses. Through testing with simulated data sets of species spanning the tree of life, PhyloAln demonstrates consistently robust performance compared with other reference-based tools across different data types, sequencing technologies, coverages, and species, with percent completeness and identity at least 50 percentage points higher in the alignments. Additionally, we validate the efficacy of PhyloAln in removing a minimum of 90% foreign and 70% cross-contamination issues, which are prevalent in sequencing data but often overlooked by other tools. Moreover, we showcase the broad applicability of PhyloAln by generating alignments (completeness mostly larger than 80%, identity larger than 90%) and reconstructing robust phylogenies using real data sets of transcriptomes of ladybird beetles, plastid genes of peppers, or ultraconserved elements of turtles. With these advantages, PhyloAln is expected to facilitate phylogenetic and evolutionary analyses in the omic era. The tool is accessible at https://github.com/huangyh45/PhyloAln.
Collapse
Affiliation(s)
- Yu-Hao Huang
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Yi-Fei Sun
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hao Li
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hao-Sen Li
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| | - Hong Pang
- State Key Laboratory of Biocontrol, School of Ecology, Sun Yat-sen University, Shenzhen 518107, China
| |
Collapse
|
17
|
Norn C, Oliveira F, André I. Improved prediction of site-rates from structure with averaging across homologs. Protein Sci 2024; 33:e5086. [PMID: 38923241 PMCID: PMC11196898 DOI: 10.1002/pro.5086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 05/12/2024] [Accepted: 06/04/2024] [Indexed: 06/28/2024]
Abstract
Variation in mutation rates at sites in proteins can largely be understood by the constraint that proteins must fold into stable structures. Models that calculate site-specific rates based on protein structure and a thermodynamic stability model have shown a significant but modest ability to predict empirical site-specific rates calculated from sequence. Models that use detailed atomistic models of protein energetics do not outperform simpler approaches using packing density. We demonstrate that a fundamental reason for this is that empirical site-specific rates are the result of the average effect of many different microenvironments in a phylogeny. By analyzing the results of evolutionary dynamics simulations, we show how averaging site-specific rates across many extant protein structures can lead to correct recovery of site-rate prediction. This result is also demonstrated in natural protein sequences and experimental structures. Using predicted structures, we demonstrate that atomistic models can improve upon contact density metrics in predicting site-specific rates from a structure. The results give fundamental insights into the factors governing the distribution of site-specific rates in protein families.
Collapse
Affiliation(s)
- Christoffer Norn
- Department of Biochemistry and Structural BiologyLund UniversityLundSweden
- Bioinnovation Institute FoundationKøbenhavnDenmark
| | - Fábio Oliveira
- Department of Biochemistry and Structural BiologyLund UniversityLundSweden
| | - Ingemar André
- Department of Biochemistry and Structural BiologyLund UniversityLundSweden
| |
Collapse
|
18
|
Tütüncü HE, Durmuş N, Sürmeli Y. Unraveling the potential of uninvestigated thermoalkaliphilic lipases by molecular docking and molecular dynamic simulation: an in silico characterization study. 3 Biotech 2024; 14:179. [PMID: 38882640 PMCID: PMC11176153 DOI: 10.1007/s13205-024-04023-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 05/29/2024] [Indexed: 06/18/2024] Open
Abstract
Thermoalkaliphilic lipase enzymes are mostly favored for use in the detergent industry. While there has been considerable research on Geobacillus lipases, a significant portion of these enzymes remains unexplored or undocumented in the scientific literature. This work performed in silico phylogeny, sequence alignment, structural and enzyme-substrate interaction analyses of the five thermoalkaliphilic lipases belonging to different Geobacillus species (Geobacillus stearothermophilus lipase = GsLip, Geobacillus sp. B4113_201601 lipase = Gb4Lip, Geobacillus kaustophilus HTA426 lipase = GkLip, Geobacillus sp. SP22 lipase = GspLip, Geobacillus sp. NTU 03 lipase = GntLip). For this purpose, unreviewed enzyme sequences of five Geobacillus thermoalkaliphilic lipases were analyzed at sequence and phylogeny levels. 3D homology enzyme models were built, validated, and investigated by different bioinformatics tools. The ligand interactions screening using seven para-nitrophenyl (pNP) esters and enzyme-ligand interactions were analyzed on Gb4Lip:pNP-C12 and BTL2:pNP-C12 by MD simulation. Biophysicochemical characteristic analysis showed that Gb4Lip had a theoretical T m value of above 65 ºC, and a higher aliphatic index indicating greater thermal stability. Sequence alignment showed a hydrophilic threonine in the α6 helix of Gb4Lip, indicating high enzymatic activity. A normalized temperature factor B (B'-factor) analysis showed that the lid domains of five lipases significantly possessed lower B'-factor values, compared to G. thermocatenulatus lipase 2 (BTL2), indicating that they had higher rigidity. Molecular docking results indicated that the five lipases had the highest binding affinity toward pNP-C12. The RMSF investigation revealed that the thermostability of Gb4Lip is influenced by specific molecular elements: D202-S203 within the αB region of the lid domain, and E274-Q275 within the b3 strand, as well as W278 in the b3-b4 loop, and H282 in the b4 strand of the Ca2+-binding region. MD simulation analysis showed that catalytic residue S114 and at least one oxyanion hole residue (F17 and/or Q114) in Gb4Lip frequently formed hydrogen bonds with the pNP-C12 ligand at 343 K and 348 K throughout the simulation process, indicating that Gb4Lip might catalyze relatively long-chain ligand pNP-C12 with high performance. In conclusion, Gb4Lip might be a more suitable candidate as the detergent additive. In addition, this investigation can offer valuable perspectives on Family I.5 lipases such as Gb4Lip for future exploration in the field of protein engineering. Supplementary Information The online version contains supplementary material available at 10.1007/s13205-024-04023-5.
Collapse
Affiliation(s)
- Havva Esra Tütüncü
- Department of Nutrition and Dietetics, Malatya Turgut Özal University, 44210 Malatya, Turkey
| | - Naciye Durmuş
- Department of Molecular Biology and Genetics, İstanbul Technical University, 34485 Istanbul, Turkey
| | - Yusuf Sürmeli
- Department of Agricultural Biotechnology, Tekirdağ Namık Kemal University, 59030 Tekirdağ, Turkey
| |
Collapse
|
19
|
Bryant P, Noé F. Improved protein complex prediction with AlphaFold-multimer by denoising the MSA profile. PLoS Comput Biol 2024; 20:e1012253. [PMID: 39052676 DOI: 10.1371/journal.pcbi.1012253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 08/06/2024] [Accepted: 06/14/2024] [Indexed: 07/27/2024] Open
Abstract
Structure prediction of protein complexes has improved significantly with AlphaFold2 and AlphaFold-multimer (AFM), but only 60% of dimers are accurately predicted. Here, we learn a bias to the MSA representation that improves the predictions by performing gradient descent through the AFM network. We demonstrate the performance on seven difficult targets from CASP15 and increase the average MMscore to 0.76 compared to 0.63 with AFM. We evaluate the procedure on 487 protein complexes where AFM fails and obtain an increased success rate (MMscore>0.75) of 33% on these difficult targets. Our protocol, AFProfile, provides a way to direct predictions towards a defined target function guided by the MSA. We expect gradient descent over the MSA to be useful for different tasks.
Collapse
Affiliation(s)
- Patrick Bryant
- Department of Mathematics and Informatics, Freie Universität Berlin, Germany
- The Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
- Science for Life Laboratory, Solna, Sweden
| | - Frank Noé
- Department of Mathematics and Informatics, Freie Universität Berlin, Germany
- Microsoft Research AI4Science, Berlin, Germany
| |
Collapse
|
20
|
Sorgenfrei FA, Sloan JJ, Weissensteiner F, Zechner M, Mehner NA, Ellinghaus TL, Schachtschabel D, Seemayer S, Kroutil W. Solvent concentration at 50% protein unfolding may reform enzyme stability ranking and process window identification. Nat Commun 2024; 15:5420. [PMID: 38926341 PMCID: PMC11208486 DOI: 10.1038/s41467-024-49774-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 06/19/2024] [Indexed: 06/28/2024] Open
Abstract
As water miscible organic co-solvents are often required for enzyme reactions to improve e.g., the solubility of the substrate in the aqueous medium, an enzyme is required which displays high stability in the presence of this co-solvent. Consequently, it is of utmost importance to identify the most suitable enzyme or the appropriate reaction conditions. Until now, the melting temperature is used in general as a measure for stability of enzymes. The experiments here show, that the melting temperature does not correlate to the activity observed in the presence of the solvent. As an alternative parameter, the concentration of the co-solvent at the point of 50% protein unfolding at a specific temperature T in shortc U 50 T is introduced. Analyzing a set of ene reductases,c U 50 T is shown to indicate the concentration of the co-solvent where also the activity of the enzyme drops fastest. Comparing possible rankings of enzymes according to melting temperature andc U 50 T reveals a clearly diverging outcome also depending on the specific solvent used. Additionally, plots ofc U 50 versus temperature enable a fast identification of possible reaction windows to deduce tolerated solvent concentrations and temperature.
Collapse
Affiliation(s)
- Frieda A Sorgenfrei
- Austrian Centre of Industrial Biotechnology c/o University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
| | - Jeremy J Sloan
- BASF SE, Carl-Bosch-Strasse 38, 67056, Ludwigshafen, Germany
| | - Florian Weissensteiner
- Austrian Centre of Industrial Biotechnology c/o University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Department of Chemistry, University of Graz, NAWI Graz, Heinrichstrasse 28, 8010, Graz, Austria
| | - Marco Zechner
- Austrian Centre of Industrial Biotechnology c/o University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
| | - Niklas A Mehner
- BASF SE, Carl-Bosch-Strasse 38, 67056, Ludwigshafen, Germany
| | | | | | - Stefan Seemayer
- BASF SE, Carl-Bosch-Strasse 38, 67056, Ludwigshafen, Germany.
| | - Wolfgang Kroutil
- Austrian Centre of Industrial Biotechnology c/o University of Graz, Heinrichstrasse 28, 8010, Graz, Austria.
- Department of Chemistry, University of Graz, NAWI Graz, Heinrichstrasse 28, 8010, Graz, Austria.
- BioTechMed Graz, 8010, Graz, Austria.
- Field of Excellence BioHealth, University of Graz, 8010, Graz, Austria.
| |
Collapse
|
21
|
Elisée E, Ducrot L, Méheust R, Bastard K, Fossey-Jouenne A, Grogan G, Pelletier E, Petit JL, Stam M, de Berardinis V, Zaparucha A, Vallenet D, Vergne-Vaxelaire C. A refined picture of the native amine dehydrogenase family revealed by extensive biodiversity screening. Nat Commun 2024; 15:4933. [PMID: 38858403 PMCID: PMC11164908 DOI: 10.1038/s41467-024-49009-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 05/20/2024] [Indexed: 06/12/2024] Open
Abstract
Native amine dehydrogenases offer sustainable access to chiral amines, so the search for scaffolds capable of converting more diverse carbonyl compounds is required to reach the full potential of this alternative to conventional synthetic reductive aminations. Here we report a multidisciplinary strategy combining bioinformatics, chemoinformatics and biocatalysis to extensively screen billions of sequences in silico and to efficiently find native amine dehydrogenases features using computational approaches. In this way, we achieve a comprehensive overview of the initial native amine dehydrogenase family, extending it from 2,011 to 17,959 sequences, and identify native amine dehydrogenases with non-reported substrate spectra, including hindered carbonyls and ethyl ketones, and accepting methylamine and cyclopropylamine as amine donor. We also present preliminary model-based structural information to inform the design of potential (R)-selective amine dehydrogenases, as native amine dehydrogenases are mostly (S)-selective. This integrated strategy paves the way for expanding the resource of other enzyme families and in highlighting enzymes with original features.
Collapse
Affiliation(s)
- Eddy Elisée
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Laurine Ducrot
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Raphaël Méheust
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Karine Bastard
- School of Pharmacy, Faculty of Medicine and Health, University of Sydney, Sydney, NSW, 2006, Australia
| | - Aurélie Fossey-Jouenne
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Gideon Grogan
- York Structural Biology Laboratory, Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Eric Pelletier
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Jean-Louis Petit
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Mark Stam
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Véronique de Berardinis
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - Anne Zaparucha
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France
| | - David Vallenet
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France.
| | - Carine Vergne-Vaxelaire
- Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Univ Evry, Université Paris-Saclay, 91057, Evry, France.
| |
Collapse
|
22
|
Rehman S, Antonovic AK, McIntire IE, Zheng H, Cleaver L, Baczynska M, Adams CO, Portlock T, Richardson K, Shaw R, Oregioni A, Mastroianni G, Whittaker SBM, Kelly G, Lorenz CD, Fornili A, Cianciotto NP, Garnett JA. The Legionella collagen-like protein employs a distinct binding mechanism for the recognition of host glycosaminoglycans. Nat Commun 2024; 15:4912. [PMID: 38851738 PMCID: PMC11162425 DOI: 10.1038/s41467-024-49255-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 05/30/2024] [Indexed: 06/10/2024] Open
Abstract
Bacterial adhesion is a fundamental process which enables colonisation of niche environments and is key for infection. However, in Legionella pneumophila, the causative agent of Legionnaires' disease, these processes are not well understood. The Legionella collagen-like protein (Lcl) is an extracellular peripheral membrane protein that recognises sulphated glycosaminoglycans on the surface of eukaryotic cells, but also stimulates bacterial aggregation in response to divalent cations. Here we report the crystal structure of the Lcl C-terminal domain (Lcl-CTD) and present a model for intact Lcl. Our data reveal that Lcl-CTD forms an unusual trimer arrangement with a positively charged external surface and negatively charged solvent exposed internal cavity. Through molecular dynamics simulations, we show how the glycosaminoglycan chondroitin-4-sulphate associates with the Lcl-CTD surface via distinct binding modes. Our findings show that Lcl homologs are present across both the Pseudomonadota and Fibrobacterota-Chlorobiota-Bacteroidota phyla and suggest that Lcl may represent a versatile carbohydrate-binding mechanism.
Collapse
Affiliation(s)
- Saima Rehman
- Centre for Host-Microbiome Interactions, Faculty of Dental, Oral & Craniofacial Sciences, King's College London, London, UK
| | - Anna Katarina Antonovic
- Department of Chemistry, School of Physical and Chemical Sciences, Queen Mary University of London, London, UK
| | - Ian E McIntire
- Department of Microbiology and Immunology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Huaixin Zheng
- Department of Microbiology and Immunology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Leanne Cleaver
- Centre for Host-Microbiome Interactions, Faculty of Dental, Oral & Craniofacial Sciences, King's College London, London, UK
| | - Maria Baczynska
- Centre for Host-Microbiome Interactions, Faculty of Dental, Oral & Craniofacial Sciences, King's College London, London, UK
- Biological Physics & Soft Matter Research Group, Department of Physics, King's College London, London, UK
| | - Carlton O Adams
- Department of Microbiology and Immunology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Theo Portlock
- Centre for Host-Microbiome Interactions, Faculty of Dental, Oral & Craniofacial Sciences, King's College London, London, UK
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | - Katherine Richardson
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | - Rosie Shaw
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | - Alain Oregioni
- The Medical Research Council Biomedical NMR Centre, The Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
| | - Giulia Mastroianni
- Department of Chemistry, School of Physical and Chemical Sciences, Queen Mary University of London, London, UK
| | - Sara B-M Whittaker
- School of Cancer Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Geoff Kelly
- The Medical Research Council Biomedical NMR Centre, The Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
| | - Christian D Lorenz
- Biological Physics & Soft Matter Research Group, Department of Physics, King's College London, London, UK
| | - Arianna Fornili
- Department of Chemistry, School of Physical and Chemical Sciences, Queen Mary University of London, London, UK.
| | - Nicholas P Cianciotto
- Department of Microbiology and Immunology, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.
| | - James A Garnett
- Centre for Host-Microbiome Interactions, Faculty of Dental, Oral & Craniofacial Sciences, King's College London, London, UK.
| |
Collapse
|
23
|
Ding X, Chen X, Sullivan EE, Shay TF, Gradinaru V. Fast, accurate ranking of engineered proteins by target-binding propensity using structure modeling. Mol Ther 2024; 32:1687-1700. [PMID: 38582966 PMCID: PMC11184338 DOI: 10.1016/j.ymthe.2024.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 02/08/2024] [Accepted: 04/03/2024] [Indexed: 04/08/2024] Open
Abstract
Deep-learning-based methods for protein structure prediction have achieved unprecedented accuracy, yet their utility in the engineering of protein-based binders remains constrained due to a gap between the ability to predict the structures of candidate proteins and the ability toprioritize proteins by their potential to bind to a target. To bridge this gap, we introduce Automated Pairwise Peptide-Receptor Analysis for Screening Engineered proteins (APPRAISE), a method for predicting the target-binding propensity of engineered proteins. After generating structural models of engineered proteins competing for binding to a target using an established structure prediction tool such as AlphaFold-Multimer or ESMFold, APPRAISE performs a rapid (under 1 CPU second per model) scoring analysis that takes into account biophysical and geometrical constraints. As proof-of-concept cases, we demonstrate that APPRAISE can accurately classify receptor-dependent vs. receptor-independent adeno-associated viral vectors and diverse classes of engineered proteins such as miniproteins targeting the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike, nanobodies targeting a G-protein-coupled receptor, and peptides that specifically bind to transferrin receptor or programmed death-ligand 1 (PD-L1). APPRAISE is accessible through a web-based notebook interface using Google Colaboratory (https://tiny.cc/APPRAISE). With its accuracy, interpretability, and generalizability, APPRAISE promises to expand the utility of protein structure prediction and accelerate protein engineering for biomedical applications.
Collapse
Affiliation(s)
- Xiaozhe Ding
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA.
| | - Xinhong Chen
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Erin E Sullivan
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Timothy F Shay
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA
| | - Viviana Gradinaru
- Division of Biology and Biological Engineering, California Institute of Technology, 1200 E California, Boulevard, Pasadena, CA 91125, USA.
| |
Collapse
|
24
|
von Kügelgen A, Cassidy CK, van Dorst S, Pagani LL, Batters C, Ford Z, Löwe J, Alva V, Stansfeld PJ, Bharat TAM. Membraneless channels sieve cations in ammonia-oxidizing marine archaea. Nature 2024; 630:230-236. [PMID: 38811725 PMCID: PMC11153153 DOI: 10.1038/s41586-024-07462-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 04/24/2024] [Indexed: 05/31/2024]
Abstract
Nitrosopumilus maritimus is an ammonia-oxidizing archaeon that is crucial to the global nitrogen cycle1,2. A critical step for nitrogen oxidation is the entrapment of ammonium ions from a dilute marine environment at the cell surface and their subsequent channelling to the cell membrane of N. maritimus. Here we elucidate the structure of the molecular machinery responsible for this process, comprising the surface layer (S-layer), using electron cryotomography and subtomogram averaging from cells. We supplemented our in situ structure of the ammonium-binding S-layer array with a single-particle electron cryomicroscopy structure, revealing detailed features of this immunoglobulin-rich and glycan-decorated S-layer. Biochemical analyses showed strong ammonium binding by the cell surface, which was lost after S-layer disassembly. Sensitive bioinformatic analyses identified similar S-layers in many ammonia-oxidizing archaea, with conserved sequence and structural characteristics. Moreover, molecular simulations and structure determination of ammonium-enriched specimens enabled us to examine the cation-binding properties of the S-layer, revealing how it concentrates ammonium ions on its cell-facing side, effectively acting as a multichannel sieve on the cell membrane. This in situ structural study illuminates the biogeochemically essential process of ammonium binding and channelling, common to many marine microorganisms that are fundamental to the nitrogen cycle.
Collapse
Affiliation(s)
- Andriko von Kügelgen
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - C Keith Cassidy
- Department of Physics and Astronomy, University of Missouri-Columbia, Columbia, MO, USA
| | - Sofie van Dorst
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Lennart L Pagani
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Christopher Batters
- Protein and Nucleic Acid Chemistry Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Zephyr Ford
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Jan Löwe
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, Tübingen, Germany
| | - Phillip J Stansfeld
- School of Life Sciences and Department of Chemistry, University of Warwick, Coventry, UK
| | - Tanmay A M Bharat
- Structural Studies Division, MRC Laboratory of Molecular Biology, Cambridge, UK.
| |
Collapse
|
25
|
Hamamsy T, Morton JT, Blackwell R, Berenberg D, Carriero N, Gligorijevic V, Strauss CEM, Leman JK, Cho K, Bonneau R. Protein remote homology detection and structural alignment using deep learning. Nat Biotechnol 2024; 42:975-985. [PMID: 37679542 PMCID: PMC11180608 DOI: 10.1038/s41587-023-01917-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 07/26/2023] [Indexed: 09/09/2023]
Abstract
Exploiting sequence-structure-function relationships in biotechnology requires improved methods for aligning proteins that have low sequence similarity to previously annotated proteins. We develop two deep learning methods to address this gap, TM-Vec and DeepBLAST. TM-Vec allows searching for structure-structure similarities in large sequence databases. It is trained to accurately predict TM-scores as a metric of structural similarity directly from sequence pairs without the need for intermediate computation or solution of structures. Once structurally similar proteins have been identified, DeepBLAST can structurally align proteins using only sequence information by identifying structurally homologous regions between proteins. It outperforms traditional sequence alignment methods and performs similarly to structure-based alignment methods. We show the merits of TM-Vec and DeepBLAST on a variety of datasets, including better identification of remotely homologous proteins compared with state-of-the-art sequence alignment and structure prediction methods.
Collapse
Grants
- R35GM122515 National Science Foundation (NSF)
- IOS-1546218 National Science Foundation (NSF)
- R35 GM122515 NIGMS NIH HHS
- R01 DK103358 NIDDK NIH HHS
- CBET- 1728858 National Science Foundation (NSF)
- R01 AI130945 NIAID NIH HHS
- This research was supported by NIH R01DK103358, the Simons Foundation, NSF- IOS-1546218, R35GM122515, NSF CBET- 1728858, NIH R01AI130945, to T.H. This research was supported by the intramural research program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) to J.T.M. This research was supported by the Flatiron Institute as part of the Simons Foundation to Robert Blackwell, J.K.L., and N.C. This research was supported by Los Alamos National Lab to C.S. This research was supported by the Samsung Advanced Institute of Technology (Next Generation Deep Learning: from pattern recognition to AI), Samsung Research (Improving Deep Learning using Latent Structure), and NSF Award 1922658 to K.C.
- Simons Foundation
- U.S. Department of Health & Human Services | NIH | Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD)
Collapse
Affiliation(s)
- Tymor Hamamsy
- Center for Data Science, New York University, New York, NY, USA
| | - James T Morton
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, USA
| | - Robert Blackwell
- Scientific Computing Core, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Daniel Berenberg
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, USA
- Prescient Design, New York, NY, USA
| | - Nicholas Carriero
- Scientific Computing Core, Flatiron Institute, Simons Foundation, New York, NY, USA
| | | | | | - Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Kyunghyun Cho
- Center for Data Science, New York University, New York, NY, USA.
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, USA.
- Prescient Design, New York, NY, USA.
- CIFAR, Toronto, Ontario, Canada.
| | - Richard Bonneau
- Center for Data Science, New York University, New York, NY, USA.
- Department of Computer Science, Courant Institute of Mathematical Sciences, New York University, New York, NY, USA.
- Prescient Design, New York, NY, USA.
- Department of Biology, New York University, New York, NY, USA.
| |
Collapse
|
26
|
Xia Y, Pan X, Shen HB. A comprehensive survey on protein-ligand binding site prediction. Curr Opin Struct Biol 2024; 86:102793. [PMID: 38447285 DOI: 10.1016/j.sbi.2024.102793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/18/2024] [Accepted: 02/18/2024] [Indexed: 03/08/2024]
Abstract
Protein-ligand binding site prediction is critical for protein function annotation and drug discovery. Biological experiments are time-consuming and require significant equipment, materials, and labor resources. Developing accurate and efficient computational methods for protein-ligand interaction prediction is essential. Here, we summarize the key challenges associated with ligand binding site (LBS) prediction and introduce recently published methods from their input features, computational algorithms, and ligand types. Furthermore, we investigate the specificity of allosteric site identification as a particular LBS type. Finally, we discuss the prospective directions for machine learning-based LBS prediction in the near future.
Collapse
Affiliation(s)
- Ying Xia
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China
| | - Xiaoyong Pan
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
| | - Hong-Bin Shen
- Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China.
| |
Collapse
|
27
|
Gaschignard G, Millet M, Bruley A, Benzerara K, Dezi M, Skouri-Panet F, Duprat E, Callebaut I. AlphaFold2-guided description of CoBaHMA, a novel family of bacterial domains within the heavy-metal-associated superfamily. Proteins 2024; 92:776-794. [PMID: 38258321 DOI: 10.1002/prot.26668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/22/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024]
Abstract
Three-dimensional (3D) structure information, now available at the proteome scale, may facilitate the detection of remote evolutionary relationships in protein superfamilies. Here, we illustrate this with the identification of a novel family of protein domains related to the ferredoxin-like superfold, by combining (i) transitive sequence similarity searches, (ii) clustering approaches, and (iii) the use of AlphaFold2 3D structure models. Domains of this family were initially identified in relation with the intracellular biomineralization of calcium carbonates by Cyanobacteria. They are part of the large heavy-metal-associated (HMA) superfamily, departing from the latter by specific sequence and structural features. In particular, most of them share conserved basic amino acids (hence their name CoBaHMA for Conserved Basic residues HMA), forming a positively charged surface, which is likely to interact with anionic partners. CoBaHMA domains are found in diverse modular organizations in bacteria, existing in the form of monodomain proteins or as part of larger proteins, some of which are membrane proteins involved in transport or lipid metabolism. This suggests that the CoBaHMA domains may exert a regulatory function, involving interactions with anionic lipids. This hypothesis might have a particular resonance in the context of the compartmentalization observed for cyanobacterial intracellular calcium carbonates.
Collapse
Affiliation(s)
- Geoffroy Gaschignard
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Maxime Millet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Apolline Bruley
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Karim Benzerara
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Manuela Dezi
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Feriel Skouri-Panet
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Elodie Duprat
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| |
Collapse
|
28
|
Medvedev KE, Zhang J, Schaeffer RD, Kinch LN, Cong Q, Grishin NV. Structure classification of the proteins from Salmonella enterica pangenome revealed novel potential pathogenicity islands. Sci Rep 2024; 14:12260. [PMID: 38806511 PMCID: PMC11133325 DOI: 10.1038/s41598-024-60991-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Accepted: 04/30/2024] [Indexed: 05/30/2024] Open
Abstract
Salmonella enterica is a pathogenic bacterium known for causing severe typhoid fever in humans, making it important to study due to its potential health risks and significant impact on public health. This study provides evolutionary classification of proteins from Salmonella enterica pangenome. We classified 17,238 domains from 13,147 proteins from 79,758 Salmonella enterica strains and studied in detail domains of 272 proteins from 14 characterized Salmonella pathogenicity islands (SPIs). Among SPIs-related proteins, 90 proteins function in the secretion machinery. 41% domains of SPI proteins have no previous sequence annotation. By comparing clinical and environmental isolates, we identified 3682 proteins that are overrepresented in clinical group that we consider as potentially pathogenic. Among domains of potentially pathogenic proteins only 50% domains were annotated by sequence methods previously. Moreover, 36% (1330 out of 3682) of potentially pathogenic proteins cannot be classified into Evolutionary Classification of Protein Domains database (ECOD). Among classified domains of potentially pathogenic proteins the most populated homology groups include helix-turn-helix (HTH), Immunoglobulin-related, and P-loop domains-related. Functional analysis revealed overrepresentation of these protein in biological processes related to viral entry into host cell, antibiotic biosynthesis, DNA metabolism and conformation change, and underrepresentation in translational processes. Analysis of the potentially pathogenic proteins indicates that they form 119 clusters or novel potential pathogenicity islands (NPPIs) within the Salmonella genome, suggesting their potential contribution to the bacterium's virulence. One of the NPPIs revealed significant overrepresentation of potentially pathogenic proteins. Overall, our analysis revealed that identified potentially pathogenic proteins are poorly studied.
Collapse
Affiliation(s)
- Kirill E Medvedev
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
| | - Jing Zhang
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - R Dustin Schaeffer
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Lisa N Kinch
- Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Qian Cong
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Nick V Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| |
Collapse
|
29
|
Bryant P, Kelkar A, Guljas A, Clementi C, Noé F. Structure prediction of protein-ligand complexes from sequence information with Umol. Nat Commun 2024; 15:4536. [PMID: 38806453 PMCID: PMC11133481 DOI: 10.1038/s41467-024-48837-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 05/15/2024] [Indexed: 05/30/2024] Open
Abstract
Protein-ligand docking is an established tool in drug discovery and development to narrow down potential therapeutics for experimental testing. However, a high-quality protein structure is required and often the protein is treated as fully or partially rigid. Here we develop an AI system that can predict the fully flexible all-atom structure of protein-ligand complexes directly from sequence information. We find that classical docking methods are still superior, but depend upon having crystal structures of the target protein. In addition to predicting flexible all-atom structures, predicted confidence metrics (plDDT) can be used to select accurate predictions as well as to distinguish between strong and weak binders. The advances presented here suggest that the goal of AI-based drug discovery is one step closer, but there is still a way to go to grasp the complexity of protein-ligand interactions fully. Umol is available at: https://github.com/patrickbryant1/Umol .
Collapse
Affiliation(s)
- Patrick Bryant
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany.
- The Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Svante Arrhenius väg 20C, 114 18, Stockholm, Sweden.
- Science for Life Laboratory, 172 21, Solna, Sweden.
| | - Atharva Kelkar
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Andrea Guljas
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Cecilia Clementi
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
| | - Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
- Department of Physics, Freie Universität Berlin, Arnimallee 12, 14195, Berlin, Germany
- Microsoft Research AI4Science, Karl-Liebknecht Str. 32, 10178, Berlin, Germany
| |
Collapse
|
30
|
Gao Y, Zhong Z, Zhang D, Zhang J, Li YX. Exploring the roles of ribosomal peptides in prokaryote-phage interactions through deep learning-enabled metagenome mining. MICROBIOME 2024; 12:94. [PMID: 38790030 PMCID: PMC11118758 DOI: 10.1186/s40168-024-01807-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 04/04/2024] [Indexed: 05/26/2024]
Abstract
BACKGROUND Microbial secondary metabolites play a crucial role in the intricate interactions within the natural environment. Among these metabolites, ribosomally synthesized and post-translationally modified peptides (RiPPs) are becoming a promising source of therapeutic agents due to their structural diversity and functional versatility. However, their biosynthetic capacity and ecological functions remain largely underexplored. RESULTS Here, we aim to explore the biosynthetic profile of RiPPs and their potential roles in the interactions between microbes and viruses in the ocean, which encompasses a vast diversity of unique biomes that are rich in interactions and remains chemically underexplored. We first developed TrRiPP to identify RiPPs from ocean metagenomes, a deep learning method that detects RiPP precursors in a hallmark gene-independent manner to overcome the limitations of classic methods in processing highly fragmented metagenomic data. Applying this method to metagenomes from the global ocean microbiome, we uncover a diverse array of previously uncharacterized putative RiPP families with great novelty and diversity. Through correlation analysis based on metatranscriptomic data, we observed a high prevalence of antiphage defense-related and phage-related protein families that were co-expressed with RiPP families. Based on this putative association between RiPPs and phage infection, we constructed an Ocean Virus Database (OVD) and established a RiPP-involving host-phage interaction network through host prediction and co-expression analysis, revealing complex connectivities linking RiPP-encoding prokaryotes, RiPP families, viral protein families, and phages. These findings highlight the potential of RiPP families involved in prokaryote-phage interactions and coevolution, providing insights into their ecological functions in the ocean microbiome. CONCLUSIONS This study provides a systematic investigation of the biosynthetic potential of RiPPs from the ocean microbiome at a global scale, shedding light on the essential insights into the ecological functions of RiPPs in prokaryote-phage interactions through the integration of deep learning approaches, metatranscriptomic data, and host-phage connectivity. This study serves as a valuable example of exploring the ecological functions of bacterial secondary metabolites, particularly their associations with unexplored microbial interactions. Video Abstract.
Collapse
Affiliation(s)
- Ying Gao
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Zheng Zhong
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Dengwei Zhang
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Jian Zhang
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China
| | - Yong-Xin Li
- CYM305, Department of Chemistry and The Swire Institute of Marine Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region, 999077, China.
| |
Collapse
|
31
|
Yu R, Huang Z, Lam TYC, Sun Y. Utilizing profile hidden Markov model databases for discovering viruses from metagenomic data: a comprehensive review. Brief Bioinform 2024; 25:bbae292. [PMID: 39003531 PMCID: PMC11246558 DOI: 10.1093/bib/bbae292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 05/19/2024] [Accepted: 06/04/2024] [Indexed: 07/15/2024] Open
Abstract
Profile hidden Markov models (pHMMs) are able to achieve high sensitivity in remote homology search, making them popular choices for detecting novel or highly diverged viruses in metagenomic data. However, many existing pHMM databases have different design focuses, making it difficult for users to decide the proper one to use. In this review, we provide a thorough evaluation and comparison for multiple commonly used profile HMM databases for viral sequence discovery in metagenomic data. We characterized the databases by comparing their sizes, their taxonomic coverage, and the properties of their models using quantitative metrics. Subsequently, we assessed their performance in virus identification across multiple application scenarios, utilizing both simulated and real metagenomic data. We aim to offer researchers a thorough and critical assessment of the strengths and limitations of different databases. Furthermore, based on the experimental results obtained from the simulated and real metagenomic data, we provided practical suggestions for users to optimize their use of pHMM databases, thus enhancing the quality and reliability of their findings in the field of viral metagenomics.
Collapse
Affiliation(s)
- Runzhou Yu
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Ziyi Huang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Theo Y C Lam
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
| |
Collapse
|
32
|
Bai H, Lewitus E, Li Y, Thomas PV, Zemil M, Merbah M, Peterson CE, Thuraisamy T, Rees PA, Hajduczki A, Dussupt V, Slike B, Mendez-Rivera L, Schmid A, Kavusak E, Rao M, Smith G, Frey J, Sims A, Wieczorek L, Polonis V, Krebs SJ, Ake JA, Vasan S, Bolton DL, Joyce MG, Townsley S, Rolland M. Contemporary HIV-1 consensus Env with AI-assisted redesigned hypervariable loops promote antibody binding. Nat Commun 2024; 15:3924. [PMID: 38724518 PMCID: PMC11082178 DOI: 10.1038/s41467-024-48139-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 04/19/2024] [Indexed: 05/12/2024] Open
Abstract
An effective HIV-1 vaccine must elicit broadly neutralizing antibodies (bnAbs) against highly diverse Envelope glycoproteins (Env). Since Env with the longest hypervariable (HV) loops is more resistant to the cognate bnAbs than Env with shorter HV loops, we redesigned hypervariable loops for updated Env consensus sequences of subtypes B and C and CRF01_AE. Using modeling with AlphaFold2, we reduced the length of V1, V2, and V5 HV loops while maintaining the integrity of the Env structure and glycan shield, and modified the V4 HV loop. Spacers are designed to limit strain-specific targeting. All updated Env are infectious as pseudoviruses. Preliminary structural characterization suggests that the modified HV loops have a limited impact on Env's conformation. Binding assays show improved binding to modified subtype B and CRF01_AE Env but not to subtype C Env. Neutralization assays show increases in sensitivity to bnAbs, although not always consistently across clades. Strikingly, the HV loop modification renders the resistant CRF01_AE Env sensitive to 10-1074 despite the absence of a glycan at N332.
Collapse
Affiliation(s)
- Hongjun Bai
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Eric Lewitus
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Yifan Li
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Paul V Thomas
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Michelle Zemil
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Mélanie Merbah
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Caroline E Peterson
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Thujitha Thuraisamy
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Phyllis A Rees
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Agnes Hajduczki
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Vincent Dussupt
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Bonnie Slike
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Letzibeth Mendez-Rivera
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Annika Schmid
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Erin Kavusak
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Mekhala Rao
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Gabriel Smith
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Jessica Frey
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Alicea Sims
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Lindsay Wieczorek
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Victoria Polonis
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Shelly J Krebs
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Julie A Ake
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Sandhya Vasan
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Diane L Bolton
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - M Gordon Joyce
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
- Emerging Infectious Disease Branch, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
| | - Samantha Townsley
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA
| | - Morgane Rolland
- U.S. Military HIV Research Program, Walter Reed Army Institute of Research, Silver Spring, MD, 20910, USA.
- Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, MD, 20817, USA.
| |
Collapse
|
33
|
Tominaga K, Ozaki S, Sato S, Katayama T, Nishimura Y, Omae K, Iwasaki W. Frequent nonhomologous replacement of replicative helicase loaders by viruses in Vibrionaceae. Proc Natl Acad Sci U S A 2024; 121:e2317954121. [PMID: 38683976 PMCID: PMC11087808 DOI: 10.1073/pnas.2317954121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 03/14/2024] [Indexed: 05/02/2024] Open
Abstract
Several microbial genomes lack textbook-defined essential genes. If an essential gene is absent from a genome, then an evolutionarily independent gene of unknown function complements its function. Here, we identified frequent nonhomologous replacement of an essential component of DNA replication initiation, a replicative helicase loader gene, in Vibrionaceae. Our analysis of Vibrionaceae genomes revealed two genes with unknown function, named vdhL1 and vdhL2, that were substantially enriched in genomes without the known helicase-loader genes. These genes showed no sequence similarities to genes with known function but encoded proteins structurally similar with a viral helicase loader. Analyses of genomic syntenies and coevolution with helicase genes suggested that vdhL1/2 encodes a helicase loader. The in vitro assay showed that Vibrio harveyi VdhL1 and Vibrio ezurae VdhL2 promote the helicase activity of DnaB. Furthermore, molecular phylogenetics suggested that vdhL1/2 were derived from phages and replaced an intrinsic helicase loader gene of Vibrionaceae over 20 times. This high replacement frequency implies the host's advantage in acquiring a viral helicase loader gene.
Collapse
Affiliation(s)
- Kento Tominaga
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Shogo Ozaki
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Shohei Sato
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Tsutomu Katayama
- Department of Molecular Biology, Graduate School of Pharmaceutical Sciences, Kyushu University, Fukuoka812-8582, Japan
| | - Yuki Nishimura
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Kimiho Omae
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
| | - Wataru Iwasaki
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo113-0032, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba277-0882, Japan
- Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba277-8564, Japan
- Institute for Quantitative Biosciences, The University of Tokyo, Tokyo113-0032, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo113-8657, Japan
| |
Collapse
|
34
|
Wallner ES, Mair A, Handler D, McWhite C, Xu SL, Dolan L, Bergmann DC. Spatially resolved proteomics of the Arabidopsis stomatal lineage identifies polarity complexes for cell divisions and stomatal pores. Dev Cell 2024; 59:1096-1109.e5. [PMID: 38518768 DOI: 10.1016/j.devcel.2024.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 02/23/2024] [Accepted: 03/05/2024] [Indexed: 03/24/2024]
Abstract
Cell polarity is used to guide asymmetric divisions and create morphologically diverse cells. We find that two oppositely oriented cortical polarity domains present during the asymmetric divisions in the Arabidopsis stomatal lineage are reconfigured into polar domains marking ventral (pore-forming) and outward-facing domains of maturing stomatal guard cells. Proteins that define these opposing polarity domains were used as baits in miniTurboID-based proximity labeling. Among differentially enriched proteins, we find kinases, putative microtubule-interacting proteins, and polar SOSEKIs with their effector ANGUSTIFOLIA. Using AI-facilitated protein structure prediction models, we identify potential protein-protein interaction interfaces among them. Functional and localization analyses of the polarity protein OPL2 and its putative interaction partners suggest a positive interaction with mitotic microtubules and a role in cytokinesis. This combination of proteomics and structural modeling with live-cell imaging provides insights into how polarity is rewired in different cell types and cell-cycle stages.
Collapse
Affiliation(s)
- Eva-Sophie Wallner
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA; Gregor Mendel Institute, Dr. Bohr-Gasse 3, 1030 Wien, Austria; Howard Hughes Medical Institute, Stanford, CA 94305, USA.
| | - Andrea Mair
- Howard Hughes Medical Institute, Stanford, CA 94305, USA
| | | | - Claire McWhite
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Shou-Ling Xu
- Carnegie Institution for Science, Stanford, CA 94305, USA; Carnegie Mass Spectrometry Facility, Carnegie Institution for Science, Stanford, CA 94305, USA
| | - Liam Dolan
- Gregor Mendel Institute, Dr. Bohr-Gasse 3, 1030 Wien, Austria
| | - Dominique C Bergmann
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA; Howard Hughes Medical Institute, Stanford, CA 94305, USA.
| |
Collapse
|
35
|
Dubey A, Baxter M, Hendargo KJ, Medrano-Soto A, Saier MH. The Pentameric Ligand-Gated Ion Channel Family: A New Member of the Voltage Gated Ion Channel Superfamily? Int J Mol Sci 2024; 25:5005. [PMID: 38732224 PMCID: PMC11084639 DOI: 10.3390/ijms25095005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2024] [Revised: 04/28/2024] [Accepted: 04/29/2024] [Indexed: 05/13/2024] Open
Abstract
In this report we present seven lines of bioinformatic evidence supporting the conclusion that the Pentameric Ligand-gated Ion Channel (pLIC) Family is a member of the Voltage-gated Ion Channel (VIC) Superfamily. In our approach, we used the Transporter Classification Database (TCDB) as a reference and applied a series of bioinformatic methods to search for similarities between the pLIC family and members of the VIC superfamily. These include: (1) sequence similarity, (2) compatibility of topology and hydropathy profiles, (3) shared domains, (4) conserved motifs, (5) similarity of Hidden Markov Model profiles between families, (6) common 3D structural folds, and (7) clustering analysis of all families. Furthermore, sequence and structural comparisons as well as the identification of a 3-TMS repeat unit in the VIC superfamily suggests that the sixth transmembrane segment evolved into a re-entrant loop. This evidence suggests that the voltage-sensor domain and the channel domain have a common origin. The classification of the pLIC family within the VIC superfamily sheds light onto the topological origins of this family and its evolution, which will facilitate experimental verification and further research into this superfamily by the scientific community.
Collapse
Affiliation(s)
| | | | | | - Arturo Medrano-Soto
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, La Jolla, CA 92093-0116, USA; (A.D.); (M.B.); (K.J.H.)
| | - Milton H. Saier
- Department of Molecular Biology, School of Biological Sciences, University of California San Diego, La Jolla, CA 92093-0116, USA; (A.D.); (M.B.); (K.J.H.)
| |
Collapse
|
36
|
Lee S, Kim G, Karin EL, Mirdita M, Park S, Chikhi R, Babaian A, Kryshtafovych A, Steinegger M. Petabase-Scale Homology Search for Structure Prediction. Cold Spring Harb Perspect Biol 2024; 16:a041465. [PMID: 38316555 PMCID: PMC11065157 DOI: 10.1101/cshperspect.a041465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2024]
Abstract
The recent CASP15 competition highlighted the critical role of multiple sequence alignments (MSAs) in protein structure prediction, as demonstrated by the success of the top AlphaFold2-based prediction methods. To push the boundaries of MSA utilization, we conducted a petabase-scale search of the Sequence Read Archive (SRA), resulting in gigabytes of aligned homologs for CASP15 targets. These were merged with default MSAs produced by ColabFold-search and provided to ColabFold-predict. By using SRA data, we achieved highly accurate predictions (GDT_TS > 70) for 66% of the non-easy targets, whereas using ColabFold-search default MSAs scored highly in only 52%. Next, we tested the effect of deep homology search and ColabFold's advanced features, such as more recycles, on prediction accuracy. While SRA homologs were most significant for improving ColabFold's CASP15 ranking from 11th to 3rd place, other strategies contributed too. We analyze these in the context of existing strategies to improve prediction.
Collapse
Affiliation(s)
- Sewon Lee
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | - Gyuri Kim
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | | | - Milot Mirdita
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
| | - Sukhwan Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, South Korea
| | - Rayan Chikhi
- Institut Pasteur, Université Paris Cité, G5 Sequence Bioinformatics, 75015 Paris, France
| | - Artem Babaian
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | | | - Martin Steinegger
- School of Biological Sciences, Seoul National University, Gwanak-gu, Seoul 08826, South Korea
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul 08826, South Korea
- Artificial Intelligence Institute, Seoul National University, Seoul 08826, South Korea
- Institute of Molecular Biology and Genetics, Seoul National University, Seoul 08826, South Korea
| |
Collapse
|
37
|
Wienhausen G, Moraru C, Bruns S, Tran DQ, Sultana S, Wilkes H, Dlugosch L, Azam F, Simon M. Ligand cross-feeding resolves bacterial vitamin B 12 auxotrophies. Nature 2024; 629:886-892. [PMID: 38720071 DOI: 10.1038/s41586-024-07396-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 04/08/2024] [Indexed: 05/24/2024]
Abstract
Cobalamin (vitamin B12, herein referred to as B12) is an essential cofactor for most marine prokaryotes and eukaryotes1,2. Synthesized by a limited number of prokaryotes, its scarcity affects microbial interactions and community dynamics2-4. Here we show that two bacterial B12 auxotrophs can salvage different B12 building blocks and cooperate to synthesize B12. A Colwellia sp. synthesizes and releases the activated lower ligand α-ribazole, which is used by another B12 auxotroph, a Roseovarius sp., to produce the corrin ring and synthesize B12. Release of B12 by Roseovarius sp. happens only in co-culture with Colwellia sp. and only coincidently with the induction of a prophage encoded in Roseovarius sp. Subsequent growth of Colwellia sp. in these conditions may be due to the provision of B12 by lysed cells of Roseovarius sp. Further evidence is required to support a causative role for prophage induction in the release of B12. These complex microbial interactions of ligand cross-feeding and joint B12 biosynthesis seem to be widespread in marine pelagic ecosystems. In the western and northern tropical Atlantic Ocean, bacteria predicted to be capable of salvaging cobinamide and synthesizing only the activated lower ligand outnumber B12 producers. These findings add new players to our understanding of B12 supply to auxotrophic microorganisms in the ocean and possibly in other ecosystems.
Collapse
Affiliation(s)
- Gerrit Wienhausen
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
- Scripps Institution of Oceanography, Marine Biology Research Division, University of California San Diego, La Jolla, CA, USA.
| | - Cristina Moraru
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, Essen, Germany
| | - Stefan Bruns
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Den Quoc Tran
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Sabiha Sultana
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Heinz Wilkes
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Leon Dlugosch
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | - Farooq Azam
- Scripps Institution of Oceanography, Marine Biology Research Division, University of California San Diego, La Jolla, CA, USA
| | - Meinhard Simon
- Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany.
- Helmholtz Institute for Functional Marine Biodiversity at the University of Oldenburg (HIFMB), Oldenburg, Germany.
| |
Collapse
|
38
|
Moreno-Manuel AI, Macías Á, Cruz FM, Gutiérrez LK, Martínez F, González-Guerra A, Martínez Carrascoso I, Bermúdez-Jimenez FJ, Sánchez-Pérez P, Vera-Pedrosa ML, Ruiz-Robles JM, Bernal JA, Jalife J. The Kir2.1E299V mutation increases atrial fibrillation vulnerability while protecting the ventricles against arrhythmias in a mouse model of short QT syndrome type 3. Cardiovasc Res 2024; 120:490-505. [PMID: 38261726 PMCID: PMC11060485 DOI: 10.1093/cvr/cvae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 10/24/2023] [Accepted: 12/12/2023] [Indexed: 01/25/2024] Open
Abstract
AIMS Short QT syndrome type 3 (SQTS3) is a rare arrhythmogenic disease caused by gain-of-function mutations in KCNJ2, the gene coding the inward rectifier potassium channel Kir2.1. We used a multidisciplinary approach and investigated arrhythmogenic mechanisms in an in-vivo model of de-novo mutation Kir2.1E299V identified in a patient presenting an extremely abbreviated QT interval and paroxysmal atrial fibrillation. METHODS AND RESULTS We used intravenous adeno-associated virus-mediated gene transfer to generate mouse models, and confirmed cardiac-specific expression of Kir2.1WT or Kir2.1E299V. On ECG, the Kir2.1E299V mouse recapitulated the QT interval shortening and the atrial-specific arrhythmia of the patient. The PR interval was also significantly shorter in Kir2.1E299V mice. Patch-clamping showed extremely abbreviated action potentials in both atrial and ventricular Kir2.1E299V cardiomyocytes due to a lack of inward-going rectification and increased IK1 at voltages positive to -80 mV. Relative to Kir2.1WT, atrial Kir2.1E299V cardiomyocytes had a significantly reduced slope conductance at voltages negative to -80 mV. After confirming a higher proportion of heterotetrameric Kir2.x channels containing Kir2.2 subunits in the atria, in-silico 3D simulations predicted an atrial-specific impairment of polyamine block and reduced pore diameter in the Kir2.1E299V-Kir2.2WT channel. In ventricular cardiomyocytes, the mutation increased excitability by shifting INa activation and inactivation in the hyperpolarizing direction, which protected the ventricle against arrhythmia. Moreover, Purkinje myocytes from Kir2.1E299V mice manifested substantially higher INa density than Kir2.1WT, explaining the abbreviation in the PR interval. CONCLUSION The first in-vivo mouse model of cardiac-specific SQTS3 recapitulates the electrophysiological phenotype of a patient with the Kir2.1E299V mutation. Kir2.1E299V eliminates rectification in both cardiac chambers but protects against ventricular arrhythmias by increasing excitability in both Purkinje-fiber network and ventricles. Consequently, the predominant arrhythmias are supraventricular likely due to the lack of inward rectification and atrial-specific reduced pore diameter of the Kir2.1E299V-Kir2.2WT heterotetramer.
Collapse
MESH Headings
- Animals
- Humans
- Mice
- Action Potentials
- Arrhythmias, Cardiac/genetics
- Arrhythmias, Cardiac/physiopathology
- Arrhythmias, Cardiac/metabolism
- Atrial Fibrillation/genetics
- Atrial Fibrillation/physiopathology
- Atrial Fibrillation/metabolism
- Disease Models, Animal
- Genetic Predisposition to Disease
- Heart Rate/genetics
- Heart Ventricles/metabolism
- Heart Ventricles/physiopathology
- Mice, Inbred C57BL
- Mice, Transgenic
- Mutation
- Myocytes, Cardiac/metabolism
- Myocytes, Cardiac/pathology
- Phenotype
- Potassium Channels, Inwardly Rectifying/genetics
- Potassium Channels, Inwardly Rectifying/metabolism
Collapse
Affiliation(s)
- Ana I Moreno-Manuel
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Álvaro Macías
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Francisco M Cruz
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Lilian K Gutiérrez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Fernando Martínez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
| | - Andrés González-Guerra
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Isabel Martínez Carrascoso
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Francisco José Bermúdez-Jimenez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- Department of Cardiology, Hospital Universitario Virgen de las Nieves, 18014 Granada, Spain
| | - Patricia Sánchez-Pérez
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | | | - Juan Manuel Ruiz-Robles
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
| | - Juan A Bernal
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
| | - José Jalife
- Centro Nacional de Investigaciones Cardiovasculares (CNIC), Melchor Fernández Almagro 3, 28029 Madrid, Spain
- CIBER de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
- Departments of Internal Medicine and Molecular and Integrative Physiology, Center for Arrhythmia Research, University of Michigan, Ann Arbor, MI 4810, USA
| |
Collapse
|
39
|
Kshirsagar M, Meller A, Humphreys I, Sledzieski S, Xu Y, Dodhia R, Horvitz E, Berger B, Bowman G, Ferres JL, Baker D, Baek M. Rapid and accurate prediction of protein homo-oligomer symmetry with Seq2Symm. RESEARCH SQUARE 2024:rs.3.rs-4215086. [PMID: 38746169 PMCID: PMC11092833 DOI: 10.21203/rs.3.rs-4215086/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The majority of proteins must form higher-order assemblies to perform their biological functions. Despite the importance of protein quaternary structure, there are few machine learning models that can accurately and rapidly predict the symmetry of assemblies involving multiple copies of the same protein chain. Here, we address this gap by training several classes of protein foundation models, including ESM-MSA, ESM2, and RoseTTAFold2, to predict homo-oligomer symmetry. Our best model named Seq2Symm, which utilizes ESM2, outperforms existing template-based and deep learning methods. It achieves an average PR-AUC of 0.48 and 0.44 across homo-oligomer symmetries on two different held-out test sets compared to 0.32 and 0.23 for the template-based method. Because Seq2Symm can rapidly predict homo-oligomer symmetries using a single sequence as input (~ 80,000 proteins/hour), we have applied it to 5 entire proteomes and ~ 3.5 million unlabeled protein sequences to identify patterns in protein assembly complexity across biological kingdoms and species.
Collapse
Affiliation(s)
| | | | | | | | - Yixi Xu
- Microsoft AI for Good research lab
| | | | | | | | | | | | | | | |
Collapse
|
40
|
Krishna R, Wang J, Ahern W, Sturmfels P, Venkatesh P, Kalvet I, Lee GR, Morey-Burrows FS, Anishchenko I, Humphreys IR, McHugh R, Vafeados D, Li X, Sutherland GA, Hitchcock A, Hunter CN, Kang A, Brackenbrough E, Bera AK, Baek M, DiMaio F, Baker D. Generalized biomolecular modeling and design with RoseTTAFold All-Atom. Science 2024; 384:eadl2528. [PMID: 38452047 DOI: 10.1126/science.adl2528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 02/27/2024] [Indexed: 03/09/2024]
Abstract
Deep-learning methods have revolutionized protein structure prediction and design but are presently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA), which combines a residue-based representation of amino acids and DNA bases with an atomic representation of all other groups to model assemblies that contain proteins, nucleic acids, small molecules, metals, and covalent modifications, given their sequences and chemical structures. By fine-tuning on denoising tasks, we developed RFdiffusion All-Atom (RFdiffusionAA), which builds protein structures around small molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we designed and experimentally validated, through crystallography and binding measurements, proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and the light-harvesting molecule bilin.
Collapse
Affiliation(s)
- Rohith Krishna
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Jue Wang
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Woody Ahern
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Pascal Sturmfels
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Preetham Venkatesh
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
| | - Indrek Kalvet
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| | - Gyu Rie Lee
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| | | | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Ian R Humphreys
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Ryan McHugh
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA 98105, USA
| | - Dionne Vafeados
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | | | - Andrew Hitchcock
- School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| | - C Neil Hunter
- School of Biosciences, University of Sheffield, Sheffield S10 2TN, UK
| | - Alex Kang
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Evans Brackenbrough
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Asim K Bera
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - Minkyung Baek
- School of Biological Sciences, Seoul National University, Seoul 08826, Republic of Korea
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98105, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98105, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
41
|
Ackmann J, Brüge A, Gotina L, Lim S, Jahreis K, Vollbrecht AL, Kim YK, Pae AN, Labus J, Ponimaskin E. Structural determinants for activation of the Tau kinase CDK5 by the serotonin receptor 5-HT7R. Cell Commun Signal 2024; 22:233. [PMID: 38641599 PMCID: PMC11031989 DOI: 10.1186/s12964-024-01612-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/11/2024] [Indexed: 04/21/2024] Open
Abstract
BACKGROUND Multiple neurodegenerative diseases are induced by the formation and deposition of protein aggregates. In particular, the microtubule-associated protein Tau leads to the development of so-called tauopathies characterized by the aggregation of hyperphosphorylated Tau within neurons. We recently showed that the constitutive activity of the serotonin receptor 7 (5-HT7R) is required for Tau hyperphosphorylation and aggregation through activation of the cyclin-dependent kinase 5 (CDK5). We also demonstrated physical interaction between 5-HT7R and CDK5 at the plasma membrane suggesting that the 5-HT7R/CDK5 complex is an integral part of the signaling network involved in Tau-mediated pathology. METHODS Using biochemical, microscopic, molecular biological, computational and AI-based approaches, we investigated structural requirements for the formation of 5-HT7R/CDK5 complex. RESULTS We demonstrated that 5-HT7R domains responsible for coupling to Gs proteins are not involved in receptor interaction with CDK5. We also created a structural model of the 5-HT7R/CDK5 complex and refined the interaction interface. The model predicted two conserved phenylalanine residues, F278 and F281, within the third intracellular loop of 5-HT7R to be potentially important for complex formation. While site-directed mutagenesis of these residues did not influence Gs protein-mediated receptor signaling, replacement of both phenylalanines by alanine residues significantly reduced 5-HT7R/CDK5 interaction and receptor-mediated CDK5 activation, leading to reduced Tau hyperphosphorylation and aggregation. Molecular dynamics simulations of 5-HT7R/CDK5 complex for wild-type and receptor mutants confirmed binding interface stability of the initial model. CONCLUSIONS Our results provide a structural basis for the development of novel drugs targeting the 5-HT7R/CDK5 interaction interface for the selective treatment of Tau-related disorders, including frontotemporal dementia and Alzheimer's disease.
Collapse
Affiliation(s)
- Jana Ackmann
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Alina Brüge
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Lizaveta Gotina
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Sungsu Lim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
| | - Kathrin Jahreis
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Anna-Lena Vollbrecht
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Yun Kyung Kim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Ae Nim Pae
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul, Republic of Korea
- Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology (UST), Daejeon, Republic of Korea
| | - Josephine Labus
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany
| | - Evgeni Ponimaskin
- Department of Cellular Neurophysiology, Institute for Neurophysiology, Hannover Medical School, Carl-Neuberg-Str. 1, 30625, Hannover, Germany.
| |
Collapse
|
42
|
Wu LY, Wijesekara Y, Piedade GJ, Pappas N, Brussaard CPD, Dutilh BE. Benchmarking bioinformatic virus identification tools using real-world metagenomic data across biomes. Genome Biol 2024; 25:97. [PMID: 38622738 PMCID: PMC11020464 DOI: 10.1186/s13059-024-03236-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 04/01/2024] [Indexed: 04/17/2024] Open
Abstract
BACKGROUND As most viruses remain uncultivated, metagenomics is currently the main method for virus discovery. Detecting viruses in metagenomic data is not trivial. In the past few years, many bioinformatic virus identification tools have been developed for this task, making it challenging to choose the right tools, parameters, and cutoffs. As all these tools measure different biological signals, and use different algorithms and training and reference databases, it is imperative to conduct an independent benchmarking to give users objective guidance. RESULTS We compare the performance of nine state-of-the-art virus identification tools in thirteen modes on eight paired viral and microbial datasets from three distinct biomes, including a new complex dataset from Antarctic coastal waters. The tools have highly variable true positive rates (0-97%) and false positive rates (0-30%). PPR-Meta best distinguishes viral from microbial contigs, followed by DeepVirFinder, VirSorter2, and VIBRANT. Different tools identify different subsets of the benchmarking data and all tools, except for Sourmash, find unique viral contigs. Performance of tools improved with adjusted parameter cutoffs, indicating that adjustment of parameter cutoffs before usage should be considered. CONCLUSIONS Together, our independent benchmarking facilitates selecting choices of bioinformatic virus identification tools and gives suggestions for parameter adjustments to viromics researchers.
Collapse
Affiliation(s)
- Ling-Yi Wu
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
| | - Yasas Wijesekara
- Institute of Bioinformatics, University Medicine Greifswald, Felix Hausdorff Str. 8, 17475, Greifswald, Germany
| | - Gonçalo J Piedade
- Department Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, PO Box 59, Texel, 1790 AB, The Netherlands
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Nikolaos Pappas
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands
| | - Corina P D Brussaard
- Department Marine Microbiology and Biogeochemistry, NIOZ Royal Netherlands Institute for Sea Research, Den Burg, PO Box 59, Texel, 1790 AB, The Netherlands
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Bas E Dutilh
- Theoretical Biology and Bioinformatics, Science4Life, Utrecht University, Padualaan 8, Utrecht, 3584 CH, The Netherlands.
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich Schiller University Jena, 07743, Jena, Germany.
| |
Collapse
|
43
|
Gucwa K, Wons E, Wisniewska A, Jakalski M, Dubiak Z, Kozlowski LP, Mruk I. Lethal perturbation of an Escherichia coli regulatory network is triggered by a restriction-modification system's regulator and can be mitigated by excision of the cryptic prophage Rac. Nucleic Acids Res 2024; 52:2942-2960. [PMID: 38153127 PMCID: PMC11014345 DOI: 10.1093/nar/gkad1234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 12/08/2023] [Accepted: 12/13/2023] [Indexed: 12/29/2023] Open
Abstract
Bacterial gene regulatory networks orchestrate responses to environmental challenges. Horizontal gene transfer can bring in genes with regulatory potential, such as new transcription factors (TFs), and this can disrupt existing networks. Serious regulatory perturbations may even result in cell death. Here, we show the impact on Escherichia coli of importing a promiscuous TF that has adventitious transcriptional effects within the cryptic Rac prophage. A cascade of regulatory network perturbations occurred on a global level. The TF, a C regulatory protein, normally controls a Type II restriction-modification system, but in E. coli K-12 interferes with expression of the RacR repressor gene, resulting in de-repression of the normally-silent Rac ydaT gene. YdaT is a prophage-encoded TF with pleiotropic effects on E. coli physiology. In turn, YdaT alters expression of a variety of bacterial regulons normally controlled by the RcsA TF, resulting in deficient lipopolysaccharide biosynthesis and cell division. At the same time, insufficient RacR repressor results in Rac DNA excision, halting Rac gene expression due to loss of the replication-defective Rac prophage. Overall, Rac induction appears to counteract the lethal toxicity of YdaT. We show here that E. coli rewires its regulatory network, so as to minimize the adverse regulatory effects of the imported C TF. This complex set of interactions may reflect the ability of bacteria to protect themselves by having robust mechanisms to maintain their regulatory networks, and/or suggest that regulatory C proteins from mobile operons are under selection to manipulate their host's regulatory networks for their own benefit.
Collapse
Affiliation(s)
- Katarzyna Gucwa
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Ewa Wons
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Aleksandra Wisniewska
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Marcin Jakalski
- 3P-Medicine Laboratory, Medical University of Gdansk, Debinki 7, 80-211 Gdansk, Poland
| | - Zuzanna Dubiak
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| | - Lukasz Pawel Kozlowski
- Institute of Informatics, Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland
| | - Iwona Mruk
- Department of Microbiology, Faculty of Biology, University of Gdansk, Wita Stwosza 59, Gdansk 80-308, Poland
| |
Collapse
|
44
|
Si Y, Yan C. Protein language model-embedded geometric graphs power inter-protein contact prediction. eLife 2024; 12:RP92184. [PMID: 38564241 PMCID: PMC10987090 DOI: 10.7554/elife.92184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024] Open
Abstract
Accurate prediction of contacting residue pairs between interacting proteins is very useful for structural characterization of protein-protein interactions. Although significant improvement has been made in inter-protein contact prediction recently, there is still a large room for improving the prediction accuracy. Here we present a new deep learning method referred to as PLMGraph-Inter for inter-protein contact prediction. Specifically, we employ rotationally and translationally invariant geometric graphs obtained from structures of interacting proteins to integrate multiple protein language models, which are successively transformed by graph encoders formed by geometric vector perceptrons and residual networks formed by dimensional hybrid residual blocks to predict inter-protein contacts. Extensive evaluation on multiple test sets illustrates that PLMGraph-Inter outperforms five top inter-protein contact prediction methods, including DeepHomo, GLINTER, CDPred, DeepHomo2, and DRN-1D2D_Inter, by large margins. In addition, we also show that the prediction of PLMGraph-Inter can complement the result of AlphaFold-Multimer. Finally, we show leveraging the contacts predicted by PLMGraph-Inter as constraints for protein-protein docking can dramatically improve its performance for protein complex structure prediction.
Collapse
Affiliation(s)
- Yunda Si
- School of Physics, Huazhong University of Science and TechnologyWuhanChina
| | - Chengfei Yan
- School of Physics, Huazhong University of Science and TechnologyWuhanChina
| |
Collapse
|
45
|
Wu JS, Liu Y, Ge F, Yu DJ. Prediction of protein-ATP binding residues using multi-view feature learning via contextual-based co-attention network. Comput Biol Med 2024; 172:108227. [PMID: 38460308 DOI: 10.1016/j.compbiomed.2024.108227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 01/17/2024] [Accepted: 02/25/2024] [Indexed: 03/11/2024]
Abstract
Accurately predicting protein-ATP binding residues is critical for protein function annotation and drug discovery. Computational methods dedicated to the prediction of binding residues based on protein sequence information have exhibited notable advancements in predictive accuracy. Nevertheless, these methods continue to grapple with several formidable challenges, including limited means of extracting more discriminative features and inadequate algorithms for integrating protein and residue information. To address the problems, we propose ATP-Deep, a novel protein-ATP binding residues predictor. ATP-Deep harnesses the capabilities of unsupervised pre-trained language models and incorporates domain-specific evolutionary context information from homologous sequences. It further refines the embedding at the residue level through integration with corresponding protein-level information and employs a contextual-based co-attention mechanism to adeptly fuse multiple sources of features. The performance evaluation results on the benchmark datasets reveal that ATP-Deep achieves an AUC of 0.954 and 0.951, respectively, surpassing the performance of the state-of-the-art model. These findings underscore the effectiveness of assimilating protein-level information and deploying a contextual-based co-attention mechanism grounded in context to bolster the prediction performance of protein-ATP binding residues.
Collapse
Affiliation(s)
- Jia-Shun Wu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing, 210094, China
| | - Yan Liu
- School of Information Engineering, Yangzhou University, 196 West Huayang, Yangzhou, 225100, China
| | - Fang Ge
- State Key Laboratory of Organic Electronics and Information Displays & Institute of Advanced Materials (IAM), Nanjing University of Posts & Telecommunications, 9 Wenyuan Road, Nanjing 210023, China
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, 200 Xiaolingwei, Nanjing, 210094, China.
| |
Collapse
|
46
|
Shallangwa GA, Mahmud AW, Uzairu A, Ibrahim MT. 2,4-disubstituted 6-fluoroquinolines as potent antiplasmodial agents: QSAR, homology modeling, molecular docking and ADMET studies. J Taibah Univ Med Sci 2024; 19:233-247. [PMID: 38179257 PMCID: PMC10762476 DOI: 10.1016/j.jtumed.2023.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 09/29/2023] [Accepted: 11/09/2023] [Indexed: 01/06/2024] Open
Abstract
Objective This work was designed to study 2,4-disubstituted 6-fluoroquinolines as antiplasmodial agents by using in silico techniques, to aid in the design of novel analogs with high potency against malaria and high inhibition of Plasmodium falciparum translation elongation factor 2 (PfeEF2), a novel drug target. Methods Quantitative structure-activity relationships (QSAR) of 2,4-disubstituted 6-fluoroquinolines were studied with the genetic function approximation technique in Material Studio software. The 3D structure of PfeEF2 was modeled in the SWISS-MODEL workspace through homology modeling. A molecular docking study of the modeled PfeEF2 and 2,4-disubstituted 6-fluoroquinolines was conducted with Autodock Vina in Pyrx software. Furthermore, the in silico pharmacokinetic properties of selected compounds were investigated. Results A robust, reliable and predictive QSAR model was developed that related the chemical structures of 2,4-disubstituted 6-fluoroquinolines to their antiplasmodium activities. The model had an internal squared correlation coefficient R2 of 0.921, adjusted squared correlation coefficient R2adj of 0.878, leave-one-out cross-validation coefficient Q2cv of 0.801 and predictive squared correlation coefficient R2pred of 0.901. The antiplasmodium activity of 6-fluoroquinolines was found to depend on the n5Ring, GGI9, TDB7u, TDB8u and RDF75i physicochemical properties: n5Ring, TDB8u and RDF75i were positively associated, whereas GGI9 and TDB7u were negatively associated, with the antiplasmodium activity of the compounds. Stable complexes formed between the compounds and modeled PfeEF2, with binding affinity ranging from -8.200 to -10.700 kcal/mol. Compounds 5, 11, 16, 22 and 24 had better binding affinities than quinoline-4-carboxamide (DDD107498), as well as good pharmacokinetic properties, and therefore may be better inhibitors of this novel target. Conclusion QSAR and docking studies provided insight into designing novel 2,4-disubstituted 6-fluoroquinolines with high antiplasmodial activity and good structural properties for inhibiting a novel antimalarial drug target.
Collapse
Affiliation(s)
| | - Aliyu W. Mahmud
- Department of Applied Chemistry, Kaduna Polytechnic, P.M.B 2021, Kaduna, Nigeria
| | - Adamu Uzairu
- Chemistry Department, Ahmadu Bello University, Zaria, Nigeria
| | | |
Collapse
|
47
|
Liu W, Wang Z, You R, Xie C, Wei H, Xiong Y, Yang J, Zhu S. PLMSearch: Protein language model powers accurate and fast sequence search for remote homology. Nat Commun 2024; 15:2775. [PMID: 38555371 PMCID: PMC10981738 DOI: 10.1038/s41467-024-46808-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 03/08/2024] [Indexed: 04/02/2024] Open
Abstract
Homologous protein search is one of the most commonly used methods for protein annotation and analysis. Compared to structure search, detecting distant evolutionary relationships from sequences alone remains challenging. Here we propose PLMSearch (Protein Language Model), a homologous protein search method with only sequences as input. PLMSearch uses deep representations from a pre-trained protein language model and trains the similarity prediction model with a large number of real structure similarity. This enables PLMSearch to capture the remote homology information concealed behind the sequences. Extensive experimental results show that PLMSearch can search millions of query-target protein pairs in seconds like MMseqs2 while increasing the sensitivity by more than threefold, and is comparable to state-of-the-art structure search methods. In particular, unlike traditional sequence search methods, PLMSearch can recall most remote homology pairs with dissimilar sequences but similar structures. PLMSearch is freely available at https://dmiip.sjtu.edu.cn/PLMSearch .
Collapse
Affiliation(s)
- Wei Liu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Ziye Wang
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Ronghui You
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China
| | - Chenghan Xie
- School of Mathematical Sciences, Fudan University, 200433, Shanghai, China
| | - Hong Wei
- School of Mathematical Sciences, Nankai University, 300071, Tianjin, China
| | - Yi Xiong
- Department of Bioinformatics and Biostatistics, Shanghai Jiao Tong University, 200240, Shanghai, China
| | - Jianyi Yang
- Ministry of Education Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Science, Shandong University, 266237, Qingdao, China.
| | - Shanfeng Zhu
- Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, 200433, Shanghai, China.
- Shanghai Qi Zhi Institute, Shanghai, China.
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China.
- Shanghai Key Lab of Intelligent Information Processing and Shanghai Institute of Artificial Intelligence Algorithm, Fudan University, Shanghai, China.
- Zhangjiang Fudan International Innovation Center, Shanghai, China.
| |
Collapse
|
48
|
Bibik P, Alibai S, Pandini A, Dantu SC. PyCoM: a python library for large-scale analysis of residue-residue coevolution data. Bioinformatics 2024; 40:btae166. [PMID: 38532297 PMCID: PMC11009027 DOI: 10.1093/bioinformatics/btae166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 02/02/2024] [Accepted: 03/25/2024] [Indexed: 03/28/2024] Open
Abstract
MOTIVATION Computational methods to detect correlated amino acid positions in proteins have become a valuable tool to predict intra- and inter-residue protein contacts, protein structures, and effects of mutation on protein stability and function. While there are many tools and webservers to compute coevolution scoring matrices, there is no central repository of alignments and coevolution matrices for large-scale studies and pattern detection leveraging on biological and structural annotations already available in UniProt. RESULTS We present a Python library, PyCoM, which enables users to query and analyze coevolution matrices and sequence alignments of 457 622 proteins, selected from UniProtKB/Swiss-Prot database (length ≤ 500 residues), from a precompiled coevolution matrix database (PyCoMdb). PyCoM facilitates the development of statistical analyses of residue coevolution patterns using filters on biological and structural annotations from UniProtKB/Swiss-Prot, with simple access to PyCoMdb for both novice and advanced users, supporting Jupyter Notebooks, Python scripts, and a web API access. The resource is open source and will help in generating data-driven computational models and methods to study and understand protein structures, stability, function, and design. AVAILABILITY AND IMPLEMENTATION PyCoM code is freely available from https://github.com/scdantu/pycom and PyCoMdb and the Jupyter Notebook tutorials are freely available from https://pycom.brunel.ac.uk.
Collapse
Affiliation(s)
- Philipp Bibik
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Sabriyeh Alibai
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Alessandro Pandini
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| | - Sarath Chandra Dantu
- Department of Computer Science, Brunel University London, Uxbridge UB8 3PH, United Kingdom
| |
Collapse
|
49
|
Luthringer R, Raphalen M, Guerra C, Colin S, Martinho C, Zheng M, Hoshino M, Badis Y, Lipinska AP, Haas FB, Barrera-Redondo J, Alva V, Coelho SM. Repeated co-option of HMG-box genes for sex determination in brown algae and animals. Science 2024; 383:eadk5466. [PMID: 38513029 DOI: 10.1126/science.adk5466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 01/31/2024] [Indexed: 03/23/2024]
Abstract
In many eukaryotes, genetic sex determination is not governed by XX/XY or ZW/ZZ systems but by a specialized region on the poorly studied U (female) or V (male) sex chromosomes. Previous studies have hinted at the existence of a dominant male-sex factor on the V chromosome in brown algae, a group of multicellular eukaryotes distantly related to animals and plants. The nature of this factor has remained elusive. Here, we demonstrate that an HMG-box gene acts as the male-determining factor in brown algae, mirroring the role HMG-box genes play in sex determination in animals. Over a billion-year evolutionary timeline, these lineages have independently co-opted the HMG box for male determination, representing a paradigm for evolution's ability to recurrently use the same genetic "toolkit" to accomplish similar tasks.
Collapse
Affiliation(s)
- Rémy Luthringer
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Morgane Raphalen
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Carla Guerra
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Sébastien Colin
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Claudia Martinho
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Min Zheng
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Masakazu Hoshino
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
- Research Center for Inland Seas, Kobe University, Kobe 658-0022, Japan
| | - Yacine Badis
- Roscoff Biological Station, CNRS-Sorbonne University, Place Georges Teissier, 29680 Roscoff, France
| | - Agnieszka P Lipinska
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Fabian B Haas
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Josué Barrera-Redondo
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| | - Susana M Coelho
- Department of Algal Development and Evolution, Max Planck Institute for Biology Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
50
|
Billman ZP, Kovacs SB, Wei B, Kang K, Cissé OH, Miao EA. Caspase-1 activates gasdermin A in non-mammals. eLife 2024; 12:RP92362. [PMID: 38497531 PMCID: PMC10948149 DOI: 10.7554/elife.92362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2024] Open
Abstract
Gasdermins oligomerize to form pores in the cell membrane, causing regulated lytic cell death called pyroptosis. Mammals encode five gasdermins that can trigger pyroptosis: GSDMA, B, C, D, and E. Caspase and granzyme proteases cleave the linker regions of and activate GSDMB, C, D, and E, but no endogenous activation pathways are yet known for GSDMA. Here, we perform a comprehensive evolutionary analysis of the gasdermin family. A gene duplication of GSDMA in the common ancestor of caecilian amphibians, reptiles, and birds gave rise to GSDMA-D in mammals. Uniquely in our tree, amphibian, reptile, and bird GSDMA group in a separate clade than mammal GSDMA. Remarkably, GSDMA in numerous bird species contain caspase-1 cleavage sites like YVAD or FASD in the linker. We show that GSDMA from birds, amphibians, and reptiles are all cleaved by caspase-1. Thus, GSDMA was originally cleaved by the host-encoded protease caspase-1. In mammals the caspase-1 cleavage site in GSDMA is disrupted; instead, a new protein, GSDMD, is the target of caspase-1. Mammal caspase-1 uses exosite interactions with the GSDMD C-terminal domain to confer the specificity of this interaction, whereas we show that bird caspase-1 uses a stereotypical tetrapeptide sequence to confer specificity for bird GSDMA. Our results reveal an evolutionarily stable association between caspase-1 and the gasdermin family, albeit a shifting one. Caspase-1 repeatedly changes its target gasdermin over evolutionary time at speciation junctures, initially cleaving GSDME in fish, then GSDMA in amphibians/reptiles/birds, and finally GSDMD in mammals.
Collapse
Affiliation(s)
- Zachary Paul Billman
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
- Department of Microbiology and Immunology, University of North Carolina at Chapel HillChapel HillUnited States
| | - Stephen Bela Kovacs
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
- Department of Microbiology and Immunology, University of North Carolina at Chapel HillChapel HillUnited States
| | - Bo Wei
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| | - Kidong Kang
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| | - Ousmane H Cissé
- Critical Care Medicine Department, National Institutes of Health Clinical CenterBethesdaUnited States
| | - Edward A Miao
- Department of Integrative Immunobiology; Molecular Genetics and Microbiology; Pathology; and Cell Biology, Duke University School of MedicineDurhamUnited States
| |
Collapse
|