1
|
Bedree JK, Bourgeois J, Balani P, Cen L, Hendrickson EL, Kerns KA, Camilli A, McLean JS, Shi W, He X. Identifying essential genes in Schaalia odontolytica using a highly-saturated transposon library. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.604004. [PMID: 39071323 PMCID: PMC11275721 DOI: 10.1101/2024.07.17.604004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
The unique epibiotic-parasitic relationship between Nanosynbacter lyticus type strain TM7x, a member of the newly identified Candidate Phyla Radiation, now referred to as Patescibacteria , and its basibiont, Schaalia odontolytica strain XH001 (formerly Actinomyces odontolyticus) , require more powerful genetic tools for deeper understanding of the genetic underpinnings that mediate their obligate relationship. Previous studies have mainly characterized the genomic landscape of XH001 during or post TM7x infection through comparative genomic or transcriptomic analyses followed by phenotypic analysis. Comprehensive genetic dissection of the pair is currently cumbersome due to the lack of robust genetic tools in TM7x. However, basic genetic tools are available for XH001 and this study expands the current genetic toolset by developing high-throughput transposon insertion sequencing (Tn-seq). Tn-seq was employed to screen for essential genes in XH001 under laboratory conditions. A highly saturated Tn-seq library was generated with nearly 660,000 unique insertion mutations, averaging one insertion every 2-3 nucleotides. 203 genes, 10.5% of the XH001 genome, were identified as putatively essential.
Collapse
|
2
|
Hasan A, Alonazi WB, Ibrahim M, Bin L. Immunoinformatics and Reverse Vaccinology Approach for the Identification of Potential Vaccine Candidates against Vandammella animalimors. Microorganisms 2024; 12:1270. [PMID: 39065039 PMCID: PMC11278545 DOI: 10.3390/microorganisms12071270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 06/17/2024] [Accepted: 06/19/2024] [Indexed: 07/28/2024] Open
Abstract
Vandammella animalimorsus is a Gram-negative and non-motile bacterium typically transmitted to humans through direct contact with the saliva of infected animals, primarily through biting, scratches, or licks on fractured skin. The absence of a confirmed post-exposure treatment of V. animalimorsus bacterium highlights the imperative for developing an effective vaccine. We intended to determine potential vaccine candidates and paradigm a chimeric vaccine against V. animalimorsus by accessible public data analysis of the strain by utilizing reverse vaccinology. By subtractive genomics, five outer membranes were prioritized as potential vaccine candidates out of 2590 proteins. Based on the instability index and transmembrane helices, a multidrug transporter protein with locus ID A0A2A2AHJ4 was designated as a potential candidate for vaccine construct. Sixteen immunodominant epitopes were retrieved by utilizing the Immune Epitope Database. The epitope encodes the strong binding affinity, nonallergenic properties, non-toxicity, high antigenicity scores, and high solubility revealing the more appropriate vaccine construct. By utilizing appropriate linkers and adjuvants alongside a suitable adjuvant molecule, the epitopes were integrated into a chimeric vaccine to enhance immunogenicity, successfully eliciting both adaptive and innate immune responses. Moreover, the promising physicochemical features, the binding confirmation of the vaccine to the major innate immune receptor TLR-4, and molecular dynamics simulations of the designed vaccine have revealed the promising potential of the selected candidate. The integration of computational methods and omics data has demonstrated significant advantages in discovering novel vaccine targets and mitigating vaccine failure rates during clinical trials in recent years.
Collapse
Affiliation(s)
- Ahmad Hasan
- State Key Laboratory of Rice Biology and Breeding, Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Biotechnology, Zhejiang University, Hangzhou 310058, China; (A.H.); (M.I.)
| | - Wadi B. Alonazi
- Health Administration Department, College of Business Administration, King Saud University, Riyadh 11421, Saudi Arabia;
| | - Muhammad Ibrahim
- State Key Laboratory of Rice Biology and Breeding, Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Biotechnology, Zhejiang University, Hangzhou 310058, China; (A.H.); (M.I.)
| | - Li Bin
- State Key Laboratory of Rice Biology and Breeding, Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Biotechnology, Zhejiang University, Hangzhou 310058, China; (A.H.); (M.I.)
| |
Collapse
|
3
|
Santos AS, Costa VAF, Freitas VAQ, Dos Anjos LRB, de Almeida Santos ES, Arantes TD, Costa CR, de Sene Amâncio Zara AL, do Rosário Rodrigues Silva M, Neves BJ. Drug to genome to drug: a computational large-scale chemogenomics screening for novel drug candidates against sporotrichosis. Braz J Microbiol 2024:10.1007/s42770-024-01406-x. [PMID: 38888692 DOI: 10.1007/s42770-024-01406-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 05/28/2024] [Indexed: 06/20/2024] Open
Abstract
Sporotrichosis is recognized as the predominant subcutaneous mycosis in South America, attributed to pathogenic species within the Sporothrix genus. Notably, in Brazil, Sporothrix brasiliensis emerges as the principal species, exhibiting significant sapronotic, zoonotic and enzootic epidemic potential. Consequently, the discovery of novel therapeutic agents for the treatment of sporotrichosis is imperative. The present study is dedicated to the repositioning of pharmaceuticals for sporotrichosis therapy. To achieve this goal, we designed a pipeline with the following steps: (a) compilation and preparation of Sporothrix genome data; (b) identification of orthologous proteins among the species; (c) identification of homologous proteins in publicly available drug-target databases; (d) selection of Sporothrix essential targets using validated genes from Saccharomyces cerevisiae; (e) molecular modeling studies; and (f) experimental validation of selected candidates. Based on this approach, we were able to prioritize eight drugs for in vitro experimental validation. Among the evaluated compounds, everolimus and bifonazole demonstrated minimum inhibitory concentration (MIC) values of 0.5 µg/mL and 4.0 µg/mL, respectively. Subsequently, molecular docking studies suggest that bifonazole and everolimus may target specific proteins within S. brasiliensis- namely, sterol 14-α-demethylase and serine/threonine-protein kinase TOR, respectively. These findings shed light on the potential binding affinities and binding modes of bifonazole and everolimus with their probable targets, providing a preliminary understanding of the antifungal mechanism of action of these compounds. In conclusion, our research advances the understanding of the therapeutic potential of bifonazole and everolimus, supporting their further investigation as antifungal agents for sporotrichosis in prospective hit-to-lead and preclinical investigations.
Collapse
Affiliation(s)
- Andressa Santana Santos
- Institute of Tropical Pathology and Public Health, Federal University of Goiás, Goiânia, Goiás, Brazil
- Laboratory of Cheminformatics, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Goiás, Brazil
| | | | | | - Laura Raniere Borges Dos Anjos
- Institute of Tropical Pathology and Public Health, Federal University of Goiás, Goiânia, Goiás, Brazil
- Laboratory of Cheminformatics, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Goiás, Brazil
| | | | - Thales Domingos Arantes
- Institute of Tropical Pathology and Public Health, Federal University of Goiás, Goiânia, Goiás, Brazil
| | - Carolina Rodrigues Costa
- Institute of Tropical Pathology and Public Health, Federal University of Goiás, Goiânia, Goiás, Brazil
| | - Ana Laura de Sene Amâncio Zara
- Postgraduate Program in Health Technology Assistance and Assessment (PPG-AAS), Faculty of Pharmacy, Federal University of Goiás, Goiânia, Goiás, Brazil
| | | | - Bruno Junior Neves
- Laboratory of Cheminformatics, Faculty of Pharmacy, Federal University of Goiás, Goiânia, Goiás, Brazil.
| |
Collapse
|
4
|
Lu P, Tian J. ACDMBI: A deep learning model based on community division and multi-source biological information fusion predicts essential proteins. Comput Biol Chem 2024; 112:108115. [PMID: 38865861 DOI: 10.1016/j.compbiolchem.2024.108115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Revised: 05/15/2024] [Accepted: 05/28/2024] [Indexed: 06/14/2024]
Abstract
Accurately identifying essential proteins is vital for drug research and disease diagnosis. Traditional centrality methods and machine learning approaches often face challenges in accurately discerning essential proteins, primarily relying on information derived from protein-protein interaction (PPI) networks. Despite attempts by some researchers to integrate biological data and PPI networks for predicting essential proteins, designing effective integration methods remains a challenge. In response to these challenges, this paper presents the ACDMBI model, specifically designed to overcome the aforementioned issues. ACDMBI is comprised of two key modules: feature extraction and classification. In terms of capturing relevant information, we draw insights from three distinct data sources. Initially, structural features of proteins are extracted from the PPI network through community division. Subsequently, these features are further optimized using Graph Convolutional Networks (GCN) and Graph Attention Networks (GAT). Moving forward, protein features are extracted from gene expression data utilizing Bidirectional Long Short-Term Memory networks (BiLSTM) and a multi-head self-attention mechanism. Finally, protein features are derived by mapping subcellular localization data to a one-dimensional vector and processing it through fully connected layers. In the classification phase, we integrate features extracted from three different data sources, crafting a multi-layer deep neural network (DNN) for protein classification prediction. Experimental results on brewing yeast data showcase the ACDMBI model's superior performance, with AUC reaching 0.9533 and AUPR reaching 0.9153. Ablation experiments further reveal that the effective integration of features from diverse biological information significantly boosts the model's performance.
Collapse
Affiliation(s)
- Pengli Lu
- School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China.
| | - Jialong Tian
- School of Computer and Communication, Lanzhou University of Technology, Lanzhou 730050, China.
| |
Collapse
|
5
|
Rebhi S, Basharat Z, Wei CR, Lebbal S, Najjaa H, Sadfi-Zouaoui N, Messaoudi A. Core proteome mediated subtractive approach for the identification of potential therapeutic drug target against the honeybee pathogen Paenibacillus larvae. PeerJ 2024; 12:e17292. [PMID: 38818453 PMCID: PMC11138523 DOI: 10.7717/peerj.17292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 04/02/2024] [Indexed: 06/01/2024] Open
Abstract
Background & Objectives American foulbrood (AFB), caused by the highly virulent, spore-forming bacterium Paenibacillus larvae, poses a significant threat to honey bee brood. The widespread use of antibiotics not only fails to effectively combat the disease but also raises concerns regarding honey safety. The current computational study was attempted to identify a novel therapeutic drug target against P. larvae, a causative agent of American foulbrood disease in honey bee. Methods We investigated effective novel drug targets through a comprehensive in silico pan-proteome and hierarchal subtractive sequence analysis. In total, 14 strains of P. larvae genomes were used to identify core genes. Subsequently, the core proteome was systematically narrowed down to a single protein predicted as the potential drug target. Alphafold software was then employed to predict the 3D structure of the potential drug target. Structural docking was carried out between a library of phytochemicals derived from traditional Chinese flora (n > 36,000) and the potential receptor using Autodock tool 1.5.6. Finally, molecular dynamics (MD) simulation study was conducted using GROMACS to assess the stability of the best-docked ligand. Results Proteome mining led to the identification of Ketoacyl-ACP synthase III as a highly promising therapeutic target, making it a prime candidate for inhibitor screening. The subsequent virtual screening and MD simulation analyses further affirmed the selection of ZINC95910054 as a potent inhibitor, with the lowest binding energy. This finding presents significant promise in the battle against P. larvae. Conclusions Computer aided drug design provides a novel approach for managing American foulbrood in honey bee populations, potentially mitigating its detrimental effects on both bee colonies and the honey industry.
Collapse
Affiliation(s)
- Sawsen Rebhi
- Université de Tunis-El Manar, Laboratoire de Mycologie, Pathologies et Biomarqueurs, Département de Biologie, Tunis, Tunisia
| | | | - Calvin R. Wei
- Department of Research and Development, Shing Huei Group, Taipei, Taiwan
| | - Salim Lebbal
- University of Khenchela, Department of Agricultural Sciences, Faculty of Nature and Life Sciences, Khenchela, Algeria
| | - Hanen Najjaa
- University of Gabes, Laboratory of Pastoral Ecosystem and Valorization of Spontaneous Plants and Associated Microorganisms, Institute of Arid Lands of Medenine, Medenine, Tunisia
| | - Najla Sadfi-Zouaoui
- Université de Tunis-El Manar, Laboratoire de Mycologie, Pathologies et Biomarqueurs, Département de Biologie, Tunis, Tunisia
| | - Abdelmonaem Messaoudi
- Université de Tunis-El Manar, Laboratoire de Mycologie, Pathologies et Biomarqueurs, Département de Biologie, Tunis, Tunisia
- Jendouba University, Higher Institute of Biotechnology of Beja, Beja, Tunisia
| |
Collapse
|
6
|
Pan L, Wang H, Yang B, Li W. A protein network refinement method based on module discovery and biological information. BMC Bioinformatics 2024; 25:157. [PMID: 38643108 PMCID: PMC11031909 DOI: 10.1186/s12859-024-05772-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 04/10/2024] [Indexed: 04/22/2024] Open
Abstract
BACKGROUND The identification of essential proteins can help in understanding the minimum requirements for cell survival and development to discover drug targets and prevent disease. Nowadays, node ranking methods are a common way to identify essential proteins, but the poor data quality of the underlying PIN has somewhat hindered the identification accuracy of essential proteins for these methods in the PIN. Therefore, researchers constructed refinement networks by considering certain biological properties of interacting protein pairs to improve the performance of node ranking methods in the PIN. Studies show that proteins in a complex are more likely to be essential than proteins not present in the complex. However, the modularity is usually ignored for the refinement methods of the PINs. METHODS Based on this, we proposed a network refinement method based on module discovery and biological information. The idea is, first, to extract the maximal connected subgraph in the PIN, and to divide it into different modules by using Fast-unfolding algorithm; then, to detect critical modules according to the orthologous information, subcellular localization information and topology information within each module; finally, to construct a more refined network (CM-PIN) by using the identified critical modules. RESULTS To evaluate the effectiveness of the proposed method, we used 12 typical node ranking methods (LAC, DC, DMNC, NC, TP, LID, CC, BC, PR, LR, PeC, WDC) to compare the overall performance of the CM-PIN with those on the S-PIN, D-PIN and RD-PIN. The experimental results showed that the CM-PIN was optimal in terms of the identification number of essential proteins, precision-recall curve, Jackknifing method and other criteria, and can help to identify essential proteins more accurately.
Collapse
Affiliation(s)
- Li Pan
- Hunan Institute of Science and Technology, Yueyang, 414006, China
- Hunan Engineering Research Center of Multimodal Health Sensing and Intelligent Analysis, Yueyang, 414006, China
| | - Haoyue Wang
- Hunan Institute of Science and Technology, Yueyang, 414006, China.
| | - Bo Yang
- Hunan Institute of Science and Technology, Yueyang, 414006, China
- Hunan Engineering Research Center of Multimodal Health Sensing and Intelligent Analysis, Yueyang, 414006, China
| | - Wenbin Li
- Hunan Institute of Science and Technology, Yueyang, 414006, China.
| |
Collapse
|
7
|
Masse M, Hutchinson RB, Morgan CE, Allaman HJ, Guan H, Yu EW, Cavagnero S. Mapping Protein-Protein Interactions at Birth: Single-Particle Cryo-EM Analysis of a Ribosome-Nascent Globin Complex. ACS CENTRAL SCIENCE 2024; 10:385-401. [PMID: 38435509 PMCID: PMC10906257 DOI: 10.1021/acscentsci.3c00777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 12/29/2023] [Accepted: 01/03/2024] [Indexed: 03/05/2024]
Abstract
Interactions between ribosome-bound nascent chains (RNCs) and ribosomal components are critical to elucidate the mechanism of cotranslational protein folding. Nascent protein-ribosome contacts within the ribosomal exit tunnel were previously assessed mostly in the presence of C-terminal stalling sequences, yet little is known about contacts taking place in the absence of these strongly interacting motifs. Further, there is nearly no information about ribosomal proteins (r-proteins) interacting with nascent chains within the outer surface of the ribosome. Here, we combine chemical cross-linking, single-particle cryo-EM, and fluorescence anisotropy decays to determine the structural features of ribosome-bound apomyoglobin (apoMb). Within the ribosomal exit tunnel core, interactions are similar to those identified in previous reports. However, once the RNC enters the tunnel vestibule, it becomes more dynamic and interacts with ribosomal RNA (rRNA) and the L23 r-protein. Remarkably, on the outer surface of the ribosome, RNCs interact mainly with a highly conserved nonpolar patch of the L23 r-protein. RNCs also comprise a compact and dynamic N-terminal region lacking contact with the ribosome. In all, apoMb traverses the ribosome and interacts with it via its C-terminal region, while N-terminal residues sample conformational space and form a compact subdomain before the entire nascent protein sequence departs from the ribosome.
Collapse
Affiliation(s)
- Meranda
M. Masse
- Department
of Chemistry, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Rachel B. Hutchinson
- Department
of Chemistry, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Christopher E. Morgan
- Department
of Pharmacology, Case Western Reserve University, Cleveland, Ohio 44106, United States
| | - Heather J. Allaman
- Department
of Chemistry, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Hongqing Guan
- Department
of Chemistry, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| | - Edward W. Yu
- Department
of Pharmacology, Case Western Reserve University, Cleveland, Ohio 44106, United States
| | - Silvia Cavagnero
- Department
of Chemistry, University of Wisconsin−Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
8
|
Aziz S, Waqas M, Naz HF, Halim SA, Jan A, Muhsinah AB, Khan A, Al-Harrasi A. Identification of novel compounds and repurposing of FDA drugs for 1-deoxy-D-xylulose 5-phosphate reductoisomerase enzyme of Plasmodium falciparum to combat malaria resistance. Int J Biol Macromol 2024; 257:128672. [PMID: 38092105 DOI: 10.1016/j.ijbiomac.2023.128672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/28/2023] [Accepted: 12/06/2023] [Indexed: 01/27/2024]
Abstract
The rise of Plasmodium falciparum resistance to Artemisinin-based combination therapies (ACTs) is a significant concern in the fight against malaria. This situation calls for the search for novel anti-malarial candidates. 1-deoxy-D-xylulose 5-phosphate reductoisomerase (IspC) is a potential target involved in various cellular processes in P. falciparum (Pf). We screened ∼0.69 billion novel compounds from the ZINC20 library and repurposed ∼1400 FDA drugs using computational drug discovery methods against PfIspC. Following our computational pipeline, we found five novel ZINC20 compounds (Z-2, Z-3, Z-10, Z-13, and Z-14) and three FDA drugs (Aliskiren, Ceftolozane, and Ombitasvir) that showed striking docking energy (ranging from -8.405 to -10.834 kcal/mol), and strong interactions with key binding site residues (Ser269, Ser270, Ser306, Asn311, Lys312, and Met360) of PfIspC. The novel anti-malarial compounds also exhibited favorable pharmacokinetics and physicochemical properties. Furthermore, through molecular dynamics simulation, we observed the stable dynamics of PfIspC-inhibitor complexes and the influence of inhibitor binding on the protein's conformational arrangements. Notably, the binding free energy estimation confirmed high binding affinity (varied from -11.68 to -33.16 kcal/mol) of these compounds for PfIspC. Our findings could contribute to the ongoing efforts in combating malaria and invite experimental-lab researchers for validation.
Collapse
Affiliation(s)
- Shahkaar Aziz
- Institute of Biotechnology and Genetic Engineering, The University of Agriculture, Peshawar 25130, Pakistan
| | - Muhammad Waqas
- Department of Biotechnology and Genetic Engineering, Hazara University, Mansehra 21120, Pakistan; Natural and Medical Sciences Research Center, University of Nizwa, Birkat-ul-Mouz, 616 Nizwa, Oman
| | - Hafiza Farah Naz
- Department of Biotechnology, , Quaid-i-Azam University, Islamabad 45320, Pakistan
| | - Sobia Ahsan Halim
- Natural and Medical Sciences Research Center, University of Nizwa, Birkat-ul-Mouz, 616 Nizwa, Oman
| | - Afnan Jan
- Department of Biochemistry, Faculty of Medicine, Umm Al-Qura University, Makkah, Kingdom of Saudi Arabia
| | - Abdullatif Bin Muhsinah
- Department of Pharmacognosy, College of Pharmacy, King Khalid University, Abha 61441, Saudi Arabia
| | - Ajmal Khan
- Natural and Medical Sciences Research Center, University of Nizwa, Birkat-ul-Mouz, 616 Nizwa, Oman.
| | - Ahmed Al-Harrasi
- Natural and Medical Sciences Research Center, University of Nizwa, Birkat-ul-Mouz, 616 Nizwa, Oman.
| |
Collapse
|
9
|
Liang Y, Luo H, Lin Y, Gao F. Recent advances in the characterization of essential genes and development of a database of essential genes. IMETA 2024; 3:e157. [PMID: 38868518 PMCID: PMC10989110 DOI: 10.1002/imt2.157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 10/09/2023] [Indexed: 06/14/2024]
Abstract
Over the past few decades, there has been a significant interest in the study of essential genes, which are crucial for the survival of an organism under specific environmental conditions and thus have practical applications in the fields of synthetic biology and medicine. An increasing amount of experimental data on essential genes has been obtained with the continuous development of technological methods. Meanwhile, various computational prediction methods, related databases and web servers have emerged accordingly. To facilitate the study of essential genes, we have established a database of essential genes (DEG), which has become popular with continuous updates to facilitate essential gene feature analysis and prediction, drug and vaccine development, as well as artificial genome design and construction. In this article, we summarized the studies of essential genes, overviewed the relevant databases, and discussed their practical applications. Furthermore, we provided an overview of the main applications of DEG and conducted comprehensive analyses based on its latest version. However, it should be noted that the essential gene is a dynamic concept instead of a binary one, which presents both opportunities and challenges for their future development.
Collapse
Affiliation(s)
| | - Hao Luo
- Department of PhysicsTianjin UniversityTianjinChina
| | - Yan Lin
- Department of PhysicsTianjin UniversityTianjinChina
| | - Feng Gao
- Department of PhysicsTianjin UniversityTianjinChina
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education)Tianjin UniversityTianjinChina
- SynBio Research PlatformCollaborative Innovation Center of Chemical Science and Engineering (Tianjin)TianjinChina
| |
Collapse
|
10
|
Payra AK, Saha B, Ghosh A. MEM-FET: Essential protein prediction using membership feature and machine learning approach. Proteins 2024; 92:60-75. [PMID: 37638618 DOI: 10.1002/prot.26577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 02/21/2023] [Accepted: 08/08/2023] [Indexed: 08/29/2023]
Abstract
Proteins are played key roles in different functionalities in our daily life. All functional roles of a protein are a bit enhanced in interaction compared to individuals. Identification of essential proteins of an organism is a time consume and costly task during observation in the wet lab. The results of observation in wet lab always ensure high reliability and accuracy in the biological ground. Essential protein prediction using computational approaches is an alternative choice in research. It proves its significance rapidly in day-to-day life as well as reduces the experimental cost of wet lab effectively. Existing computational methods were implemented using Protein interaction networks (PPIN), Sequence, Gene Expression Dataset (GED), Gene Ontology (GO), Orthologous groups, and Subcellular localized datasets. Machine learning has diverse categories of features that enable to model and predict essential macromolecules of understudied organisms. A novel methodology MEM-FET (membership feature) is predicted based on features, that is, edge clustering coefficient, Average clustering coefficient, subcellular localization, and Gene Ontology within a compartment of common neighbors. The accuracy (ACC) values of the predicted true positive (TP) essential proteins are 0.79, 0.74, 0.78, and 0.71 for YHQ, YMIPS, YDIP, and YMBD datasets. An enriched set of essential proteins are also predicted using the MEM-FET algorithm. Ensemble ML also validated the proposed model with an accuracy of 60%. It has been predicted that MEM-FET algorithms outperform other existing algorithms with an ACC value of 80% for the yeast dataset.
Collapse
Affiliation(s)
- Anjan Kumar Payra
- Department of Computer Science and Engineering, Dr. Sudhir Chandra Sur Degree Engineering College, Kolkata, India
| | - Banani Saha
- Department of Computer Science and Engineering, University of Calcutta, Kolkata, India
| | - Anupam Ghosh
- Department of Computer Science and Engineering, Netaji Subhash Engineering College, Kolkata, India
| |
Collapse
|
11
|
Li G, Luo X, Hu Z, Wu J, Peng W, Liu J, Zhu X. Essential proteins discovery based on dominance relationship and neighborhood similarity centrality. Health Inf Sci Syst 2023; 11:55. [PMID: 37981988 PMCID: PMC10654316 DOI: 10.1007/s13755-023-00252-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 10/13/2023] [Indexed: 11/21/2023] Open
Abstract
Essential proteins play a vital role in development and reproduction of cells. The identification of essential proteins helps to understand the basic survival of cells. Due to time-consuming, costly and inefficient with biological experimental methods for discovering essential proteins, computational methods have gained increasing attention. In the initial stage, essential proteins are mainly identified by the centralities based on protein-protein interaction (PPI) networks, which limit their identification rate due to many false positives in PPI networks. In this study, a purified PPI network is firstly introduced to reduce the impact of false positives in the PPI network. Secondly, by analyzing the similarity relationship between a protein and its neighbors in the PPI network, a new centrality called neighborhood similarity centrality (NSC) is proposed. Thirdly, based on the subcellular localization and orthologous data, the protein subcellular localization score and ortholog score are calculated, respectively. Fourthly, by analyzing a large number of methods based on multi-feature fusion, it is found that there is a special relationship among features, which is called dominance relationship, then, a novel model based on dominance relationship is proposed. Finally, NSC, subcellular localization score, and ortholog score are fused by the dominance relationship model, and a new method called NSO is proposed. In order to verify the performance of NSO, the seven representative methods (ION, NCCO, E_POC, SON, JDC, PeC, WDC) are compared on yeast datasets. The experimental results show that the NSO method has higher identification rate than other methods.
Collapse
Affiliation(s)
- Gaoshi Li
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
| | - Xinlong Luo
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
| | - Zhipeng Hu
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
| | - Jingli Wu
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
| | - Wei Peng
- Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500 Yunnan China
| | - Jiafei Liu
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
| | - Xiaoshu Zhu
- Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal University, Guilin, 541004 China
- Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 Guangxi China
- College of Computer Science and Engineering, Guangxi Normal University, Guilin, 541004 Guangxi China
- School of Computer and Information Security & School of Software Engineering, Guilin University of Electronic Science and Technology, Guilin, China
| |
Collapse
|
12
|
Zhao H, Liu G, Cao X. A seed expansion-based method to identify essential proteins by integrating protein-protein interaction sub-networks and multiple biological characteristics. BMC Bioinformatics 2023; 24:452. [PMID: 38036960 PMCID: PMC10688502 DOI: 10.1186/s12859-023-05583-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 11/24/2023] [Indexed: 12/02/2023] Open
Abstract
BACKGROUND The identification of essential proteins is of great significance in biology and pathology. However, protein-protein interaction (PPI) data obtained through high-throughput technology include a high number of false positives. To overcome this limitation, numerous computational algorithms based on biological characteristics and topological features have been proposed to identify essential proteins. RESULTS In this paper, we propose a novel method named SESN for identifying essential proteins. It is a seed expansion method based on PPI sub-networks and multiple biological characteristics. Firstly, SESN utilizes gene expression data to construct PPI sub-networks. Secondly, seed expansion is performed simultaneously in each sub-network, and the expansion process is based on the topological features of predicted essential proteins. Thirdly, the error correction mechanism is based on multiple biological characteristics and the entire PPI network. Finally, SESN analyzes the impact of each biological characteristic, including protein complex, gene expression data, GO annotations, and subcellular localization, and adopts the biological data with the best experimental results. The output of SESN is a set of predicted essential proteins. CONCLUSIONS The analysis of each component of SESN indicates the effectiveness of all components. We conduct comparison experiments using three datasets from two species, and the experimental results demonstrate that SESN achieves superior performance compared to other methods.
Collapse
Affiliation(s)
- He Zhao
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Guixia Liu
- College of Computer Science and Technology, Jilin University, Changchun, China.
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China.
| | - Xintian Cao
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| |
Collapse
|
13
|
Toyomoto T, Ono K, Shiba T, Momitani K, Zhang T, Tsutsuki H, Ishikawa T, Hoso K, Hamada K, Rahman A, Wen L, Maeda Y, Yamamoto K, Matsuoka M, Hanaoka K, Niidome T, Akaike T, Sawa T. Alkyl gallates inhibit serine O-acetyltransferase in bacteria and enhance susceptibility of drug-resistant Gram-negative bacteria to antibiotics. Front Microbiol 2023; 14:1276447. [PMID: 37965540 PMCID: PMC10641863 DOI: 10.3389/fmicb.2023.1276447] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 10/12/2023] [Indexed: 11/16/2023] Open
Abstract
A principal concept in developing antibacterial agents with selective toxicity is blocking metabolic pathways that are critical for bacterial growth but that mammalian cells lack. Serine O-acetyltransferase (CysE) is an enzyme in many bacteria that catalyzes the first step in l-cysteine biosynthesis by transferring an acetyl group from acetyl coenzyme A (acetyl-CoA) to l-serine to form O-acetylserine. Because mammalian cells lack this l-cysteine biosynthesis pathway, developing an inhibitor of CysE has been thought to be a way to establish a new class of antibacterial agents. Here, we demonstrated that alkyl gallates such as octyl gallate (OGA) could act as potent CysE inhibitors in vitro and in bacteria. Mass spectrometry analyses indicated that OGA treatment markedly reduced intrabacterial levels of l-cysteine and its metabolites including glutathione and glutathione persulfide in Escherichia coli to a level similar to that found in E. coli lacking the cysE gene. Consistent with the reduction of those antioxidant molecules in bacteria, E. coli became vulnerable to hydrogen peroxide-mediated bacterial killing in the presence of OGA. More important, OGA treatment intensified susceptibilities of metallo-β-lactamase-expressing Gram-negative bacteria (E. coli and Klebsiella pneumoniae) to carbapenem. Structural analyses showed that alkyl gallate bound to the binding site for acetyl-CoA that limits access of acetyl-CoA to the active site. Our data thus suggest that CysE inhibitors may be used to treat infectious diseases caused by drug-resistant Gram-negative bacteria not only via direct antibacterial activity but also by enhancing therapeutic potentials of existing antibiotics.
Collapse
Affiliation(s)
- Touya Toyomoto
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Katsuhiko Ono
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Tomoo Shiba
- Department of Applied Biology, Graduate School of Science and Technology, Kyoto Institute of Technology, Kyoto, Japan
| | - Kenta Momitani
- Department of Applied Biology, Graduate School of Science and Technology, Kyoto Institute of Technology, Kyoto, Japan
| | - Tianli Zhang
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Hiroyasu Tsutsuki
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Takeshi Ishikawa
- Department of Chemistry, Biotechnology, and Chemical Engineering, Graduate School of Science and Engineering, Kagoshima University, Kagoshima, Japan
| | - Kanae Hoso
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Koma Hamada
- Faculty of Advanced Science and Technology, Kumamoto University, Kumamoto, Japan
| | - Azizur Rahman
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Liping Wen
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Yosuke Maeda
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Keiichi Yamamoto
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| | - Masao Matsuoka
- Department of Hematology, Rheumatology, and Infectious Diseases, Faculty of Life Sciences, Kumamoto University, Kumamoto, Japan
| | - Kenjiro Hanaoka
- Graduate School of Pharmaceutical Sciences, Keio University, Tokyo, Japan
| | - Takuro Niidome
- Faculty of Advanced Science and Technology, Kumamoto University, Kumamoto, Japan
| | - Takaaki Akaike
- Department of Environmental Medicine and Molecular Toxicology, Tohoku University Graduate School of Medicine, Sendai, Japan
| | - Tomohiro Sawa
- Department of Microbiology, Graduate School of Medical Sciences, Kumamoto University, Kumamoto, Japan
| |
Collapse
|
14
|
Han Y, Liu M, Wang Z. Key protein identification by integrating protein complex information and multi-biological features. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:18191-18206. [PMID: 38052554 DOI: 10.3934/mbe.2023808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
Identifying key proteins based on protein-protein interaction networks has emerged as a prominent area of research in bioinformatics. However, current methods exhibit certain limitations, such as the omission of subcellular localization information and the disregard for the impact of topological structure noise on the reliability of key protein identification. Moreover, the influence of proteins outside a complex but interacting with proteins inside the complex on complex participation tends to be overlooked. Addressing these shortcomings, this paper presents a novel method for key protein identification that integrates protein complex information with multiple biological features. This approach offers a comprehensive evaluation of protein importance by considering subcellular localization centrality, topological centrality weighted by gene ontology (GO) similarity and complex participation centrality. Experimental results, including traditional statistical metrics, jackknife methodology metric and key protein overlap or difference, demonstrate that the proposed method not only achieves higher accuracy in identifying key proteins compared to nine classical methods but also exhibits robustness across diverse protein-protein interaction networks.
Collapse
Affiliation(s)
- Yongyin Han
- School of Computer Science and Technology, China University of Mining and Technology, China
- Xuzhou College of Industrial Technology, China
| | - Maolin Liu
- School of Computer Science and Technology, China University of Mining and Technology, China
| | - Zhixiao Wang
- School of Computer Science and Technology, China University of Mining and Technology, China
| |
Collapse
|
15
|
Li C, Guan R, Li W, Wei D, Cao S, Xu C, Chang F, Wang P, Chen L, Lei D. Single-cell RNA sequencing reveals tumor immune microenvironment in human hypopharygeal squamous cell carcinoma and lymphatic metastasis. Front Immunol 2023; 14:1168191. [PMID: 37503341 PMCID: PMC10369788 DOI: 10.3389/fimmu.2023.1168191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 06/23/2023] [Indexed: 07/29/2023] Open
Abstract
Background Human hypopharygeal squamous cell carcinoma (HSCC) is a common head and neck cancer with a poor prognosis in advanced stages. The occurrence and development of tumor is the result of mutual influence and co-evolution between tumor cells and tumor microenvironment (TME). Tumor immune microenvironment (TIME) refers to the immune microenvironment surrounding tumor cells. Studying TIME in HSCC could provide new targets and therapeutic strategies for HSCC. Methods We performed single-cell RNA sequencing (scRNA-seq) and analysis of hypopharyngeal carcinoma, paracancerous, and lymphoid tissues from five HSCC patients. Subdivide of B cells, T cells, macrophages cells, and monocytes and their distribution in three kinds of tissues as well as marker genes were analyzed. Different genes of IGHG1 plasma cells and SPP1+ macrophages between HSCC tissues, adjacent normal tissues and lymphatic tissues were analyzed. Additionally, we studied proliferating lymphocytes, T cells exhaustion, and T cell receptor (TCR) repertoire in three kinds of tissues. Results Transcriptome profiles of 132,869 single cells were obtained and grouped into seven cell clusters, including epithelial cells, lymphocytes, mononuclear phagocytics system (MPs), fibroblasts, endothelial cells (ECs), plasmacytoid dendritic cells (pDCs), and mast cells. Tumor metastasis occurred in three lymphoid tissues. Four distinct populations were identified from lymphocytes, including B cells, plasma cells, T cells and proliferating lymphocytes. We found IGHA1 and IGHG1 specific plasma cells significantly overexpressed in HSCC tissues compared with normal hypopharygeal tissues and lymphatic tissues. Five distinct populations from MPs were identified, including macrophages, monocytes, mature dendritic cells (DCs), Type 1 conventional dendritic cells (cDC1) and Type 2 conventional dendritic cells (cDC2). SPP1+ macrophages were significantly overexpressed in HSCC tissues and lymphatic tissues compared with normal hypopharygeal tissues, which are thought to be M2-type macrophages. Exhaustion of CD8+ Teff cells occurred in HSCC tissues. At last, we verified that IgA and IgG1 protein expression levels were significantly up-regulated in HSCC tissues compared to adjacent normal tissues. Conclusion Overall, this study revealed TIME in HSCC and lymphatic metastasis, and provided potential therapeutic targets for HSCC.
Collapse
|
16
|
Alotaibi BS, Ajmal A, Hakami MA, Mahmood A, Wadood A, Hu J. New drug target identification in Vibrio vulnificus by subtractive genome analysis and their inhibitors through molecular docking and molecular dynamics simulations. Heliyon 2023; 9:e17650. [PMID: 37449110 PMCID: PMC10336522 DOI: 10.1016/j.heliyon.2023.e17650] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 05/29/2023] [Accepted: 06/24/2023] [Indexed: 07/18/2023] Open
Abstract
Vibrio vulnificus is a rod shape, Gram-negative bacterium that causes sepsis (with a greater than 50% mortality rate), necrotizing fasciitis, gastroenteritis, skin, and soft tissue infection, wound infection, peritonitis, meningitis, pneumonia, keratitis, and arthritis. Based on pathogenicity V. vulnificus is categorized into three biotypes. Type 1 and type 3 cause diseases in humans while biotype 2 causes diseases in eel and fish. Due to indiscriminate use of antibiotics V. vulnificus has developed resistance to many antibiotics so curing is dramatically a challenge. V. vulnificus is resistant to cefazolin, streptomycin, tetracycline, aztreonam, tobramycin, cefepime, and gentamycin. Subtractive genome analysis is the most effective method for drug target identification. The method is based on the subtraction of homologous proteins from both pathogen and host. By this process set of proteins present only in the pathogen and perform essential functions in the pathogen can be identified. The entire proteome of Vibrio vulnificus strain ATCC 27562 was reduced step by step to a single protein predicted as the drug target. AlphaFold2 is one of the applications of deep learning algorithms in biomedicine and is correctly considered the game changer in the field of structural biology. Accuracy and speed are the major strength of AlphaFold2. In the PDB database, the crystal structure of the predicted drug target was not present, therefore the Colab notebook was used to predict the 3D structure by the AlphaFold2, and subsequently, the predicted model was validated. Potent inhibitors against the new target were predicted by virtual screening and molecular docking study. The most stable compound ZINC01318774 tightly attaches to the binding pocket of bisphosphoglycerate-independent phosphoglycerate mutase. The time-dependent molecular dynamics simulation revealed compound ZINC01318774 was superior as compared to the standard drug tetracycline in terms of stability. The availability of V. vulnificus strain ATCC 27562 has allowed in silico identification of drug target which will provide a base for the discovery of specific therapeutic targets against Vibrio vulnificus.
Collapse
Affiliation(s)
- Bader S. Alotaibi
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Al-Quwayiyah, Shaqra Univesity, Riyadh, Saudi Arabia
| | - Amar Ajmal
- Department of Biochemistry, Computational Medicinal Chemistry Laboratory, UCSS, Abdul Wali Khan University, Mardan, Pakistan
| | - Mohammed Ageeli Hakami
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Al-Quwayiyah, Shaqra Univesity, Riyadh, Saudi Arabia
| | - Arif Mahmood
- Center for Medical Genetics and Human Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, 410078, Hunan, China
| | - Abdul Wadood
- Department of Biochemistry, Computational Medicinal Chemistry Laboratory, UCSS, Abdul Wali Khan University, Mardan, Pakistan
| | - Junjian Hu
- Department of Central Laboratory, SSL, Central Hospital of Gongguan City, Affiliated Dongguan Shilong People's Hospital of Southern Medical University, Dongguan, China
| |
Collapse
|
17
|
Shami A, Alharbi NK, Al-Saeed FA, Alsaegh AA, Al Syaad KM, Abd El-Rahim IHA, Mostafa YS, Ahmed AE. In Silico Subtractive Proteomics and Molecular Docking Approaches for the Identification of Novel Inhibitors against Streptococcus pneumoniae Strain D39. Life (Basel) 2023; 13:life13051128. [PMID: 37240772 DOI: 10.3390/life13051128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/10/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
Streptococcus pneumoniae is a notorious Gram-positive pathogen present asymptomatically in the nasophayrnx of humans. According to the World Health Organization (W.H.O), pneumococcus causes approximately one million deaths yearly. Antibiotic resistance in S. pneumoniae is raising considerable concern around the world. There is an immediate need to address the major issues that have arisen as a result of persistent infections caused by S. pneumoniae. In the present study, subtractive proteomics was used in which the entire proteome of the pathogen consisting of 1947 proteins is effectively decreased to a finite number of possible targets. Various kinds of bioinformatics tools and software were applied for the discovery of novel inhibitors. The CD-HIT analysis revealed 1887 non-redundant sequences from the entire proteome. These non-redundant proteins were submitted to the BLASTp against the human proteome and 1423 proteins were screened as non-homologous. Further, databases of essential genes (DEGG) and J browser identified almost 171 essential proteins. Moreover, non-homologous, essential proteins were subjected in KEGG Pathway Database which shortlisted six unique proteins. In addition, the subcellular localization of these unique proteins was checked and cytoplasmic proteins were chosen for the druggability analysis, which resulted in three proteins, namely DNA binding response regulator (SPD_1085), UDP-N-acetylmuramate-L-alanine Ligase (SPD_1349) and RNA polymerase sigma factor (SPD_0958), which can act as a promising potent drug candidate to limit the toxicity caused by S. pneumoniae. The 3D structures of these proteins were predicted by Swiss Model, utilizing the homology modeling approach. Later, molecular docking by PyRx software 0.8 version was used to screen a library of phytochemicals retrieved from PubChem and ZINC databases and already approved drugs from DrugBank database against novel druggable targets to check their binding affinity with receptor proteins. The top two molecules from each receptor protein were selected based on the binding affinity, RMSD value, and the highest conformation. Finally, the absorption, distribution, metabolism, excretion, and toxicity (ADMET) analyses were carried out by utilizing the SWISS ADME and Protox tools. This research supported the discovery of cost-effective drugs against S. pneumoniae. However, more in vivo/in vitro research should be conducted on these targets to investigate their pharmacological efficacy and their function as efficient inhibitors.
Collapse
Affiliation(s)
- Ashwag Shami
- Department of Biology, College of Sciences, Princess Nourah Bint Abdulrahman University, Riyadh 11617, Saudi Arabia
| | - Nada K Alharbi
- Department of Biology, College of Sciences, Princess Nourah Bint Abdulrahman University, Riyadh 11617, Saudi Arabia
| | - Fatimah A Al-Saeed
- Research Centre, Department of Biology, College of Science, King Khalid University, Abha 61413, Saudi Arabia
- Advanced Material Science (RCAMS), King Khalid University, Abha 61413, Saudi Arabia
| | - Aiman A Alsaegh
- Department of Laboratory Medicine, Faculty of Applied Medical Sciences, Umm Al-Qura University, Makkah Al-Mukarramah 24382, Saudi Arabia
| | - Khalid M Al Syaad
- Biology Department, Faculty of Science, King Khalid University, P.O. Box 9004, Abha 61413, Saudi Arabia
- The Research Center, Faculty of Science, King Khalid University, P.O. Box 9004, Abha 61413, Saudi Arabia
| | - Ibrahim H A Abd El-Rahim
- Department of Environmental and Health Research, Umm Al-Qura University, P.O. Box 6287, Makkah Al-Mukarramah 21955, Saudi Arabia
| | - Yasser Sabry Mostafa
- Biology Department, Faculty of Science, King Khalid University, P.O. Box 9004, Abha 61413, Saudi Arabia
| | - Ahmed Ezzat Ahmed
- Biology Department, Faculty of Science, King Khalid University, P.O. Box 9004, Abha 61413, Saudi Arabia
- Department of Theriogenology, Faculty of Veterinary Medicine, South Valley University, Qena 83523, Egypt
| |
Collapse
|
18
|
Khan K, Jalal K, Uddin R. Pangenome profiling of novel drug target against vancomycin-resistant Enterococcus faecium. J Biomol Struct Dyn 2023; 41:15647-15660. [PMID: 36935100 DOI: 10.1080/07391102.2023.2191134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Accepted: 03/07/2023] [Indexed: 03/20/2023]
Abstract
Enterococcus faecium is a frequent causative agent of nosocomial infection mainly acquired from outgoing hospital patients (Hospital Acquired Infection-HAIs). They are largely involved in the outbreaks of bacteremia, UTI, and endocarditis with a high transmissibility rate. The recent emergence of VRE strain (i.e. vancomycin resistant enterococcus) turned it into high priority pathogen for which new drug research is of dire need. Therefore, in current study, pangenome and resistome analyses were performed for available antibiotic-resistant genomes (n = 216) of E. faecium. It resulted in the prediction of around 5,059 genes as an accessory gene, 1,076 genes as core and 1,558 genes made up a unique genome fraction. Core genes common to all strains were further used for the identification of potent drug targets by applying subtractive genomics approach. Moreover, the COG functional analysis showed that these genomes are highly enriched in metabolic pathways such as in translational, ribosomal, proteins, carbohydrates and nucleotide transport metabolism. Through subtractive genomics it was observed that 431 proteins were non-homologous to the human proteome, 166 identified as essential for pathogen survival while 26 as potential and unique therapeutic targets. Finally, 3-dehydroquinate dehydrogenase was proposed as a potent drug target for further therapeutic candidate identification. Moreover, the molecular docking and dynamic simulation technique were applied to performed a virtual screening of natural product libraries (i.e., TCM and Ayurvedic compounds) along with 3-amino-4,5-dihydroxy-cyclohex-1-enecarboxylate (DHS) as a standard compound to validate the study. Consequently, Argeloside I, Apigenin-7-O-gentiobioside (from Ayurvedic library), ZINC85571062, and ZINC85570908 (TCM library) compounds were identified as potential inhibitors of 3-dehydroquinate dehydrogenase. The study proposed new compounds as novel therapeutics, however, further experimental validation is needed as a follow-up.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Kanwal Khan
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Khurshid Jalal
- H.E.J. Research Institute of Chemistry, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| | - Reaz Uddin
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, Pakistan
| |
Collapse
|
19
|
Seal S, Banerjee N, Mahato R, Kundu T, Sinha D, Chakraborty T, Sinha D, Sau K, Chatterjee S, Sau S. Serine 106 preserves the tertiary structure, function, and stability of a cyclophilin from Staphylococcus aureus. J Biomol Struct Dyn 2023; 41:1479-1494. [PMID: 34967275 DOI: 10.1080/07391102.2021.2021992] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
SaCyp, a staphylococcal cyclophilin involved in both protein folding and pathogenesis, has a Ser residue at position 106 and a Trp residue at position 136. While Ser 106 of SaCyp aligned with a cyclosporin A (CsA) binding Ala residue, its Trp 136 aligned with a Trp or a Phe residue of most other cyclophilins. To demonstrate the exact roles of Ser 106 and Trp 136 in SaCyp, we have elaborately studied rCyp[S106A] and rCyp[W136A], two-point mutants of a recombinant SaCyp (rCyp) harboring an Ala substitution at positions 106 and 136, respectively. Of the mutants, rCyp[W136A] showed the rCyp-like CsA binding affinity and peptidyl-prolyl cis-trans isomerase (PPIase) activity. Conversely, the PPIase activity, CsA binding affinity, stability, tertiary structure, surface hydrophobicity, and Trp accessibility of rCyp[S106A] notably differed from those of rCyp. The computational experiments also reveal that the structure, dimension, and fluctuation of SaCyp are not identical to those of SaCyp[S106A]. Furthermore, Ser at position 106 of SaCyp, compared to Ala at the same position, formed a higher number of non-covalent bonds with CsA. Collectively, Ser 106 is an indispensable residue for SaCyp that keeps its tertiary structure, function, and stability intact.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Soham Seal
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| | - Nilanjan Banerjee
- Department of Biophysics, Bose Institute, Kolkata, West Bengal, India
| | - Rohit Mahato
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| | - Tanmoy Kundu
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| | - Debabrata Sinha
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| | | | - Debasmita Sinha
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| | - Keya Sau
- Department of Biotechnology, Haldia Institute of Technology, Haldia, West Bengal, India
| | | | - Subrata Sau
- Department of Biochemistry, Bose Institute, Kolkata, West Bengal, India
| |
Collapse
|
20
|
Santos AS, Borges Dos Anjos LR, Costa VAF, Freitas VAQ, Zara ALDSA, Costa CR, Neves BJ, Silva MDRR. In silico-chemogenomic repurposing of new chemical scaffolds for histoplasmosis treatment. J Mycol Med 2023; 33:101363. [PMID: 36842411 DOI: 10.1016/j.mycmed.2023.101363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 01/10/2023] [Accepted: 02/10/2023] [Indexed: 02/13/2023]
Abstract
BACKGROUND Histoplasmosis is a systemic form of endemic mycosis to the American continent and may be lethal to people living with HIV/AIDS. The drugs available for treating histoplasmosis are limited, costly, and highly toxic. New drug development is time-consuming and costly; hence, drug repositioning is an advantageous strategy for discovering new therapeutic options. OBJECTIVE This study was conducted to identify drugs that can be repositioned for treating histoplasmosis in immunocompromised patients. METHODS Homologous proteins among Histoplasma capsulatum strains were selected and used to search for homologous targets in the DrugBank and Therapeutic Target Database. Essential genes were selected using Saccharomyces cerevisiae as a model, and functional regions of the therapeutic targets were analyzed. The antifungal activity of the selected drugs was verified, and homology modeling and molecular docking were performed to verify the interactions between the drugs with low inhibitory concentration values and their corresponding targets. RESULTS We selected 149 approved drugs with potential activity against histoplasmosis, among which eight were selected for evaluating their in vitro activity. For drugs with low minimum inhibitory concentration values, such as mebendazole, everolimus, butenafine, and bifonazole, molecular docking studies were performed. A chemogenomic framework revealed lanosterol 14-α-demethylase, squalene monooxygenase, serine/threonine-protein kinase mTOR, and the β-4B tubulin chain of H. capsulatum, respectively, as the protein targets of the drugs. CONCLUSIONS Our strategy can be used to identify promising antifungal targets, and drugs with repositioning potential for treating H. capsulatum.
Collapse
Affiliation(s)
- Andressa Santana Santos
- Institute of Tropical Pathology and Public Health (IPTSP), Federal University of Goiás, Goiânia, Brazil
| | | | | | | | | | - Carolina Rodrigues Costa
- Institute of Tropical Pathology and Public Health (IPTSP), Federal University of Goiás, Goiânia, Brazil
| | - Bruno Junior Neves
- Laboratory of Cheminformatics (LabChem), Faculty of Pharmacy, Federal University of Goiás, Goiânia, Brazil
| | | |
Collapse
|
21
|
Nithya C, Kiran M, Nagarajaram HA. Dissection of hubs and bottlenecks in a protein-protein interaction network. Comput Biol Chem 2023; 102:107802. [PMID: 36603332 DOI: 10.1016/j.compbiolchem.2022.107802] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/20/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
Analysis of degree centrality in conjunction with betweenness centrality of proteins in a human protein-protein interaction network revealed three categories of centrally important proteins: a) proteins with high degree and betweenness (hub-bottlenecks denoted as MX), b) proteins with high betweenness and low degree (non-hub-bottlenecks/pure bottlenecks denoted as PB) and c) proteins with high degree and low betweenness (hub-non-bottlenecks/pure hubs denoted as PH). When subjected to a detailed statistical analysis of their molecular-level properties, the proteins belonging to each of these categories were found to be associated with distinct canonical molecular properties, i.e., "molecular markers". The MX proteins are a) conformationally versatile, mainly comprising of essential proteins, b) the targets for interactions by the proteins of viral and bacterial pathogens, c) evolutionally constrained, involved in multiple pathways, enriched with disease genes and d) involved in the functions such as protein stabilization, phosphorylation, and mRNA slicing processes. PB proteins are a) enriched with extracellular and cancer-related proteins, b) enriched with the approved drug targets and c) involved in cell-cell signaling processes. Finally, PH are a) structurally versatile, b) enriched with essential proteins primarily involved in housekeeping processes (transcription and replication). The fact that the proteins belonging to these three categories form three distinct sets in terms of their molecular properties reveals the existence of trichotomy among hubs and bottlenecks, and this knowledge is of paramount importance while prioritizing protein targets for further studies such as drug design and disease association studies based on their network centrality values.
Collapse
Affiliation(s)
- Chandramohan Nithya
- Department of Biotechnology and Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | - Manjari Kiran
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | | |
Collapse
|
22
|
Chen S, Huang C, Wang L, Zhou S. A disease-related essential protein prediction model based on the transfer neural network. Front Genet 2023; 13:1087294. [PMID: 36685976 PMCID: PMC9845409 DOI: 10.3389/fgene.2022.1087294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 12/14/2022] [Indexed: 01/06/2023] Open
Abstract
Essential proteins play important roles in the development and survival of organisms whose mutations are proven to be the drivers of common internal diseases having higher prevalence rates. Due to high costs of traditional biological experiments, an improved Transfer Neural Network (TNN) was designed to extract raw features from multiple biological information of proteins first, and then, based on the newly-constructed Transfer Neural Network, a novel computational model called TNNM was designed to infer essential proteins in this paper. Different from traditional Markov chain, since Transfer Neural Network adopted the gradient descent algorithm to automatically obtain the transition probability matrix, the prediction accuracy of TNNM was greatly improved. Moreover, additional antecedent memory coefficient and bias term were introduced in Transfer Neural Network, which further enhanced both the robustness and the non-linear expression ability of TNNM as well. Finally, in order to evaluate the identification performance of TNNM, intensive experiments have been executed based on two well-known public databases separately, and experimental results show that TNNM can achieve better performance than representative state-of-the-art prediction models in terms of both predictive accuracies and decline rate of accuracies. Therefore, TNNM may play an important role in key protein prediction in the future.
Collapse
Affiliation(s)
- Sisi Chen
- The First Hospital of Hunan University of Chinese Medicine, Changsha, Hunan, China
| | - Chiguo Huang
- Big Data Innovation and Entrepreneurship Education Center of Hunan Province, Changsha University, Changsha, China,*Correspondence: Chiguo Huang, ; Lei Wang, ; Shunxian Zhou,
| | - Lei Wang
- The First Hospital of Hunan University of Chinese Medicine, Changsha, Hunan, China,Big Data Innovation and Entrepreneurship Education Center of Hunan Province, Changsha University, Changsha, China,*Correspondence: Chiguo Huang, ; Lei Wang, ; Shunxian Zhou,
| | - Shunxian Zhou
- The First Hospital of Hunan University of Chinese Medicine, Changsha, Hunan, China,Big Data Innovation and Entrepreneurship Education Center of Hunan Province, Changsha University, Changsha, China,College of Information Science and Engineering, Hunan Women’s University, Changsha, Hunan, China,*Correspondence: Chiguo Huang, ; Lei Wang, ; Shunxian Zhou,
| |
Collapse
|
23
|
Xue X, Zhang W, Fan A. Comparative analysis of gene ontology-based semantic similarity measurements for the application of identifying essential proteins. PLoS One 2023; 18:e0284274. [PMID: 37083829 PMCID: PMC10121005 DOI: 10.1371/journal.pone.0284274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 03/28/2023] [Indexed: 04/22/2023] Open
Abstract
Identifying key proteins from protein-protein interaction (PPI) networks is one of the most fundamental and important tasks for computational biologists. However, the protein interactions obtained by high-throughput technology are characterized by a high false positive rate, which severely hinders the prediction accuracy of the current computational methods. In this paper, we propose a novel strategy to identify key proteins by constructing reliable PPI networks. Five Gene Ontology (GO)-based semantic similarity measurements (Jiang, Lin, Rel, Resnik, and Wang) are used to calculate the confidence scores for protein pairs under three annotation terms (Molecular function (MF), Biological process (BP), and Cellular component (CC)). The protein pairs with low similarity values are assumed to be low-confidence links, and the refined PPI networks are constructed by filtering the low-confidence links. Six topology-based centrality methods (the BC, DC, EC, NC, SC, and aveNC) are applied to test the performance of the measurements under the original network and refined network. We systematically compare the performance of the five semantic similarity metrics with the three GO annotation terms on four benchmark datasets, and the simulation results show that the performance of these centrality methods under refined PPI networks is relatively better than that under the original networks. Resnik with a BP annotation term performs best among all five metrics with the three annotation terms. These findings suggest the importance of semantic similarity metrics in measuring the reliability of the links between proteins and highlight the Resnik metric with the BP annotation term as a favourable choice.
Collapse
Affiliation(s)
- Xiaoli Xue
- School of Science, East China Jiaotong University, Nanchang, China
| | - Wei Zhang
- School of Science, East China Jiaotong University, Nanchang, China
| | - Anjing Fan
- School of Computer and Information Engineering, Anyang Normal University, Anyang, China
| |
Collapse
|
24
|
Payra AK, Saha B, Ghosh A. MM-CCNB: Essential protein prediction using MAX-MIN strategies and compartment of common neighboring approach. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 228:107247. [PMID: 36427433 DOI: 10.1016/j.cmpb.2022.107247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 10/16/2022] [Accepted: 11/14/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Proteins are indispensable for the flow of the life of living organisms. Protein pairs in interaction exhibit more functional activities than individuals. These activities have been considered an essential measure in predicting their essentiality. Neighborhood approaches have been used frequently in the prediction of essentiality scores. All paired neighbors of the essential proteins are nominated for the suitable candidate seeds for prediction. Still now Jaccard's coefficient is limited to predicting functions, homologous groups, sequence analysis, etc. It really motivate us to predict essential proteins efficiently using different computational approaches. METHODS In our work, we proposed modified Jaccard's coefficient to predict essential proteins. We have proposed a novel methodology for predicting essential proteins using MAX-MIN strategies and modified Jaccard's coefficient approach. RESULTS The performance of our proposed methodology has been analyzed for Saccharomyces cerevisiae datasets with an accuracy of more than 80%. It has been observed that the proposed algorithm is outperforms with an accuracy of 0.78, 0.74, 0.79, and 0.862 for YDIP, YMIPS, YHQ, and YMBD datasets respectivly. CONCLUSIONS There are several computational approaches in the existing state-of-art model of essential protein prediction. It has been noted that our predicted methodology outperforms other existing models viz. different centralities, local interaction density combined with protein complexes, modified monkey algorithm and ortho_sim_loc methods.
Collapse
Affiliation(s)
- Anjan Kumar Payra
- Department of Computer Science & Engineering, Dr. Sudhir Chandra Sur Degree Engineering College, 540, Dum Dum Road, Near Dum Dum Jn. Station, Surermath, Kolkata 700074, India.
| | - Banani Saha
- Department of Computer Science & Engineering, University of Calcutta, Saltlake City Kolkata 700073, India
| | - Anupam Ghosh
- Department of Computer Science & Engineering, Netaji Subhash Engineering College, Techno City, Panchpota, Garia, Kolkata 700152, India.
| |
Collapse
|
25
|
Wang L, Peng J, Kuang L, Tan Y, Chen Z. Identification of Essential Proteins Based on Local Random Walk and Adaptive Multi-View Multi-Label Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3507-3516. [PMID: 34788220 DOI: 10.1109/tcbb.2021.3128638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Accumulating evidences have indicated that essential proteins play vital roles in human physiological process. In recent years, although researches on prediction of essential proteins have been developing rapidly, there are as well various limitations such as unsatisfactory data suitability, low accuracy of predictive results and so on. In this manuscript, a novel method called RWAMVL was proposed to predict essential proteins based on the Random Walk and the Adaptive Multi-View multi-label Learning. In RWAMVL, considering that the inherent noise is ubiquitous in existing datasets of known protein-protein interactions (PPIs), a variety of different features including biological features of proteins and topological features of PPI networks were obtained by adopting adaptive multi-view multi-label learning first. And then, an improved random walk method was designed to detect essential proteins based on these different features. Finally, in order to verify the predictive performance of RWAMVL, intensive experiments were done to compare it with multiple state-of-the-art predictive methods under different expeditionary frameworks. And as a result, RWAMVL was proven that it can achieve better prediction accuracy than all those competitive methods, which demonstrated as well that RWAMVL may be a potential tool for prediction of key proteins in the future.
Collapse
|
26
|
Wang C, Zhang H, Ma H, Wang Y, Cai K, Guo T, Yang Y, Li Z, Zhu Y. Inference of pan-cancer related genes by orthologs matching based on enhanced LSTM model. Front Microbiol 2022; 13:963704. [PMID: 36267181 PMCID: PMC9577021 DOI: 10.3389/fmicb.2022.963704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 08/16/2022] [Indexed: 11/13/2022] Open
Abstract
Many disease-related genes have been found to be associated with cancer diagnosis, which is useful for understanding the pathophysiology of cancer, generating targeted drugs, and developing new diagnostic and treatment techniques. With the development of the pan-cancer project and the ongoing expansion of sequencing technology, many scientists are focusing on mining common genes from The Cancer Genome Atlas (TCGA) across various cancer types. In this study, we attempted to infer pan-cancer associated genes by examining the microbial model organism Saccharomyces Cerevisiae (Yeast) by homology matching, which was motivated by the benefits of reverse genetics. First, a background network of protein-protein interactions and a pathogenic gene set involving several cancer types in humans and yeast were created. The homology between the human gene and yeast gene was then discovered by homology matching, and its interaction sub-network was obtained. This was undertaken following the principle that the homologous genes of the common ancestor may have similarities in expression. Then, using bidirectional long short-term memory (BiLSTM) in combination with adaptive integration of heterogeneous information, we further explored the topological characteristics of the yeast protein interaction network and presented a node representation score to evaluate the node ability in graphs. Finally, homologous mapping for human genes matched the important genes identified by ensemble classifiers for yeast, which may be thought of as genes connected to all types of cancer. One way to assess the performance of the BiLSTM model is through experiments on the database. On the other hand, enrichment analysis, survival analysis, and other outcomes can be used to confirm the biological importance of the prediction results. You may access the whole experimental protocols and programs at https://github.com/zhuyuan-cug/AI-BiLSTM/tree/master.
Collapse
Affiliation(s)
- Chao Wang
- Department of Surgery, Hepatic Surgery Center, Institute of Hepato-Pancreato-Biliary Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Houwang Zhang
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Haishu Ma
- School of Automation, China University of Geosciences, Wuhan, China
- Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan, China
- Engineering Research Center of Intelligent Technology for Geo-Exploration, Wuhan, China
| | - Yawen Wang
- School of Mathematics and Physics, China University of Geosciences, Wuhan, China
| | - Ke Cai
- School of Automation, China University of Geosciences, Wuhan, China
- Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan, China
- Engineering Research Center of Intelligent Technology for Geo-Exploration, Wuhan, China
| | - Tingrui Guo
- School of Automation, China University of Geosciences, Wuhan, China
- Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan, China
- Engineering Research Center of Intelligent Technology for Geo-Exploration, Wuhan, China
| | - Yuanhang Yang
- School of Mathematics and Physics, China University of Geosciences, Wuhan, China
| | - Zhen Li
- School of Mathematics and Physics, China University of Geosciences, Wuhan, China
| | - Yuan Zhu
- School of Automation, China University of Geosciences, Wuhan, China
- Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan, China
- Engineering Research Center of Intelligent Technology for Geo-Exploration, Wuhan, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Shanghai, China
- *Correspondence: Yuan Zhu
| |
Collapse
|
27
|
Identification of Concomitant Inhibitors against Glutamine Synthetase and Isocitrate Lyase in Mycobacterium tuberculosis from Natural Sources. BIOMED RESEARCH INTERNATIONAL 2022; 2022:4661491. [PMID: 36225979 PMCID: PMC9550479 DOI: 10.1155/2022/4661491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 08/05/2022] [Accepted: 08/23/2022] [Indexed: 11/17/2022]
Abstract
Tuberculosis (T.B.) is a disease that occurs due to infection by the bacterium, Mycobacterium tuberculosis (Mtb), which is responsible for millions of deaths every year. Due to the emergence of multidrug and extensive drug-resistant Mtb strains, there is an urgent need to develop more powerful drugs for inclusion in the current tuberculosis treatment regime. In this study, 1778 molecules from four medicinal plants, Azadirachta indica, Camellia sinensis, Adhatoda vasica, and Ginkgo biloba, were selected and docked against two chosen drug targets, namely, Glutamine Synthetase (G.S.) and Isocitrate Lyase (I.C.L.). Molecular Docking was performed using the Glide module of the Schrӧdinger suite to identify the best-performing ligands; the complexes formed by the best-performing ligands were further investigated for their binding stability via Molecular Dynamics Simulation of 100 ns. The present study suggests that Azadiradione from Azadirachta indica possesses the potential to inhibit Glutamine Synthetase and Isocitrate Lyase of M. tuberculosis concomitantly. The excellent docking score of the ligand and the stability of receptor-ligand complexes, coupled with the complete pharmacokinetic profile of Azadiradione, support the proposal of the small molecule, Azadiradione as a novel antitubercular agent. Further, wet lab analysis of Azadiradione may lead to the possible discovery of a novel antitubercular drug.
Collapse
|
28
|
Rule-Based Pruning and In Silico Identification of Essential Proteins in Yeast PPIN. Cells 2022; 11:cells11172648. [PMID: 36078056 PMCID: PMC9454873 DOI: 10.3390/cells11172648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 08/18/2022] [Accepted: 08/22/2022] [Indexed: 11/25/2022] Open
Abstract
Proteins are vital for the significant cellular activities of living organisms. However, not all of them are essential. Identifying essential proteins through different biological experiments is relatively more laborious and time-consuming than the computational approaches used in recent times. However, practical implementation of conventional scientific methods sometimes becomes challenging due to poor performance impact in specific scenarios. Thus, more developed and efficient computational prediction models are required for essential protein identification. An effective methodology is proposed in this research, capable of predicting essential proteins in a refined yeast protein–protein interaction network (PPIN). The rule-based refinement is done using protein complex and local interaction density information derived from the neighborhood properties of proteins in the network. Identification and pruning of non-essential proteins are equally crucial here. In the initial phase, careful assessment is performed by applying node and edge weights to identify and discard the non-essential proteins from the interaction network. Three cut-off levels are considered for each node and edge weight for pruning the non-essential proteins. Once the PPIN has been filtered out, the second phase starts with two centralities-based approaches: (1) local interaction density (LID) and (2) local interaction density with protein complex (LIDC), which are successively implemented to identify the essential proteins in the yeast PPIN. Our proposed methodology achieves better performance in comparison to the existing state-of-the-art techniques.
Collapse
|
29
|
Mazaya M, Kwon YK. In Silico Pleiotropy Analysis in KEGG Signaling Networks Using a Boolean Network Model. Biomolecules 2022; 12:biom12081139. [PMID: 36009032 PMCID: PMC9406064 DOI: 10.3390/biom12081139] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 08/10/2022] [Accepted: 08/15/2022] [Indexed: 11/16/2022] Open
Abstract
Pleiotropy, which refers to the ability of different mutations on the same gene to cause different pathological effects in human genetic diseases, is important in understanding system-level biological diseases. Although some biological experiments have been proposed, still little is known about pleiotropy on gene–gene dynamics, since most previous studies have been based on correlation analysis. Therefore, a new perspective is needed to investigate pleiotropy in terms of gene–gene dynamical characteristics. To quantify pleiotropy in terms of network dynamics, we propose a measure called in silico Pleiotropic Scores (sPS), which represents how much a gene is affected against a pair of different types of mutations on a Boolean network model. We found that our model can identify more candidate pleiotropic genes that are not known to be pleiotropic than the experimental database. In addition, we found that many types of functionally important genes tend to have higher sPS values than other genes; in other words, they are more pleiotropic. We investigated the relations of sPS with the structural properties in the signaling network and found that there are highly positive relations to degree, feedback loops, and centrality measures. This implies that the structural characteristics are principles to identify new pleiotropic genes. Finally, we found some biological evidence showing that sPS analysis is relevant to the real pleiotropic data and can be considered a novel candidate for pleiotropic gene research. Taken together, our results can be used to understand the dynamics pleiotropic characteristics in complex biological systems in terms of gene–phenotype relations.
Collapse
Affiliation(s)
- Maulida Mazaya
- Research Center for Computing, National Research and Innovation Agency (BRIN), Cibinong Science Center, Jl. Raya Jakarta-Bogor KM 46, Cibinong 16911, West Java, Indonesia
| | - Yung-Keun Kwon
- School of IT Convergence, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan 44610, Korea
- Correspondence:
| |
Collapse
|
30
|
ELIMINATOR: essentiality analysis using multisystem networks and integer programming. BMC Bioinformatics 2022; 23:324. [PMID: 35933325 PMCID: PMC9357337 DOI: 10.1186/s12859-022-04855-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Accepted: 07/21/2022] [Indexed: 11/28/2022] Open
Abstract
A gene is considered as essential when it is indispensable for cells to grow and replicate in a certain environment. However, gene essentiality is not a structural property but rather a contextual one, which depends on the specific biological conditions affecting the cell. This circumstantial essentiality of genes is what brings the attention of scientist since we can identify genes essential for cancer cells but not essential for healthy cells. This same contextuality makes their identification extremely challenging. Huge experimental efforts such as Project Achilles where the essentiality of thousands of genes is measured together with a plethora of molecular data (transcriptomics, copy number, mutations, etc.) in over one thousand cell lines can shed light on the causality behind the essentiality of a gene in a given environment. Here, we present an in-silico method for the identification of patient-specific essential genes using constraint-based modelling (CBM). Our method expands the ideas behind traditional CBM to accommodate multisystem networks. In essence, it first calculates the minimum number of lowly expressed genes required to be activated by the cell to sustain life as defined by a set of requirements; and second, it performs an exhaustive in-silico gene knockout to find those that lead to the need of activating additional lowly expressed genes. We validated the proposed methodology using a set of 452 cancer cell lines derived from the Cancer Cell Line Encyclopedia where an exhaustive experimental large-scale gene knockout study using CRISPR (Achilles Project) evaluates the impact of each removal. We also show that the integration of different essentiality predictions per gene, what we called Essentiality Congruity Score, reduces the number of false positives. Finally, we explored our method in a breast cancer patient dataset, and our results showed high concordance with previous publications. These findings suggest that identifying genes whose activity is fundamental to sustain cellular life in a patient-specific manner is feasible using in-silico methods. The patient-level gene essentiality predictions can pave the way for precision medicine by identifying potential drug targets whose deletion can induce death in tumour cells.
Collapse
|
31
|
Wu C, Feng Z, Zheng J, Zhang H, Cao J, Yan H. Star topology convolution for graph representation learning. COMPLEX INTELL SYST 2022. [DOI: 10.1007/s40747-022-00744-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
AbstractWe present a novel graph convolutional method called star topology convolution (STC). This method makes graph convolution more similar to conventional convolutional neural networks (CNNs) in Euclidean feature spaces. STC learns subgraphs which have a star topology rather than learning a fixed graph like most spectral methods. Due to the properties of a star topology, STC is graph-scale free (without a fixed graph size constraint). It has fewer parameters in its convolutional filter and is inductive, so it is more flexible and can be applied to large and evolving graphs. The convolutional filter is learnable and localized, similar to CNNs in Euclidean feature spaces, and can share weights across graphs. To test the method, STC was compared with the state-of-the-art graph convolutional methods in a supervised learning setting on nine node properties prediction benchmark datasets: Cora, Citeseer, Pubmed, PPI, Arxiv, MAG, ACM, DBLP, and IMDB. The experimental results showed that STC achieved the state-of-the-art performance on all these datasets and maintained good robustness. In an essential protein identification task, STC outperformed the state-of-the-art essential protein identification methods. An application of using pretrained STC as the embedding for feature extraction of some downstream classification tasks was introduced. The experimental results showed that STC can share weights across different graphs and be used as the embedding to improve the performance of downstream tasks.
Collapse
|
32
|
Noori S, Al‐A'araji N, Al‐Shamery E. Construction of dynamic protein interaction network based on gene expression data and quartile one principle. Proteins 2022; 90:1219-1228. [DOI: 10.1002/prot.26304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/12/2022] [Accepted: 01/13/2022] [Indexed: 11/09/2022]
Affiliation(s)
- Soheir Noori
- Software Department University of Babylon Hillah Babylon Iraq
- Computer Science Department University of Kerbala Karbala Iraq
| | | | - Eman Al‐Shamery
- Software Department University of Babylon Hillah Babylon Iraq
| |
Collapse
|
33
|
Zhang Z, Luo Y, Jiang M, Wu D, Zhang W, Yan W, Zhao B. An efficient strategy for identifying essential proteins based on homology, subcellular location and protein-protein interaction information. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:6331-6343. [PMID: 35603404 DOI: 10.3934/mbe.2022296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
High throughput biological experiments are expensive and time consuming. For the past few years, many computational methods based on biological information have been proposed and widely used to understand the biological background. However, the processing of biological information data inevitably produces false positive and false negative data, such as the noise in the Protein-Protein Interaction (PPI) networks and the noise generated by the integration of a variety of biological information. How to solve these noise problems is the key role in essential protein predictions. An Identifying Essential Proteins model based on non-negative Matrix Symmetric tri-Factorization and multiple biological information (IEPMSF) is proposed in this paper, which utilizes only the PPI network proteins common neighbor characters to develop a weighted network, and uses the non-negative matrix symmetric tri-factorization method to find more potential interactions between proteins in the network so as to optimize the weighted network. Then, using the subcellular location and lineal homology information, the starting score of proteins is determined, and the random walk algorithm with restart mode is applied to the optimized network to mark and rank each protein. We tested the suggested forecasting model against current representative approaches using a public database. Experiment shows high efficiency of new method in essential proteins identification. The effectiveness of this method shows that it can dramatically solve the noise problems that existing in the multi-source biological information itself and cased by integrating them.
Collapse
Affiliation(s)
- Zhihong Zhang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, Hunan 410022, China
| | - Yingchun Luo
- Department of Ultrasound, Hunan Provincial Maternal and Child Health Care Hospital, Changsha, Hunan 410008, China
| | - Meiping Jiang
- Department of Ultrasound, Hunan Provincial Maternal and Child Health Care Hospital, Changsha, Hunan 410008, China
| | - Dongjie Wu
- Department of Banking and Finance, Monash University, Clayton, Victoria 3168, Australia
| | - Wang Zhang
- Department of Optoelectronic Engineering, Jinan University, Guangzhou, Guangdong 510632, China
| | - Wei Yan
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, Hunan 410022, China
| | - Bihai Zhao
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, Hunan 410022, China
| |
Collapse
|
34
|
Lu H, Shang C, Zou S, Cheng L, Yang S, Wang L. A Novel Method for Predicting Essential Proteins by Integrating Multidimensional Biological Attribute Information and Topological Properties. Curr Bioinform 2022. [DOI: 10.2174/1574893617666220304201507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Essential proteins are indispensable to the maintenance of life activities and play essential roles in the areas of synthetic biology. Identification of essential proteins by computational methods has become a hot topic in recent years because of its efficiency.
Objective:
Identification of essential proteins is of important significance and practical use in the areas of synthetic biology, drug targets, and human disease genes.
Method:
In this paper, a method called EOP(Edge clustering coefficient -Orthologous-Protein) is proposed to infer potential essential proteins by combining Multidimensional Biological Attribute Information of proteins with Topological Properties of the protein-protein interaction network.
Results:
The simulation results on the yeast protein interaction network show that the number of essential proteins identified by this method is more than the number identified by the other 12 methods(DC, IC, EC, SC, BC, CC, NC, LAC, PEC, CoEWC, POEM, DWE). Especially compared with DC(Degree Centrality), the SN(sensitivity) is 9% higher, when the candidate protein is 1%, the recognition rate is 34% higher, when the candidate protein is 5%, 10%, 15%, 20%, 25% the recognition rate is 36%, 22%, 15%, 11%, 8% higher respectively.
Conclusion:
Experimental results show that our method can achieve satisfactory prediction results, which may provide references for future research.
Collapse
Affiliation(s)
- Hanyu Lu
- College of Big Data and Information Engineering, Guizhou University, Guizhou, China
| | - Chen Shang
- College of Big Data and Information Engineering, Guizhou University, Guizhou, China
| | - Sai Zou
- College of Big Data and Information Engineering, Guizhou University, Guizhou, China
| | - Lihong Cheng
- College of Foreign Languages, Dalian Jiaotong University, China
| | - Shikong Yang
- College of Big Data and Information Engineering, Guizhou University, Guizhou, China
| | - Lei Wang
- College of Computer Engineering and Applied Mathematics, Changsha University, China
| |
Collapse
|
35
|
Khan K, Jalal K, Uddin R. An integrated in silico based subtractive genomics and reverse vaccinology approach for the identification of novel vaccine candidate and chimeric vaccine against XDR Salmonella typhi H58. Genomics 2022; 114:110301. [PMID: 35149170 DOI: 10.1016/j.ygeno.2022.110301] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 12/25/2021] [Accepted: 02/01/2022] [Indexed: 12/14/2022]
Abstract
Salmonella typhi is notorious for causing enteric fever which is also known as typhoid fever. It emerged as an extreme drug resistant strain that requires urgent attention to prevent its global spread. Statistically, about 11-17 million typhoid illnesses are reported worldwide annually. The only alternative approach for the control of this illness is proper vaccination. However, available typhoid vaccine has certain limitations such as poor long-term efficacy, and non-recommendation for below 6 years children, which opens the avenues for designing new vaccines to overcome such limitations. Computational-based reverse vaccinology along with subtractive genomics analysis is one of the robust approaches used for the prioritization of vaccine candidates through direct screening of genome sequence assemblies. In the current study, we have successfully designed a peptide-based novel antigen chimeric vaccine candidate against the XDR strain of S. typhi H58. The pipeline revealed four peptides from WP_001176621.1 i.e., peptidoglycan-associated lipoprotein Pal and two peptides from WP_000747548.1 i.e., OmpA family lipoprotein as promising target for the induction of immune response against S. typhi. The six epitopes from both proteins were found as immunogenic, antigenic, virulent, highly conserved, nontoxic, and non-allergenic among whole Salmonella H58 proteome. Furthermore, the binding interaction between a chimeric vaccine and human population alleles was unveiled through structure-based studies. So far, these proteins have never been characterized as vaccine targets against S. typhi. The current study proposed that construct V2 could be a significant vaccine candidate against S. typhi H58. However, to ascertain this, future experimental holistic studies are recommended as follow-up.
Collapse
Affiliation(s)
- Kanwal Khan
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Pakistan
| | - Khurshid Jalal
- HEJ Research Institute of Chemistry International Center for Chemical and Biological Sciences, University of Karachi, Pakistan
| | - Reaz Uddin
- Dr. Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Pakistan.
| |
Collapse
|
36
|
Zhu X, Zhu Y, Tan Y, Chen Z, Wang L. An Iterative Method for Predicting Essential Proteins Based on Multifeature Fusion and Linear Neighborhood Similarity. Front Aging Neurosci 2022; 13:799500. [PMID: 35140599 PMCID: PMC8819145 DOI: 10.3389/fnagi.2021.799500] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 12/02/2021] [Indexed: 11/13/2022] Open
Abstract
Growing evidence have demonstrated that many biological processes are inseparable from the participation of key proteins. In this paper, a novel iterative method called linear neighborhood similarity-based protein multifeatures fusion (LNSPF) is proposed to identify potential key proteins based on multifeature fusion. In LNSPF, an original protein-protein interaction (PPI) network will be constructed first based on known protein-protein interaction data downloaded from benchmark databases, based on which, topological features will be further extracted. Next, gene expression data of proteins will be adopted to transfer the original PPI network to a weighted PPI network based on the linear neighborhood similarity. After that, subcellular localization and homologous information of proteins will be integrated to extract functional features for proteins, and based on both functional and topological features obtained above. And then, an iterative method will be designed and carried out to predict potential key proteins. At last, for evaluating the predictive performance of LNSPF, extensive experiments have been done, and compare results between LNPSF and 15 state-of-the-art competitive methods have demonstrated that LNSPF can achieve satisfactory recognition accuracy, which is markedly better than that achieved by each competing method.
Collapse
Affiliation(s)
- Xianyou Zhu
- College of Computer Science and Technology, Hengyang Normal University, Hengyang, China
| | - Yaocan Zhu
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Yihong Tan
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Zhiping Chen
- College of Computer Science and Technology, Hengyang Normal University, Hengyang, China
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Lei Wang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
37
|
Predicting Essential Proteins Based on Integration of Local Fuzzy Fractal Dimension and Subcellular Location Information. Genes (Basel) 2022; 13:genes13020173. [PMID: 35205217 PMCID: PMC8872415 DOI: 10.3390/genes13020173] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 01/08/2022] [Accepted: 01/12/2022] [Indexed: 11/17/2022] Open
Abstract
Essential proteins are indispensable to cells’ survival and development. Prediction and analysis of essential proteins are crucial for uncovering the mechanisms of cells. With the help of computer science and high-throughput technologies, forecasting essential proteins by protein–protein interaction (PPI) networks has become more efficient than traditional approaches (expensive experimental methods are generally used). Many computational algorithms were employed to predict the essential proteins; however, they have various restrictions. To improve the prediction accuracy, by introducing the Local Fuzzy Fractal Dimension (LFFD) of complex networks into the analysis of the PPI network, we propose a novel algorithm named LDS, which combines the LFFD of the PPI network with the protein subcellular location information. By testing the proposed LDS algorithm on three different yeast PPI networks, the experimental results show that LDS outperforms some state-of-the-art essential protein-prediction techniques.
Collapse
|
38
|
Qi G, Jiang K, Qu J, Zhang A, Xu Z, Li Z, Zheng X, Li Z. The Material Basis and Mechanism of Xuefu Zhuyu Decoction in Treating Stable Angina Pectoris and Unstable Angina Pectoris. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE : ECAM 2022; 2022:3741027. [PMID: 35140797 PMCID: PMC8820872 DOI: 10.1155/2022/3741027] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 01/05/2022] [Indexed: 02/07/2023]
Abstract
METHODS Firstly, we used a network proximity approach to calculate and compare the effectiveness of the formula with that of Western drugs for each type of angina, including all targets and intersecting targets, from a topological perspective. Secondly, we compared the mechanisms of action of the two angina pectoris at three levels and five aspects, including conventional and modular analysis approaches. Thirdly, based on the unique functions of each angina in the complex heterogeneous network, we designed a reverse process for finding the material basis using dynamic, static, and enriched items as well as a total item. Finally, the designed inverse process, material basis, and mechanism of action were validated. RESULTS The target network of Xuefu Zhuyu decoction is closer to the target network of each type of angina than that of Western drugs, and the intersection targets have a closer proximity. Comparison of the mechanisms of action showed that stable angina and unstable angina had 158 common targets, while the unique targets were 34 and 1, respectively. Modularity analysis showed that the GO similarity of target modules was highly correlated with KEGG similarity. We ended up with 67 compounds upregulated for stable angina and 47 compounds upregulated for unstable angina. Our results were validated by literature mining, high-volume molecular docking, and miRNA enrichment analysis. CONCLUSIONS For both types of angina pectoris, Xuefu Zhuyu decoction is superior to Western drugs. A comparison of various aspects led to the unique mechanisms of action, from which the material basis of each type of angina was deduced.
Collapse
Affiliation(s)
- Guanpeng Qi
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Kaiwen Jiang
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Jiaming Qu
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Aijun Zhang
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Ze Xu
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Zhaohang Li
- 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China
| | - Xiaosong Zheng
- 2School of Medical Devices, Shenyang Pharmaceutical University, Shenyang, China
| | - Zuojing Li
- 2School of Medical Devices, Shenyang Pharmaceutical University, Shenyang, China
| |
Collapse
|
39
|
Ali A, Ahmad S, de Albuquerque PMM, Kamil A, Alshammari FA, Alouffi A, da Silva Vaz I. Prediction of Novel Drug Targets and Vaccine Candidates against Human Lice (Insecta), Acari (Arachnida), and Their Associated Pathogens. Vaccines (Basel) 2021; 10:vaccines10010008. [PMID: 35062669 PMCID: PMC8778234 DOI: 10.3390/vaccines10010008] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 12/14/2021] [Accepted: 12/20/2021] [Indexed: 01/12/2023] Open
Abstract
The emergence of drug-resistant lice, acari, and their associated pathogens (APs) is associated with economic losses; thus, it is essential to find new appropriate therapeutic approaches. In the present study, a subtractive proteomics approach was used to predict suitable therapeutics against these vectors and their infectious agents. We found 9701 proteins in the lice (Pediculus humanus var. corporis) and acari (Ixodes scapularis, Leptotrombidium deliense), and 4822 proteins in the proteomes of their APs (Babesia microti, Borreliella mayonii, Borrelia miyamotoi, Borrelia recurrentis, Rickettsia prowazekii, Orientia tsutsugamushi str. Boryong) that were non-homologous to host proteins. Among these non-homologous proteins, 365 proteins of lice and acari, and 630 proteins of APs, were predicted as essential proteins. Twelve unique essential proteins were predicted to be involved in four unique metabolic pathways of lice and acari, and 103 unique proteins were found to be involved in 75 unique metabolic pathways of APs. The sub cellular localization analysis of 115 unique essential proteins of lice and acari and their APs revealed that 61 proteins were cytoplasmic, 42 as membrane-bound proteins and 12 proteins with multiple localization. The druggability analysis of the identified 73 cytoplasmic and multiple localization essential proteins revealed 22 druggable targets and 51 novel drug targets that participate in unique pathways of lice and acari and their APs. Further, the predicted 42 membrane bound proteins could be potential vaccine candidates. Screening of useful inhibitors against these novel targets may result in finding novel compounds efficient for the control of these parasites.
Collapse
Affiliation(s)
- Abid Ali
- Department of Zoology, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan;
- Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brazil; (S.A.); (P.M.M.d.A.)
| | - Shabir Ahmad
- Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brazil; (S.A.); (P.M.M.d.A.)
| | | | - Atif Kamil
- Department of Biotechnology, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan;
| | - Fahdah Ayed Alshammari
- College of Sciences and Literature Microbiology, Nothern Border University, Rafha 76413, Saudi Arabia;
| | - Abdulaziz Alouffi
- King Abdulaziz City for Science and Technology, Riyadh 12354, Saudi Arabia;
- Vaccines Research for Infectious Diseases, King Saud University, Riyadh 11495, Saudi Arabia
- Veterinary Laboratories and Vaccines Center, Ministry of Environment Water & Agriculture, Riyadh 11195, Saudi Arabia
| | - Itabajara da Silva Vaz
- Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Porto Alegre 91501-970, Brazil; (S.A.); (P.M.M.d.A.)
- Correspondence: ; Tel.: +55-(51)-33086078; Fax: +55-(51)-33087309
| |
Collapse
|
40
|
Zhu X, He X, Kuang L, Chen Z, Lancine C. A Novel Collaborative Filtering Model-Based Method for Identifying Essential Proteins. Front Genet 2021; 12:763153. [PMID: 34745230 PMCID: PMC8566338 DOI: 10.3389/fgene.2021.763153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 09/13/2021] [Indexed: 11/19/2022] Open
Abstract
Considering that traditional biological experiments are expensive and time consuming, it is important to develop effective computational models to infer potential essential proteins. In this manuscript, a novel collaborative filtering model-based method called CFMM was proposed, in which, an updated protein–domain interaction (PDI) network was constructed first by applying collaborative filtering algorithm on the original PDI network, and then, through integrating topological features of PDI networks with biological features of proteins, a calculative method was designed to infer potential essential proteins based on an improved PageRank algorithm. The novelties of CFMM lie in construction of an updated PDI network, application of the commodity-customer-based collaborative filtering algorithm, and introduction of the calculation method based on an improved PageRank algorithm, which ensured that CFMM can be applied to predict essential proteins without relying entirely on known protein–domain associations. Simulation results showed that CFMM can achieve reliable prediction accuracies of 92.16, 83.14, 71.37, 63.87, 55.84, and 52.43% in the top 1, 5, 10, 15, 20, and 25% predicted candidate key proteins based on the DIP database, which are remarkably higher than 14 competitive state-of-the-art predictive models as a whole, and in addition, CFMM can achieve satisfactory predictive performances based on different databases with various evaluation measurements, which further indicated that CFMM may be a useful tool for the identification of essential proteins in the future.
Collapse
Affiliation(s)
- Xianyou Zhu
- College of Computer Science and Technology, Hengyang Normal University, Hengyang, China.,Hunan Provincial Key Laboratory of Intelligent Information Processing and Application, Hengyang, China
| | - Xin He
- College of Computer, Xiangtan University, Xiangtan, China
| | - Linai Kuang
- College of Computer, Xiangtan University, Xiangtan, China
| | - Zhiping Chen
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Camara Lancine
- The Social Sciences and Management University of Bamako, Bamako, Mali
| |
Collapse
|
41
|
Li S, Zhang Z, Li X, Tan Y, Wang L, Chen Z. An iteration model for identifying essential proteins by combining comprehensive PPI network with biological information. BMC Bioinformatics 2021; 22:430. [PMID: 34496745 PMCID: PMC8425031 DOI: 10.1186/s12859-021-04300-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 07/08/2021] [Indexed: 11/10/2022] Open
Abstract
Background Essential proteins have great impacts on cell survival and development, and played important roles in disease analysis and new drug design. However, since it is inefficient and costly to identify essential proteins by using biological experiments, then there is an urgent need for automated and accurate detection methods. In recent years, the recognition of essential proteins in protein interaction networks (PPI) has become a research hotspot, and many computational models for predicting essential proteins have been proposed successively. Results In order to achieve higher prediction performance, in this paper, a new prediction model called TGSO is proposed. In TGSO, a protein aggregation degree network is constructed first by adopting the node density measurement method for complex networks. And simultaneously, a protein co-expression interactive network is constructed by combining the gene expression information with the network connectivity, and a protein co-localization interaction network is constructed based on the subcellular localization data. And then, through integrating these three kinds of newly constructed networks, a comprehensive protein–protein interaction network will be obtained. Finally, based on the homology information, scores can be calculated out iteratively for different proteins, which can be utilized to estimate the importance of proteins effectively. Moreover, in order to evaluate the identification performance of TGSO, we have compared TGSO with 13 different latest competitive methods based on three kinds of yeast databases. And experimental results show that TGSO can achieve identification accuracies of 94%, 82% and 72% out of the top 1%, 5% and 10% candidate proteins respectively, which are to some degree superior to these state-of-the-art competitive models. Conclusions We constructed a comprehensive interactive network based on multi-source data to reduce the noise and errors in the initial PPI, and combined with iterative methods to improve the accuracy of necessary protein prediction, and means that TGSO may be conducive to the future development of essential protein recognition as well.
Collapse
Affiliation(s)
- Shiyuan Li
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, China
| | - Zhen Zhang
- College of Electronic Information and Electrical Engineering, Changsha University, Changsha, 410022, China
| | - Xueyong Li
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, China
| | - Yihong Tan
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, China. .,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, China.
| | - Lei Wang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, China.,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, China
| | - Zhiping Chen
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, 410022, China. .,Hunan Province Key Laboratory of Industrial Internet Technology and Security, Changsha University, Changsha, 410022, China.
| |
Collapse
|
42
|
Xie J, Zhao C, Sun J, Li J, Yang F, Wang J, Nie Q. Prediction of Essential Genes in Comparison States Using Machine Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1784-1792. [PMID: 32991286 DOI: 10.1109/tcbb.2020.3027392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Identifying essential genes in comparison states (EGS) is vital to understanding cell differentiation, performing drug discovery, and identifying disease causes. Here, we present a machine learning method termed Prediction of Essential Genes in Comparison States (PreEGS). To capture the alteration of the network in comparison states, PreEGS extracts topological and gene expression features of each gene in a five-dimensional vector. PreEGS also recruits a positive sample expansion method to address the problem of unbalanced positive and negative samples, which is often encountered in practical applications. Different classifiers are applied to the simulated datasets, and the PreEGS based on the random forests model (PreEGSRF) was chosen for optimal performance. PreEGSRF was then compared with six other methods, including three machine learning methods, to predict EGS in a specific state. On real datasets with four gene regulatory networks, PreEGSRF predicted five essential genes related to leukemia and five enriched KEGG pathways. Four of the predicted essential genes and all predicted pathways were consistent with previous studies and highly correlated with leukemia. With high prediction accuracy and generalization ability, PreEGSRF is broadly applicable for the discovery of disease-causing genes, driver genes for cell fate decisions, and complex biomarkers of biological systems.
Collapse
|
43
|
Zhang Z, Jiang M, Wu D, Zhang W, Yan W, Qu X. A Novel Method for Identifying Essential Proteins Based on Non-negative Matrix Tri-Factorization. Front Genet 2021; 12:709660. [PMID: 34422014 PMCID: PMC8378176 DOI: 10.3389/fgene.2021.709660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 07/06/2021] [Indexed: 11/29/2022] Open
Abstract
Identification of essential proteins is very important for understanding the basic requirements to sustain a living organism. In recent years, there has been an increasing interest in using computational methods to predict essential proteins based on protein–protein interaction (PPI) networks or fusing multiple biological information. However, it has been observed that existing PPI data have false-negative and false-positive data. The fusion of multiple biological information can reduce the influence of false data in PPI, but inevitably more noise data will be produced at the same time. In this article, we proposed a novel non-negative matrix tri-factorization (NMTF)-based model (NTMEP) to predict essential proteins. Firstly, a weighted PPI network is established only using the topology features of the network, so as to avoid more noise. To reduce the influence of false data (existing in PPI network) on performance of identify essential proteins, the NMTF technique, as a widely used recommendation algorithm, is performed to reconstruct a most optimized PPI network with more potential protein–protein interactions. Then, we use the PageRank algorithm to compute the final ranking score of each protein, in which subcellular localization and homologous information of proteins were used to calculate the initial scores. In addition, extensive experiments are performed on the publicly available datasets and the results indicate that our NTMEP model has better performance in predicting essential proteins against the start-of-the-art method. In this investigation, we demonstrated that the introduction of non-negative matrix tri-factorization technology can effectively improve the condition of the protein–protein interaction network, so as to reduce the negative impact of noise on the prediction. At the same time, this finding provides a more novel angle of view for other applications based on protein–protein interaction networks.
Collapse
Affiliation(s)
- Zhihong Zhang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China.,School of Information Technology and Management, Hunan University of Finance and Economics, Changsha, China
| | - Meiping Jiang
- Department of Ultrasound, Hunan Provincial Maternal and Child Health Care Hospital, Changsha, China
| | - Dongjie Wu
- Department of Banking and Finance, Monash University, Clayton, VIC, Australia
| | - Wang Zhang
- Department of Optoelectronic Engineering, Jinan University, Guangzhou, China
| | - Wei Yan
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Xilong Qu
- School of Information Technology and Management, Hunan University of Finance and Economics, Changsha, China.,Hunan Provincial Key Laboratory of Finance and Economics Big Data Science and Technology, Hunan University of Finance and Economics, Changsha, China
| |
Collapse
|
44
|
Verma D, Gupta V. New insights into the structure and function of an emerging drug target CysE. 3 Biotech 2021; 11:373. [PMID: 34367865 DOI: 10.1007/s13205-021-02891-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Accepted: 06/09/2021] [Indexed: 11/24/2022] Open
Abstract
The antimicrobial resistant strains of several pathogens are major culprits of hospital-acquired nosocomial infections. An active and urgent action is necessary against these pathogens for the development of unique therapeutics. The cysteine biosynthetic pathway or genes (that are absent in humans) involved in the production of L-cysteine appear to be an attractive target for developing novel antibiotics. CysE, a Serine Acetyltransferase (SAT), catalyzes the first step of cysteine synthesis and is reported to be essential for the survival of persistence in several microbes including Mycobacterium tuberculosis. Structure determination provides fundamental insight into structure and function of protein and aid in drug design/discovery efforts. This review focuses on the overview of current knowledge of structure function, regulatory mechanism, and potential inhibitors (active site as well as allosteric site) of CysE. Despite having conserved structure, slight modification in CysE structure lead to altered the regulatory mechanism and hence affects the cysteine production. Due to its possible role in virulence and vital metabolism of pathogens makes it a potential target in the quest to develop novel therapeutics to treat multi-drug-resistant bacteria.
Collapse
Affiliation(s)
- Deepali Verma
- Department of Biotechnology, Jaypee Institute of Information Technology, A-10, Sector-62, Noida, Uttar Pradesh 201309 India
| | - Vibha Gupta
- Department of Biotechnology, Jaypee Institute of Information Technology, A-10, Sector-62, Noida, Uttar Pradesh 201309 India
| |
Collapse
|
45
|
Peng J, Kuang L, Zhang Z, Tan Y, Chen Z, Wang L. A Novel Model for Identifying Essential Proteins Based on Key Target Convergence Sets. Front Genet 2021; 12:721486. [PMID: 34394201 PMCID: PMC8358660 DOI: 10.3389/fgene.2021.721486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 06/30/2021] [Indexed: 11/20/2022] Open
Abstract
In recent years, many computational models have been designed to detect essential proteins based on protein-protein interaction (PPI) networks. However, due to the incompleteness of PPI networks, the prediction accuracy of these models is still not satisfactory. In this manuscript, a novel key target convergence sets based prediction model (KTCSPM) is proposed to identify essential proteins. In KTCSPM, a weighted PPI network and a weighted (Domain-Domain Interaction) network are constructed first based on known PPIs and PDIs downloaded from benchmark databases. And then, by integrating these two kinds of networks, a novel weighted PDI network is built. Next, through assigning a unique key target convergence set (KTCS) for each node in the weighted PDI network, an improved method based on the random walk with restart is designed to identify essential proteins. Finally, in order to evaluate the predictive effects of KTCSPM, it is compared with 12 competitive state-of-the-art models, and experimental results show that KTCSPM can achieve better prediction accuracy. Considering the satisfactory predictive performance achieved by KTCSPM, it indicates that KTCSPM might be a good supplement to the future research on prediction of essential proteins.
Collapse
Affiliation(s)
- Jiaxin Peng
- College of Computer, Xiangtan University, Xiangtan, China.,College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Linai Kuang
- College of Computer, Xiangtan University, Xiangtan, China
| | - Zhen Zhang
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Yihong Tan
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Zhiping Chen
- College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| | - Lei Wang
- College of Computer, Xiangtan University, Xiangtan, China.,College of Computer Engineering and Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
46
|
He X, Kuang L, Chen Z, Tan Y, Wang L. Method for Identifying Essential Proteins by Key Features of Proteins in a Novel Protein-Domain Network. Front Genet 2021; 12:708162. [PMID: 34267785 PMCID: PMC8276041 DOI: 10.3389/fgene.2021.708162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 05/31/2021] [Indexed: 11/21/2022] Open
Abstract
In recent years, due to low accuracy and high costs of traditional biological experiments, more and more computational models have been proposed successively to infer potential essential proteins. In this paper, a novel prediction method called KFPM is proposed, in which, a novel protein-domain heterogeneous network is established first by combining known protein-protein interactions with known associations between proteins and domains. Next, based on key topological characteristics extracted from the newly constructed protein-domain network and functional characteristics extracted from multiple biological information of proteins, a new computational method is designed to effectively integrate multiple biological features to infer potential essential proteins based on an improved PageRank algorithm. Finally, in order to evaluate the performance of KFPM, we compared it with 13 state-of-the-art prediction methods, experimental results show that, among the top 1, 5, and 10% of candidate proteins predicted by KFPM, the prediction accuracy can achieve 96.08, 83.14, and 70.59%, respectively, which significantly outperform all these 13 competitive methods. It means that KFPM may be a meaningful tool for prediction of potential essential proteins in the future.
Collapse
Affiliation(s)
- Xin He
- College of Computer, Xiangtan University, Xiangtan, China
| | - Linai Kuang
- College of Computer, Xiangtan University, Xiangtan, China
| | - Zhiping Chen
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| | - Yihong Tan
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| | - Lei Wang
- College of Computer, Xiangtan University, Xiangtan, China
- College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China
| |
Collapse
|
47
|
Cao H, Xu H, Ning C, Xiang L, Ren Q, Zhang T, Zhang Y, Gao R. Multi-Omics Approach Reveals the Potential Core Vaccine Targets for the Emerging Foodborne Pathogen Campylobacter jejuni. Front Microbiol 2021; 12:665858. [PMID: 34248875 PMCID: PMC8265506 DOI: 10.3389/fmicb.2021.665858] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/06/2021] [Indexed: 11/30/2022] Open
Abstract
Campylobacter jejuni is a leading cause of bacterial gastroenteritis in humans around the world. The emergence of bacterial resistance is becoming more serious; therefore, development of new vaccines is considered to be an alternative strategy against drug-resistant pathogen. In this study, we investigated the pangenome of 173 C. jejuni strains and analyzed the phylogenesis and the virulence factor genes. In order to acquire a high-quality pangenome, genomic relatedness was firstly performed with average nucleotide identity (ANI) analyses, and an open pangenome of 8,041 gene families was obtained with the correct taxonomy genomes. Subsequently, the virulence property of the core genome was analyzed and 145 core virulence factor (VF) genes were obtained. Upon functional genomics and immunological analyses, five core VF proteins with high antigenicity were selected as potential core vaccine targets for humans. Furthermore, functional annotations indicated that these proteins are involved in important molecular functions and biological processes, such as adhesion, regulation, and secretion. In addition, transcriptome analysis in human cells and pig intestinal loop proved that these vaccine target genes are important in the virulence of C. jejuni in different hosts. Comprehensive pangenome and relevant animal experiments will facilitate discovering the potential core vaccine targets with improved efficiency in reverse vaccinology. Likewise, this study provided some insights into the genetic polymorphism and phylogeny of C. jejuni and discovered potential vaccine candidates for humans. Prospective development of new vaccines using the targets will be an alternative to the use of antibiotics and prevent the development of multidrug-resistant C. jejuni in humans and even other animals.
Collapse
Affiliation(s)
- Hengchun Cao
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Hanxiao Xu
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Chunhui Ning
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Li Xiang
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Qiufang Ren
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Tiantian Zhang
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Yusen Zhang
- School of Mathematics and Statistics, Shandong University, Weihai, China
| | - Rui Gao
- School of Control Science and Engineering, Shandong University, Jinan, China
| |
Collapse
|
48
|
Wang N, Zeng M, Li Y, Wu FX, Li M. Essential Protein Prediction Based on node2vec and XGBoost. J Comput Biol 2021; 28:687-700. [PMID: 34152838 DOI: 10.1089/cmb.2020.0543] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Essential proteins are a vital part of the survival of organisms and cells. Identification of essential proteins lays a solid foundation for understanding protein functions and discovering drug targets. The traditional biological experiments are expensive and time-consuming. Recently, many computational methods have been proposed. However, some noises in the protein-protein interaction (PPI) networks affect the efficiency of essential protein prediction. It is necessary to construct a credible PPI network by using other useful biological information to reduce the effects of these noises. In this article, we proposed a model, Ess-NEXG, to identify essential proteins, which integrates biological information, including orthologous information, subcellular localization information, RNA-Seq information, and PPI network. In our model, first, we constructed a credible weighted PPI network by using different types of biological information. Second, we extracted the topological features of proteins in the constructed weighted PPI network by using the node2vec technique. Last, we used eXtreme Gradient Boosting (XGBoost) to predict essential proteins by using the topological features of proteins. The extensive results show that our model has better performance than other computational methods.
Collapse
Affiliation(s)
- Nian Wang
- School of Computer Science and Engineering, Central South University, Changsha, P.R. China
| | - Min Zeng
- School of Computer Science and Engineering, Central South University, Changsha, P.R. China
| | - Yiming Li
- School of Computer Science and Engineering, Central South University, Changsha, P.R. China
| | - Fang-Xiang Wu
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, Canada.,Department of Mechanical Engineering, University of Saskatchewan, Saskatoon, Canada
| | - Min Li
- School of Computer Science and Engineering, Central South University, Changsha, P.R. China
| |
Collapse
|
49
|
Zhong J, Tang C, Peng W, Xie M, Sun Y, Tang Q, Xiao Q, Yang J. A novel essential protein identification method based on PPI networks and gene expression data. BMC Bioinformatics 2021; 22:248. [PMID: 33985429 PMCID: PMC8120700 DOI: 10.1186/s12859-021-04175-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 05/06/2021] [Indexed: 02/08/2023] Open
Abstract
Background Some proposed methods for identifying essential proteins have better results by using biological information. Gene expression data is generally used to identify essential proteins. However, gene expression data is prone to fluctuations, which may affect the accuracy of essential protein identification. Therefore, we propose an essential protein identification method based on gene expression and the PPI network data to calculate the similarity of "active" and "inactive" state of gene expression in a cluster of the PPI network. Our experiments show that the method can improve the accuracy in predicting essential proteins. Results In this paper, we propose a new measure named JDC, which is based on the PPI network data and gene expression data. The JDC method offers a dynamic threshold method to binarize gene expression data. After that, it combines the degree centrality and Jaccard similarity index to calculate the JDC score for each protein in the PPI network. We benchmark the JDC method on four organisms respectively, and evaluate our method by using ROC analysis, modular analysis, jackknife analysis, overlapping analysis, top analysis, and accuracy analysis. The results show that the performance of JDC is better than DC, IC, EC, SC, BC, CC, NC, PeC, and WDC. We compare JDC with both NF-PIN and TS-PIN methods, which predict essential proteins through active PPI networks constructed from dynamic gene expression. Conclusions We demonstrate that the new centrality measure, JDC, is more efficient than state-of-the-art prediction methods with same input. The main ideas behind JDC are as follows: (1) Essential proteins are generally densely connected clusters in the PPI network. (2) Binarizing gene expression data can screen out fluctuations in gene expression profiles. (3) The essentiality of the protein depends on the similarity of "active" and "inactive" state of gene expression in a cluster of the PPI network.
Collapse
Affiliation(s)
- Jiancheng Zhong
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China.,Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Hunan Provincial Key Laboratory of Intelligent Computing and Language Information Processing, Changsha, 410083, China
| | - Chao Tang
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China
| | - Wei Peng
- College of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, Yunnan, China
| | - Minzhu Xie
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China
| | - Yusui Sun
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China
| | - Qiang Tang
- College of Engineering and Design, Hunan Normal University, Changsha, 410081, China
| | - Qiu Xiao
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China.
| | - Jiahong Yang
- School of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China.
| |
Collapse
|
50
|
Payra AK, Saha B, Ghosh A. Ortho_Sim_Loc: Essential protein prediction using orthology and priority-based similarity approach. Comput Biol Chem 2021; 92:107503. [PMID: 33962168 DOI: 10.1016/j.compbiolchem.2021.107503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 04/02/2021] [Accepted: 04/21/2021] [Indexed: 10/21/2022]
Abstract
Proteins are the essential macro-molecules of living organism. But all proteins cannot be considered as essential in different relevant studies. Essentiality of a protein is thus computed by computation methods rather than biological experiments which in turn save both time and effort. Different computational approaches are already predicted to select essential proteins successfully with different biological significances by researchers. Most of the experimental approaches return higher false negative outcomes with respect to others. In order to retain the prediction accuracy level, a novel methodology "Ortho_Sim_Loc"has been proposed which is a combined approach of Orthology, Similarity (using clustering and priority based GO-Annotation) and Subcellular localization. Ortho_Sim_Loc can predict enriched functional set essential proteins. The predicted results are validated with other existing methods like different centrality measures, LIDC. The validation results exhibits better performance of Ortho_Sim_Loc in compare to other existing computational approaches.
Collapse
Affiliation(s)
- Anjan Kumar Payra
- Department of Computer Science & Engineering, Dr. Sudhir Chandra Sur Degree Engineering College, 540, Dum Dum Road, Near Dum Dum Jn. Station, Surermath, Kolkata, 700074, India.
| | - Banani Saha
- Department of Computer Science & Engineering, University of Calcutta, Saltlake City, Kolkata, 700073, India.
| | - Anupam Ghosh
- Department of Computer Science & Engineering, Netaji Subhash Engineering College, Techno City, Panchpota, Garia, Kolkata, 700152, India.
| |
Collapse
|