1
|
Chakma V, Barman DN, Das SC, Hossain A, Momin MB, Tasneem M, Gupta SD. In silico analysis of a novel hypothetical protein (YP_498675.1) from Staphylococcus aureus unravels the protein of tryptophan synthase beta superfamily (Try-synth-beta_ II). J Genet Eng Biotechnol 2023; 21:135. [PMID: 37995054 PMCID: PMC10667181 DOI: 10.1186/s43141-023-00613-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Accepted: 11/14/2023] [Indexed: 11/24/2023]
Abstract
BACKGROUND Staphylococcus aureus is a gram-positive spherical bacteria and the most common cause of nosocomial infections in the world. Given its clinical significance, the genome sequence of S. aureus has been elucidated to enhance our comprehension of its lifestyle and pathogenicity. The research aimed to summarize a potential hypothetical protein that may play an important role in S. aureus virulence and pathogenicity, covering its anticipated structure, probable biological functions, and importance in this context. RESULTS A hypothetical protein, YP_498675.1 with 281 amino acid residues of S. aureus, was chosen for analysis and modeling by several bioinformatics tools and databases in this work. According to primary and secondary structure analyses, YP_498675.1 is a stable hydrophilic protein with a significant proportion of α-helices. Subcellular localization predictions by CELLO, PSORTb, and SOSUI server indicate that it is a cytoplasmic protein. NCBI-CDD, Pfam, and InterProScan functional genomics research revealed that the hypothetical protein may include the pyridoxal phosphate (PLP)-dependent 2, 3-diaminopropionate biosynthesis protein SbnA domain. In the homology modeling method, the HHpred server was employed to create its 3D structure using the template structure of a Staphyloferrin B precursor biosynthetic enzyme SbnA bound to PLP (PDB ID: 5D84_A), an X-ray diffraction model having 100% sequence identity with the hypothetical protein. After energy minimization, several quality assessments and validation factors determined that the generated protein model was reliable and of reasonable quality. CONCLUSION The present study has characterized and functionally annotated the hypothetical protein YP_498675.1 of S. aureus. Further experimental validation would aid in determining the actual function of YP_498675.1 as well as confirm the protein's value as a therapeutic target.
Collapse
Affiliation(s)
- Vivian Chakma
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Dhirendra Nath Barman
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Shuvo Chandra Das
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Anwar Hossain
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Monira Binte Momin
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Maisha Tasneem
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh
| | - Shipan Das Gupta
- Department of Biotechnology and Genetic Engineering, Noakhali Science and Technology University, Noakhali, 3814, Bangladesh.
| |
Collapse
|
2
|
Ferreira RM, Dos Santos Silva DH, Silva KF, de Melo Monteiro J, Ferreira GF, Silva MRC, da Silva LCN, de Castro Oliveira L, Monteiro AS. Draft genome sequence of Staphylococcus aureus sequence type 5 SA01 isolated from bloodstream infection and comparative analysis with reference strains. Funct Integr Genomics 2023; 23:288. [PMID: 37653266 DOI: 10.1007/s10142-023-01204-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 09/02/2023]
Abstract
A Staphylococcus aureus isolate (SA01) obtained from bloodstream infection exhibited a remarkable drug resistance profile. In this study, we report the draft genome sequence of S. aureus ST 5 SA01, a multidrug-resistant isolate, and analyzed the genes associated with drug resistance and virulence. The genome sketch of S. aureus ST5 SA01 was sequenced with Illumina and annotated using the Prokka software. Rapid Annotation Subsystem Technology (RAST) was used to verify the gene functions in the genome subsystems. The Comprehensive Antibiotic Resistance Database (CARD) and Virulence Factor Database (VFDB) were used in the analysis. The RAST indicated a contribution of 25 proteins to host adenine, fibronectin-binding protein A (FnbA), and biofilm formation as an intercellular polysaccharide adhesive system (PIA). The MLST indicated that S. aureus ST 5 SA01 belongs to ST5 (CC5). In silico analyses also showed an extensive repertoire of genes associated with toxins, such as LukGH leukocidin, enterotoxins, and superantigen staphylococcal classes (SSL). The 11 genes for antimicrobial resistance in S. aureus ST 5 SA01 showed similarity and identity above ≥ 99% with nucleotide sequences deposited in GenBank. Although studies on ST5 clones in Brazil are scarce, monitoring the clone of S. aureus ST 5 SA01 is essential, as it has become a problem in pediatrics in several countries.
Collapse
Affiliation(s)
- Romulo Maia Ferreira
- Laboratório de Microbiologia Aplicada, Universidade CEUMA, São Luís, 65075-120, MA, Brasil
| | | | - Karinny Farias Silva
- Laboratório de Microbiologia Aplicada, Universidade CEUMA, São Luís, 65075-120, MA, Brasil
| | | | - Gabriella Freitas Ferreira
- Departamento de Farmácia, Universidade Federal de Juiz de Fora - Campus Governador Valadares, CEP 35010-180, Juiz de Fora, MG, Brasil
| | | | | | - Letícia de Castro Oliveira
- Departamento de Microbiologia, Universidade Federal Do Triângulo Mineiro, Imunologia E Parasitologia, 38025180, Uberaba, MG, Brasil
| | - Andrea Souza Monteiro
- Laboratório de Microbiologia Aplicada, Universidade CEUMA, São Luís, 65075-120, MA, Brasil
| |
Collapse
|
3
|
Sicilia C, Corral-Lugo A, Smialowski P, McConnell MJ, Martín-Galiano AJ. Unsupervised Machine Learning Organization of the Functional Dark Proteome of Gram-Negative "Superbugs": Six Protein Clusters Amenable for Distinct Scientific Applications. ACS OMEGA 2022; 7:46131-46145. [PMID: 36570227 PMCID: PMC9774411 DOI: 10.1021/acsomega.2c04076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 10/06/2022] [Indexed: 06/17/2023]
Abstract
Uncharacterized proteins have been underutilized as targets for the development of novel therapeutics for difficult-to-treat bacterial infections. To facilitate the exploration of these proteins, 2819 predicted, uncharacterized proteins (19.1% of the total) from reference strains of multidrug Acinetobacter baumannii, Klebsiella pneumoniae, and Pseudomonas aeruginosa species were organized using an unsupervised k-means machine learning algorithm. Classification using normalized values for protein length, pI, hydrophobicity, degree of conservation, structural disorder, and %AT of the coding gene rendered six natural clusters. Cluster proteins showed different trends regarding operon membership, expression, presence of unknown function domains, and interactomic relevance. Clusters 2, 4, and 5 were enriched with highly disordered proteins, nonworkable membrane proteins, and likely spurious proteins, respectively. Clusters 1, 3, and 6 showed closer distances to known antigens, antibiotic targets, and virulence factors. Up to 21.8% of proteins in these clusters were structurally covered by modeling, which allowed assessment of druggability and discontinuous B-cell epitopes. Five proteins (4 in Cluster 1) were potential druggable targets for antibiotherapy. Eighteen proteins (11 in Cluster 6) were strong B-cell and T-cell immunogen candidates for vaccine development. Conclusively, we provide a feature-based schema to fractionate the functional dark proteome of critical pathogens for fundamental and biomedical purposes.
Collapse
Affiliation(s)
- Carlos Sicilia
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Andrés Corral-Lugo
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Pawel Smialowski
- Core
Facility Bioinformatics, Biomedical Center Munich, Faculty of Medicine, Ludwig Maximilians Universität München, Munich 80539, Germany
- Institute
of Stem Cell Research, Helmholtz Center Munich, Planegg-Martinsried 82152, Germany
| | - Michael J. McConnell
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| | - Antonio J. Martín-Galiano
- Intrahospital
Infections Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III (ISCIII), Majadahonda, 28220 Madrid, Spain
| |
Collapse
|
4
|
Goel P, Panchal T, Kaushik N, Chauhan R, Saini S, Ahuja V, Thakur CJ. In silico functional and structural characterization revealed virulent proteins of Francisella tularensis strain SCHU4. MOLECULAR BIOLOGY RESEARCH COMMUNICATIONS 2022; 11:73-84. [PMID: 36059929 PMCID: PMC9336787 DOI: 10.22099/mbrc.2022.43128.1719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Francisella tularensis is a pathogenic, aerobic gram-negative coccobacillus bacterium. It is the causative agent of tularemia, a rare infectious disease that can attack skin, lungs, eyes, and lymph nodes. The genome of F. tularensis has been sequenced, and ~16% of the proteome is still uncharacterized. Characterizations of these proteins are essential to find new drug targets for better therapeutics. In silico characterization of proteins has become an extremely important approach to determine the functionality of proteins as experimental functional elucidation is unable to keep pace with the current growth of the sequence database. Initially, we have annotated 577 Hypothetical Proteins (HPs) of F. tularensis strain SCHU4 with seven bioinformatics tools which characterized them based on the family, domain and motif. Out of 577 HPs, 119 HPs were annotated by five or more tools and are further screened to predict their virulence properties, subcellular localization, transmembrane helices as well as physicochemical parameters. VirulentPred predicted 66 HPs out of 119 as virulent. These virulent proteins were annotated to find the interacting partner using STRING, and proteins with high confidence interaction scores were used to predict their 3D structures using Phyre2. The three virulent proteins Q5NH99 (phosphoserine phosphatase), Q5NG42 (Cystathionine beta-synthase) and Q5NG83 (Rrf2-type helix turn helix domain) were predicted to involve in modulation of cytoskeletal and innate immunity of host, H2S (hydrogen sulfide) based antibiotic tolerance and nitrite and iron metabolism of bacteria. The above predicted virulent proteins can serve as novel drug targets in the era of antibiotic resistance.
Collapse
Affiliation(s)
- Prerna Goel
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India
| | - Tanya Panchal
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India
| | - Nandini Kaushik
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India
| | - Ritika Chauhan
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India
| | - Sandeep Saini
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India,Department of Biophysics, Panjab University, Sector 25, 160014, Chandigarh, India
| | - Vartika Ahuja
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India
| | - Chander Jyoti Thakur
- Department of Bioinformatics, Goswami Ganesh Dutta Sanatan Dharma College, Sector 32 C, Chandigarh, India,Corresponding Author: Department of Bioinformatics,Goswami Ganesh Dutta Sanatan Dharma College Sector 32 C, Chandigarh, India, 160030. Tel: +91 8699776533; Fax: +91 1722661077, E. mail:
| |
Collapse
|
5
|
Javier RA, Matías R, Alonso F, Renato C, Gloria L. A novel gene from the acidophilic bacterium Leptospirillum sp. CF-1 and its role in oxidative stress and chromate tolerance. Biol Res 2022; 55:19. [PMID: 35525996 PMCID: PMC9080137 DOI: 10.1186/s40659-022-00388-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 04/15/2022] [Indexed: 11/16/2022] Open
Abstract
Background Acidophilic microorganisms like Leptospirillum sp. CF-1 thrive in environments with extremely low pH and high concentrations of dissolved heavy metals that can induce the generation of reactive oxygen species (ROS). Several hypothetical genes and proteins from Leptospirillum sp. CF-1 are known to be up-regulated under oxidative stress conditions. Results In the present work, the function of hypothetical gene ABH19_09590 from Leptospirillum sp. CF-1 was studied. Heterologous expression of this gene in Escherichia coli led to an increase in the ability to grow under oxidant conditions with 5 mM K2CrO4 or 5 mM H2O2. Similarly, a significant reduction in ROS production in E. coli transformed with a plasmid carrying ABH19_09590 was observed after exposure to these oxidative stress elicitors for 30 min, compared to a strain complemented with the empty vector. A co-transcriptional study using RT-PCR showed that ABH19_09590 is contained in an operon, here named the “och” operon, that also contains ABH19_09585, ABH19_09595 and ABH19_09600 genes. The expression of the och operon was significantly up-regulated in Leptospirillum sp. CF-1 exposed to 5 mM K2CrO4 for 15 and 30 min. Genes of this operon potentially encode a NADH:ubiquinone oxidoreductase, a CXXC motif-containing protein likely involved in thiol/disulfide exchange, a hypothetical protein, and a di-hydroxy-acid dehydratase. A comparative genomic analysis revealed that the och operon is a characteristic genetic determinant of the Leptospirillum genus that is not present in other acidophiles. Conclusions Altogether, these results suggest that the och operon plays a protective role against chromate and hydrogen peroxide and is an important mechanism required to face polyextremophilic conditions in acid environments.
Collapse
Affiliation(s)
- Rivera-Araya Javier
- Biology Department, Faculty of Chemistry and Biology, University of Santiago of Chile (USACH), Santiago, Chile
| | - Riveros Matías
- Biology Department, Faculty of Chemistry and Biology, University of Santiago of Chile (USACH), Santiago, Chile
| | - Ferrer Alonso
- Núcleo de Química y Bioquímica, Facultad de Ciencias, Universidad Mayor, Santiago, Chile
| | - Chávez Renato
- Biology Department, Faculty of Chemistry and Biology, University of Santiago of Chile (USACH), Santiago, Chile
| | - Levicán Gloria
- Biology Department, Faculty of Chemistry and Biology, University of Santiago of Chile (USACH), Santiago, Chile.
| |
Collapse
|
6
|
Subramanian D, Natarajan J. Leveraging big data bioinformatics approaches to extract knowledge from Staphylococcus aureus public omics data. Crit Rev Microbiol 2022; 49:391-413. [PMID: 35468027 DOI: 10.1080/1040841x.2022.2065905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Staphylococcus aureus is a notorious pathogen posing challenges in the medical industry due to drug resistance and biofilm formation. The horizon of knowledge on S. aureus pathogenesis has expanded with the advancement of data-driven bioinformatics techniques. Mining information from sequenced genomes and their expression data is an economic approach that alleviates wastage of resources and redundancy in experiments. The current review covers how big data bioinformatics has been used in the analysis of S. aureus from publicly available -omics data to uncover mechanisms of infection and inhibition. Particularly, advances in the past two decades in biomarker discovery, host responses, phenotype identification, consolidation of information, and drug development are discussed highlighting the challenges and shortcomings. Overall, the review summarizes the diverse aspects of scrupulous re-analysis of S. aureus proteomic and transcriptomic expression datasets retrieved from public repositories in terms of the efforts taken, benefits offered, and follow-up actions. The detailed review thus serves as a reference and aid for (i) Computational biologists by briefing the approaches utilized for bacterial omics re-analysis concerning S. aureus and (ii) Experimental biologists by elucidating the potential of bioinformatics in biological research to generate reliable postulates in a prompt and economical manner.
Collapse
Affiliation(s)
- Devika Subramanian
- Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore, India
| | - Jeyakumar Natarajan
- Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore, India
| |
Collapse
|
7
|
Naveed M, Makhdoom SI, Abbas G, Safdari M, Farhadi A, Habtemariam S, Shabbir MA, Jabeen K, Asif MF, Tehreem S. The Virulent Hypothetical Proteins: The Potential Drug Target Involved in Bacterial Pathogenesis. Mini Rev Med Chem 2022; 22:2608-2623. [DOI: 10.2174/1389557522666220413102107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 12/01/2021] [Accepted: 01/21/2022] [Indexed: 11/22/2022]
Abstract
Abstract:
Hypothetical proteins (HPs) are non-predicted sequences that are identified only by open reading frames in sequenced genomes but their protein products remain uncharacterized by any experimental means. The genome of every species consists of HPs that are involved in various cellular processes and signaling pathways. Annotation of HPs is important as they play a key role in disease mechanisms, drug designing, vaccine production, antibiotic production, and host adaptation. In the case of bacteria, 25-50% of the genome comprises of HPs, which are involved in metabolic pathways and pathogenesis. The characterization of bacterial HPs helps to identify virulent proteins that are involved in pathogenesis. This can be done using in-silico studies, which provide sequence analogs, physiochemical properties, cellular or subcellular localization, structure and function validation, and protein-protein interactions. The most diverse types of virulent proteins are exotoxins, endotoxins, and adherent virulent factors that are encoded by virulent genes present on the chromosomal DNA of the bacteria. This review evaluates virulent HPs of pathogenic bacteria, such as Staphylococcus aureus, Chlamydia trachomatis, Fusobacterium nucleatum, and Yersinia pestis. The potential of these HPs as a drug target in bacteria-caused infectious diseases along with the mode of action and treatment approaches have been discussed.
Collapse
Affiliation(s)
- Muhammad Naveed
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Pakistan
| | - Syeda Izma Makhdoom
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Pakistan
| | - Ghulam Abbas
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mohammadreza Safdari
- Department of Orthopedic Surgery, Faculty of Medicine, North Khorasan University of Medical Sciences, Bojnurd, Iran
| | - Amin Farhadi
- Kavian Institute of Higher Education, Mashhad, Iran
| | - Solomon Habtemariam
- Pharmacognosy Research Laboratories & Herbal Analysis Services UK, University of Greenwich, Medway Campus-Science, Grenville Building (G102/G107), Central Avenue, Chatham-Maritime, Kent, ME4 4TB, UK
| | - Muhammad Aqib Shabbir
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Pakistan
| | - Khizra Jabeen
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Pakistan
| | - Muhammad Farrukh Asif
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Pakistan
| | - Sana Tehreem
- State Key Laboratory of Biocatalysis and Enzyme Engineering, School of Life Sciences, Hubei University, Wuhan 430062, Hubei, China
| |
Collapse
|
8
|
Shahbazi M, Tohidfar M, Azimzadeh Irani M, Moheb Seraj RG. Functional annotation and evaluation of hypothetical proteins in cyanobacterium Synechocystis sp. PCC 6803. BIOCATALYSIS AND AGRICULTURAL BIOTECHNOLOGY 2022. [DOI: 10.1016/j.bcab.2021.102246] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
9
|
Park A, Harris LK. Gene Expression Meta-Analysis Reveals Interferon-Induced Genes Associated With SARS Infection in Lungs. Front Immunol 2021; 12:694355. [PMID: 34367154 PMCID: PMC8342995 DOI: 10.3389/fimmu.2021.694355] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/05/2021] [Indexed: 01/01/2023] Open
Abstract
Background Severe Acute Respiratory Syndrome (SARS) corona virus (CoV) infections are a serious public health threat because of their pandemic-causing potential. This work is the first to analyze mRNA expression data from SARS infections through meta-analysis of gene signatures, possibly identifying therapeutic targets associated with major SARS infections. Methods This work defines 37 gene signatures representing SARS-CoV, Middle East Respiratory Syndrome (MERS)-CoV, and SARS-CoV2 infections in human lung cultures and/or mouse lung cultures or samples and compares them through Gene Set Enrichment Analysis (GSEA). To do this, positive and negative infectious clone SARS (icSARS) gene panels are defined from GSEA-identified leading-edge genes between two icSARS-CoV derived signatures, both from human cultures. GSEA then is used to assess enrichment and identify leading-edge icSARS panel genes between icSARS gene panels and 27 other SARS-CoV gene signatures. The meta-analysis is expanded to include five MERS-CoV and three SARS-CoV2 gene signatures. Genes associated with SARS infection are predicted by examining the intersecting membership of GSEA-identified leading-edges across gene signatures. Results Significant enrichment (GSEA p<0.001) is observed between two icSARS-CoV derived signatures, and those leading-edge genes defined the positive (233 genes) and negative (114 genes) icSARS panels. Non-random significant enrichment (null distribution p<0.001) is observed between icSARS panels and all verification icSARSvsmock signatures derived from human cultures, from which 51 over- and 22 under-expressed genes are shared across leading-edges with 10 over-expressed genes already associated with icSARS infection. For the icSARSvsmock mouse signature, significant, non-random significant enrichment held for only the positive icSARS panel, from which nine genes are shared with icSARS infection in human cultures. Considering other SARS strains, significant, non-random enrichment (p<0.05) is observed across signatures derived from other SARS strains for the positive icSARS panel. Five positive icSARS panel genes, CXCL10, OAS3, OASL, IFIT3, and XAF1, are found across mice and human signatures regardless of SARS strains. Conclusion The GSEA-based meta-analysis approach used here identifies genes with and without reported associations with SARS-CoV infections, highlighting this approach’s predictability and usefulness in identifying genes that have potential as therapeutic targets to preclude or overcome SARS infections.
Collapse
Affiliation(s)
- Amber Park
- Harris Interdisciplinary Research, Davenport University, Grand Rapids, MI, United States
| | - Laura K Harris
- Harris Interdisciplinary Research, Davenport University, Grand Rapids, MI, United States.,Institute for Cyber-Enabled Research, Michigan State University, East Lansing, MI, United States
| |
Collapse
|
10
|
Amatore Z, Gunn S, Harris LK. An Educational Bioinformatics Project to Improve Genome Annotation. Front Microbiol 2020; 11:577497. [PMID: 33365016 PMCID: PMC7750189 DOI: 10.3389/fmicb.2020.577497] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 10/27/2020] [Indexed: 01/28/2023] Open
Abstract
Scientific advancement is hindered without proper genome annotation because biologists lack a complete understanding of cellular protein functions. In bacterial cells, hypothetical proteins (HPs) are open reading frames with unknown functions. HPs result from either an outdated database or insufficient experimental evidence (i.e., indeterminate annotation). While automated annotation reviews help keep genome annotation up to date, often manual reviews are needed to verify proper annotation. Students can provide the manual review necessary to improve genome annotation. This paper outlines an innovative classroom project that determines if HPs have outdated or indeterminate annotation. The Hypothetical Protein Characterization Project uses multiple well-documented, freely available, web-based, bioinformatics resources that analyze an amino acid sequence to (1) detect sequence similarities to other proteins, (2) identify domains, (3) predict tertiary structure including active site characterization and potential binding ligands, and (4) determine cellular location. Enough evidence can be generated from these analyses to support re-annotation of HPs or prioritize HPs for experimental examinations such as structural determination via X-ray crystallography. Additionally, this paper details several approaches for selecting HPs to characterize using the Hypothetical Protein Characterization Project. These approaches include student- and instructor-directed random selection, selection using differential gene expression from mRNA expression data, and selection based on phylogenetic relations. This paper also provides additional resources to support instructional use of the Hypothetical Protein Characterization Project, such as example assignment instructions with grading rubrics, links to training videos in YouTube, and several step-by-step example projects to demonstrate and interpret the range of achievable results that students might encounter. Educational use of the Hypothetical Protein Characterization Project provides students with an opportunity to learn and apply knowledge of bioinformatic programs to address scientific questions. The project is highly customizable in that HP selection and analysis can be specifically formulated based on the scope and purpose of each student's investigations. Programs used for HP analysis can be easily adapted to course learning objectives. The project can be used in both online and in-seat instruction for a wide variety of undergraduate and graduate classes as well as undergraduate capstone, honor's, and experiential learning projects.
Collapse
Affiliation(s)
- Zoie Amatore
- Science Department, Harris Interdisciplinary Research, Davenport University, Lansing, MI, United States
| | - Susan Gunn
- College of Urban Education, Davenport University, Grand Rapids, MI, United States
| | - Laura K. Harris
- Science Department, Harris Interdisciplinary Research, Davenport University, Lansing, MI, United States
| |
Collapse
|
11
|
Thakur CJ, Saini S, Notra A, Chauhan B, Arya S, Gupta R, Thakur J, Kumar V. Deciphering the functional role of hypothetical proteins from Chloroflexus aurantiacs J-10-f1 using bioinformatics approach. MOLECULAR BIOLOGY RESEARCH COMMUNICATIONS 2020; 9:129-139. [PMID: 33313333 PMCID: PMC7727763 DOI: 10.22099/mbrc.2020.36894.1495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Chloroflexus aurantiacus J-10-f1 is an anoxygenic, photosynthetic, facultative autotrophic gram negative bacterium found from hot spring at a temperature range of 50-60°C. It can sustain itself in dark only if oxygen is available thereby exhibiting a dark orange color, however display a dark green color when grown in sunlight. Genome of the organism contains total of 3853 proteins out of which 785 (~20%) proteins are uncharacterised or hypothetical proteins (HPs). Therefore in this work we have characterized the 785 hypothetical proteins of Chloroflexus aurantiacus J-10-f1 using bioinformatics tools and databases. HPs annotated by more than five domain prediction tools were filtered and named high confidence-hypothetical proteins (HC-HPs). These HC-HPs were further annotated by calculating their physiochemical properties, homologous, subcellular locations, signal peptides and transmembrane regions. We found most of the HC-HPs were involved in photosynthesis, carbohydrate metabolism, biofuel production and cellulose synthesis processes. Furthermore, few of these HC-HPs could provide resistance to bacteria at high temperature due to their thermophilic nature. Hence these HC-HPs have the potential to be used in industrial as well as in biomedical needs. To conclude, the bioinformatics approach used in this study provides an insight to better understand the nature and role of Chloroflexus aurantiacus J-10-f1 hypothetical proteins.
Collapse
Affiliation(s)
- Chander Jyoti Thakur
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India,Corresponding Author: Department of Bioinformatics, GGDSD College, Chandigarh, India. Tel: +91 8699776533 ; Fax: +91 172 2661077, E. mail:
| | - Sandeep Saini
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India,Department of Biophysics, Panjab University, Sector 25, 160014, Chandigarh, India
| | - Aayushi Notra
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| | - Bhavanshu Chauhan
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| | - Sarthak Arya
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| | - Rishabh Gupta
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| | - Jyotsna Thakur
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| | - Varinder Kumar
- Department of Bioinformatics, GGDSD College, Sector 32-C, 160030, Chandigarh, India
| |
Collapse
|
12
|
Goad B, Harris LK. Identification and prioritization of macrolideresistance genes with hypothetical annotation inStreptococcus pneumoniae. Bioinformation 2018; 14:488-498. [PMID: 31223208 PMCID: PMC6563660 DOI: 10.6026/97320630014488] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Accepted: 11/22/2018] [Indexed: 11/23/2022] Open
Abstract
Macrolide resistant Streptococcus pneumoniae infections have limited treatment options. While some resistance mechanisms are well
established, ample understanding is limited by incomplete genome annotation (hypothetical genes). Some hypothetical genes encode a
domain of unknown function (DUF), a conserved protein domain with uncharacterized function. Here, we identify and confirm macrolide
resistance genes. We further explore DUFs from macrolide resistance hypothetical genes to prioritize them for experimental
characterization. We found gene similarities between two macrolide resistance gene signatures from untreated and either erythromycin- or
spiramycin-treated resistant Streptococcus pneumoniae. We confirmed the association of these gene sets with macrolide resistance through
comparison to gene signatures from (i) second erythromycin resistant Streptococcus pneumoniae strain, and (ii) erythromycin-treated
sensitive Streptococcus pneumoniae strain, both from non-overlapping datasets. Examination into which cellular processes these macrolide
resistance genes belong found connections to known resistance mechanisms such as increased amino acid biosynthesis and efflux genes,
and decreased ribonucleotide biosynthesis genes, highlighting the predictive ability of the method used. 22 genes had hypothetical
annotation with 10 DUFs associated with macrolide resistance. DUF characterization could uncover novel co-therapies that restore
macrolide efficacy across multiple macrolide resistant species. Application of the methods to other antibiotic resistances could
revolutionize treatment of resistant infections
Collapse
Affiliation(s)
- Blue Goad
- Davenport University, 200 S. Grand Ave, Lansing, MI, 48933, USA
| | - Laura K Harris
- Davenport University, 200 S. Grand Ave, Lansing, MI, 48933, USA
| |
Collapse
|
13
|
da Costa WLO, Araújo CLDA, Dias LM, Pereira LCDS, Alves JTC, Araújo FA, Folador EL, Henriques I, Silva A, Folador ARC. Functional annotation of hypothetical proteins from the Exiguobacterium antarcticum strain B7 reveals proteins involved in adaptation to extreme environments, including high arsenic resistance. PLoS One 2018; 13:e0198965. [PMID: 29940001 PMCID: PMC6016940 DOI: 10.1371/journal.pone.0198965] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 05/28/2018] [Indexed: 02/07/2023] Open
Abstract
Exiguobacterium antarcticum strain B7 is a psychrophilic Gram-positive bacterium that possesses enzymes that can be used for several biotechnological applications. However, many proteins from its genome are considered hypothetical proteins (HPs). These functionally unknown proteins may indicate important functions regarding the biological role of this bacterium, and the use of bioinformatics tools can assist in the biological understanding of this organism through functional annotation analysis. Thus, our study aimed to assign functions to proteins previously described as HPs, present in the genome of E. antarcticum B7. We used an extensive in silico workflow combining several bioinformatics tools for function annotation, sub-cellular localization and physicochemical characterization, three-dimensional structure determination, and protein-protein interactions. This genome contains 2772 genes, of which 765 CDS were annotated as HPs. The amino acid sequences of all HPs were submitted to our workflow and we successfully attributed function to 132 HPs. We identified 11 proteins that play important roles in the mechanisms of adaptation to adverse environments, such as flagellar biosynthesis, biofilm formation, carotenoids biosynthesis, and others. In addition, three predicted HPs are possibly related to arsenic tolerance. Through an in vitro assay, we verified that E. antarcticum B7 can grow at high concentrations of this metal. The approach used was important to precisely assign function to proteins from diverse classes and to infer relationships with proteins with functions already described in the literature. This approach aims to produce a better understanding of the mechanism by which this bacterium adapts to extreme environments and to the finding of targets with biotechnological interest.
Collapse
Affiliation(s)
- Wana Lailan Oliveira da Costa
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Carlos Leonardo de Aragão Araújo
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Larissa Maranhão Dias
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Lino César de Sousa Pereira
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Jorianne Thyeska Castro Alves
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Fabrício Almeida Araújo
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Edson Luiz Folador
- Biotechnology Center, Federal University of Paraiba, João Pessoa, Paraíba, Brazil
| | - Isabel Henriques
- Biology Department & CESAM, University of Aveiro, Aveiro, Portugal
| | - Artur Silva
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
| | - Adriana Ribeiro Carneiro Folador
- Laboratory of Genomic and Bioinformatics, Center of Genomics and System Biology, Institute of Biological Science, Federal University of Para, Belém, Pará, Brazil
- * E-mail: ,
| |
Collapse
|
14
|
Naveed M, Tehreem S, Usman M, Chaudhry Z, Abbas G. Structural and functional annotation of hypothetical proteins of human adenovirus: prioritizing the novel drug targets. BMC Res Notes 2017; 10:706. [PMID: 29212526 PMCID: PMC5719520 DOI: 10.1186/s13104-017-2992-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2017] [Accepted: 11/25/2017] [Indexed: 01/11/2023] Open
Abstract
OBJECTIVE Human adenoviruses are small double stranded DNA viruses that provoke vast array of human diseases. Next generation sequencing techniques increase genomic data of HAdV rapidly, which increase their serotypes. The complete genome sequence of human adenovirus shows that it contains large amount of proteins with unknown cellular or biochemical function, known as hypothetical proteins. Hence, it is indispensable to functionally and structurally annotate these proteins to get better understanding of the novel drug targets. The purpose was the characterization of 38 randomly retrieved hypothetical proteins through determination of their physiochemical properties, subcellular localization, function, structure and ligand binding sites using various sequence and structure based bioinformatics tools. RESULTS Function of six hypothetical proteins P03269, P03261, P03263, Q83127, Q1L4D7 and I6LEV1 were predicted confidently and then used further for structure analysis. We found that these proteins may act as DNA terminal protein, DNA polymerase, DNA binding protein, adenovirus E3 region protein CR1 and adenoviral protein L1. Functional and structural annotation leading to detection of binding sites by means of docking analysis can indicate potential target for therapeutics to defeat adenoviral infection.
Collapse
Affiliation(s)
- Muhammad Naveed
- Department of Biotechnology, Faculty of Life Sciences, University of Central Punjab, Lahore, 54000 Pakistan
- Department of Biochemistry and Biotechnology, University of Gujrat, Gujrat, 50700 Pakistan
| | - Sana Tehreem
- Department of Biochemistry and Biotechnology, University of Gujrat, Gujrat, 50700 Pakistan
| | - Muhammad Usman
- Department of Biochemistry and Biotechnology, University of Gujrat, Gujrat, 50700 Pakistan
| | - Zoma Chaudhry
- Department of Biochemistry and Biotechnology, University of Gujrat, Gujrat, 50700 Pakistan
| | - Ghulam Abbas
- Department of Biochemistry and Biotechnology, University of Gujrat, Gujrat, 50700 Pakistan
| |
Collapse
|
15
|
Marklevitz J, Harris LK. Improved Annotations of 23 Differentially Expressed Hypothetical Proteins in Methicillin Resistant S.aureus. Bioinformation 2017; 13:104-110. [PMID: 28539731 PMCID: PMC5429968 DOI: 10.6026/97320630013104] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Revised: 04/12/2017] [Accepted: 04/12/2017] [Indexed: 11/23/2022] Open
Abstract
Antibiotic resistant Staphylococcus aureus is a major public health concern effecting millions of people annually. Medical science has documented completely untreatable S. aureus infections. These strains are appearing in the community with increasing frequency. New diagnostic and therapeutic options are needed to combat this deadly infection. Interestingly, around 50% of the proteins in S. aureus are annotated as hypothetical. Methods to select hypothetical proteins related to antibiotic resistance have been inadequate. This study uses differential gene expression to identify hypothetical proteins related to antibiotic resistant phenotype strain variations. We apply computational tools to predict physiochemical properties, cellular location, sequence-based homologs, domains, 3D modeling, active site features, and binding partners. Nine of 23 hypothetical proteins were <100 residues, unlikely to be functional proteins based on size. Of the 14 differentially expressed hypothetical proteins examined, confident predictions on function could not be made. Most identified domains had unknown functions. Six hypothetical protein models had >50% confidence over >20% residues. These findings indicate the method of hypothetical protein identification is sufficient; however, current scientific knowledge is inadequate to properly annotate these proteins. This process should be repeated regularly until entire genomes are clearly and accurately annotated.
Collapse
Affiliation(s)
- Jessica Marklevitz
- Department of Science, Davenport University, Lansing, Michigan, United States of America
| | - Laura K. Harris
- Department of Science, Davenport University, Lansing, Michigan, United States of America
- Department of Health Informatics, Rutgers School of Health Professions, Newark, New Jersey, United States of America
| |
Collapse
|