1
|
Kim S, Yu B, Li Q, Bolton EE. PubChem synonym filtering process using crowdsourcing. J Cheminform 2024; 16:69. [PMID: 38880887 PMCID: PMC11181558 DOI: 10.1186/s13321-024-00868-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 06/09/2024] [Indexed: 06/18/2024] Open
Abstract
PubChem ( https://pubchem.ncbi.nlm.nih.gov ) is a public chemical information resource containing more than 100 million unique chemical structures. One of the most requested tasks in PubChem and other chemical databases is to search chemicals by name (also commonly called a "chemical synonym"). PubChem performs this task by looking up chemical synonym-structure associations provided by individual depositors to PubChem. In addition, these synonyms are used for many purposes, including creating links between chemicals and PubMed articles (using Medical Subject Headings (MeSH) terms). However, these depositor-provided name-structure associations are subject to substantial discrepancies within and between depositors, making it difficult to unambiguously map a chemical name to a specific chemical structure. The present paper describes PubChem's crowdsourcing-based synonym filtering strategy, which resolves inter- and intra-depositor discrepancies in synonym-structure associations as well as in the chemical-MeSH associations. The PubChem synonym filtering process was developed based on the analysis of four crowd-voting strategies, which differ in the consistency threshold value employed (60% vs 70%) and how to resolve intra-depositor discrepancies (a single vote vs. multiple votes per depositor) prior to inter-depositor crowd-voting. The agreement of voting was determined at six levels of chemical equivalency, which considers varying isotopic composition, stereochemistry, and connectivity of chemical structures and their primary components. While all four strategies showed comparable results, Strategy I (one vote per depositor with a 60% consistency threshold) resulted in the most synonyms assigned to a single chemical structure as well as the most synonym-structure associations disambiguated at the six chemical equivalency contexts. Based on the results of this study, Strategy I was implemented in PubChem's filtering process that cleans up synonym-structure associations as well as chemical-MeSH associations. This consistency-based filtering process is designed to look for a consensus in name-structure associations but cannot attest to their correctness. As a result, it can fail to recognize correct name-structure associations (or incorrect ones), for example, when a synonym is provided by only one depositor or when many contributors are incorrect. However, this filtering process is an important starting point for quality control in name-structure associations in large chemical databases like PubChem.
Collapse
Affiliation(s)
- Sunghwan Kim
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Bo Yu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Qingliang Li
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Evan E Bolton
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA.
| |
Collapse
|
2
|
Differential Transcriptome Profiling Unveils Novel Deregulated Gene Signatures Involved in Pathogenesis of Alzheimer's Disease. Biomedicines 2022; 10:biomedicines10030611. [PMID: 35327413 PMCID: PMC8945049 DOI: 10.3390/biomedicines10030611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 02/13/2022] [Accepted: 02/28/2022] [Indexed: 02/01/2023] Open
Abstract
Alzheimer’s disease (AD) is a neurodegenerative disorder that is characterized by a progressive loss of cognitive functions at a higher level than normal aging. Although the apolipoprotein (APOE) gene is a major risk factor in developing AD, other genes have also been reported to be linked with complex phenotypes. Therefore, this genome-wide expression study explored differentially expressed genes as possible novel biomarkers involved in AD. The mRNA expression dataset, GSE28146, containing 15 sample data composed of 7 AD cases from the hippocampus region with age-matched control (n = 8, >80 years), was analyzed. Using “affy” R-package, mRNA expression was calculated, while pathway enrichment analysis was performed to determine related biological processes. Of 58 differentially expressed genes, 44 downregulated and 14 upregulated genes were found to be significantly (p < 0.001) altered. The pathway enrichment analysis revealed two altered genes, i.e., dynein light chain 1 (DYNLL1) and kalirin (KLRN), associated with AD in the elderly population. The majority of genes were associated with retrograde endocannabinoid as well as vascular endothelial growth factors affecting the complex phenotypes. The DYNLL1 and KLRN genes may be involved with AD and Huntington’s disease (HD) phenotypes and represent a common genetic basis of these diseases. However, the hallmark of AD is dementia, while the classic motor sign of HD includes chorea. Our data warrant further investigation to identify the role of these genes in disease pathogenesis.
Collapse
|
3
|
Choudhury C, Bhardwaj A. Hybrid Dynamic Pharmacophore Models as Effective Tools to Identify Novel Chemotypes for Anti-TB Inhibitor Design: A Case Study With Mtb-DapB. Front Chem 2020; 8:596412. [PMID: 33425853 PMCID: PMC7793862 DOI: 10.3389/fchem.2020.596412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 10/28/2020] [Indexed: 11/13/2022] Open
Abstract
Antimicrobial resistance (AMR) is one of the most serious global public health threats as it compromises the successful treatment of deadly infectious diseases like tuberculosis. New therapeutics are constantly needed but it takes a long time and is expensive to explore new biochemical space. One way to address this issue is to repurpose the validated targets and identify novel chemotypes that can simultaneously bind to multiple binding pockets of these targets as a new lead generation strategy. This study reports such a strategy, dynamic hybrid pharmacophore model (DHPM), which represents the combined interaction features of different binding pockets contrary to the conventional approaches, where pharmacophore models are generated from single binding sites. We have considered Mtb-DapB, a validated mycobacterial drug target, as our model system to explore the effectiveness of DHPMs to screen novel unexplored compounds. Mtb-DapB has a cofactor binding site (CBS) and an adjacent substrate binding site (SBS). Four different model systems of Mtb-DapB were designed where, either NADPH/NADH occupies CBS in presence/absence of an inhibitor 2, 6-PDC in the adjacent SBS. Two more model systems were designed, where 2, 6-PDC was linked to NADPH and NADH to form hybrid molecules. The six model systems were subjected to 200 ns molecular dynamics simulations and trajectories were analyzed to identify stable ligand-receptor interaction features. Based on these interactions, conventional pharmacophore models (CPM) were generated from the individual binding sites while DHPMs were created from hybrid-molecules occupying both binding sites. A huge library of 1,563,764 publicly available molecules were screened by CPMs and DHPMs. The screened hits obtained from both types of models were compared based on their Hashed binary molecular fingerprints and 4-point pharmacophore fingerprints using Tanimoto, Cosine, Dice and Tversky similarity matrices. Molecules screened by DHPM exhibited significant structural diversity, better binding strength and drug like properties as compared to the compounds screened by CPMs indicating the efficiency of DHPM to explore new chemical space for anti-TB drug discovery. The idea of DHPM can be applied for a wide range of mycobacterial or other pathogen targets to venture into unexplored chemical space.
Collapse
Affiliation(s)
- Chinmayee Choudhury
- Department of Experimental Medicine and Biotechnology, Post Graduate Institute of Medical Education and Research, Chandigarh, India
| | - Anshu Bhardwaj
- Bioinformatics Centre, Council of Scientific and Industrial Research-Institute of Microbial Technology, Chandigarh, India
| |
Collapse
|
4
|
Kumar S, Sahu P, Jena L. An In silico approach to identify potential inhibitors against multiple drug targets of Mycobacterium tuberculosis. Int J Mycobacteriol 2019; 8:252-261. [DOI: 10.4103/ijmy.ijmy_109_19] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
|
5
|
Créquit P, Mansouri G, Benchoufi M, Vivot A, Ravaud P. Mapping of Crowdsourcing in Health: Systematic Review. J Med Internet Res 2018; 20:e187. [PMID: 29764795 PMCID: PMC5974463 DOI: 10.2196/jmir.9330] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 02/10/2018] [Accepted: 03/14/2018] [Indexed: 11/22/2022] Open
Abstract
Background Crowdsourcing involves obtaining ideas, needed services, or content by soliciting Web-based contributions from a crowd. The 4 types of crowdsourced tasks (problem solving, data processing, surveillance or monitoring, and surveying) can be applied in the 3 categories of health (promotion, research, and care). Objective This study aimed to map the different applications of crowdsourcing in health to assess the fields of health that are using crowdsourcing and the crowdsourced tasks used. We also describe the logistics of crowdsourcing and the characteristics of crowd workers. Methods MEDLINE, EMBASE, and ClinicalTrials.gov were searched for available reports from inception to March 30, 2016, with no restriction on language or publication status. Results We identified 202 relevant studies that used crowdsourcing, including 9 randomized controlled trials, of which only one had posted results at ClinicalTrials.gov. Crowdsourcing was used in health promotion (91/202, 45.0%), research (73/202, 36.1%), and care (38/202, 18.8%). The 4 most frequent areas of application were public health (67/202, 33.2%), psychiatry (32/202, 15.8%), surgery (22/202, 10.9%), and oncology (14/202, 6.9%). Half of the reports (99/202, 49.0%) referred to data processing, 34.6% (70/202) referred to surveying, 10.4% (21/202) referred to surveillance or monitoring, and 5.9% (12/202) referred to problem-solving. Labor market platforms (eg, Amazon Mechanical Turk) were used in most studies (190/202, 94%). The crowd workers’ characteristics were poorly reported, and crowdsourcing logistics were missing from two-thirds of the reports. When reported, the median size of the crowd was 424 (first and third quartiles: 167-802); crowd workers’ median age was 34 years (32-36). Crowd workers were mainly recruited nationally, particularly in the United States. For many studies (58.9%, 119/202), previous experience in crowdsourcing was required, and passing a qualification test or training was seldom needed (11.9% of studies; 24/202). For half of the studies, monetary incentives were mentioned, with mainly less than US $1 to perform the task. The time needed to perform the task was mostly less than 10 min (58.9% of studies; 119/202). Data quality validation was used in 54/202 studies (26.7%), mainly by attention check questions or by replicating the task with several crowd workers. Conclusions The use of crowdsourcing, which allows access to a large pool of participants as well as saving time in data collection, lowering costs, and speeding up innovations, is increasing in health promotion, research, and care. However, the description of crowdsourcing logistics and crowd workers’ characteristics is frequently missing in study reports and needs to be precisely reported to better interpret the study findings and replicate them.
Collapse
Affiliation(s)
- Perrine Créquit
- INSERM UMR1153, Methods Team, Epidemiology and Statistics Sorbonne Paris Cité Research Center, Paris Descartes University, Paris, France.,Centre d'Epidémiologie Clinique, Hôpital Hôtel Dieu, Assistance Publique des Hôpitaux de Paris, Paris, France.,Cochrane France, Paris, France
| | - Ghizlène Mansouri
- INSERM UMR1153, Methods Team, Epidemiology and Statistics Sorbonne Paris Cité Research Center, Paris Descartes University, Paris, France
| | - Mehdi Benchoufi
- Centre d'Epidémiologie Clinique, Hôpital Hôtel Dieu, Assistance Publique des Hôpitaux de Paris, Paris, France
| | - Alexandre Vivot
- INSERM UMR1153, Methods Team, Epidemiology and Statistics Sorbonne Paris Cité Research Center, Paris Descartes University, Paris, France.,Centre d'Epidémiologie Clinique, Hôpital Hôtel Dieu, Assistance Publique des Hôpitaux de Paris, Paris, France
| | - Philippe Ravaud
- INSERM UMR1153, Methods Team, Epidemiology and Statistics Sorbonne Paris Cité Research Center, Paris Descartes University, Paris, France.,Centre d'Epidémiologie Clinique, Hôpital Hôtel Dieu, Assistance Publique des Hôpitaux de Paris, Paris, France.,Cochrane France, Paris, France.,Department of Epidemiology, Columbia University, Mailman School of Public Health, New York, NY, United States
| |
Collapse
|
6
|
Janardhan S, John L, Prasanthi M, Poroikov V, Narahari Sastry G. A QSAR and molecular modelling study towards new lead finding: polypharmacological approach to Mycobacterium tuberculosis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2017; 28:815-832. [PMID: 29183232 DOI: 10.1080/1062936x.2017.1398782] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Accepted: 10/25/2017] [Indexed: 06/07/2023]
Abstract
Developing effective inhibitors against Mycobacterium tuberculosis (Mtb) is a challenging task, primarily due to the emergence of resistant strains. In this study, we have proposed and implemented an in silico guided polypharmacological approach, which is expected to be effective against resistant strains by simultaneously inhibiting several potential Mtb drug targets. A combination of pharmacophore and QSAR based virtual screening strategy taking three key targets such as InhA (enoyl-acyl-carrier-protein reductase), GlmU (N-acetyl-glucosamine-1-phosphate uridyltransferase) and DapB (dihydrodipicolinate reductase) have resulted in initial 784 hits from Asinex database of 435,000 compounds. These hits were further subjected to docking with 33 Mtb druggable targets. About 110 potential polypharmacological hits were taken by integrating the aforementioned screening protocols. Further screening was conducted by taking various parameters and properties such as cell permeability, drug-likeness, drug-induced phospholipidosisand structural alerts. A consensus analysis has yielded 59 potential hits that pass through all the filters and can be prioritized for effective drug-resistant tuberculosis. This study proposes about nine potential hits which are expected to be promising molecules, having not only drug-like properties, but also being effective against multiple Mtb targets.
Collapse
Affiliation(s)
- S Janardhan
- a Centre for Molecular Modelling , CSIR-Indian Institute of Chemical Technology , Hyderabad - 500007 , India
| | - L John
- a Centre for Molecular Modelling , CSIR-Indian Institute of Chemical Technology , Hyderabad - 500007 , India
| | - M Prasanthi
- a Centre for Molecular Modelling , CSIR-Indian Institute of Chemical Technology , Hyderabad - 500007 , India
| | - V Poroikov
- b Institute of Biomedical Chemistry , Moscow , 119121 , Russia
| | - G Narahari Sastry
- a Centre for Molecular Modelling , CSIR-Indian Institute of Chemical Technology , Hyderabad - 500007 , India
| |
Collapse
|
7
|
Decoding the similarities and differences among mycobacterial species. PLoS Negl Trop Dis 2017; 11:e0005883. [PMID: 28854187 PMCID: PMC5595346 DOI: 10.1371/journal.pntd.0005883] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2017] [Revised: 09/12/2017] [Accepted: 08/18/2017] [Indexed: 11/19/2022] Open
Abstract
Mycobacteriaceae comprises pathogenic species such as Mycobacterium tuberculosis, M. leprae and M. abscessus, as well as non-pathogenic species, for example, M. smegmatis and M. thermoresistibile. Genome comparison and annotation studies provide insights into genome evolutionary relatedness, identify unique and pathogenicity-related genes in each species, and explore new targets that could be used for developing new diagnostics and therapeutics. Here, we present a comparative analysis of ten-mycobacterial genomes with the objective of identifying similarities and differences between pathogenic and non-pathogenic species. We identified 1080 core orthologous clusters that were enriched in proteins involved in amino acid and purine/pyrimidine biosynthetic pathways, DNA-related processes (replication, transcription, recombination and repair), RNA-methylation and modification, and cell-wall polysaccharide biosynthetic pathways. For their pathogenicity and survival in the host cell, pathogenic species have gained specific sets of genes involved in repair and protection of their genomic DNA. M. leprae is of special interest owing to its smallest genome (1600 genes and ~1300 psuedogenes), yet poor genome annotation. More than 75% of the pseudogenes were found to have a functional ortholog in the other mycobacterial genomes and belong to protein families such as transferases, oxidoreductases and hydrolases. Members of the Mycobacteriaceae family, which are known to adapt to different environmental niches, comprise bacterial species with varied genome sizes. They are unique in their cell-wall composition, which is remarkably thick and lipid-rich as compared to other bacteria. We performed a comparative analysis at the proteome level for ten mycobacterial species that differ in their pathogenicity, genome size and environmental niches. A total of 1080 orthologous clusters with representation from all ten species were obtained, and these were further examined for their domain annotations, domain architecture similarities and enriched GO terms. These core orthologous clusters are enriched in various biosynthetic pathways. The proteins that are specific to each of the ten species were also investigated for their GO functions. The M. leprae genome has a large number of pseudogenes and we searched for their functional orthologs in other mycobacterial species in order to understand the functions that are lost from the M. leprae genome. The proteins present exclusively in M. leprae genome were studied in more detail, in order to predict putative drug targets and diagnostic markers. These findings, which have implications in understanding evolution of mycobacterial genomes, identify species-specific proteins that have potential for use in developing new diagnostic tools and therapeutics.
Collapse
|
8
|
Alexander NS, Palczewski K. Crowd sourcing difficult problems in protein science . Protein Sci 2017; 26:2118-2125. [PMID: 28762619 DOI: 10.1002/pro.3247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 07/21/2017] [Indexed: 11/08/2022]
Abstract
Dedicated computing resources are expensive to develop, maintain, and administrate. Frequently, research groups require bursts of computing power, during which progress is still limited by available computing resources. One way to alleviate this bottleneck would be to use additional computing resources. Today, many computing devices remain idle most of the time. Passive volunteer computing exploits this unemployed reserve of computing power by allowing device-owners to donate computing time on their own devices. Another complementary way to alleviate bottlenecks in computing resources is to use more efficient algorithms. Engaging volunteer computing employs human intuition to help solve challenging problems for which efficient algorithms are difficult to develop or unavailable. Designing engaging volunteer computing projects is challenging but can result in high-quality solutions. Here, we highlight four examples.
Collapse
Affiliation(s)
- Nathan S Alexander
- Department of Pharmacology, School of Medicine, Case Western Reserve University, Cleveland, Ohio, 44106
| | - Krzysztof Palczewski
- Department of Pharmacology, School of Medicine, Case Western Reserve University, Cleveland, Ohio, 44106.,Cleveland Center for Membrane and Structural Biology, Case Western Reserve University, Cleveland, Ohio, 44106
| |
Collapse
|
9
|
Abstract
The generation of large-scale biomedical data is creating unprecedented opportunities for basic and translational science. Typically, the data producers perform initial analyses, but it is very likely that the most informative methods may reside with other groups. Crowdsourcing the analysis of complex and massive data has emerged as a framework to find robust methodologies. When the crowdsourcing is done in the form of collaborative scientific competitions, known as Challenges, the validation of the methods is inherently addressed. Challenges also encourage open innovation, create collaborative communities to solve diverse and important biomedical problems, and foster the creation and dissemination of well-curated data repositories.
Collapse
|
10
|
Data Intensive Genome Level Analysis for Identifying Novel, Non-Toxic Drug Targets for Multi Drug Resistant Mycobacterium tuberculosis. Sci Rep 2017; 7:46595. [PMID: 28425478 PMCID: PMC5397868 DOI: 10.1038/srep46595] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2016] [Accepted: 03/22/2017] [Indexed: 11/08/2022] Open
Abstract
We report the construction of a novel Systems Biology based virtual drug discovery model for the prediction of non-toxic metabolic targets in Mycobacterium tuberculosis (Mtb). This is based on a data-intensive genome level analysis and the principle of conservation of the evolutionarily important genes. In the 1623 sequenced Mtb strains, 890 metabolic genes identified through a systems approach in Mtb were evaluated for non-synonymous mutations. The 33 genes showed none or one variation in the entire 1623 strains, including 1084 Russian MDR strains. These invariant targets were further evaluated for their experimental and in silico essentiality as well as availability of their crystal structure in Protein Data Bank (PDB). Along with this, targets for the common existing antibiotics and the new Tb drug candidates were also screened for their variation across 1623 strains of Mtb for understanding the drug resistance. We propose that the reduced set of these reported targets could be a more effective starting point for medicinal chemists in generating new chemical leads. This approach has the potential of fueling the dried up Tuberculosis (Tb) drug discovery pipeline.
Collapse
|
11
|
Namasivayam AA, Morales AF, Lacave ÁMF, Tallam A, Simovic B, Alfaro DG, Bobbili DR, Martin F, Androsova G, Shvydchenko I, Park J, Calvo JV, Hoeng J, Peitsch MC, Racero MGV, Biryukov M, Talikka M, Pérez MB, Rohatgi N, Díaz-Díaz N, Mandarapu R, Ruiz RA, Davidyan S, Narayanasamy S, Boué S, Guryanova S, Arbas SM, Menon S, Xiang Y. Community-Reviewed Biological Network Models for Toxicology and Drug Discovery Applications. GENE REGULATION AND SYSTEMS BIOLOGY 2016; 10:51-66. [PMID: 27429547 PMCID: PMC4944831 DOI: 10.4137/grsb.s39076] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2016] [Revised: 03/31/2016] [Accepted: 04/12/2016] [Indexed: 12/13/2022]
Abstract
Biological network models offer a framework for understanding disease by describing the relationships between the mechanisms involved in the regulation of biological processes. Crowdsourcing can efficiently gather feedback from a wide audience with varying expertise. In the Network Verification Challenge, scientists verified and enhanced a set of 46 biological networks relevant to lung and chronic obstructive pulmonary disease. The networks were built using Biological Expression Language and contain detailed information for each node and edge, including supporting evidence from the literature. Network scoring of public transcriptomics data inferred perturbation of a subset of mechanisms and networks that matched the measured outcomes. These results, based on a computable network approach, can be used to identify novel mechanisms activated in disease, quantitatively compare different treatments and time points, and allow for assessment of data with low signal. These networks are periodically verified by the crowd to maintain an up-to-date suite of networks for toxicology and drug discovery applications.
Collapse
Affiliation(s)
| | - Aishwarya Alex Namasivayam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | | | | | - Aravind Tallam
- TWINCORE, Zentrum für Experimentelle und Klinische Infektionsforschung, Hannover, Germany
| | | | | | - Dheeraj Reddy Bobbili
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | - Florian Martin
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| | - Ganna Androsova
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | - Irina Shvydchenko
- Kuban State University of Physical Education, Sport and Tourism, Krasnodar, Russia
| | | | - Jorge Val Calvo
- Center for Molecular Biology, “Severo Ochoa” – Spanish National Research Council, Madrid, Spain
| | - Julia Hoeng
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| | - Manuel C. Peitsch
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| | | | - Maria Biryukov
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | - Marja Talikka
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| | | | - Neha Rohatgi
- Center for Systems Biology, University of Iceland, Reykjavik, Iceland
| | | | - Rajesh Mandarapu
- Prakhya Research Laboratories, Lakshminagar, Selaiyur, Chennai, Tamil Nadu, India
| | | | - Sergey Davidyan
- Institute of Biochemical Physics Russian Academy of Sciences, Moscow, Russia
| | - Shaman Narayanasamy
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | - Stéphanie Boué
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| | - Svetlana Guryanova
- Institute of Bioorganic Chemistry Russian Academy of Sciences, Moscow, Russia
| | - Susana Martínez Arbas
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Campus Belval, Esch-sur-Alzette, Luxembourg
| | - Swapna Menon
- AnalyzeDat Consulting Services, Edapally Byepass Junction, Kochi, Kerala, India
| | - Yang Xiang
- Philip Morris International R&D, Philip Morris Products S.A., Quai Jeanrenaud, Neuchâtel, Switzerland (part of Philip Morris International group of companies)
| |
Collapse
|
12
|
Sridhar S, Dash P, Guruprasad K. Comparative analyses of the proteins from Mycobacterium tuberculosis and human genomes: Identification of potential tuberculosis drug targets. Gene 2016; 579:69-74. [PMID: 26762852 DOI: 10.1016/j.gene.2015.12.054] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2015] [Accepted: 12/26/2015] [Indexed: 12/13/2022]
Abstract
Tuberculosis, one of the major infectious diseases affecting human beings is caused by the bacillus Mycobacterium tuberculosis. Increased resistance to known drugs commonly used for the treatment of tuberculosis has created an urgent need to identify new targets for validation and to develop drugs. In this study, we have used various bioinformatics tools, to compare the protein sequences from twenty-three M. tuberculosis genome strains along with the known human protein sequences, in order to identify the 'conserved' M. tuberculosis proteins absent in human. Further, based on the analysis of protein interaction networks, we selected one-hundred and forty proteins that were predicted as potential M. tuberculosis drug targets and prioritized according to the ranking of 'clusters' of interacting proteins. Comparison of the predicted 140 TB targets with literature indicated that 46 of them were previously reported, thereby increasing the confidence in our predictions of the remaining 94 targets too. The analyses of the structures and functions corresponding to the predicted potential TB drug targets indicated a diverse range of proteins that included ten 'druggable' targets with some of the known drugs.
Collapse
Affiliation(s)
- Settu Sridhar
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500007, India.
| | - Pallabini Dash
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500007, India.
| | - Kunchur Guruprasad
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500007, India.
| |
Collapse
|
13
|
Singh HN, Rajeswari MR. Identification of genes containing expanded purine repeats in the human genome and their apparent protective role against cancer. J Biomol Struct Dyn 2015; 34:689-704. [PMID: 25990537 DOI: 10.1080/07391102.2015.1049553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.
Collapse
Affiliation(s)
- Himanshu Narayan Singh
- a Department of Biochemistry , All India Institute of Medical Sciences , Room No: 3005A, New Delhi 110029 , India
| | - Moganty R Rajeswari
- a Department of Biochemistry , All India Institute of Medical Sciences , Room No: 3005A, New Delhi 110029 , India
| |
Collapse
|
14
|
Huo T, Liu W, Guo Y, Yang C, Lin J, Rao Z. Prediction of host - pathogen protein interactions between Mycobacterium tuberculosis and Homo sapiens using sequence motifs. BMC Bioinformatics 2015; 16:100. [PMID: 25887594 PMCID: PMC4456996 DOI: 10.1186/s12859-015-0535-y] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2014] [Accepted: 03/13/2015] [Indexed: 12/28/2022] Open
Abstract
Background Emergence of multiple drug resistant strains of M. tuberculosis (MDR-TB) threatens to derail global efforts aimed at reigning in the pathogen. Co-infections of M. tuberculosis with HIV are difficult to treat. To counter these new challenges, it is essential to study the interactions between M. tuberculosis and the host to learn how these bacteria cause disease. Results We report a systematic flow to predict the host pathogen interactions (HPIs) between M. tuberculosis and Homo sapiens based on sequence motifs. First, protein sequences were used as initial input for identifying the HPIs by ‘interolog’ method. HPIs were further filtered by prediction of domain-domain interactions (DDIs). Functional annotations of protein and publicly available experimental results were applied to filter the remaining HPIs. Using such a strategy, 118 pairs of HPIs were identified, which involve 43 proteins from M. tuberculosis and 48 proteins from Homo sapiens. A biological interaction network between M. tuberculosis and Homo sapiens was then constructed using the predicted inter- and intra-species interactions based on the 118 pairs of HPIs. Finally, a web accessible database named PATH (Protein interactions of M. tuberculosis and Human) was constructed to store these predicted interactions and proteins. Conclusions This interaction network will facilitate the research on host-pathogen protein-protein interactions, and may throw light on how M. tuberculosis interacts with its host. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0535-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Tong Huo
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Wei Liu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Yu Guo
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Cheng Yang
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Jianping Lin
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Pharmacy, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| | - Zihe Rao
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, 300071, China. .,College of Life Sciences, Nankai University, Tianjin, 300071, China. .,Tianjin International Joint Academy of Biotechnology and Medicine, Tianjin, 300457, China.
| |
Collapse
|
15
|
Systems Approaches to Study Infectious Diseases. SYSTEMS AND SYNTHETIC BIOLOGY 2015. [DOI: 10.1007/978-94-017-9514-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
16
|
Schubert OT, Aebersold R. Microbial Proteome Profiling and Systems Biology: Applications to Mycobacterium tuberculosis. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2015; 883:235-54. [PMID: 26621471 DOI: 10.1007/978-3-319-23603-2_13] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Each year, 1.3 million people die from tuberculosis, an infectious disease caused by Mycobacterium tuberculosis. Systems biology-based strategies might significantly contribute to the knowledge-guided development of more effective vaccines and drugs to prevent and cure infectious diseases. To build models simulating the behaviour of a system in response to internal or external stimuli and to identify potential targets for therapeutic intervention, systems biology approaches require the acquisition of quantitative molecular profiles on many perturbed states. Here we review the current state of proteomic analyses in Mycobacterium tuberculosis and discuss the potential of recently emerging targeting mass spectrometry-based techniques which enable fast, sensitive and accurate protein measurements.
Collapse
Affiliation(s)
- Olga T Schubert
- Institute of Molecular Systems Biology, ETH Zurich, Zurich, CH-8093, Switzerland.
- Systems Biology Graduate School, Zurich, CH-8057, Switzerland.
| | - Ruedi Aebersold
- Institute of Molecular Systems Biology, ETH Zurich, Zurich, CH-8093, Switzerland.
- Faculty of Science, University of Zurich, Zurich, CH-8057, Switzerland.
| |
Collapse
|
17
|
Ramakrishnan G, Ochoa-Montaño B, Raghavender US, Mudgal R, Joshi AG, Chandra NR, Sowdhamini R, Blundell TL, Srinivasan N. Enriching the annotation of Mycobacterium tuberculosis H37Rv proteome using remote homology detection approaches: insights into structure and function. Tuberculosis (Edinb) 2014; 95:14-25. [PMID: 25467293 DOI: 10.1016/j.tube.2014.10.009] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Revised: 10/14/2014] [Accepted: 10/27/2014] [Indexed: 12/01/2022]
Abstract
The availability of the genome sequence of Mycobacterium tuberculosis H37Rv has encouraged determination of large numbers of protein structures and detailed definition of the biological information encoded therein; yet, the functions of many proteins in M. tuberculosis remain unknown. The emergence of multidrug resistant strains makes it a priority to exploit recent advances in homology recognition and structure prediction to re-analyse its gene products. Here we report the structural and functional characterization of gene products encoded in the M. tuberculosis genome, with the help of sensitive profile-based remote homology search and fold recognition algorithms resulting in an enhanced annotation of the proteome where 95% of the M. tuberculosis proteins were identified wholly or partly with information on structure or function. New information includes association of 244 proteins with 205 domain families and a separate set of new association of folds to 64 proteins. Extending structural information across uncharacterized protein families represented in the M. tuberculosis proteome, by determining superfamily relationships between families of known and unknown structures, has contributed to an enhancement in the knowledge of structural content. In retrospect, such superfamily relationships have facilitated recognition of probable structure and/or function for several uncharacterized protein families, eventually aiding recognition of probable functions for homologous proteins corresponding to such families. Gene products unique to mycobacteria for which no functions could be identified are 183. Of these 18 were determined to be M. tuberculosis specific. Such pathogen-specific proteins are speculated to harbour virulence factors required for pathogenesis. A re-annotated proteome of M. tuberculosis, with greater completeness of annotated proteins and domain assigned regions, provides a valuable basis for experimental endeavours designed to obtain a better understanding of pathogenesis and to accelerate the process of drug target discovery.
Collapse
Affiliation(s)
- Gayatri Ramakrishnan
- Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore 560012, India; Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India.
| | | | - Upadhyayula S Raghavender
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Gandhi Krishi Vignyan Kendra Campus, Bangalore 560065, India.
| | - Richa Mudgal
- Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore 560012, India; Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India.
| | - Adwait G Joshi
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Gandhi Krishi Vignyan Kendra Campus, Bangalore 560065, India; Manipal University, Manipal, Karnataka 576104, India.
| | - Nagasuma R Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, India.
| | - Ramanathan Sowdhamini
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Gandhi Krishi Vignyan Kendra Campus, Bangalore 560065, India.
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK.
| | | |
Collapse
|
18
|
Cui T, He ZG. Improved understanding of pathogenesis from protein interactions inMycobacteriumtuberculosis. Expert Rev Proteomics 2014; 11:745-55. [DOI: 10.1586/14789450.2014.971762] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
19
|
Vashisht R, Bhat AG, Kushwaha S, Bhardwaj A, Consortium OSDD, Brahmachari SK. Systems level mapping of metabolic complexity in Mycobacterium tuberculosis to identify high-value drug targets. J Transl Med 2014; 12:263. [PMID: 25304862 PMCID: PMC4201925 DOI: 10.1186/s12967-014-0263-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Accepted: 09/11/2014] [Indexed: 01/17/2023] Open
Abstract
BACKGROUND The effectiveness of current therapeutic regimens for Mycobacterium tuberculosis (Mtb) is diminished by the need for prolonged therapy and the rise of drug resistant/tolerant strains. This global health threat, despite decades of basic research and a wealth of legacy knowledge, is due to a lack of systems level understanding that can innovate the process of fast acting and high efficacy drug discovery. METHODS The enhanced functional annotations of the Mtb genome, which were previously obtained through a crowd sourcing approach was used to reconstruct the metabolic network of Mtb in a bottom up manner. We represent this information by developing a novel Systems Biology Spindle Map of Metabolism (SBSM) and comprehend its static and dynamic structure using various computational approaches based on simulation and design. RESULTS The reconstructed metabolism of Mtb encompasses 961 metabolites, involved in 1152 reactions catalyzed by 890 protein coding genes, organized into 50 pathways. By accounting for static and dynamic analysis of SBSM in Mtb we identified various critical proteins required for the growth and survival of bacteria. Further, we assessed the potential of these proteins as putative drug targets that are fast acting and less toxic. Further, we formulate a novel concept of metabolic persister genes (MPGs) and compared our predictions with published in vitro and in vivo experimental evidence. Through such analyses, we report for the first time that de novo biosynthesis of NAD may give rise to bacterial persistence in Mtb under conditions of metabolic stress induced by conventional anti-tuberculosis therapy. We propose such MPG's as potential combination of drug targets for existing antibiotics that can improve their efficacy and efficiency for drug tolerant bacteria. CONCLUSION The systems level framework formulated by us to identify potential non-toxic drug targets and strategies to circumvent the issue of bacterial persistence can substantially aid in the process of TB drug discovery and translational research.
Collapse
Affiliation(s)
- Rohit Vashisht
- />CSIR-Open Source Drug Discovery Unit, New Delhi, India
- />Academy of Scientific and Innovative Research, New Delhi, India
| | - Ashwini G Bhat
- />CSIR-Open Source Drug Discovery Unit, New Delhi, India
| | | | - Anshu Bhardwaj
- />CSIR-Open Source Drug Discovery Unit, New Delhi, India
| | - OSDD Consortium
- />CSIR-Open Source Drug Discovery Unit, New Delhi, India
- />Academy of Scientific and Innovative Research, New Delhi, India
- />CSIR - Institute of Genomics and Integrative Biology, New Delhi, India
| | - Samir K Brahmachari
- />CSIR-Open Source Drug Discovery Unit, New Delhi, India
- />Academy of Scientific and Innovative Research, New Delhi, India
- />CSIR - Institute of Genomics and Integrative Biology, New Delhi, India
| |
Collapse
|
20
|
Dekkers OM, Mummery CL, Rabelink TJ. A case for crowd sourcing in stem cell research. Stem Cells Transl Med 2014; 3:1259-61. [PMID: 25232185 DOI: 10.5966/sctm.2014-0125] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Thousands of patients and placebo-treated controls have been included in many clinical trials of stem cell therapy over the last decade or so, but often the study groups have been small. Their scientific value may therefore be limited and their ethical justification questionable. Would "crowd sourcing" for data sharing be a means of increasing the collective value of clinical trials? Here, we make a case for open access of all data emerging from stem cell studies (trials but also observational studies) independent of whether they are investigator-initiated or commercially driven.
Collapse
Affiliation(s)
| | - Christine L Mummery
- Anatomy and Developmental Biology, Leiden University Medical Center, Leiden, The Netherlands
| | | |
Collapse
|
21
|
Clark AM, Sarker M, Ekins S. New target prediction and visualization tools incorporating open source molecular fingerprints for TB Mobile 2.0. J Cheminform 2014; 6:38. [PMID: 25302078 PMCID: PMC4190048 DOI: 10.1186/s13321-014-0038-2] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Accepted: 06/30/2014] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND We recently developed a freely available mobile app (TB Mobile) for both iOS and Android platforms that displays Mycobacterium tuberculosis (Mtb) active molecule structures and their targets with links to associated data. The app was developed to make target information available to as large an audience as possible. RESULTS We now report a major update of the iOS version of the app. This includes enhancements that use an implementation of ECFP_6 fingerprints that we have made open source. Using these fingerprints, the user can propose compounds with possible anti-TB activity, and view the compounds within a cluster landscape. Proposed compounds can also be compared to existing target data, using a näive Bayesian scoring system to rank probable targets. We have curated an additional 60 new compounds and their targets for Mtb and added these to the original set of 745 compounds. We have also curated 20 further compounds (many without targets in TB Mobile) to evaluate this version of the app with 805 compounds and associated targets. CONCLUSIONS TB Mobile can now manage a small collection of compounds that can be imported from external sources, or exported by various means such as email or app-to-app inter-process communication. This means that TB Mobile can be used as a node within a growing ecosystem of mobile apps for cheminformatics. It can also cluster compounds and use internal algorithms to help identify potential targets based on molecular similarity. TB Mobile represents a valuable dataset, data-visualization aid and target prediction tool.
Collapse
Affiliation(s)
- Alex M Clark
- Molecular Materials Informatics, 1900 St. Jacques #302, Montreal H3J 2S1, Quebec, Canada
| | - Malabika Sarker
- SRI International, 333 Ravenswood Avenue, Menlo Park 94025, CA, USA
| | - Sean Ekins
- Collaborative Drug Discovery, 1633 Bayshore Highway, Suite 342, Burlingame 94010, CA, USA
- Collaborations in Chemistry, 5616 Hilltop Needmore Road, Fuquay-Varina 27526, NC, USA
| |
Collapse
|
22
|
Ghosh S, Baloni P, Mukherjee S, Anand P, Chandra N. A multi-level multi-scale approach to study essential genes in Mycobacterium tuberculosis. BMC SYSTEMS BIOLOGY 2013; 7:132. [PMID: 24308365 PMCID: PMC4234997 DOI: 10.1186/1752-0509-7-132] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 11/20/2013] [Indexed: 11/10/2022]
Abstract
Background The set of indispensable genes that are required by an organism to grow and sustain life are termed as essential genes. There is a strong interest in identification of the set of essential genes, particularly in pathogens, not only for a better understanding of the pathogen biology, but also for identifying drug targets and the minimal gene set for the organism. Essentiality is inherently a systems property and requires consideration of the system as a whole for their identification. The available experimental approaches capture some aspects but each method comes with its own limitations. Moreover, they do not explain the basis for essentiality in most cases. A powerful prediction method to recognize this gene pool including rationalization of the known essential genes in a given organism would be very useful. Here we describe a multi-level multi-scale approach to identify the essential gene pool in a deadly pathogen, Mycobacterium tuberculosis. Results The multi-level workflow analyses the bacterial cell by studying (a) genome-wide gene expression profiles to identify the set of genes which show consistent and significant levels of expression in multiple samples of the same condition, (b) indispensability for growth by using gene expression integrated flux balance analysis of a genome-scale metabolic model, (c) importance for maintaining the integrity and flow in a protein-protein interaction network and (d) evolutionary conservation in a set of genomes of the same ecological niche. In the gene pool identified, the functional basis for essentiality has been addressed by studying residue level conservation and the sub-structure at the ligand binding pockets, from which essential amino acid residues in that pocket have also been identified. 283 genes were identified as essential genes with high-confidence. An agreement of about 73.5% is observed with that obtained from the experimental transposon mutagenesis technique. A large proportion of the identified genes belong to the class of intermediary metabolism and respiration. Conclusions The multi-scale, multi-level approach described can be generally applied to other pathogens as well. The essential gene pool identified form a basis for designing experiments to probe their finer functional roles and also serve as a ready shortlist for identifying drug targets.
Collapse
Affiliation(s)
| | | | | | | | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, India.
| |
Collapse
|
23
|
Masum H, Rao A, Good BM, Todd MH, Edwards AM, Chan L, Bunin BA, Su AI, Thomas Z, Bourne PE. Ten simple rules for cultivating open science and collaborative R&D. PLoS Comput Biol 2013; 9:e1003244. [PMID: 24086123 PMCID: PMC3784487 DOI: 10.1371/journal.pcbi.1003244] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Hassan Masum
- Waterloo Institute for Complexity and Innovation, Waterloo, Ontario, Canada
- * E-mail:
| | - Aarthi Rao
- Results for Development Institute, Washington, D.C., United States of America
| | - Benjamin M. Good
- Department of Molecular and Experimental Medicine, Scripps Research Institute, La Jolla, California, United States of America
| | - Matthew H. Todd
- School of Chemistry, University of Sydney, Sydney, New South Wales, Australia
| | - Aled M. Edwards
- Structural Genomics Consortium, University of Toronto, Toronto, Ontario, Canada
| | - Leslie Chan
- Department of Social Sciences, University of Toronto Scarborough, Toronto, Ontario, Canada
| | - Barry A. Bunin
- Collaborative Drug Discovery, Burlingame, California, United States of America
| | - Andrew I. Su
- Department of Molecular and Experimental Medicine, Scripps Research Institute, La Jolla, California, United States of America
| | - Zakir Thomas
- Council of Scientific and Industrial Research, New Delhi, India
| | - Philip E. Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| |
Collapse
|
24
|
Chung BKS, Dick T, Lee DY. In silico analyses for the discovery of tuberculosis drug targets. J Antimicrob Chemother 2013; 68:2701-9. [DOI: 10.1093/jac/dkt273] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
25
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 506] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
26
|
Vashisht R, Bhardwaj A, Osdd Consortium, Brahmachari SK. Social networks to biological networks: systems biology of Mycobacterium tuberculosis. MOLECULAR BIOSYSTEMS 2013; 9:1584-93. [PMID: 23629487 DOI: 10.1039/c3mb25546h] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Contextualizing relevant information to construct a network that represents a given biological process presents a fundamental challenge in the network science of biology. The quality of network for the organism of interest is critically dependent on the extent of functional annotation of its genome. Mostly the automated annotation pipelines do not account for unstructured information present in volumes of literature and hence large fraction of genome remains poorly annotated. However, if used, this information could substantially enhance the functional annotation of a genome, aiding the development of a more comprehensive network. Mining unstructured information buried in volumes of literature often requires manual intervention to a great extent and thus becomes a bottleneck for most of the automated pipelines. In this review, we discuss the potential of scientific social networking as a solution for systematic manual mining of data. Focusing on Mycobacterium tuberculosis, as a case study, we discuss our open innovative approach for the functional annotation of its genome. Furthermore, we highlight the strength of such collated structured data in the context of drug target prediction based on systems level analysis of pathogen.
Collapse
|
27
|
Murthy D, Attri KS, Gokhale RS. Network, nodes and nexus: systems approach to multitarget therapeutics. Curr Opin Biotechnol 2013; 24:1129-36. [PMID: 23453398 DOI: 10.1016/j.copbio.2013.02.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2012] [Revised: 01/17/2013] [Accepted: 02/09/2013] [Indexed: 11/28/2022]
Abstract
Systems biology is revealing multiple layers of regulatory networks that manifest spatiotemporal variations. Since genes and environment also influence the emergent property of a cell, the biological output requires dynamic understanding of various molecular circuitries. The metabolic networks continually adapt and evolve to cope with the changing milieu of the system, which could also include infection by another organism. Such perturbations of the functional networks can result in disease phenotypes, for instance tuberculosis and cancer. In order to develop effective therapeutics, it is important to determine the disease progression profiles of complex disorders that can reveal dynamic aspects and to develop mutitarget systemic therapies that can help overcome pathway adaptations and redundancy.
Collapse
Affiliation(s)
- Divya Murthy
- CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi, India
| | | | | |
Collapse
|
28
|
Slayden RA, Jackson M, Zucker J, Ramirez MV, Dawson CC, Crew R, Sampson NS, Thomas ST, Jamshidi N, Sisk P, Caspi R, Crick DC, McNeil MR, Pavelka MS, Niederweis M, Siroy A, Dona V, McFadden J, Boshoff H, Lew JM. Updating and curating metabolic pathways of TB. Tuberculosis (Edinb) 2013; 93:47-59. [PMID: 23375378 DOI: 10.1016/j.tube.2012.11.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 11/25/2012] [Indexed: 01/08/2023]
Abstract
The sequencing of complete genomes has accelerated biomedical research by providing information about the overall coding capacity of bacterial chromosomes. The original TB annotation resulted in putative functional assignment of ∼60% of the genes to specific metabolic functions, however, the other 40% of the encoded ORFs where annotated as conserved hypothetical proteins, hypothetical proteins or encoding proteins of unknown function. The TB research community is now at the beginning of the next phases of post-genomics; namely reannotation and functional characterization by targeted experimentation. Arguably, this is the most significant time for basic microbiology in recent history. To foster basic TB research, the Tuberculosis Community Annotation Project (TBCAP) jamboree exercise began the reannotation effort by providing additional information for previous annotations, and refining and substantiating the functional assignment of ORFs and genes within metabolic pathways. The overall goal of the TBCAP 2012 exercise was to gather and compile various data types and use this information with oversight from the scientific community to provide additional information to support the functional annotations of encoding genes. Another objective of this effort was to standardize the publicly accessible Mycobacterium tuberculosis reference sequence and its annotation. The greatest benefit of functional annotation information of genome sequence is that it fuels TB research for drug discovery, diagnostics, vaccine development and epidemiology.
Collapse
|
29
|
Groza T, Tudorache T, Dumontier M. State of the art and open challenges in community-driven knowledge curation. J Biomed Inform 2012; 46:1-4. [PMID: 23219718 DOI: 10.1016/j.jbi.2012.11.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2012] [Revised: 11/17/2012] [Accepted: 11/17/2012] [Indexed: 11/19/2022]
Affiliation(s)
- Tudor Groza
- School of ITEE, The University of Queensland, Queensland, Australia.
| | | | | |
Collapse
|
30
|
Brahmachari SK. Introducing the medical bioinformatics in Journal of Translational Medicine. J Transl Med 2012; 10:202. [PMID: 23013487 PMCID: PMC3533924 DOI: 10.1186/1479-5876-10-202] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Accepted: 09/11/2012] [Indexed: 11/10/2022] Open
Abstract
The explosion of genome sequencing data along with genotype to phenotype correlation studies has created data deluge in the area of biomedical sciences. The aim of the Medical bioinformatics section is to aid the development and maturation of the field by providing a platform for the translation of these datasets into useful clinical applications. The increase in computing capabilities and availability of different data from advanced technologies will allow researchers to build System Biology models of various diseases in order to efficiently develop new therapeutic interventions and reduce the current prohibitively large costs of drug discovery. The section welcomes studies on the development of Biomedical Informatics for translational medicine and clinical applications, including tools, methodologies and data integration.
Collapse
|