Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen G, Cairelli MJ, Kilicoglu H, Shin D, Rindflesch TC. Augmenting microarray data with literature-based knowledge to enhance gene regulatory network inference. PLoS Comput Biol 2014;10:e1003666. [PMID: 24921649 PMCID: PMC4055569 DOI: 10.1371/journal.pcbi.1003666] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 04/29/2014] [Indexed: 12/28/2022] Open

For:	Chen G, Cairelli MJ, Kilicoglu H, Shin D, Rindflesch TC. Augmenting microarray data with literature-based knowledge to enhance gene regulatory network inference. PLoS Comput Biol 2014;10:e1003666. [PMID: 24921649 PMCID: PMC4055569 DOI: 10.1371/journal.pcbi.1003666] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 04/29/2014] [Indexed: 12/28/2022] Open

Number

Cited by Other Article(s)

Altay G, Zapardiel-Gonzalo J, Peters B. RNA-seq preprocessing and sample size considerations for gene network inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.02.522518. [PMID: 36711979 PMCID: PMC9881880 DOI: 10.1101/2023.01.02.522518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]

Abstract

Background

Gene network inference (GNI) methods have the potential to reveal functional relationships between different genes and their products. Most GNI algorithms have been developed for microarray gene expression datasets and their application to RNA-seq data is relatively recent. As the characteristics of RNA-seq data are different from microarray data, it is an unanswered question what preprocessing methods for RNA-seq data should be applied prior to GNI to attain optimal performance, or what the required sample size for RNA-seq data is to obtain reliable GNI estimates.

Results

We ran 9144 analysis of 7 different RNA-seq datasets to evaluate 300 different preprocessing combinations that include data transformations, normalizations and association estimators. We found that there was no single best performing preprocessing combination but that there were several good ones. The performance varied widely over various datasets, which emphasized the importance of choosing an appropriate preprocessing configuration before GNI. Two preprocessing combinations appeared promising in general: First, Log-2 TPM (transcript per million) with Variance-stabilizing transformation (VST) and Pearson Correlation Coefficient (PCC) association estimator. Second, raw RNA-seq count data with PCC. Along with these two, we also identified 18 other good preprocessing combinations. Any of these algorithms might perform best in different datasets. Therefore, the GNI performances of these approaches should be measured on any new dataset to select the best performing one for it. In terms of the required biological sample size of RNA-seq data, we found that between 30 to 85 samples were required to generate reliable GNI estimates.

Conclusions

This study provides practical recommendations on default choices for data preprocessing prior to GNI analysis of RNA-seq data to obtain optimal performance results.

Collapse

Basu A, Sarkar A, Bandyopadhyay S, Maulik U. In silico strategies to identify protein-protein interaction modulator in cell-to-cell transmission of SARS CoV2. Transbound Emerg Dis 2022;69:3896-3905. [PMID: 36379049 DOI: 10.1111/tbed.14760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 07/08/2022] [Accepted: 09/15/2022] [Indexed: 11/16/2022]

Sharma PP, Bansal M, Sethi A, Poonam, Pena L, Goel VK, Grishina M, Chaturvedi S, Kumar D, Rathi B. Computational methods directed towards drug repurposing for COVID-19: advantages and limitations. RSC Adv 2021;11:36181-36198. [PMID: 35492747 PMCID: PMC9043418 DOI: 10.1039/d1ra05320e] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 10/07/2021] [Indexed: 12/19/2022] Open

Zhang R, Hristovski D, Schutte D, Kastrin A, Fiszman M, Kilicoglu H. Drug repurposing for COVID-19 via knowledge graph completion. J Biomed Inform 2021;115:103696. [PMID: 33571675 PMCID: PMC7869625 DOI: 10.1016/j.jbi.2021.103696] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 12/23/2020] [Accepted: 02/01/2021] [Indexed: 02/07/2023]

Abstract

OBJECTIVE

To discover candidate drugs to repurpose for COVID-19 using literature-derived knowledge and knowledge graph completion methods.

METHODS

We propose a novel, integrative, and neural network-based literature-based discovery (LBD) approach to identify drug candidates from PubMed and other COVID-19-focused research literature. Our approach relies on semantic triples extracted using SemRep (via SemMedDB). We identified an informative and accurate subset of semantic triples using filtering rules and an accuracy classifier developed on a BERT variant. We used this subset to construct a knowledge graph, and applied five state-of-the-art, neural knowledge graph completion algorithms (i.e., TransE, RotatE, DistMult, ComplEx, and STELP) to predict drug repurposing candidates. The models were trained and assessed using a time slicing approach and the predicted drugs were compared with a list of drugs reported in the literature and evaluated in clinical trials. These models were complemented by a discovery pattern-based approach.

RESULTS

Accuracy classifier based on PubMedBERT achieved the best performance (F1 = 0.854) in identifying accurate semantic predications. Among five knowledge graph completion models, TransE outperformed others (MR = 0.923, Hits@1 = 0.417). Some known drugs linked to COVID-19 in the literature were identified, as well as others that have not yet been studied. Discovery patterns enabled identification of additional candidate drugs and generation of plausible hypotheses regarding the links between the candidate drugs and COVID-19. Among them, five highly ranked and novel drugs (i.e., paclitaxel, SB 203580, alpha 2-antiplasmin, metoclopramide, and oxymatrine) and the mechanistic explanations for their potential use are further discussed.

CONCLUSION

We showed that a LBD approach can be feasible not only for discovering drug candidates for COVID-19, but also for generating mechanistic explanations. Our approach can be generalized to other diseases as well as to other clinical questions. Source code and data are available at https://github.com/kilicogluh/lbd-covid.

Collapse

Kilicoglu H, Rosemblat G, Fiszman M, Shin D. Broad-coverage biomedical relation extraction with SemRep. BMC Bioinformatics 2020;21:188. [PMID: 32410573 PMCID: PMC7222583 DOI: 10.1186/s12859-020-3517-7] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 04/29/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In the era of information overload, natural language processing (NLP) techniques are increasingly needed to support advanced biomedical information management and discovery applications. In this paper, we present an in-depth description of SemRep, an NLP system that extracts semantic relations from PubMed abstracts using linguistic principles and UMLS domain knowledge. We also evaluate SemRep on two datasets. In one evaluation, we use a manually annotated test collection and perform a comprehensive error analysis. In another evaluation, we assess SemRep's performance on the CDR dataset, a standard benchmark corpus annotated with causal chemical-disease relationships.

RESULTS

A strict evaluation of SemRep on our manually annotated dataset yields 0.55 precision, 0.34 recall, and 0.42 F 1 score. A relaxed evaluation, which more accurately characterizes SemRep performance, yields 0.69 precision, 0.42 recall, and 0.52 F 1 score. An error analysis reveals named entity recognition/normalization as the largest source of errors (26.9%), followed by argument identification (14%) and trigger detection errors (12.5%). The evaluation on the CDR corpus yields 0.90 precision, 0.24 recall, and 0.38 F 1 score. The recall and the F 1 score increase to 0.35 and 0.50, respectively, when the evaluation on this corpus is limited to sentence-bound relationships, which represents a fairer evaluation, as SemRep operates at the sentence level.

CONCLUSIONS

SemRep is a broad-coverage, interpretable, strong baseline system for extracting semantic relations from biomedical text. It also underpins SemMedDB, a literature-scale knowledge graph based on semantic relations. Through SemMedDB, SemRep has had significant impact in the scientific community, supporting a variety of clinical and translational applications, including clinical decision making, medical diagnosis, drug repurposing, literature-based discovery and hypothesis generation, and contributing to improved health outcomes. In ongoing development, we are redesigning SemRep to increase its modularity and flexibility, and addressing weaknesses identified in the error analysis.

Collapse

de Campos LM, Cano A, Castellano JG, Moral S. Combining gene expression data and prior knowledge for inferring gene regulatory networks via Bayesian networks using structural restrictions. Stat Appl Genet Mol Biol 2019;18:sagmb-2018-0042. [PMID: 31042646 DOI: 10.1515/sagmb-2018-0042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Kim YH, Song M. A context-based ABC model for literature-based discovery. PLoS One 2019;14:e0215313. [PMID: 31017923 PMCID: PMC6481912 DOI: 10.1371/journal.pone.0215313] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2018] [Accepted: 03/29/2019] [Indexed: 12/13/2022] Open

Abstract

Background

In the literature-based discovery, considerable research has been done based on the ABC model developed by Swanson. ABC model hypothesizes that there is a meaningful relation between entity A extracted from document set 1 and entity C extracted from document set 2 through B entities that appear commonly in both document sets. The results of ABC model are relations among entity A, B, and C, which is referred as paths. A path allows for hypothesizing the relationship between entity A and entity C, or helps discover entity B as a new evidence for the relationship between entity A and entity C. The co-occurrence based approach of ABC model is a well-known approach to automatic hypothesis generation by creating various paths. However, the co-occurrence based ABC model has a limitation, in that biological context is not considered. It focuses only on matching of B entity which commonly appears in relation between two entities. Therefore, the paths extracted by the co-occurrence based ABC model tend to include a lot of irrelevant paths, meaning that expert verification is essential.

Methods

In order to overcome this limitation of the co-occurrence based ABC model, we propose a context-based approach to connecting one entity relation to another, modifying the ABC model using biological contexts. In this study, we defined four biological context elements: cell, drug, disease, and organism. Based on these biological context, we propose two extended ABC models: a context-based ABC model and a context-assignment-based ABC model. In order to measure the performance of the both proposed models, we examined the relevance of the B entities between the well-known relations “APOE–MAPT” as well as “FUS–TARDBP”. Each relation means interaction between neurodegenerative disease associated with proteins. The interaction between APOE and MAPT is known to play a crucial role in Alzheimer’s disease as APOE affects tau-mediated neurodegeneration. It has been shown that mutation in FUS and TARDBP are associated with amyotrophic lateral sclerosis(ALS), a motor neuron disease by leading to neuronal cell death. Using these two relations, we compared both of proposed models to co-occurrence based ABC model.

Results

The precision of B entities by co-occurrence based ABC model was 27.1% for “APOE–MAPT” and 22.1% for “FUS–TARDBP”, respectively. In context-based ABC model, precision of extracted B entities was 71.4% for “APOE–MAPT”, and 77.9% for “FUS–TARDBP”. Context-assignment based ABC model achieved 89% and 97.5% precision for the two relations, respectively. Both proposed models achieved a higher precision than co-occurrence-based ABC model.

Collapse

Ko Y, Kim J, Rodriguez-Zas SL. Markov chain Monte Carlo simulation of a Bayesian mixture model for gene network inference. Genes Genomics 2019;41:547-555. [PMID: 30741379 DOI: 10.1007/s13258-019-00789-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 01/21/2019] [Indexed: 12/31/2022]

Chen G, Jia Y, Zhu L, Li P, Zhang L, Tao C, Jim Zheng W. Gene fingerprint model for literature based detection of the associations among complex diseases: a case study of COPD. BMC Med Inform Decis Mak 2019;19:20. [PMID: 30700303 PMCID: PMC6354331 DOI: 10.1186/s12911-019-0738-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Chen G, Ramírez JC, Deng N, Qiu X, Wu C, Zheng WJ, Wu H. Restructured GEO: restructuring Gene Expression Omnibus metadata for genome dynamics analysis. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2019;2019:5289627. [PMID: 30649296 PMCID: PMC6333964 DOI: 10.1093/database/bay145] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Accepted: 12/11/2018] [Indexed: 11/14/2022]

Kilicoglu H. Biomedical text mining for research rigor and integrity: tasks, challenges, directions. Brief Bioinform 2018;19:1400-1414. [PMID: 28633401 PMCID: PMC6291799 DOI: 10.1093/bib/bbx057] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Revised: 04/10/2017] [Indexed: 01/01/2023] Open

Roy S, Yun D, Madahian B, Berry MW, Deng LY, Goldowitz D, Homayouni R. Navigating the Functional Landscape of Transcription Factors via Non-Negative Tensor Factorization Analysis of MEDLINE Abstracts. Front Bioeng Biotechnol 2017;5:48. [PMID: 28894735 PMCID: PMC5581332 DOI: 10.3389/fbioe.2017.00048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 07/31/2017] [Indexed: 01/09/2023] Open

Ding YP, Ladeiro Y, Morilla I, Bouhnik Y, Marah A, Zaag H, Cazals-Hatem D, Seksik P, Daniel F, Hugot JP, Wainrib G, Tréton X, Ogier-Denis E. Integrative Network-based Analysis of Colonic Detoxification Gene Expression in Ulcerative Colitis According to Smoking Status. J Crohns Colitis 2017;11:474-484. [PMID: 27702825 DOI: 10.1093/ecco-jcc/jjw179] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 10/03/2016] [Indexed: 02/08/2023]

Affiliation(s)

Yong-Ping Ding INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France
Yannick Ladeiro INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France
Ian Morilla INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Université Paris 13, Sorbonne Paris Cité, Villetaneuse, France
Yoram Bouhnik INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Assistance Publique Hôpitaux de Paris, Service de gastroentérologie, MICI et assistance nutritive, Hôpital Beaujon, Clichy la Garenne, France
Assiya Marah INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France
Hatem Zaag Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Université Paris 13, Sorbonne Paris Cité, Villetaneuse, France
Dominique Cazals-Hatem INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Assistance Publique Hôpitaux de Paris, Service d'anatomopathologie, Hôpital Beaujon, Clichy la Garenne, France
Philippe Seksik INSERM U1157, UMR 7203, F-7502, Paris, France.,Assistance Publique Hôpitaux de Paris, Hôpital Saint-Antoine, Paris, France
Fanny Daniel INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France
Jean-Pierre Hugot INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Assistance Publique Hôpitaux de Paris, Hôpital Robert Debré, Paris, France
Gilles Wainrib Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Département d'Informatique, Equipe DATA, Ecole Normale Supérieure, Paris, France
Xavier Tréton INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France.,Assistance Publique Hôpitaux de Paris, Service de gastroentérologie, MICI et assistance nutritive, Hôpital Beaujon, Clichy la Garenne, France
Eric Ogier-Denis INSERM, Research Centre of Inflammation BP 416, Paris, France.,Université Paris-Diderot Sorbonne Paris-Cité, Paris, France.,Laboratory of Excellence Labex INFLAMEX, Sorbonne-Paris- Cité, Paris, France

Collapse

Supervised EEG Source Imaging with Graph Regularization in Transformed Domain. Brain Inform 2017. [DOI: 10.1007/978-3-319-70772-3_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open

Kim YH, Beak SH, Charidimou A, Song M. Discovering New Genes in the Pathways of Common Sporadic Neurodegenerative Diseases: A Bioinformatics Approach. J Alzheimers Dis 2016;51:293-312. [PMID: 26836166 DOI: 10.3233/jad-150769] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Yu C, Wang J. A Physical Mechanism and Global Quantification of Breast Cancer. PLoS One 2016;11:e0157422. [PMID: 27410227 PMCID: PMC4943646 DOI: 10.1371/journal.pone.0157422] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 05/31/2016] [Indexed: 12/24/2022] Open

Abstract

Initiation and progression of cancer depend on many factors. Those on the genetic level are often considered crucial. To gain insight into the physical mechanisms of breast cancer, we construct a gene regulatory network (GRN) which reflects both genetic and environmental aspects of breast cancer. The construction of the GRN is based on available experimental data. Three basins of attraction, representing the normal, premalignant and cancer states respectively, were found on the phenotypic landscape. The progression of breast cancer can be seen as switching transitions between different state basins. We quantified the stabilities and kinetic paths of the three state basins to uncover the biological process of breast cancer formation. The gene expression levels at each state were obtained, which can be tested directly in experiments. Furthermore, by performing global sensitivity analysis on the landscape topography, six key genes (HER2, MDM2, TP53, BRCA1, ATM, CDK2) and four regulations (HER2⊣TP53, CDK2⊣BRCA1, ATM→MDM2, TP53→ATM) were identified as being critical for breast cancer. Interestingly, HER2 and MDM2 are the most popular targets for treating breast cancer. BRCA1 and TP53 are the most important oncogene of breast cancer and tumor suppressor gene, respectively. This further validates the feasibility of our model and the reliability of our prediction results. The regulation ATM→MDM2 has been extensive studied on DNA damage but not on breast cancer. We notice the importance of ATM→MDM2 on breast cancer. Previous studies of breast cancer have often focused on individual genes and the anti-cancer drugs are mainly used to target the individual genes. Our results show that the network-based strategy is more effective on treating breast cancer. The landscape approach serves as a new strategy for analyzing breast cancer on both the genetic and epigenetic levels and can help on designing network based medicine for breast cancer.

Collapse

Mayer G, Marcus K, Eisenacher M, Kohl M. Boolean modeling techniques for protein co-expression networks in systems medicine. Expert Rev Proteomics 2016;13:555-69. [PMID: 27105325 DOI: 10.1080/14789450.2016.1181546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Dholaniya PS, Ghosh S, Surampudi BR, Kondapi AK. A knowledge driven supervised learning approach to identify gene network of differentially up-regulated genes during neuronal senescence in Rattus norvegicus. Biosystems 2015;135:9-14. [PMID: 26163927 DOI: 10.1016/j.biosystems.2015.07.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 05/18/2015] [Accepted: 07/06/2015] [Indexed: 12/22/2022]

Cairelli MJ, Fiszman M, Zhang H, Rindflesch TC. Networks of neuroinjury semantic predications to identify biomarkers for mild traumatic brain injury. J Biomed Semantics 2015;6:25. [PMID: 25992264 PMCID: PMC4436163 DOI: 10.1186/s13326-015-0022-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Accepted: 04/22/2015] [Indexed: 12/13/2022] Open

Abstract

Objective

Mild traumatic brain injury (mTBI) has high prevalence in the military, among athletes, and in the general population worldwide (largely due to falls). Consequences can include a range of neuropsychological disorders. Unfortunately, such neural injury often goes undiagnosed due to the difficulty in identifying symptoms, so the discovery of an effective biomarker would greatly assist diagnosis; however, no single biomarker has been identified. We identify several body substances as potential components of a panel of biomarkers to support the diagnosis of mild traumatic brain injury.

Methods

Our approach to diagnostic biomarker discovery combines ideas and techniques from systems medicine, natural language processing, and graph theory. We create a molecular interaction network that represents neural injury and is composed of relationships automatically extracted from the literature. We retrieve citations related to neurological injury and extract relationships (semantic predications) that contain potential biomarkers. After linking all relationships together to create a network representing neural injury, we filter the network by relationship frequency and concept connectivity to reduce the set to a manageable size of higher interest substances.

Results

99,437 relevant citations yielded 26,441 unique relations. 18,085 of these contained a potential biomarker as subject or object with a total of 6246 unique concepts. After filtering by graph metrics, the set was reduced to 1021 relationships with 49 unique concepts, including 17 potential biomarkers.

Conclusion

We created a network of relationships containing substances derived from 99,437 citations and filtered using graph metrics to provide a set of 17 potential biomarkers. We discuss the interaction of several of these (glutamate, glucose, and lactate) as the basis for more effective diagnosis than is currently possible. This method provides an opportunity to focus the effort of wet bench research on those substances with the highest potential as biomarkers for mTBI.

Collapse

Data Integration for Microarrays: Enhanced Inference for Gene Regulatory Networks. MICROARRAYS 2015;4:255-69. [PMID: 27600224 PMCID: PMC4996389 DOI: 10.3390/microarrays4020255] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 04/30/2015] [Indexed: 01/01/2023]

Manyam G, Birerdinc A, Baranova A. KPP: KEGG Pathway Painter. BMC SYSTEMS BIOLOGY 2015;9 Suppl 2:S3. [PMID: 25879163 PMCID: PMC4407080 DOI: 10.1186/1752-0509-9-s2-s3] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]

LGscore: A method to identify disease-related genes using biological literature and Google data. J Biomed Inform 2015;54:270-82. [DOI: 10.1016/j.jbi.2015.01.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2014] [Revised: 12/23/2014] [Accepted: 01/05/2015] [Indexed: 02/05/2023]

Gene Network Reconstruction by Integration of Prior Biological Knowledge. G3-GENES GENOMES GENETICS 2015;5:1075-9. [PMID: 25823587 PMCID: PMC4478538 DOI: 10.1534/g3.115.018127] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]