1
|
Henry S, Wijesinghe DS, Myers A, McInnes BT. Using Literature Based Discovery to Gain Insights Into the Metabolomic Processes of Cardiac Arrest. Front Res Metr Anal 2021; 6:644728. [PMID: 34250435 PMCID: PMC8267364 DOI: 10.3389/frma.2021.644728] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 05/07/2021] [Indexed: 12/19/2022] Open
Abstract
In this paper, we describe how we applied LBD techniques to discover lecithin cholesterol acyltransferase (LCAT) as a druggable target for cardiac arrest. We fully describe our process which includes the use of high-throughput metabolomic analysis to identify metabolites significantly related to cardiac arrest, and how we used LBD to gain insights into how these metabolites relate to cardiac arrest. These insights lead to our proposal (for the first time) of LCAT as a druggable target; the effects of which are supported by in vivo studies which were brought forth by this work. Metabolites are the end product of many biochemical pathways within the human body. Observed changes in metabolite levels are indicative of changes in these pathways, and provide valuable insights toward the cause, progression, and treatment of diseases. Following cardiac arrest, we observed changes in metabolite levels pre- and post-resuscitation. We used LBD to help discover diseases implicitly linked via these metabolites of interest. Results of LBD indicated a strong link between Fish Eye disease and cardiac arrest. Since fish eye disease is characterized by an LCAT deficiency, it began an investigation into the effects of LCAT and cardiac arrest survival. In the investigation, we found that decreased LCAT activity may increase cardiac arrest survival rates by increasing ω-3 polyunsaturated fatty acid availability in circulation. We verified the effects of ω-3 polyunsaturated fatty acids on increasing survival rate following cardiac arrest via in vivo with rat models.
Collapse
Affiliation(s)
- Sam Henry
- Department of Physics, Computer Science and Engineering, Christopher Newport University, Newport News, VA, United States
| | - D. Shanaka Wijesinghe
- Department of Pharmacotherapy and Outcomes Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Aidan Myers
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Bridget T. McInnes
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| |
Collapse
|
2
|
Current trends in cancer immunotherapy: a literature-mining analysis. Cancer Immunol Immunother 2020; 69:2425-2439. [PMID: 32556496 DOI: 10.1007/s00262-020-02630-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Accepted: 05/28/2020] [Indexed: 11/27/2022]
Abstract
Cancer immunotherapy is a rapidly growing field that is completely transforming oncology care. Mining this knowledge base for biomedically important information is becoming increasingly challenging, due to the expanding number of scientific publications, and the dynamic evolution of this subject with time. In this study, we have employed a literature-mining approach that was used to analyze the cancer immunotherapy-related publications listed in PubMed and quantify emerging trends. A total of 93,033 publications published in 5055 journals have been retrieved, and 141 meaningful topics have been identified, which were further classified into eight distinct categories. Statistical analysis indicates a mean annual increase in the number of published papers of approximately 8% in the last 20 years. The research topics that exhibited the highest trends included "immune checkpoint inhibitors," "tumor microenvironment," "HPV vaccination," "CAR T-cells," and "gene mutations/tumor profiling." The top identified cancer types included "lung," "colorectal," and "breast cancer," and a shift in popularity from hematological to solid tumors was observed. As regards clinical research, a transition from early phase clinical trials to randomized control trials was recorded, indicating that the field is entering a more advanced phase of development. Overall, this mining approach provided an unbiased analysis of the cancer immunotherapy literature in a time-conserving and scale-efficient manner.
Collapse
|
3
|
Thilakaratne M, Falkner K, Atapattu T. A systematic review on literature-based discovery workflow. PeerJ Comput Sci 2019; 5:e235. [PMID: 33816888 PMCID: PMC7924697 DOI: 10.7717/peerj-cs.235] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 10/17/2019] [Indexed: 05/02/2023]
Abstract
As scientific publication rates increase, knowledge acquisition and the research development process have become more complex and time-consuming. Literature-Based Discovery (LBD), supporting automated knowledge discovery, helps facilitate this process by eliciting novel knowledge by analysing existing scientific literature. This systematic review provides a comprehensive overview of the LBD workflow by answering nine research questions related to the major components of the LBD workflow (i.e., input, process, output, and evaluation). With regards to the input component, we discuss the data types and data sources used in the literature. The process component presents filtering techniques, ranking/thresholding techniques, domains, generalisability levels, and resources. Subsequently, the output component focuses on the visualisation techniques used in LBD discipline. As for the evaluation component, we outline the evaluation techniques, their generalisability, and the quantitative measures used to validate results. To conclude, we summarise the findings of the review for each component by highlighting the possible future research directions.
Collapse
Affiliation(s)
- Menasha Thilakaratne
- Faculty of Engineering, Computer and Mathematical Sciences, The University of Adelaide, Adelaide, South Australia, Australia
| | - Katrina Falkner
- Faculty of Engineering, Computer and Mathematical Sciences, The University of Adelaide, Adelaide, South Australia, Australia
| | - Thushari Atapattu
- Faculty of Engineering, Computer and Mathematical Sciences, The University of Adelaide, Adelaide, South Australia, Australia
| |
Collapse
|
4
|
Alves T, Rodrigues R, Costa H, Rocha M. Development of an information retrieval tool for biomedical patents. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 159:125-134. [PMID: 29650307 DOI: 10.1016/j.cmpb.2018.03.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Revised: 02/07/2018] [Accepted: 03/12/2018] [Indexed: 06/08/2023]
Abstract
BACKGROUND AND OBJECTIVE The volume of biomedical literature has been increasing in the last years. Patent documents have also followed this trend, being important sources of biomedical knowledge, technical details and curated data, which are put together along the granting process. The field of Biomedical text mining (BioTM) has been creating solutions for the problems posed by the unstructured nature of natural language, which makes the search of information a challenging task. Several BioTM techniques can be applied to patents. From those, Information Retrieval (IR) includes processes where relevant data are obtained from collections of documents. In this work, the main goal was to build a patent pipeline addressing IR tasks over patent repositories to make these documents amenable to BioTM tasks. METHODS The pipeline was developed within @Note2, an open-source computational framework for BioTM, adding a number of modules to the core libraries, including patent metadata and full text retrieval, PDF to text conversion and optical character recognition. Also, user interfaces were developed for the main operations materialized in a new @Note2 plug-in. RESULTS The integration of these tools in @Note2 opens opportunities to run BioTM tools over patent texts, including tasks from Information Extraction, such as Named Entity Recognition or Relation Extraction. We demonstrated the pipeline's main functions with a case study, using an available benchmark dataset from BioCreative challenges. Also, we show the use of the plug-in with a user query related to the production of vanillin. CONCLUSIONS This work makes available all the relevant content from patents to the scientific community, decreasing drastically the time required for this task, and provides graphical interfaces to ease the use of these tools.
Collapse
Affiliation(s)
- Tiago Alves
- Centre Biological Engineering, University of Minho, Braga 4710-057, Portugal; Silicolife Lda, Braga 4715-387, Portugal
| | - Rúben Rodrigues
- Centre Biological Engineering, University of Minho, Braga 4710-057, Portugal
| | - Hugo Costa
- Silicolife Lda, Braga 4715-387, Portugal
| | - Miguel Rocha
- Centre Biological Engineering, University of Minho, Braga 4710-057, Portugal.
| |
Collapse
|
5
|
Smart Cities: A Review and Analysis of Stakeholders’ Literature. BUSINESS & INFORMATION SYSTEMS ENGINEERING 2018. [DOI: 10.1007/s12599-018-0535-3] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
6
|
Raja K, Patrick M, Gao Y, Madu D, Yang Y, Tsoi LC. A Review of Recent Advancement in Integrating Omics Data with Literature Mining towards Biomedical Discoveries. Int J Genomics 2017; 2017:6213474. [PMID: 28331849 PMCID: PMC5346376 DOI: 10.1155/2017/6213474] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Accepted: 02/09/2017] [Indexed: 12/13/2022] Open
Abstract
In the past decade, the volume of "omics" data generated by the different high-throughput technologies has expanded exponentially. The managing, storing, and analyzing of this big data have been a great challenge for the researchers, especially when moving towards the goal of generating testable data-driven hypotheses, which has been the promise of the high-throughput experimental techniques. Different bioinformatics approaches have been developed to streamline the downstream analyzes by providing independent information to interpret and provide biological inference. Text mining (also known as literature mining) is one of the commonly used approaches for automated generation of biological knowledge from the huge number of published articles. In this review paper, we discuss the recent advancement in approaches that integrate results from omics data and information generated from text mining approaches to uncover novel biomedical information.
Collapse
Affiliation(s)
- Kalpana Raja
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Matthew Patrick
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Yilin Gao
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Desmond Madu
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Yuyang Yang
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Lam C. Tsoi
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Computational Medicine & Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA
| |
Collapse
|
7
|
Lu J, Wang W, Xu M, Li Y, Chen C, Wang X. A global view of regulatory networks in lung cancer: An approach to understand homogeneity and heterogeneity. Semin Cancer Biol 2016; 42:31-38. [PMID: 27894849 DOI: 10.1016/j.semcancer.2016.11.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 11/08/2016] [Indexed: 12/12/2022]
Abstract
A number of new biotechnologies are used to identify potential biomarkers for the early detection of lung cancer, enabling a personalized therapy to be developed in response. The combinatorial cross-regulation of hundreds of biological function-specific transcription factors (TFs) is defined as the understanding of regulatory networks of molecules within the cell. Here we integrated global databases with 537 patients with lung adenocarcinoma (ADC), 140 with lung squamous carcinoma (SCC), 9 with lung large-cell carcinoma (LCC), 56 with small-cell lung cancer (SCLC), and 590 without cancer with the understanding of TF functions. The present review aims at the homogeneity or heterogeneity of gene expression profiles among subtypes of lung cancer. About 5, 136, 52, or 16 up-regulated or 19, 24, 122, or 97down-regulated type-special TF genes were identified in ADC, SCC, LCC or SCLC, respectively. DNA-binding and transcription regulator activity associated genes play a dominant role in the differentiation of subtypes in lung cancer. Subtype-specific TF gene regulatory networks with elements should be an alternative for diagnostic and therapeutic targets for early identification of lung cancer and can provide insightful clues to etiology and pathogenesis.
Collapse
Affiliation(s)
- Jiapei Lu
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - William Wang
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Menglin Xu
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Yuping Li
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Chengshui Chen
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China
| | - Xiangdong Wang
- Department of Pulmonary Medicine, The First Affiliated Hospital, Wenzhou Medical University, Wenzhou, China.
| |
Collapse
|
8
|
CLIP-GENE: a web service of the condition specific context-laid integrative analysis for gene prioritization in mouse TF knockout experiments. Biol Direct 2016; 11:57. [PMID: 27776539 PMCID: PMC5078909 DOI: 10.1186/s13062-016-0158-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 10/10/2016] [Indexed: 02/06/2023] Open
Abstract
MOTIVATION Transcriptome data from the gene knockout experiment in mouse is widely used to investigate functions of genes and relationship to phenotypes. When a gene is knocked out, it is important to identify which genes are affected by the knockout gene. Existing methods, including differentially expressed gene (DEG) methods, can be used for the analysis. However, existing methods require cutoff values to select candidate genes, which can produce either too many false positives or false negatives. This hurdle can be addressed either by improving the accuracy of gene selection or by providing a method to rank candidate genes effectively, or both. Prioritization of candidate genes should consider the goals or context of the knockout experiment. As of now, there are no tools designed for both selecting and prioritizing genes from the mouse knockout data. Hence, the necessity of a new tool arises. RESULTS In this study, we present CLIP-GENE, a web service that selects gene markers by utilizing differentially expressed genes, mouse transcription factor (TF) network, and single nucleotide variant information. Then, protein-protein interaction network and literature information are utilized to find genes that are relevant to the phenotypic differences. One of the novel features is to allow researchers to specify their contexts or hypotheses in a set of keywords to rank genes according to the contexts that the user specify. We believe that CLIP-GENE will be useful in characterizing functions of TFs in mouse experiments. AVAILABILITY http://epigenomics.snu.ac.kr/CLIP-GENE REVIEWERS: This article was reviewed by Dr. Lee and Dr. Pongor.
Collapse
|
9
|
Zhu J, Wang S, Zhang W, Qiu J, Shan Y, Yang D, Shen B. Screening key microRNAs for castration-resistant prostate cancer based on miRNA/mRNA functional synergistic network. Oncotarget 2016; 6:43819-30. [PMID: 26540468 PMCID: PMC4791269 DOI: 10.18632/oncotarget.6102] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2015] [Accepted: 10/17/2015] [Indexed: 12/18/2022] Open
Abstract
High-throughput methods have been used to explore the mechanisms by which androgen-sensitive prostate cancer (ASPC) develops into castration-resistant prostate cancer (CRPC). However, it is difficult to interpret cryptic results by routine experimental methods. In this study, we performed systematic and integrative analysis to detect key miRNAs that contribute to CRPC development. From three DNA microarray datasets, we retrieved 11 outlier microRNAs (miRNAs) that had expression discrepancies between ASPC and CRPC using a specific algorithm. Two of the miRNAs (miR-125b and miR-124) have previously been shown to be related to CRPC. Seven out of the other nine miRNAs were confirmed by quantitative PCR (Q-PCR) analysis. MiR-210, miR-218, miR-346, miR-197, and miR-149 were found to be over-expressed, while miR-122, miR-145, and let-7b were under-expressed in CRPC cell lines. GO and KEGG pathway analyses revealed that miR-218, miR-197, miR-145, miR-122, and let-7b, along with their target genes, were found to be involved in the PI3K and AKT3 signaling network, which is known to contribute to CRPC development. We then chose five miRNAs to verify the accuracy of the analysis. The target genes of each miRNA were altered significantly upon transfection of specific miRNA mimics in the C4–2 CRPC cell line, which was consistent with our pathway analysis results. Finally, we hypothesized that miR-218, miR-145, miR-197, miR-149, miR-122, and let-7b may contribute to the development of CRPC through the influence of Ras, Rho proteins, and the SCF complex. Further investigation is needed to verify the functions of the identified novel pathways in CRPC development.
Collapse
Affiliation(s)
- Jin Zhu
- Department of Urology, Second Affiliated Hospital of Soochow University, Suzhou, China
| | - Sugui Wang
- Department of Urology, Second Affiliated Hospital of Soochow University, Suzhou, China.,Department of Urology, Huai'an Hospital Affiliated to Xuzhou Medical College and Second People's Hospital of Huai'an, Huai'an, China
| | - Wenyu Zhang
- Center for Systems Biology, Soochow University, Suzhou, China
| | - Junyi Qiu
- Department of General Surgery, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Yuxi Shan
- Department of Urology, Second Affiliated Hospital of Soochow University, Suzhou, China
| | - Dongrong Yang
- Department of Urology, Second Affiliated Hospital of Soochow University, Suzhou, China
| | - Bairong Shen
- Center for Systems Biology, Soochow University, Suzhou, China
| |
Collapse
|
10
|
Pesch R, Zimmer R. Cross-species Conservation of context-specific networks. BMC SYSTEMS BIOLOGY 2016; 10:76. [PMID: 27531214 PMCID: PMC4988053 DOI: 10.1186/s12918-016-0304-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/10/2016] [Accepted: 07/04/2016] [Indexed: 11/20/2022]
Abstract
BACKGROUND Many large data compendia on context-specific high-throughput genomic and regulatory data have been made available by international research consortia such as ENCODE, TCGA, and Epigenomics Roadmap. The use of these resources is impaired by the sheer size of the available big data and big metadata. Many of these context-specific data can be modeled as data derived regulatory networks (DDRNs) representing the complex and complicated interactions between transcription factors and target genes. These DDRNs are useful for the understanding of regulatory mechanisms and helpful for interpreting biomedical data. RESULTS The Cross-species Conservation framework (CroCo) provides a network-oriented view on the ENCODE regulatory data (CroCo network repository), convenient ways to access and browse networks and metadata, and a method to combine networks across compendia, experimental techniques, and species (CroCo tool suite). DDRNs can be combined with additional information and networks derived from the literature, curated resources, and computational predictions in order to enable detailed exploration and cross checking of regulatory interactions. Applications of the CroCo framework range from simple evidence look-up for user-defined regulatory interactions to the identification of conserved sub-networks in diverse cell-lines, conditions, and even species. CONCLUSION CroCo adds an intuitive unifying view on the data from the ENCODE projects via a comprehensive repository of derived context-specific regulatory networks and enables flexible cross-context, cross-species, and cross-compendia comparison via a basis set of analysis tools. The CroCo web-application and Cytoscape plug-in are freely available at: http://services.bio.ifi.lmu.de/croco-web . The web-page links to a detailed system description, a user guide, and tutorial videos presenting common use cases of the CroCo framework.
Collapse
Affiliation(s)
- Robert Pesch
- Institute for Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, München, Germany
| | - Ralf Zimmer
- Institute for Informatics, Ludwig-Maximilians-Universität München, Amalienstrasse 17, München, Germany
| |
Collapse
|
11
|
Jurca G, Addam O, Aksac A, Gao S, Özyer T, Demetrick D, Alhajj R. Integrating text mining, data mining, and network analysis for identifying genetic breast cancer trends. BMC Res Notes 2016; 9:236. [PMID: 27112211 PMCID: PMC4845430 DOI: 10.1186/s13104-016-2023-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Accepted: 04/05/2016] [Indexed: 01/08/2023] Open
Abstract
Background Breast cancer is a serious disease which affects many women and may lead to death. It has received considerable attention from the research community. Thus, biomedical researchers aim to find genetic biomarkers indicative of the disease. Novel biomarkers can be elucidated from the existing literature. However, the vast amount of scientific publications on breast cancer make this a daunting task. This paper presents a framework which investigates existing literature data for informative discoveries. It integrates text mining and social network analysis in order to identify new potential biomarkers for breast cancer. Results We utilized PubMed for the testing. We investigated gene–gene interactions, as well as novel interactions such as gene-year, gene-country, and abstract-country to find out how the discoveries varied over time and how overlapping/diverse are the discoveries and the interest of various research groups in different countries. Conclusions Interesting trends have been identified and discussed, e.g., different genes are highlighted in relationship to different countries though the various genes were found to share functionality. Some text analysis based results have been validated against results from other tools that predict gene–gene relations and gene functions. Electronic supplementary material The online version of this article (doi:10.1186/s13104-016-2023-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gabriela Jurca
- Department of Computer Science, University of Calgary, Calgary, AB, Canada
| | - Omar Addam
- Department of Computer Science, University of Calgary, Calgary, AB, Canada
| | - Alper Aksac
- Department of Computer Science, University of Calgary, Calgary, AB, Canada
| | - Shang Gao
- College of Computer Science and Technology, Jilin University, Changchun, China
| | - Tansel Özyer
- Department of Computer Engineering, TOBB University, Ankara, Turkey
| | - Douglas Demetrick
- Departments of Pathology, Oncology and Biochemistry & Molecular Biology, University of Calgary, Calgary, AB, Canada
| | - Reda Alhajj
- Department of Computer Science, University of Calgary, Calgary, AB, Canada. .,Department of Computer Science, Global University, Beirut, Lebanon.
| |
Collapse
|
12
|
Matone A, Scott-Boyer MP, Carayol J, Fazelzadeh P, Lefebvre G, Valsesia A, Charon C, Vervoort J, Astrup A, Saris WHM, Morine M, Hager J. Network Analysis of Metabolite GWAS Hits: Implication of CPS1 and the Urea Cycle in Weight Maintenance. PLoS One 2016; 11:e0150495. [PMID: 26938218 PMCID: PMC4777532 DOI: 10.1371/journal.pone.0150495] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 02/15/2016] [Indexed: 01/09/2023] Open
Abstract
Background and Scope Weight loss success is dependent on the ability to refrain from regaining the lost weight in time. This feature was shown to be largely variable among individuals, and these differences, with their underlying molecular processes, are diverse and not completely elucidated. Altered plasma metabolites concentration could partly explain weight loss maintenance mechanisms. In the present work, a systems biology approach has been applied to investigate the potential mechanisms involved in weight loss maintenance within the Diogenes weight-loss intervention study. Methods and Results A genome wide association study identified SNPs associated with plasma glycine levels within the CPS1 (Carbamoyl-Phosphate Synthase 1) gene (rs10206976, p-value = 4.709e-11 and rs12613336, p-value = 1.368e-08). Furthermore, gene expression in the adipose tissue showed that CPS1 expression levels were associated with successful weight maintenance and with several SNPs within CPS1 (cis-eQTL). In order to contextualize these results, a gene-metabolite interaction network of CPS1 and glycine has been built and analyzed, showing functional enrichment in genes involved in lipid metabolism and one carbon pool by folate pathways. Conclusions CPS1 is the rate-limiting enzyme for the urea cycle, catalyzing carbamoyl phosphate from ammonia and bicarbonate in the mitochondria. Glycine and CPS1 are connected through the one-carbon pool by the folate pathway and the urea cycle. Furthermore, glycine could be linked to metabolic health and insulin sensitivity through the betaine osmolyte. These considerations, and the results from the present study, highlight a possible role of CPS1 and related pathways in weight loss maintenance, suggesting that it might be partly genetically determined in humans.
Collapse
Affiliation(s)
- Alice Matone
- The Microsoft Research—University of Trento Centre for Computational Systems Biology (COSBI), Rovereto, Italy
| | - Marie-Pier Scott-Boyer
- The Microsoft Research—University of Trento Centre for Computational Systems Biology (COSBI), Rovereto, Italy
| | - Jerome Carayol
- Nestlé Institute of Health Sciences SA, Lausanne, Switzerland
| | - Parastoo Fazelzadeh
- Nutrition, Metabolism & Genomics group, University of Wageningen, Wageningen, Netherlands
| | | | - Armand Valsesia
- Nestlé Institute of Health Sciences SA, Lausanne, Switzerland
| | - Celine Charon
- CEA-Genomics Institute- National Genotyping Center, Evry, France
| | - Jacques Vervoort
- Nutrition, Metabolism & Genomics group, University of Wageningen, Wageningen, Netherlands
| | - Arne Astrup
- Department of Nutrition, Exercise and Sports, University of Copenhagen, Copenhagen, Denmark
| | - Wim H. M. Saris
- Dept of Human Biology Medical and Health Science Faculty, University of Maastricht, Maastricht, Netherlands
| | - Melissa Morine
- The Microsoft Research—University of Trento Centre for Computational Systems Biology (COSBI), Rovereto, Italy
| | - Jörg Hager
- Nestlé Institute of Health Sciences SA, Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
13
|
Liu Y, Liang Y, Wishart D. PolySearch2: a significantly improved text-mining system for discovering associations between human diseases, genes, drugs, metabolites, toxins and more. Nucleic Acids Res 2015; 43:W535-42. [PMID: 25925572 PMCID: PMC4489268 DOI: 10.1093/nar/gkv383] [Citation(s) in RCA: 91] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2015] [Accepted: 04/11/2015] [Indexed: 02/01/2023] Open
Abstract
PolySearch2 (http://polysearch.ca) is an online text-mining system for identifying relationships between biomedical entities such as human diseases, genes, SNPs, proteins, drugs, metabolites, toxins, metabolic pathways, organs, tissues, subcellular organelles, positive health effects, negative health effects, drug actions, Gene Ontology terms, MeSH terms, ICD-10 medical codes, biological taxonomies and chemical taxonomies. PolySearch2 supports a generalized ‘Given X, find all associated Ys’ query, where X and Y can be selected from the aforementioned biomedical entities. An example query might be: ‘Find all diseases associated with Bisphenol A’. To find its answers, PolySearch2 searches for associations against comprehensive collections of free-text collections, including local versions of MEDLINE abstracts, PubMed Central full-text articles, Wikipedia full-text articles and US Patent application abstracts. PolySearch2 also searches 14 widely used, text-rich biological databases such as UniProt, DrugBank and Human Metabolome Database to improve its accuracy and coverage. PolySearch2 maintains an extensive thesaurus of biological terms and exploits the latest search engine technology to rapidly retrieve relevant articles and databases records. PolySearch2 also generates, ranks and annotates associative candidates and present results with relevancy statistics and highlighted key sentences to facilitate user interpretation.
Collapse
Affiliation(s)
- Yifeng Liu
- Department of Computing Science, University of Alberta, Edmonton, T6G 2E8 Canada
| | - Yongjie Liang
- Department of Computing Science, University of Alberta, Edmonton, T6G 2E8 Canada
| | - David Wishart
- Department of Computing Science, University of Alberta, Edmonton, T6G 2E8 Canada Department of Biological Science, University of Alberta, Edmonton, T6G 2E6 Canada
| |
Collapse
|
14
|
Tsiliki G, Kossida S, Friesen N, Rüping S, Tzagarakis M, Karacapilidis N. A Data Mining Based Approach for Collaborative Analysis of Biomedical Data. INT J ARTIF INTELL T 2014. [DOI: 10.1142/s0218213014600100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Biomedical research becomes increasingly multidisciplinary and collaborative in nature. At the same time, it has recently seen a vast growth in publicly and instantly available information. As the available resources become more specialized, there is a growing need for multidisciplinary collaborations between biomedical researchers to address complex research questions. We present an application of a data mining algorithm to genomic data in a collaborative decision-making support environment, as a typical example of how multidisciplinary researchers can collaborate in analyzing and interpreting biomedical data. Through the proposed approach, researchers can easily decide about which data repositories should be considered, analyze the algorithmic results, discuss the weaknesses of the patterns identified, and set up new iterations of the data mining algorithm by defining other descriptive attributes or integrating other relevant data. Evaluation results show that the proposed approach facilitates users to set their research objectives and better understand the data and methodologies used in their research.
Collapse
Affiliation(s)
- Georgia Tsiliki
- Bioinformatics and Medical Informatics Team, Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou 115 27, Greece
| | - Sophia Kossida
- Bioinformatics and Medical Informatics Team, Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou 115 27, Greece
| | - Natalja Friesen
- Knowledge Discovery Group, Fraunhofer Institute IAIS, Sankt Augustin, Germany
| | - Stefan Rüping
- Knowledge Discovery Group, Fraunhofer Institute IAIS, Sankt Augustin, Germany
| | - Manolis Tzagarakis
- University of Patras and Computer Technology Institute & Press “Diophantus”, Rio Patras, Greece
| | - Nikos Karacapilidis
- University of Patras and Computer Technology Institute & Press “Diophantus”, Rio Patras, Greece
| |
Collapse
|
15
|
Titz B, Elamin A, Martin F, Schneider T, Dijon S, Ivanov NV, Hoeng J, Peitsch MC. Proteomics for systems toxicology. Comput Struct Biotechnol J 2014; 11:73-90. [PMID: 25379146 PMCID: PMC4212285 DOI: 10.1016/j.csbj.2014.08.004] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Current toxicology studies frequently lack measurements at molecular resolution to enable a more mechanism-based and predictive toxicological assessment. Recently, a systems toxicology assessment framework has been proposed, which combines conventional toxicological assessment strategies with system-wide measurement methods and computational analysis approaches from the field of systems biology. Proteomic measurements are an integral component of this integrative strategy because protein alterations closely mirror biological effects, such as biological stress responses or global tissue alterations. Here, we provide an overview of the technical foundations and highlight select applications of proteomics for systems toxicology studies. With a focus on mass spectrometry-based proteomics, we summarize the experimental methods for quantitative proteomics and describe the computational approaches used to derive biological/mechanistic insights from these datasets. To illustrate how proteomics has been successfully employed to address mechanistic questions in toxicology, we summarized several case studies. Overall, we provide the technical and conceptual foundation for the integration of proteomic measurements in a more comprehensive systems toxicology assessment framework. We conclude that, owing to the critical importance of protein-level measurements and recent technological advances, proteomics will be an integral part of integrative systems toxicology approaches in the future.
Collapse
|
16
|
Thomas M, Daemen A, De Moor B. Maximum Likelihood Estimation of GEVD: Applications in Bioinformatics. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:673-680. [PMID: 26356338 DOI: 10.1109/tcbb.2014.2304292] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
We propose a method, maximum likelihood estimation of generalized eigenvalue decomposition (MLGEVD) that employs a well known technique relying on the generalization of singular value decomposition (SVD). The main aim of the work is to show the tight equivalence between MLGEVD and generalized ridge regression. This relationship reveals an important mathematical property of GEVD in which the second argument act as prior information in the model. Thus we show that MLGEVD allows the incorporation of external knowledge about the quantities of interest into the estimation problem. We illustrate the importance of prior knowledge in clinical decision making/identifying differentially expressed genes with case studies for which microarray data sets with corresponding clinical/literature information are available. On all of these three case studies, MLGEVD outperformed GEVD on prediction in terms of test area under the ROC curve (test AUC). MLGEVD results in significantly improved diagnosis, prognosis and prediction of therapy response.
Collapse
|
17
|
Identification of a QTL in Mus musculus for alcohol preference, withdrawal, and Ap3m2 expression using integrative functional genomics and precision genetics. Genetics 2014; 197:1377-93. [PMID: 24923803 DOI: 10.1534/genetics.114.166165] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Extensive genetic and genomic studies of the relationship between alcohol drinking preference and withdrawal severity have been performed using animal models. Data from multiple such publications and public data resources have been incorporated in the GeneWeaver database with >60,000 gene sets including 285 alcohol withdrawal and preference-related gene sets. Among these are evidence for positional candidates regulating these behaviors in overlapping quantitative trait loci (QTL) mapped in distinct mouse populations. Combinatorial integration of functional genomics experimental results revealed a single QTL positional candidate gene in one of the loci common to both preference and withdrawal. Functional validation studies in Ap3m2 knockout mice confirmed these relationships. Genetic validation involves confirming the existence of segregating polymorphisms that could account for the phenotypic effect. By exploiting recent advances in mouse genotyping, sequence, epigenetics, and phylogeny resources, we confirmed that Ap3m2 resides in an appropriately segregating genomic region. We have demonstrated genetic and alcohol-induced regulation of Ap3m2 expression. Although sequence analysis revealed no polymorphisms in the Ap3m2-coding region that could account for all phenotypic differences, there are several upstream SNPs that could. We have identified one of these to be an H3K4me3 site that exhibits strain differences in methylation. Thus, by making cross-species functional genomics readily computable we identified a common QTL candidate for two related bio-behavioral processes via functional evidence and demonstrate sufficiency of the genetic locus as a source of variation underlying two traits.
Collapse
|
18
|
Bezerra CA, Guimarães AJR. Mineração de texto aplicada às publicações científicas sobre gestão do conhecimento no período de 2003 a 2012. PERSPECTIVAS EM CIÊNCIA DA INFORMAÇÃO 2014. [DOI: 10.1590/1981-5344/1834] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Desde sua apresentação, em meados da década de 1980, o tema 'Gestão do Conhecimento' (GC) tem sido abordado sob os mais diversos aspectos. Neste sentido, uma vez observado o aumento da quantidade de trabalhos publicados em bases acadêmicas, nos últimos 10 anos, os objetivos da presente pesquisa são (1) encontrar os termos mais empregados nestes trabalhos e as (2) diferenças entre os termos encontrados em publicações brasileiras e estrangeiras. Para isto, empregou-se a técnica de mineração de textos. Os resultados mostram que temas envolvendo práticas, processos, sistemas de GC e tecnologias de informação são recorrentemente encontrados tanto em resumos de artigos nacionais como internacionais. Por outro lado, de maneira geral, os artigos nacionais se distinguem pelo fato de apresentarem termos mais associados a aspectos abstratos da GC (modelos, conhecimento tácito e organizacional), enquanto que os internacionais caracterizam-se por termos mais pragmáticos (efetividade, implementação, relacionamentos e desenvolvimento).
Collapse
|
19
|
Hsu YY, Kao HY. CoIN: a network analysis for document triage. Database (Oxford) 2013; 2013:bat076. [PMID: 24218542 PMCID: PMC3822784 DOI: 10.1093/database/bat076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2012] [Revised: 09/23/2013] [Accepted: 10/18/2013] [Indexed: 06/02/2023]
Abstract
In recent years, there was a rapid increase in the number of medical articles. The number of articles in PubMed has increased exponentially. Thus, the workload for biocurators has also increased exponentially. Under these circumstances, a system that can automatically determine in advance which article has a higher priority for curation can effectively reduce the workload of biocurators. Determining how to effectively find the articles required by biocurators has become an important task. In the triage task of BioCreative 2012, we proposed the Co-occurrence Interaction Nexus (CoIN) for learning and exploring relations in articles. We constructed a co-occurrence analysis system, which is applicable to PubMed articles and suitable for gene, chemical and disease queries. CoIN uses co-occurrence features and their network centralities to assess the influence of curatable articles from the Comparative Toxicogenomics Database. The experimental results show that our network-based approach combined with co-occurrence features can effectively classify curatable and non-curatable articles. CoIN also allows biocurators to survey the ranking lists for specific queries without reviewing meaningless information. At BioCreative 2012, CoIN achieved a 0.778 mean average precision in the triage task, thus finishing in second place out of all participants. Database URL: http://ikmbio.csie.ncku.edu.tw/coin/home.php.
Collapse
Affiliation(s)
| | - Hung-Yu Kao
- *Corresponding author: Tel: +886-06-2757575 ext 62546; Fax: +886-6-2747076;
| |
Collapse
|
20
|
Vos R, Aarts S, van Mulligen E, Metsemakers J, van Boxtel MP, Verhey F, van den Akker M. Finding potentially new multimorbidity patterns of psychiatric and somatic diseases: exploring the use of literature-based discovery in primary care research. J Am Med Inform Assoc 2013; 21:139-45. [PMID: 23775174 DOI: 10.1136/amiajnl-2012-001448] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND Multimorbidity, the co-occurrence of two or more chronic medical conditions within a single individual, is increasingly becoming part of daily care of general medical practice. Literature-based discovery may help to investigate the patterns of multimorbidity and to integrate medical knowledge for improving healthcare delivery for individuals with co-occurring chronic conditions. OBJECTIVE To explore the usefulness of literature-based discovery in primary care research through the key-case of finding associations between psychiatric and somatic diseases relevant to general practice in a large biomedical literature database (Medline). METHODS By using literature based discovery for matching disease profiles as vectors in a high-dimensional associative concept space, co-occurrences of a broad spectrum of chronic medical conditions were matched for their potential in biomedicine. An experimental setting was chosen in parallel with expert evaluations and expert meetings to assess performance and to generate targets for integrating literature-based discovery in multidisciplinary medical research of psychiatric and somatic disease associations. RESULTS Through stepwise reductions a reference set of 21,945 disease combinations was generated, from which a set of 166 combinations between psychiatric and somatic diseases was selected and assessed by text mining and expert evaluation. CONCLUSIONS Literature-based discovery tools generate specific patterns of associations between psychiatric and somatic diseases: one subset was appraised as promising for further research; the other subset surprised the experts, leading to intricate discussions and further eliciting of frameworks of biomedical knowledge. These frameworks enable us to specify targets for further developing and integrating literature-based discovery in multidisciplinary research of general practice, psychology and psychiatry, and epidemiology.
Collapse
Affiliation(s)
- Rein Vos
- School for Public Health and Primary Care: CAPHRI, Maastricht University, Maastricht, The Netherlands
| | | | | | | | | | | | | |
Collapse
|
21
|
Naika M, Shameer K, Sowdhamini R. Comparative analyses of stress-responsive genes in Arabidopsis thaliana: insight from genomic data mining, functional enrichment, pathway analysis and phenomics. MOLECULAR BIOSYSTEMS 2013; 9:1888-908. [PMID: 23645342 DOI: 10.1039/c3mb70072k] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Biotic and abiotic stresses adversely affect agriculture by reducing crop growth and productivity worldwide. To investigate the abiotic stress-responsive genes in Arabidopsis thaliana, we compiled a dataset of stress signals and differentially upregulated genes (>= 2.5 fold change) from Stress-responsive transcription Factors DataBase (STIFDB) with additional set of stress signals and genes curated from PubMed and Gene Expression Omnibus. A dataset of 3091 genes differentially upregulated due to 14 different stress signals (abscisic acid, aluminum, cold, cold-drought-salt, dehydration, drought, heat, iron, light, NaCl, osmotic stress, oxidative stress, UV-B and wounding) were curated and used for the analysis. Details about stress-responsive enriched genes and their association with stress signals can be obtained from STIFDB2 database . The gene-stress-signal data were analyzed using an enrichment-based meta-analysis framework consisting of two different ontologies (Gene Ontology and Plant Ontology), biological pathway and functional domain annotations. We found several shared and distinct biological processes, cellular components and molecular functions associated with stress-responsive genes. Pathway analysis revealed that stress-responsive genes perturbed the pathways under the "Metabolic pathways" category. We also found several shared and stress-signal specific protein domains, suggesting functional mechanisms regulating stress-response. Phenomic characteristics of abiotic stress-responsive genes were ascertained for several stresses and found to be shared by multiple stresses in both anatomy and temporal categories of Plant Ontology. We found several constitutive stress-responsive genes that are differentially upregulated due to perturbation of different stress signals, for example a gene (AT1G68440) involved in phenylpropanoid metabolism and polyamine catabolism as responsive to seven different stress signals. We also performed structure-function prediction of five genes associated responsive to multiple abiotic stress signals. We envisage that results from our analysis that provide insight into functional repertoire, metabolic pathways and phenomic characteristics common and specifically associated with stress signals would help to understand abiotic stress regulome in Arabidopsis thaliana and may also help to develop an improved plant variety using molecular breeding and genetic engineering techniques that are rapidly stress-responsive and tolerant.
Collapse
Affiliation(s)
- Mahantesha Naika
- National Centre for Biological Sciences (TIFR), GKVK Campus, Bangalore, 560065, India.
| | | | | |
Collapse
|
22
|
Van Landeghem S, De Bodt S, Drebert ZJ, Inzé D, Van de Peer Y. The potential of text mining in data integration and network biology for plant research: a case study on Arabidopsis. THE PLANT CELL 2013; 25:794-807. [PMID: 23532071 PMCID: PMC3634689 DOI: 10.1105/tpc.112.108753] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Revised: 02/27/2013] [Accepted: 03/08/2013] [Indexed: 05/21/2023]
Abstract
Despite the availability of various data repositories for plant research, a wealth of information currently remains hidden within the biomolecular literature. Text mining provides the necessary means to retrieve these data through automated processing of texts. However, only recently has advanced text mining methodology been implemented with sufficient computational power to process texts at a large scale. In this study, we assess the potential of large-scale text mining for plant biology research in general and for network biology in particular using a state-of-the-art text mining system applied to all PubMed abstracts and PubMed Central full texts. We present extensive evaluation of the textual data for Arabidopsis thaliana, assessing the overall accuracy of this new resource for usage in plant network analyses. Furthermore, we combine text mining information with both protein-protein and regulatory interactions from experimental databases. Clusters of tightly connected genes are delineated from the resulting network, illustrating how such an integrative approach is essential to grasp the current knowledge available for Arabidopsis and to uncover gene information through guilt by association. All large-scale data sets, as well as the manually curated textual data, are made publicly available, hereby stimulating the application of text mining data in future plant biology studies.
Collapse
Affiliation(s)
- Sofie Van Landeghem
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Stefanie De Bodt
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Zuzanna J. Drebert
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Dirk Inzé
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
- Address correspondence to
| |
Collapse
|
23
|
Ewald JA, Downs TM, Cetnar JP, Ricke WA. Expression microarray meta-analysis identifies genes associated with Ras/MAPK and related pathways in progression of muscle-invasive bladder transition cell carcinoma. PLoS One 2013; 8:e55414. [PMID: 23383328 PMCID: PMC3562183 DOI: 10.1371/journal.pone.0055414] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Accepted: 12/22/2012] [Indexed: 11/19/2022] Open
Abstract
The effective detection and management of muscle-invasive bladder Transition Cell Carcinoma (TCC) continues to be an urgent clinical challenge. While some differences of gene expression and function in papillary (Ta), superficial (T1) and muscle-invasive (≥T2) bladder cancers have been investigated, the understanding of mechanisms involved in the progression of bladder tumors remains incomplete. Statistical methods of pathway-enrichment, cluster analysis and text-mining can extract and help interpret functional information about gene expression patterns in large sets of genomic data. The public availability of patient-derived expression microarray data allows open access and analysis of large amounts of clinical data. Using these resources, we investigated gene expression differences associated with tumor progression and muscle-invasive TCC. Gene expression was calculated relative to Ta tumors to assess progression-associated differences, revealing a network of genes related to Ras/MAPK and PI3K signaling pathways with increased expression. Further, we identified genes within this network that are similarly expressed in superficial Ta and T1 stages but altered in muscle-invasive T2 tumors, finding 7 genes (COL3A1, COL5A1, COL11A1, FN1, ErbB3, MAPK10 and CDC25C) whose expression patterns in muscle-invasive tumors are consistent in 5 to 7 independent outside microarray studies. Further, we found increased expression of the fibrillar collagen proteins COL3A1 and COL5A1 in muscle-invasive tumor samples and metastatic T24 cells. Our results suggest that increased expression of genes involved in mitogenic signaling may support the progression of muscle-invasive bladder tumors that generally lack activating mutations in these pathways, while expression changes of fibrillar collagens, fibronectin and specific signaling proteins are associated with muscle-invasive disease. These results identify potential biomarkers and targets for TCC treatments, and provide an integrated systems-level perspective of TCC pathobiology to inform future studies.
Collapse
Affiliation(s)
- Jonathan A. Ewald
- Department of Urology, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
- University of Wisconsin Carbone Cancer Center, Madison, Wisconsin, United States of America
| | - Tracy M. Downs
- Department of Urology, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
- University of Wisconsin Carbone Cancer Center, Madison, Wisconsin, United States of America
| | - Jeremy P. Cetnar
- Department of Medicine, Hematology/Oncology Unit, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
- University of Wisconsin Carbone Cancer Center, Madison, Wisconsin, United States of America
| | - William A. Ricke
- Department of Urology, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
- University of Wisconsin Carbone Cancer Center, Madison, Wisconsin, United States of America
- * E-mail:
| |
Collapse
|
24
|
Chakrabarti A, Verbridge S, Stroock AD, Fischbach C, Varner JD. Multiscale models of breast cancer progression. Ann Biomed Eng 2012; 40:2488-500. [PMID: 23008097 DOI: 10.1007/s10439-012-0655-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2012] [Accepted: 09/04/2012] [Indexed: 12/13/2022]
Abstract
Breast cancer initiation, invasion and metastasis span multiple length and time scales. Molecular events at short length scales lead to an initial tumorigenic population, which left unchecked by immune action, acts at increasingly longer length scales until eventually the cancer cells escape from the primary tumor site. This series of events is highly complex, involving multiple cell types interacting with (and shaping) the microenvironment. Multiscale mathematical models have emerged as a powerful tool to quantitatively integrate the convective-diffusion-reaction processes occurring on the systemic scale, with the molecular signaling processes occurring on the cellular and subcellular scales. In this study, we reviewed the current state of the art in cancer modeling across multiple length scales, with an emphasis on the integration of intracellular signal transduction models with pro-tumorigenic chemical and mechanical microenvironmental cues. First, we reviewed the underlying biomolecular origin of breast cancer, with a special emphasis on angiogenesis. Then, we summarized the development of tissue engineering platforms which could provide high-fidelity ex vivo experimental models to identify and validate multiscale simulations. Lastly, we reviewed top-down and bottom-up multiscale strategies that integrate subcellular networks with the microenvironment. We present models of a variety of cancers, in addition to breast cancer specific models. Taken together, we expect as the sophistication of the simulations increase, that multiscale modeling and bottom-up agent-based models in particular will become an increasingly important platform technology for basic scientific discovery, as well as the identification and validation of potentially novel therapeutic targets.
Collapse
Affiliation(s)
- Anirikh Chakrabarti
- School of Chemical and Biomolecular Engineering, 244 Olin Hall, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | |
Collapse
|
25
|
Pavey SA, Bernatchez L, Aubin-Horth N, Landry CR. What is needed for next-generation ecological and evolutionary genomics? Trends Ecol Evol 2012; 27:673-8. [PMID: 22902072 DOI: 10.1016/j.tree.2012.07.014] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2012] [Revised: 07/23/2012] [Accepted: 07/25/2012] [Indexed: 11/27/2022]
Abstract
Ecological and evolutionary genomics (EEG) aims to link gene functions and genomic features to phenotypes and ecological factors. Although the rapid development of technologies allows central questions to be addressed at an unprecedented level of molecular detail, they do not alleviate one of the major challenges of EEG, which is that a large fraction of genes remains without any annotation. Here, we propose two solutions to this challenge. The first solution is in the form of a database that regroups associations between genes, organismal attributes and abiotic and biotic conditions. This database would result in an ecological annotation of genes by allowing cross-referencing across studies and taxa. Our second solution is to use new functional techniques to characterize genes implicated in the response to ecological challenges.
Collapse
Affiliation(s)
- Scott A Pavey
- Département de Biologie & Institut de Biologie Intégrative et des Systèmes (IBIS), Pavillon Charles-Eugène-Marchand, Université Laval, QC, Canada
| | | | | | | |
Collapse
|
26
|
Hur J, Xiang Z, Feldman EL, He Y. Ontology-based Brucella vaccine literature indexing and systematic analysis of gene-vaccine association network. BMC Immunol 2011; 12:49. [PMID: 21871085 PMCID: PMC3180695 DOI: 10.1186/1471-2172-12-49] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 08/26/2011] [Indexed: 11/22/2022] Open
Abstract
Background Vaccine literature indexing is poorly performed in PubMed due to limited hierarchy of Medical Subject Headings (MeSH) annotation in the vaccine field. Vaccine Ontology (VO) is a community-based biomedical ontology that represents various vaccines and their relations. SciMiner is an in-house literature mining system that supports literature indexing and gene name tagging. We hypothesize that application of VO in SciMiner will aid vaccine literature indexing and mining of vaccine-gene interaction networks. As a test case, we have examined vaccines for Brucella, the causative agent of brucellosis in humans and animals. Results The VO-based SciMiner (VO-SciMiner) was developed to incorporate a total of 67 Brucella vaccine terms. A set of rules for term expansion of VO terms were learned from training data, consisting of 90 biomedical articles related to Brucella vaccine terms. VO-SciMiner demonstrated high recall (91%) and precision (99%) from testing a separate set of 100 manually selected biomedical articles. VO-SciMiner indexing exhibited superior performance in retrieving Brucella vaccine-related papers over that obtained with MeSH-based PubMed literature search. For example, a VO-SciMiner search of "live attenuated Brucella vaccine" returned 922 hits as of April 20, 2011, while a PubMed search of the same query resulted in only 74 hits. Using the abstracts of 14,947 Brucella-related papers, VO-SciMiner identified 140 Brucella genes associated with Brucella vaccines. These genes included known protective antigens, virulence factors, and genes closely related to Brucella vaccines. These VO-interacting Brucella genes were significantly over-represented in biological functional categories, including metabolite transport and metabolism, replication and repair, cell wall biogenesis, intracellular trafficking and secretion, posttranslational modification, and chaperones. Furthermore, a comprehensive interaction network of Brucella vaccines and genes were identified. The asserted and inferred VO hierarchies provide semantic support for inferring novel knowledge of association of vaccines and genes from the retrieved data. New hypotheses were generated based on this analysis approach. Conclusion VO-SciMiner can be used to improve the efficiency for PubMed searching in the vaccine domain.
Collapse
Affiliation(s)
- Junguk Hur
- Bioinformatics Program, University of Michigan, Ann Arbor, MI 48109, USA
| | | | | | | |
Collapse
|