1
|
Detection of pan-cancer surface protein biomarkers via a network-based approach on transcriptomics data. Brief Bioinform 2022; 23:6695270. [DOI: 10.1093/bib/bbac400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 07/28/2022] [Accepted: 08/17/2022] [Indexed: 11/13/2022] Open
Abstract
Abstract
Cell surface proteins have been used as diagnostic and prognostic markers in cancer research and as targets for the development of anticancer agents. Many of these proteins lie at the top of signaling cascades regulating cell responses and gene expression, therefore acting as ‘signaling hubs’. It has been previously demonstrated that the integrated network analysis on transcriptomic data is able to infer cell surface protein activity in breast cancer. Such an approach has been implemented in a publicly available method called ‘SURFACER’. SURFACER implements a network-based analysis of transcriptomic data focusing on the overall activity of curated surface proteins, with the final aim to identify those proteins driving major phenotypic changes at a network level, named surface signaling hubs. Here, we show the ability of SURFACER to discover relevant knowledge within and across cancer datasets. We also show how different cancers can be stratified in surface-activity-specific groups. Our strategy may identify cancer-wide markers to design targeted therapies and biomarker-based diagnostic approaches.
Collapse
|
2
|
Network for network concept offers new insights into host- SARS-CoV-2 protein interactions and potential novel targets for developing antiviral drugs. Comput Biol Med 2022; 146:105575. [PMID: 35533462 PMCID: PMC9055686 DOI: 10.1016/j.compbiomed.2022.105575] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 04/16/2022] [Accepted: 04/27/2022] [Indexed: 01/08/2023]
Abstract
SARS-CoV-2, the causal agent of COVID-19, is primarily a pulmonary virus that can directly or indirectly infect several organs. Despite many studies carried out during the current COVID-19 pandemic, some pathological features of SARS-CoV-2 have remained unclear. It has been recently attempted to address the current knowledge gaps on the viral pathogenicity and pathological mechanisms via cellular-level tropism of SARS-CoV-2 using human proteomics, visualization of virus-host protein-protein interactions (PPIs), and enrichment analysis of experimental results. The synergistic use of models and methods that rely on graph theory has enabled the visualization and analysis of the molecular context of virus/host PPIs. We review current knowledge on the SARS-COV-2/host interactome cascade involved in the viral pathogenicity through the graph theory concept and highlight the hub proteins in the intra-viral network that create a subnet with a small number of host central proteins, leading to cell disintegration and infectivity. Then we discuss the putative principle of the "gene-for-gene and "network for network" concepts as platforms for future directions toward designing efficient anti-viral therapies.
Collapse
|
3
|
A novel algorithm for finding top-k weighted overlapping densest connected subgraphs in dual networks. APPLIED NETWORK SCIENCE 2021; 6:40. [PMID: 34124340 PMCID: PMC8179714 DOI: 10.1007/s41109-021-00381-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 05/20/2021] [Indexed: 06/12/2023]
Abstract
The use of networks for modelling and analysing relations among data is currently growing. Recently, the use of a single networks for capturing all the aspects of some complex scenarios has shown some limitations. Consequently, it has been proposed to use Dual Networks (DN), a pair of related networks, to analyse complex systems. The two graphs in a DN have the same set of vertices and different edge sets. Common subgraphs among these networks may convey some insights about the modelled scenarios. For instance, the detection of the Top-k Densest Connected subgraphs, i.e. a set k subgraphs having the largest density in the conceptual network which are also connected in the physical network, may reveal set of highly related nodes. After proposing a formalisation of the approach, we propose a heuristic to find a solution, since the problem is computationally hard. A set of experiments on synthetic and real networks is also presented to support our approach.
Collapse
|
4
|
Data science in unveiling COVID-19 pathogenesis and diagnosis: evolutionary origin to drug repurposing. Brief Bioinform 2021; 22:855-872. [PMID: 33592108 PMCID: PMC7929414 DOI: 10.1093/bib/bbaa420] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 11/09/2020] [Accepted: 12/19/2020] [Indexed: 12/20/2022] Open
Abstract
MOTIVATION The outbreak of novel severe acute respiratory syndrome coronavirus (SARS-CoV-2, also known as COVID-19) in Wuhan has attracted worldwide attention. SARS-CoV-2 causes severe inflammation, which can be fatal. Consequently, there has been a massive and rapid growth in research aimed at throwing light on the mechanisms of infection and the progression of the disease. With regard to this data science is playing a pivotal role in in silico analysis to gain insights into SARS-CoV-2 and the outbreak of COVID-19 in order to forecast, diagnose and come up with a drug to tackle the virus. The availability of large multiomics, radiological, bio-molecular and medical datasets requires the development of novel exploratory and predictive models, or the customisation of existing ones in order to fit the current problem. The high number of approaches generates the need for surveys to guide data scientists and medical practitioners in selecting the right tools to manage their clinical data. RESULTS Focusing on data science methodologies, we conduct a detailed study on the state-of-the-art of works tackling the current pandemic scenario. We consider various current COVID-19 data analytic domains such as phylogenetic analysis, SARS-CoV-2 genome identification, protein structure prediction, host-viral protein interactomics, clinical imaging, epidemiological research and drug discovery. We highlight data types and instances, their generation pipelines and the data science models currently in use. The current study should give a detailed sketch of the road map towards handling COVID-19 like situations by leveraging data science experts in choosing the right tools. We also summarise our review focusing on prime challenges and possible future research directions. CONTACT hguzzi@unicz.it, sroy01@cus.ac.in.
Collapse
|
5
|
Regional Resource Assessment During the COVID-19 Pandemic in Italy: Modeling Study. JMIR Med Inform 2021; 9:e18933. [PMID: 33629957 PMCID: PMC7945976 DOI: 10.2196/18933] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 06/05/2020] [Accepted: 01/17/2021] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND COVID-19 has been declared a worldwide emergency and a pandemic by the World Health Organization. It started in China in December 2019, and it rapidly spread throughout Italy, which was the most affected country after China. The pandemic affected all countries with similarly negative effects on the population and health care structures. OBJECTIVE The evolution of the COVID-19 infections and the way such a phenomenon can be characterized in terms of resources and planning has to be considered. One of the most critical resources has been intensive care units (ICUs) with respect to the infection trend and critical hospitalization. METHODS We propose a model to estimate the needed number of places in ICUs during the most acute phase of the infection. We also define a scalable geographic model to plan emergency and future management of patients with COVID-19 by planning their reallocation in health structures of other regions. RESULTS We applied and assessed the prediction method both at the national and regional levels. ICU bed prediction was tested with respect to real data provided by the Italian government. We showed that our model is able to predict, with a reliable error in terms of resource complexity, estimation parameters used in health care structures. In addition, the proposed method is scalable at different geographic levels. This is relevant for pandemics such as COVID-19, which has shown different case incidences even among northern and southern Italian regions. CONCLUSIONS Our contribution can be useful for decision makers to plan resources to guarantee patient management, but it can also be considered as a reference model for potential upcoming waves of COVID-19 and similar emergency situations.
Collapse
|
6
|
Predicting the response of the dental pulp to SARS-CoV2 infection: a transcriptome-wide effect cross-analysis. Genes Immun 2020; 21:360-363. [PMID: 33011745 PMCID: PMC7532735 DOI: 10.1038/s41435-020-00112-6] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 08/29/2020] [Accepted: 09/21/2020] [Indexed: 12/15/2022]
Abstract
Pulpitis, inflammation of the dental pulp, is a disease that often necessitates emergency dental care. While pulpitis is considered to be a microbial disease primarily caused by bacteria, viruses have also been implicated in its pathogenesis. Here, we determined the expression of the SARS-CoV2 receptor, angiotensin converting enzyme 2 (ACE2) and its associated cellular serine protease TPMRSS2 in the dental pulp under normal and inflamed conditions. Next, we explored the relationship between the SARS-CoV-2/human interactome and genes expressed in pulpitis. Using existing datasets we show that both ACE2 and TPMRSS2 are expressed in the dental pulp and, that their expression does not change under conditions of inflammation. Furthermore, Master Regulator Analysis of the SARS-CoV2/human interactome identified 75 relevant genes whose expression values are either up-regulated or down-regulated in both the human interactome and pulpitis. Our results suggest that the dental pulp is vulnerable to SARS-CoV2 infection and that SARS-CoV-2 infection of the dental pulp may contribute to worse outcomes of pulpitis.
Collapse
|
7
|
Master Regulator Analysis of the SARS-CoV-2/Human Interactome. J Clin Med 2020; 9:E982. [PMID: 32244779 PMCID: PMC7230814 DOI: 10.3390/jcm9040982] [Citation(s) in RCA: 123] [Impact Index Per Article: 30.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 03/27/2020] [Accepted: 03/28/2020] [Indexed: 12/20/2022] Open
Abstract
The recent epidemic outbreak of a novel human coronavirus called SARS-CoV-2 causing the respiratory tract disease COVID-19 has reached worldwide resonance and a global effort is being undertaken to characterize the molecular features and evolutionary origins of this virus. In this paper, we set out to shed light on the SARS-CoV-2/host receptor recognition, a crucial factor for successful virus infection. Based on the current knowledge of the interactome between SARS-CoV-2 and host cell proteins, we performed Master Regulator Analysis to detect which parts of the human interactome are most affected by the infection. We detected, amongst others, affected apoptotic and mitochondrial mechanisms, and a downregulation of the ACE2 protein receptor, notions that can be used to develop specific therapies against this new virus.
Collapse
|
8
|
DIETOS: A dietary recommender system for chronic diseases monitoring and management. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2018; 153:93-104. [PMID: 29157465 DOI: 10.1016/j.cmpb.2017.10.014] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 10/02/2017] [Accepted: 10/10/2017] [Indexed: 06/07/2023]
Abstract
BACKGROUND AND OBJECTIVE Use of mobile and web-based applications for diet and weight management is currently increasing. However, the impact of known apps on clinical outcomes is not well-characterized so far. Moreover, availability of food recommender systems providing high quality nutritional advices to both healthy and diet-related chronic diseases users is very limited. In addition, the potentiality of nutraceutical properties of typical regional foods for improving app utility has not been exerted to this end. We present DIETOS, a recommender system for the adaptive delivery of nutrition contents to improve the quality of life of both healthy subjects and patients with diet-related chronic diseases. DIETOS provides highly specialized nutritional advices in different health conditions. METHODS DIETOS was projected to provide users with health profile and individual nutritional recommendation. Health profiling was based on user answers to dynamic real-time medical questionnaires. Furthermore, DIETOS contains catalogs of typical foods from Calabria, a southern Italian region. Several Calabrian foods have been inserted because of their nutraceutical properties widely reported in several quality studies. DIETOS includes some well known methods for user profiling (overlay profiling) and content adaptation (content selection) coming from general purpose adaptive web systems. RESULTS DIETOS has been validated for usability for both patients and specialists and for assessing the correctness of the profiling and recommendation, by enrolling 20 chronic kidney disease (CKD) patients at the Department of Nephrology and Dialysis, University Hospital, Catanzaro (Italy) and 20 age-matched healthy controls. Recruited subjects were invited to register to DIETOS and answer to medical questions to determine their health status. Based on our results, DIETOS has high specificity and sensitivity, allowing to determine a medical-controlled user's health profile and to perform a fine-grained recommendation that is better adapted to each user health status. The current version of DIETOS, available online at http://www.easyanalysis.it/dietos is not intended to be used by general users, but only for review purpose. CONCLUSIONS DIETOS is a novel food recommender system for healthy people and individuals affected by diet-related chronic diseases. The proposed system builds a users health profile and, accordingly, provides individualized nutritional recommendations, also with attention to food geographical origin.
Collapse
|
9
|
|
10
|
Abstract
Electronic medical records (EMRs) store data related to patients information enrolled during their stay in health structures. Data stored into EMRs span from data crawled from biological laboratories to textual description of diseases and diagnostic device results (e.g., biomedical images). Each EMR is related to a diagnosis related group (DRG) record. A DRG record is a record associated with a citizen that has been cured in a hospital. It contains a code, called major diagnostic category (MDC), which summarizes the treated disease and allows to reimburse costs related to patient treatments during his staying in health structures. DRGs are used for administrative process (e.g., costs and reimbursement management) as well as disease monitoring. Associating diagnostic codes with external information (such as environmental and geographical data) and with information filtered from EMRs (e.g., biological results or analytes values) can be useful to monitor citizens wellness status. We propose a methodology to analyze such data based on a multistep process. First, we cross reference data by using a semantics-based clustering procedure, extract information from EMRs, and then, cluster them by looking for similar patterns of diseases. Then, biological records in each disease cluster are analyzed to evaluate intracluster similarity by selecting analytes typologies and values. Finally, biological data is related to diagnosis codes and geometrically projected in areas of interest in order to map calculated outlier patients. We applied the methodology on two case studies: 1) diagnosis codes and biochemical analytes of 20 000 biological analyses about hospitalized patients during one observation year and 2) the correlation between cardiovascular diseases and water quality in a southern Italian region. Preliminary findings show the effectiveness of our method.
Collapse
|
11
|
Methodologies and experimental platforms for generating and analysing microarray and mass spectrometry-based omics data to support P4 medicine. Brief Bioinform 2015; 17:553-61. [DOI: 10.1093/bib/bbv076] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2015] [Indexed: 11/13/2022] Open
|
12
|
Abstract
This paper presents the design and implementation of a system for digital telecardiology on mobile devices called Remote Cardio Consultation (RCC). Using RCC may improve first intervention procedures in case of heart attack. In fact, it allows physicians to remotely consult ECG signals from a mobile device or smartphone by using a so-called app. The remote consultation is implemented by a server application collecting physician availability to answer upon client support requests. The app can be used by first intervention clinicians and allows reducing delays and decision errors in emergency interventions. Thus, best decision, certified and supported by cardiologists, can be obtained in case of heart attacks and first interventions even by base medical doctors able to produce and send an ECG. RCC tests have been performed, and the prototype is freely available as a service for testing.
Collapse
|
13
|
DMET-Miner: Efficient discovery of association rules from pharmacogenomic data. J Biomed Inform 2015; 56:273-83. [PMID: 26092773 DOI: 10.1016/j.jbi.2015.06.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Revised: 05/09/2015] [Accepted: 06/03/2015] [Indexed: 01/06/2023]
Abstract
Microarray platforms enable the investigation of allelic variants that may be correlated to phenotypes. Among those, the Affymetrix DMET (Drug Metabolism Enzymes and Transporters) platform enables the simultaneous investigation of all the genes that are related to drug absorption, distribution, metabolism and excretion (ADME). Although recent studies demonstrated the effectiveness of the use of DMET data for studying drug response or toxicity in clinical studies, there is a lack of tools for the automatic analysis of DMET data. In a previous work we developed DMET-Analyzer, a methodology and a supporting platform able to automatize the statistical study of allelic variants, that has been validated in several clinical studies. Although DMET-Analyzer is able to correlate a single variant for each probe (related to a portion of a gene) through the use of the Fisher test, it is unable to discover multiple associations among allelic variants, due to its underlying statistic analysis strategy that focuses on a single variant for each time. To overcome those limitations, here we propose a new analysis methodology for DMET data based on Association Rules mining, and an efficient implementation of this methodology, named DMET-Miner. DMET-Miner extends the DMET-Analyzer tool with data mining capabilities and correlates the presence of a set of allelic variants with the conditions of patient's samples by exploiting association rules. To face the high number of frequent itemsets generated when considering large clinical studies based on DMET data, DMET-Miner uses an efficient data structure and implements an optimized search strategy that reduces the search space and the execution time. Preliminary experiments on synthetic DMET datasets, show how DMET-Miner outperforms off-the-shelf data mining suites such as the FP-Growth algorithms available in Weka and RapidMiner. To demonstrate the biological relevance of the extracted association rules and the effectiveness of the proposed approach from a medical point of view, some preliminary studies on a real clinical dataset are currently under medical investigation.
Collapse
|
14
|
Analysis of miRNA, mRNA, and TF interactions through network-based methods. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2015; 2015:4. [PMID: 28194173 PMCID: PMC5270379 DOI: 10.1186/s13637-015-0023-8] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2015] [Accepted: 05/18/2015] [Indexed: 12/29/2022]
Abstract
Recent findings have elucidated that the regulation of messenger RNA (mRNA) levels is due to the synergistic and antagonist actions of transcription factors (TFs) and microRNAs (miRNAs). Mutual interactions among these molecules are easily modeled and analyzed using graphs whose nodes are molecules, and directed edges represent the associations among them. In particular, small subgraphs having three nodes also referred to as feed-forward loops (FFLs) or regulatory loops play a crucial role in many different diseases, such as cancer. Available technological platforms enable the investigation of only a single aspect of these mechanisms, e.g., the quantification of levels of mRNA or miRNA. Consequently, there exist different data sources for investigating some aspects of this problem, e.g., miRNA-mRNA or TF-mRNA associations. The comprehensive analysis is made possible only by the integration and the analysis of these data sources. Currently, the interest of researchers in this area is growing, the number of projects is increasing, and the number of challenges and issues for computer scientists is considerable. The need for an introductive survey from a computer science point of view consequently arises. This survey starts by discussing general concepts related to production of data. Then, main existing approaches of analysis are presented and discussed. Future improvements and challenges are also discussed.
Collapse
|
15
|
Serum adipokine levels in overweight patients and their relationship with non-alcoholic fatty liver disease. Panminerva Med 2014; 56:189-193. [PMID: 24994581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
AIM Non-alcoholic fatty liver disease (NAFLD) is a relevant public health matter in Western countries. The pathogenetic link between visceral fat, insulin resistance (IR) and NAFLD has been reported in literature. However, there are contradictions on the changes of adipokine levels in serum related to the presence of NAFLD. The aim of the present study was to evaluate the serum concentrations of a selected set of adipokines, that is, adiponectin, leptin, resistin and the pro-inflammatory cytokine interleukin-6 (IL-6) in overweight patients, and to clarify their relationship with NAFLD. METHODS Fasting serum levels of adipokines were determined in 42 consecutive overweight patients and in 25 lean controls. The degree of ultrasound (US) liver steatosis was graded according to the Hamaguchi score. RESULTS Liver steatosis was detected in 33 patients (78%) by US examination. Twelve patients with elevated transaminases levels showed significantly higher values of IR, leptin and resistin levels (P<0.05). Patients with steatosis presented a significantly higher leptin and a lower adiponectin levels (P<0.05) than controls. A significant inverse correlation was found between US steatosis progression and adiponectin and resistin levels (p<0.05). Considering the multiple logistic regression, adiponectin and leptin were good predictors to detect the presence of steatosis (p<0.05). CONCLUSION Our data support the concept that adipokine level changes are closely linked with IR. In addition, serum adiponectin and leptin levels may be used as diagnostic markers to determine the presence of NAFLD in overweight patients.
Collapse
|
16
|
Tractography in amyotrophic lateral sclerosis using a novel probabilistic tool: a study with tract-based reconstruction compared to voxel-based approach. J Neurosci Methods 2014; 224:79-87. [PMID: 24406465 DOI: 10.1016/j.jneumeth.2013.12.014] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 11/19/2013] [Accepted: 12/30/2013] [Indexed: 12/11/2022]
Abstract
BACKGROUND Diffusion tensor imaging (DTI) is one of the most sensitive MRI tools for detecting subtle cerebral white matter abnormalities in amyotrophic lateral sclerosis (ALS). Nowadays a plethora of DTI tools have been proposed, but very few methods have been translated into clinical practice. NEW METHOD The aim of this study is to validate the objective measurement of fiber tracts as provided by a new unbiased and automated tractography reconstruction tool named as TRActs Constrained by UnderLying Anatomy (TRACULA). The reliability of this tract-based approach was evaluated on a dataset of 14 patients with definite ALS compared with 14 age/sex-matched healthy controls. To further corroborate these measurements, we used a well-known voxelwise approach, called tract-based spatial statistics (TBSS), on the same dataset. RESULTS TRACULA showed specific significant alterations of several DTI parameters in the corticospinal tract of the ALS group with respect to controls. COMPARISON WITH EXISTING METHOD The same finding was detected using the well-known TBSS analysis. Similarly, both methods depicted also additional microstructural changes in the cingulum. CONCLUSIONS DTI tractography metrics provided by TRACULA perfectly agree with those previously reported in several post-mortem and DTI studies, thus demonstrating the accuracy of this method in characterizing the microstructural changes occurring in ALS. With further validation (i.e. considering the heterogeneity of other clinical phenotypes), this method has the potential to become useful for clinical practice providing objective measurements that might aid radiologists in the interpretation of MR images and improve diagnostic accuracy of ALS.
Collapse
|
17
|
M-Finder: Uncovering functionally associated proteins from interactome data integrated with GO annotations. Proteome Sci 2013; 11:S3. [PMID: 24565382 PMCID: PMC3909039 DOI: 10.1186/1477-5956-11-s1-s3] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Protein-protein interactions (PPIs) play a key role in understanding the mechanisms of cellular processes. The availability of interactome data has catalyzed the development of computational approaches to elucidate functional behaviors of proteins on a system level. Gene Ontology (GO) and its annotations are a significant resource for functional characterization of proteins. Because of wide coverage, GO data have often been adopted as a benchmark for protein function prediction on the genomic scale. RESULTS We propose a computational approach, called M-Finder, for functional association pattern mining. This method employs semantic analytics to integrate the genome-wide PPIs with GO data. We also introduce an interactive web application tool that visualizes a functional association network linked to a protein specified by a user. The proposed approach comprises two major components. First, the PPIs that have been generated by high-throughput methods are weighted in terms of their functional consistency using GO and its annotations. We assess two advanced semantic similarity metrics which quantify the functional association level of each interacting protein pair. We demonstrate that these measures outperform the other existing methods by evaluating their agreement to other biological features, such as sequence similarity, the presence of common Pfam domains, and core PPIs. Second, the information flow-based algorithm is employed to discover a set of proteins functionally associated with the protein in a query and their links efficiently. This algorithm reconstructs a functional association network of the query protein. The output network size can be flexibly determined by parameters. CONCLUSIONS M-Finder provides a useful framework to investigate functional association patterns with any protein. This software will also allow users to perform further systematic analysis of a set of proteins for any specific function. It is available online at http://bionet.ecs.baylor.edu/mfinder.
Collapse
|
18
|
|
19
|
J-TM Align: Efficient Comparison of Protein Structure Based on TMAlign. Curr Bioinform 2013. [DOI: 10.2174/1574893611308020010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
20
|
AlignNemo: a local network alignment method to integrate homology and topology. PLoS One 2012; 7:e38107. [PMID: 22719866 PMCID: PMC3373574 DOI: 10.1371/journal.pone.0038107] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Accepted: 05/02/2012] [Indexed: 11/19/2022] Open
Abstract
Local network alignment is an important component of the analysis of protein-protein interaction networks that may lead to the identification of evolutionary related complexes. We present AlignNemo, a new algorithm that, given the networks of two organisms, uncovers subnetworks of proteins that relate in biological function and topology of interactions. The discovered conserved subnetworks have a general topology and need not to correspond to specific interaction patterns, so that they more closely fit the models of functional complexes proposed in the literature. The algorithm is able to handle sparse interaction data with an expansion process that at each step explores the local topology of the networks beyond the proteins directly interacting with the current solution. To assess the performance of AlignNemo, we ran a series of benchmarks using statistical measures as well as biological knowledge. Based on reference datasets of protein complexes, AlignNemo shows better performance than other methods in terms of both precision and recall. We show our solutions to be biologically sound using the concept of semantic similarity applied to Gene Ontology vocabularies. The binaries of AlignNemo and supplementary details about the algorithms and the experiments are available at: sourceforge.net/p/alignnemo.
Collapse
|
21
|
High sensitive troponin T in individuals with chest pain of presumed ischemic origin. Front Biosci (Elite Ed) 2012; 4:2322-2327. [PMID: 22652639 DOI: 10.2741/e544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
This study was aimed at assessing the bias of high sensitive cardiac troponin T vs. the standard cardiac troponin T in a selected population with chest pain of presumed cardiac origin. Serum cTnT was determined in 132 patients and in 106 apparently healthy controls by both assays. The hs-cTnT outperformed the standard generation assay by: i) allowing a larger and earlier diagnosis of AMI (74.2 percent vs. 64.3 percent patients resulted positive at the final diagnosis of AMI when tested with the hs-cTnT or the std-cTnT assay, respectively); ii) showing a better time-dependent dynamics in patients with AMI due to a higher precision at low concentrations; iii) identifying, within the controls, 6 subjects in whom a further examination revealed the presence of chronic asymptomatic cardiac ischemia. The results underscore the excellent performance of the hs-cTnT assay in our population. The use of this test can thus be strongly recommended in subjects presenting to the emergency unit with chest pain of presumed ischemic origin in order to increase the probability of earlier diagnosis of AMI, especially in non-STEMI.
Collapse
|
22
|
Semantic similarity analysis of protein data: assessment with biological features and issues. Brief Bioinform 2011; 13:569-85. [PMID: 22138322 DOI: 10.1093/bib/bbr066] [Citation(s) in RCA: 164] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The integration of proteomics data with biological knowledge is a recent trend in bioinformatics. A lot of biological information is available and is spread on different sources and encoded in different ontologies (e.g. Gene Ontology). Annotating existing protein data with biological information may enable the use (and the development) of algorithms that use biological ontologies as framework to mine annotated data. Recently many methodologies and algorithms that use ontologies to extract knowledge from data, as well as to analyse ontologies themselves have been proposed and applied to other fields. Conversely, the use of such annotations for the analysis of protein data is a relatively novel research area that is currently becoming more and more central in research. Existing approaches span from the definition of the similarity among genes and proteins on the basis of the annotating terms, to the definition of novel algorithms that use such similarities for mining protein data on a proteome-wide scale. This work, after the definition of main concept of such analysis, presents a systematic discussion and comparison of main approaches. Finally, remaining challenges, as well as possible future directions of research are presented.
Collapse
|
23
|
A large set of estrogen receptor β-interacting proteins identified by tandem affinity purification in hormone-responsive human breast cancer cell nuclei. Proteomics Clin Appl 2011. [DOI: 10.1002/prca.201190045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
24
|
A peroxisome proliferator-activated receptor gamma (PPARG) polymorphism is associated with zoledronic acid-related osteonecrosis of the jaw in multiple myeloma patients: analysis by DMET microarray profiling. Br J Haematol 2011; 154:529-33. [PMID: 21517810 PMCID: PMC3638364 DOI: 10.1111/j.1365-2141.2011.08622.x] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
25
|
Comparative analysis of nuclear estrogen receptor alpha and beta interactomes in breast cancer cells. ACTA ACUST UNITED AC 2011; 7:667-76. [DOI: 10.1039/c0mb00145g] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
26
|
A large set of estrogen receptor β-interacting proteins identified by tandem affinity purification in hormone-responsive human breast cancer cell nuclei. Proteomics 2010; 11:159-65. [DOI: 10.1002/pmic.201000344] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 08/04/2010] [Accepted: 09/27/2010] [Indexed: 12/17/2022]
|
27
|
mu-CS: an extension of the TM4 platform to manage Affymetrix binary data. BMC Bioinformatics 2010; 11:315. [PMID: 20537149 PMCID: PMC2907348 DOI: 10.1186/1471-2105-11-315] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2009] [Accepted: 06/10/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND A main goal in understanding cell mechanisms is to explain the relationship among genes and related molecular processes through the combined use of technological platforms and bioinformatics analysis. High throughput platforms, such as microarrays, enable the investigation of the whole genome in a single experiment. There exist different kind of microarray platforms, that produce different types of binary data (images and raw data). Moreover, also considering a single vendor, different chips are available. The analysis of microarray data requires an initial preprocessing phase (i.e. normalization and summarization) of raw data that makes them suitable for use on existing platforms, such as the TIGR M4 Suite. Nevertheless, the annotations of data with additional information such as gene function, is needed to perform more powerful analysis. Raw data preprocessing and annotation is often performed in a manual and error prone way. Moreover, many available preprocessing tools do not support annotation. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of microarray data are needed. RESULTS The paper presents mu-CS (Microarray Cel file Summarizer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix binary data. mu-CS is based on a client-server architecture. The mu-CS client is provided both as a plug-in of the TIGR M4 platform and as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data, avoiding the manual invocation of external tools (e.g. the Affymetrix Power Tools), the manual loading of preprocessing libraries, and the management of intermediate files. The mu-CS server automatically updates the references to the summarization and annotation libraries that are provided to the mu-CS client before the preprocessing. The mu-CS server is based on the web services technology and can be easily extended to support more microarray vendors (e.g. Illumina). CONCLUSIONS Thus mu-CS users can directly manage binary data without worrying about locating and invoking the proper preprocessing tools and chip-specific libraries. Moreover, users of the mu-CS plugin for TM4 can manage Affymetrix binary files without using external tools, such as APT (Affymetrix Power Tools) and related libraries. Consequently, mu-CS offers four main advantages: (i) it avoids to waste time for searching the correct libraries, (ii) it reduces possible errors in the preprocessing and further analysis phases, e.g. due to the incorrect choice of parameters or the use of old libraries, (iii) it implements the annotation of preprocessed data, and finally, (iv) it may enhance the quality of further analysis since it provides the most updated annotation libraries. The mu-CS client is freely available as a plugin of the TM4 platform as well as a standalone application at the project web site (http://bioingegneria.unicz.it/M-CS).
Collapse
|
28
|
Differential transcriptional response to cisplatinum in BRCA1-defective versus BRCA1-reconstituted breast cancer cells by microarrays. Cancer Res 2009. [DOI: 10.1158/0008-5472.sabcs-5062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Abstract
Abstract #5062
BRCA1-related breast cancer represents a distinct clinical entity characterized by basal-like molecular phenotype. HCC1937 is a BRCA1-defective breast cancer cell line which discloses higher sensitivity to cisplatinum (CDDP) as compared to the BRCA1 full-length cDNA transfected clone HCC1937/wtBRCA1. To study the molecular bases of BRCA1-related differential sensitivity to the drug, we have analyzed the whole gene expression profile of HCC1937 and HCC1937/wtBRCA1 cells following in vivo and in vitro exposure of tumour cells to CDDP.
 Differential in vivo sensitivity was evaluated in xenografted tumors in SCID mice after 4 weeks of treatment with CDDP (5 mg/kg) or vehicle. Tumors were retrieved from animals 4 hours after the last dose of drug or vehicle. In vitro data were obtained exposing HCC1937 and HCC1937/wtBRCA1 cell lines to CDDP at IC50 doses of 30 and 70µM, or at 60 or 140µM respectively, for 3, 12 and 24 hours. Gene expression profiling was performed by Affymetrix® technology using GeneArray 1.0ST. Array data were analyzed using Gene Expression Console, GeneSpring software and Ingenuity Pathway Analysis (IPA)®.
 We found up-regulation of G2/M DNA damage checkpoint pathway, in a dose- and time-dependent manner, as well as down-regulation of Notch signaling pathway in HCC1937 cells, after treatment with CDDP in vitro. Both pathways were not affected in BRCA1-reconstituted cells. Notch signaling pathway plays a key role in normal development of several tissues and its aberrant activation is linked with breast cancer tumourigenesis. Our findings suggest that sensitivity of the BRCA1-defective cells to CDDP exposure is related to the DNA-damage repair machinery due to BRCA1-deficiency but also to the modulation of survival pathways triggered by BRCA1. In vivo data analysis of HCC1937/wtBRCA1 tumours exposed to CDDP identified modulation of pathways involving AKT3 and PRKAR2B, also known as RII-BETA, a cAMP-dependent protein kinase, whereas no Notch signaling pathway modulation was detected.
 In conclusion, reconstituted expression of BRCA1 induces resistance to CDDP in vitro and in vivo but produces a different transcriptional response to drug exposure. This effect could be related to the different growth and environmental milieu as well as to the different exposure conditions. Signaling pathways involving Notch, AKT and RII-BETA are involved in these events. Our data reinforce the concept that BRCA1 offers a molecular marker for individualized therapeutical options in breast cancer patients.
Citation Information: Cancer Res 2009;69(2 Suppl):Abstract nr 5062.
Collapse
|
29
|
A grid-based protein complex predictor. Stud Health Technol Inform 2008; 138:116-124. [PMID: 18560113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Proteins interact among them and different interactions form a very huge number of possible combinations representable as protein to protein interaction (PPI) networks that are mapped into graph structures. The interest in analyzing PPI networks is related to the possibility of predicting PPI properties, starting from a set of known proteins interacting among each other. For example, predicting the configuration of a subset of nodes in a graph (representing a PPI network), allows to study the generation of protein complexes. Nevertheless, due to the huge number of possible configurations of protein interactions, automatic based computation tools are required. Available prediction tools are able to analyze and predict possible combinations of proteins in a PPI network which have biological meanings. Once obtained, the protein interactions are analyzed with respect to biological meanings representing quality measures. Nevertheless, such tools strictly depend on input configuration and require biological validation. In this paper we propose a new grid-based prediction tool that integrate of different prediction results.
Collapse
|