1
|
Lee C, Park YH, Cho B, Lee HA. A network-based approach to explore comorbidity patterns among community-dwelling older adults living alone. GeroScience 2024; 46:2253-2264. [PMID: 37924440 PMCID: PMC10828172 DOI: 10.1007/s11357-023-00987-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 10/14/2023] [Indexed: 11/06/2023] Open
Abstract
The detailed comorbidity patterns of community-dwelling older adults have not yet been explored. This study employed a network-based approach to investigate the comorbidity patterns of community-dwelling older adults living alone. The sample comprised a cross-sectional cohort of adults 65 or older living alone in a Korean city (n = 1041; mean age = 77.7 years, 77.6% women). A comorbidity network analysis that estimates networks aggregated from measures of significant co-occurrence between pairs of diseases was employed to investigate comorbid associations between 31 chronic conditions. A cluster detection algorithm was employed to identify specific clusters of comorbidities. The association strength was expressed as the observed-to-expected ratio (OER). As a result, fifteen diseases were interconnected within the network (OER > 1, p-value < .05). While hypertension had a high prevalence, osteoporosis was the most central disease, co-occurring with numerous other diseases. The strongest associations among comorbidities were found between thyroid disease and urinary incontinence, chronic otitis media and osteoporosis, gastric duodenal ulcer/gastritis and anemia, and depression and gastric duodenal ulcer/gastritis (OER > 1.85). Three distinct clusters were identified as follows: (a) cataracts, osteoporosis, chronic otitis media, osteoarthritis/rheumatism, low back pain/sciatica, urinary incontinence, post-accident sequelae, and thyroid diseases; (b) hyperlipidemia, diabetes mellitus, and hypertension; and (c) depression, skin disease, gastric duodenal ulcer/gastritis, and anemia. The results may prove valuable in guiding the early diagnosis, management, and treatment of comorbidities in older adults living alone.
Collapse
Affiliation(s)
- Chiyoung Lee
- School of Nursing & Health Studies, University of Washington Bothell, 18115 Campus Way NE, Bothell, WA, 98011, USA
| | - Yeon-Hwan Park
- College of Nursing, Seoul National University, 103 Daehak-Ro, Jongno-Gu, Seoul, 03080, Republic of Korea.
- The Research Institute of Nursing Science, College of Nursing, Seoul National University, 103 Daehak-Ro, Jongno-Gu, Seoul, 03080, Republic of Korea.
| | - Belong Cho
- Department of Family Medicine, College of Medicine, Seoul National University, 103 Daehak-Ro, Jongno-Gu, Seoul, 03080, Republic of Korea
- Health Promotion Center, Seoul National University Hospital, 101 Daehak-Ro, Jongno-Gu, Seoul, 03080, Republic of Korea
| | - Hye Ah Lee
- Clinical Trial Center, Ewha Womans University Mokdong Hospital, 1071 Anyangcheon-Ro, Yangcheon-Gu, Seoul, 07985, Republic of Korea
| |
Collapse
|
2
|
Hier DB, Yelugam R, Carrithers MD, Wunsch DC. The visualization of Orphadata neurology phenotypes. Front Digit Health 2023; 5:1064936. [PMID: 36778102 PMCID: PMC9911440 DOI: 10.3389/fdgth.2023.1064936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 01/10/2023] [Indexed: 01/28/2023] Open
Abstract
Disease phenotypes are characterized by signs (what a physician observes during the examination of a patient) and symptoms (the complaints of a patient to a physician). Large repositories of disease phenotypes are accessible through the Online Mendelian Inheritance of Man, Human Phenotype Ontology, and Orphadata initiatives. Many of the diseases in these datasets are neurologic. For each repository, the phenotype of neurologic disease is represented as a list of concepts of variable length where the concepts are selected from a restricted ontology. Visualizations of these concept lists are not provided. We address this limitation by using subsumption to reduce the number of descriptive features from 2,946 classes into thirty superclasses. Phenotype feature lists of variable lengths were converted into fixed-length vectors. Phenotype vectors were aggregated into matrices and visualized as heat maps that allowed side-by-side disease comparisons. Individual diseases (representing a row in the matrix) were visualized as word clouds. We illustrate the utility of this approach by visualizing the neuro-phenotypes of 32 dystonic diseases from Orphadata. Subsumption can collapse phenotype features into superclasses, phenotype lists can be vectorized, and phenotypes vectors can be visualized as heat maps and word clouds.
Collapse
Affiliation(s)
- Daniel B Hier
- Applied Computational Intelligence Laboratory, Department of Electrical & Computer Engineering, Missouri University of Science & Technology, Rolla, MO, United States.,Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Raghu Yelugam
- Applied Computational Intelligence Laboratory, Department of Electrical & Computer Engineering, Missouri University of Science & Technology, Rolla, MO, United States
| | - Michael D Carrithers
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Donald C Wunsch
- National Institute of Diabetes and Digestive and Kidney Diseases, Liver Diseases Branch, Bethesda, MD, United States
| |
Collapse
|
3
|
A comprehensive review of Artificial Intelligence and Network based approaches to drug repurposing in Covid-19. Biomed Pharmacother 2022; 153:113350. [PMID: 35777222 PMCID: PMC9236981 DOI: 10.1016/j.biopha.2022.113350] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/22/2022] [Accepted: 06/24/2022] [Indexed: 11/26/2022] Open
Abstract
Conventional drug discovery and development is tedious and time-taking process; because of which it has failed to keep the required pace to mitigate threats and cater demands of viral and re-occurring diseases, such as Covid-19. The main reasons of this delay in traditional drug development are: high attrition rates, extensive time requirements, and huge financial investment with significant risk. The effective solution to de novo drug discovery is drug repurposing. Previous studies have shown that the network-based approaches and analysis are versatile platform for repurposing as the network biology is used to model the interactions between variety of biological concepts. Herein, we provide a comprehensive background of machine learning and deep learning in drug repurposing while specifically focusing on the applications of network-based approach to drug repurposing in Covid-19, data sources, and tools used. Furthermore, use of network proximity, network diffusion, and AI on network-based drug repurposing for Covid-19 is well-explained. Finally, limitations of network-based approaches in general and specific to network are stated along with future recommendations for better network-based models.
Collapse
|
4
|
Wang F, Ding Y, Lei X, Liao B, Wu FX. Identifying Gene Signatures for Cancer Drug Repositioning Based on Sample Clustering. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:953-965. [PMID: 32845842 DOI: 10.1109/tcbb.2020.3019781] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Drug repositioning is an important approach for drug discovery. Computational drug repositioning approaches typically use a gene signature to represent a particular disease and connect the gene signature with drug perturbation profiles. Although disease samples, especially from cancer, may be heterogeneous, most existing methods consider them as a homogeneous set to identify differentially expressed genes (DEGs)for further determining a gene signature. As a result, some genes that should be in a gene signature may be averaged off. In this study, we propose a new framework to identify gene signatures for cancer drug repositioning based on sample clustering (GS4CDRSC). GS4CDRSC first groups samples into several clusters based on their gene expression profiles. Second, an existing method is applied to the samples in each cluster for generating a list of DEGs. Then a weighting approach is used to identify an intergrated gene signature from all the lists of DEGs. The integrated gene signature is used to connect with drug perturbation profiles in the Connectivity Map (CMap)database to generate a list of drug candidates. GS4CDRSC has been tested with several cancer datasets and existing methods. The computational results show that GS4CDRSC outperforms those methods without the sample clustering and weighting approaches in terms of both number and rate of predicted known drugs for specific cancers.
Collapse
|
5
|
Wang F, Ding Y, Lei X, Liao B, Wu FX. Human Protein Complex-Based Drug Signatures for Personalized Cancer Medicine. IEEE J Biomed Health Inform 2021; 25:4079-4088. [PMID: 34665747 DOI: 10.1109/jbhi.2021.3120933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Disease signature-based drug repositioning approaches typically first identify a disease signature from gene expression profiles of disease samples to represent a particular disease. Then such a disease signature is connected with the drug-induced gene expression profiles to find potential drugs for the particular disease. In order to obtain reliable disease signatures, the size of disease samples should be large enough, which is not always a single case in practice, especially for personalized medicine. On the other hand, the sample sizes of drug-induced gene expression profiles are generally large. In this study, we propose a new drug repositioning approach (HDgS), in which the drug signature is first identified from drug-induced gene expression profiles, and then connected to the gene expression profiles of disease samples to find the potential drugs for patients. In order to take the dependencies among genes into account, the human protein complexes (HPC) are used to define the drug signature. The proposed HDgS is applied to the drug-induced gene expression profiles in LINCS and several types of cancer samples. The results indicate that the HPC-based drug signature can effectively find drug candidates for patients and that the proposed HDgS can be applied for personalized medicine with even one patient sample.
Collapse
|
6
|
Emmert-Streib F. Grand Challenges for Artificial Intelligence in Molecular Medicine. FRONTIERS IN MOLECULAR MEDICINE 2021; 1:734659. [PMID: 39087080 PMCID: PMC11285658 DOI: 10.3389/fmmed.2021.734659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 07/08/2021] [Indexed: 08/02/2024]
Affiliation(s)
- Frank Emmert-Streib
- Predictive Society and Data Analytics Lab, Faculty of Information Technolgy and Communication Sciences, Tampere University, Tampere, Finland
- Institute of Biosciences and Medical Technology, Tampere, Finland
| |
Collapse
|
7
|
Sonaye HV, Sheikh RY, Doifode CA. Drug repurposing: Iron in the fire for older drugs. Biomed Pharmacother 2021; 141:111638. [PMID: 34153846 DOI: 10.1016/j.biopha.2021.111638] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 04/19/2021] [Accepted: 04/19/2021] [Indexed: 12/22/2022] Open
Abstract
Repositioning or "repurposing" of existing therapies for indications of alternative disease is an attractive approach that can generate lower costs and require a shorter approval time than developing a de novo drug. The development of experimental drugs is time-consuming, expensive, and limited to a fairly small number of targets. The incorporation of separate and complementary data should be used, as each type of data set exposes a specific feature of organism knowledge Drug repurposing opportunities are often focused on sporadic findings or on time-consuming pre-clinical drug tests which are often not guided by hypothesis. In comparison, repurposing in-silico drugs is a new, hypothesis-driven method that takes advantage of big-data use. Nonetheless, the widespread use of omics technology, enhanced data storage, data sense, machine learning algorithms, and computational modeling all give unparalleled knowledge of the methods of action of biological processes and drugs, providing wide availability, for both disease-related data and drug-related data. This review has taken an in-depth look at the current state, possibilities, and limitations of further progress in the field of drug repositioning.
Collapse
Affiliation(s)
- H V Sonaye
- Shri Sachhidanand Shikshan Santh's Taywade College of Pharmacy, Nagpur 441111, India.
| | - R Y Sheikh
- K.E.M. Hospital Research Centre, Pune 411011, India.
| | - C A Doifode
- Shri Sachhidanand Shikshan Santh's Taywade College of Pharmacy, Nagpur 441111, India.
| |
Collapse
|
8
|
Yin MM, Cui Z, Gao MM, Liu JX, Gao YL. LWPCMF: Logistic Weighted Profile-Based Collaborative Matrix Factorization for Predicting MiRNA-Disease Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1122-1129. [PMID: 31478868 DOI: 10.1109/tcbb.2019.2937774] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
As is known to all, constructing experiments to predict unknown miRNA-disease association is time-consuming, laborious and costly. Accordingly, new prediction model should be conducted to predict novel miRNA-disease associations. What's more, the performance of this method should be high and reliable. In this paper, a new computation model Logistic Weighted Profile-based Collaborative Matrix Factorization (LWPCMF) is put forward. In this method, weighted profile (WP) is combined with collaborative matrix factorization (CMF) to increase the performance of this model. And, the neighbor information is considered. In addition, logistic function is applied to miRNA functional similarity matrix and disease semantic similarity matrix to extract valuable information. At the same time, by adding WP and logistic function, the known correlation can be protected. And, Gaussian Interaction Profile (GIP) kernels of miRNAs and diseases are added to miRNA functional similarity network and disease semantic similarity network to augment kernel similarities. Then, a five-fold cross validation is implemented to evaluate the predictive ability of this method. Besides, case studies are conducted to view the experimental results. The final result contains not only known associations but also newly predicted ones. And, the result proves that our method is better than other existing methods. This model is able to predict potential miRNA-disease associations.
Collapse
|
9
|
Jafari M, Wang Y, Amiryousefi A, Tang J. Unsupervised Learning and Multipartite Network Models: A Promising Approach for Understanding Traditional Medicine. Front Pharmacol 2020; 11:1319. [PMID: 32982738 PMCID: PMC7479204 DOI: 10.3389/fphar.2020.01319] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 08/07/2020] [Indexed: 12/11/2022] Open
Abstract
The ultimate goal of precision medicine is to determine right treatment for right patients based on precise diagnosis. To achieve this goal, correct stratification of patients using molecular features and clinical phenotypes is crucial. During the long history of medical science, our understanding on disease classification has been improved greatly by chemistry and molecular biology. Nowadays, we gain access to large scale patient-derived data by high-throughput technologies, generating a greater need for data science including unsupervised learning and network modeling. Unsupervised learning methods such as clustering could be a better solution to stratify patients when there is a lack of predefined classifiers. In network modularity analysis, clustering methods can be also applied to elucidate the complex structure of biological and disease networks at the systems level. In this review, we went over the main points of clustering analysis and network modeling, particularly in the context of Traditional Chinese medicine (TCM). We showed that this approach can provide novel insights on the rationale of classification for TCM herbs. In a case study, using a modularity analysis of multipartite networks, we illustrated that the TCM classifications are associated with the chemical properties of the herb ingredients. We concluded that multipartite network modeling may become a suitable data integration tool for understanding the mechanisms of actions of traditional medicine.
Collapse
Affiliation(s)
- Mohieddin Jafari
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Yinyin Wang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Ali Amiryousefi
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| |
Collapse
|
10
|
Colmenarejo G. Machine Learning Models to Predict Childhood and Adolescent Obesity: A Review. Nutrients 2020; 12:E2466. [PMID: 32824342 PMCID: PMC7469049 DOI: 10.3390/nu12082466] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 08/09/2020] [Accepted: 08/13/2020] [Indexed: 12/19/2022] Open
Abstract
The prevalence of childhood and adolescence overweight an obesity is raising at an alarming rate in many countries. This poses a serious threat to the current and near-future health systems, given the association of these conditions with different comorbidities (cardiovascular diseases, type II diabetes, and metabolic syndrome) and even death. In order to design appropriate strategies for its prevention, as well as understand its origins, the development of predictive models for childhood/adolescent overweight/obesity and related outcomes is of extreme value. Obesity has a complex etiology, and in the case of childhood and adolescence obesity, this etiology includes also specific factors like (pre)-gestational ones; weaning; and the huge anthropometric, metabolic, and hormonal changes that during this period the body suffers. In this way, Machine Learning models are becoming extremely useful tools in this area, given their excellent predictive power; ability to model complex, nonlinear relationships between variables; and capacity to deal with high-dimensional data typical in this area. This is especially important given the recent appearance of large repositories of Electronic Health Records (EHR) that allow the development of models using datasets with many instances and predictor variables, from which Deep Learning variants can generate extremely accurate predictions. In the current work, the area of Machine Learning models to predict childhood and adolescent obesity and related outcomes is comprehensively and critically reviewed, including the latest ones using Deep Learning with EHR. These models are compared with the traditional statistical ones that used mainly logistic regression. The main features and applications appearing from these models are described, and the future opportunities are discussed.
Collapse
Affiliation(s)
- Gonzalo Colmenarejo
- Biostatistics and Bioinformatics Unit, IMDEA Food, CEI UAM+CSIC, E28049 Madrid, Spain
| |
Collapse
|
11
|
Brunson JC, Agresta TP, Laubenbacher RC. Sensitivity of comorbidity network analysis. JAMIA Open 2020; 3:94-103. [PMID: 32607491 PMCID: PMC7309234 DOI: 10.1093/jamiaopen/ooz067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 11/12/2019] [Accepted: 12/10/2019] [Indexed: 01/10/2023] Open
Abstract
OBJECTIVES Comorbidity network analysis (CNA) is a graph-theoretic approach to systems medicine based on associations revealed from disease co-occurrence data. Researchers have used CNA to explore epidemiological patterns, differentiate populations, characterize disorders, and more; but these techniques have not been comprehensively evaluated. Our objectives were to assess the stability of common CNA techniques. MATERIALS AND METHODS We obtained seven co-occurrence data sets, most from previous CNAs, coded using several ontologies. We constructed comorbidity networks under various modeling procedures and calculated summary statistics and centrality rankings. We used regression, ordination, and rank correlation to assess these properties' sensitivity to the source of data and construction parameters. RESULTS Most summary statistics were robust to variation in link determination but somewhere sensitive to the association measure. Some more effectively than others discriminated among networks constructed from different data sets. Centrality rankings, especially among hubs, were somewhat sensitive to link determination and highly sensitive to ontology. As multivariate models incorporated additional effects, comorbid associations among low-prevalence disorders weakened while those between high-prevalence disorders shifted negative. DISCUSSION Pairwise CNA techniques are generally robust, but some analyses are highly sensitive to certain parameters. Multivariate approaches expose additional conceptual and technical limitations to the usual pairwise approach. CONCLUSION We conclude with a set of recommendations we believe will help CNA researchers improve the robustness of results and the potential of follow-up research.
Collapse
Affiliation(s)
- Jason Cory Brunson
- Center for Quantitative Medicine, UConn Health, 263 Farmington Ave, Farmington, Connecticut 06030-6033, USA
| | - Thomas P Agresta
- Center for Quantitative Medicine, UConn Health, 263 Farmington Ave, Farmington, Connecticut 06030-6033, USA
- Department of Family Medicine, UConn Health, 263 Farmington Ave, Farmington, Connecticut 06030-6033, USA
| | - Reinhard C Laubenbacher
- Center for Quantitative Medicine, UConn Health, 263 Farmington Ave, Farmington, Connecticut 06030-6033, USA
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Dr, Farmington, CT 06032, USA
| |
Collapse
|
12
|
Musa A, Tripathi S, Dehmer M, Emmert-Streib F. L1000 Viewer: A Search Engine and Web Interface for the LINCS Data Repository. Front Genet 2019; 10:557. [PMID: 31258549 PMCID: PMC6588157 DOI: 10.3389/fgene.2019.00557] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2019] [Accepted: 05/28/2019] [Indexed: 12/12/2022] Open
Abstract
The LINCS L1000 data repository contains almost two million gene expression profiles for thousands of small molecules and drugs. However, due to the complexity and the size of the data repository and a lack of an interoperable interface, the creation of pharmacologically meaningful workflows utilizing these data is severely hampered. In order to overcome this limitation, we developed the L1000 Viewer, a search engine and graphical web interface for the LINCS data repository. The web interface serves as an interactive platform allowing the user to select different forms of perturbation profiles, e.g., for specific cell lines, drugs, dosages, time points and combinations thereof. At its core, our method has a database we created from inferring and utilizing the intricate dependency graph structure among the data files. The L1000 Viewer is accessible via http://L1000viewer.bio-complexity.com/.
Collapse
Affiliation(s)
- Aliyu Musa
- Predictive Society and Data Analytics Lab, Faculty of Information Technology and Communication Sciences, Tampere University, Tampere, Finland.,Institute of Biosciences and Medical Technology, Tampere, Finland
| | - Shailesh Tripathi
- Predictive Society and Data Analytics Lab, Faculty of Information Technology and Communication Sciences, Tampere University, Tampere, Finland.,Institute for Intelligent Production, Faculty for Management, University of Applied Sciences Upper Austria, Linz, Austria
| | - Matthias Dehmer
- Institute for Intelligent Production, Faculty for Management, University of Applied Sciences Upper Austria, Linz, Austria.,Department of Mechatronics and Biomedical Computer Science, UMIT, Hall in Tyrol, Austria.,College of Computer and Control Engineering, Nankai University, Tianjin, China
| | - Frank Emmert-Streib
- Predictive Society and Data Analytics Lab, Faculty of Information Technology and Communication Sciences, Tampere University, Tampere, Finland.,Institute of Biosciences and Medical Technology, Tampere, Finland
| |
Collapse
|
13
|
Lotfi Shahreza M, Ghadiri N, Mousavi SR, Varshosaz J, Green JR. A review of network-based approaches to drug repositioning. Brief Bioinform 2019; 19:878-892. [PMID: 28334136 DOI: 10.1093/bib/bbx017] [Citation(s) in RCA: 161] [Impact Index Per Article: 32.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Indexed: 01/17/2023] Open
Abstract
Experimental drug development is time-consuming, expensive and limited to a relatively small number of targets. However, recent studies show that repositioning of existing drugs can function more efficiently than de novo experimental drug development to minimize costs and risks. Previous studies have proven that network analysis is a versatile platform for this purpose, as the biological networks are used to model interactions between many different biological concepts. The present study is an attempt to review network-based methods in predicting drug targets for drug repositioning. For each method, the preferred type of data set is described, and their advantages and limitations are discussed. For each method, we seek to provide a brief description, as well as an evaluation based on its performance metrics.We conclude that integrating distinct and complementary data should be used because each type of data set reveals a unique aspect of information about an organism. We also suggest that applying a standard set of evaluation metrics and data sets would be essential in this fast-growing research domain.
Collapse
Affiliation(s)
- Maryam Lotfi Shahreza
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| | - Nasser Ghadiri
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| | | | - Jaleh Varshosaz
- Drug Delivery Systems Research Center of Isfahan University of Medical Sciences
| | - James R Green
- Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
| |
Collapse
|
14
|
Long Noncoding RNA and Protein Interactions: From Experimental Results to Computational Models Based on Network Methods. Int J Mol Sci 2019; 20:ijms20061284. [PMID: 30875752 PMCID: PMC6471543 DOI: 10.3390/ijms20061284] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 03/09/2019] [Accepted: 03/11/2019] [Indexed: 01/13/2023] Open
Abstract
Non-coding RNAs with a length of more than 200 nucleotides are long non-coding RNAs (lncRNAs), which have gained tremendous attention in recent decades. Many studies have confirmed that lncRNAs have important influence in post-transcriptional gene regulation; for example, lncRNAs affect the stability and translation of splicing factor proteins. The mutations and malfunctions of lncRNAs are closely related to human disorders. As lncRNAs interact with a variety of proteins, predicting the interaction between lncRNAs and proteins is a significant way to depth exploration functions and enrich annotations of lncRNAs. Experimental approaches for lncRNA–protein interactions are expensive and time-consuming. Computational approaches to predict lncRNA–protein interactions can be grouped into two broad categories. The first category is based on sequence, structural information and physicochemical property. The second category is based on network method through fusing heterogeneous data to construct lncRNA related heterogeneous network. The network-based methods can capture the implicit feature information in the topological structure of related biological heterogeneous networks containing lncRNAs, which is often ignored by sequence-based methods. In this paper, we summarize and discuss the materials, interaction score calculation algorithms, advantages and disadvantages of state-of-the-art algorithms of lncRNA–protein interaction prediction based on network methods to assist researchers in selecting a suitable method for acquiring more dependable results. All the related different network data are also collected and processed in convenience of users, and are available at https://github.com/HAN-Siyu/APINet/.
Collapse
|
15
|
Alaimo S, Pulvirenti A. Network-Based Drug Repositioning: Approaches, Resources, and Research Directions. Methods Mol Biol 2019; 1903:97-113. [PMID: 30547438 DOI: 10.1007/978-1-4939-8955-3_6] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The wealth of knowledge and omic data available in drug research allowed the rising of several computational methods in drug discovery field yielding a novel and exciting application called drug repositioning. Several computational methods try to make a high-level integration of all the knowledge in order to discover unknown mechanisms. In this chapter we present an in-depth review of data resources and computational models for drug repositioning.
Collapse
Affiliation(s)
- Salvatore Alaimo
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Experimental Medicine, University of Catania, Catania, Italy.
| |
Collapse
|
16
|
Wang F, Lei X, Wu FX. A Review of Drug Repositioning Based Chemical-induced Cell Line Expression Data. Curr Med Chem 2018; 27:5340-5350. [PMID: 30381060 DOI: 10.2174/0929867325666181101115801] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2017] [Revised: 08/10/2018] [Accepted: 10/21/2018] [Indexed: 12/14/2022]
Abstract
Drug repositioning is an important area of biomedical research. The drug repositioning studies have shifted to computational approaches. Large-scale perturbation databases, such as the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures, contain a number of chemical-induced gene expression profiles and provide great opportunities for computational biology and drug repositioning. One reason is that the profiles provided by the Connectivity Map and the Library of Integrated Network-Based Cellular Signatures databases show an overall view of biological mechanism in drugs, diseases and genes. In this article, we provide a review of the two databases and their recent applications in drug repositioning.
Collapse
Affiliation(s)
- Fei Wang
- Division of Biomedical Engineering, University of Saskatchewan, Saskatoon, Canada
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an, Shaanxi, China
| | - Fang-Xiang Wu
- School of Computer Science, Shaanxi Normal University, Xi'an, Shaanxi, China
| |
Collapse
|
17
|
Brunson JC, Laubenbacher RC. Applications of network analysis to routinely collected health care data: a systematic review. J Am Med Inform Assoc 2018; 25:210-221. [PMID: 29025116 PMCID: PMC6664849 DOI: 10.1093/jamia/ocx052] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Revised: 04/18/2017] [Accepted: 04/23/2017] [Indexed: 01/21/2023] Open
Abstract
Objective To survey network analyses of datasets collected in the course of routine operations in health care settings and identify driving questions, methods, needs, and potential for future research. Materials and Methods A search strategy was designed to find studies that applied network analysis to routinely collected health care datasets and was adapted to 3 bibliographic databases. The results were grouped according to a thematic analysis of their settings, objectives, data, and methods. Each group received a methodological synthesis. Results The search found 189 distinct studies reported before August 2016. We manually partitioned the sample into 4 groups, which investigated institutional exchange, physician collaboration, clinical co-occurrence, and workplace interaction networks. Several robust and ongoing research programs were discerned within (and sometimes across) the groups. Little interaction was observed between these programs, despite conceptual and methodological similarities. Discussion We use the literature sample to inform a discussion of good practice at this methodological interface, including the concordance of motivations, study design, data, and tools and the validation and standardization of techniques. We then highlight instances of positive feedback between methodological development and knowledge domains and assess the overall cohesion of the sample.
Collapse
|
18
|
Lotfi Shahreza M, Ghadiri N, Mousavi SR, Varshosaz J, Green JR. Heter-LP: A heterogeneous label propagation algorithm and its application in drug repositioning. J Biomed Inform 2017; 68:167-183. [DOI: 10.1016/j.jbi.2017.03.006] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2016] [Revised: 02/09/2017] [Accepted: 03/10/2017] [Indexed: 12/14/2022]
|
19
|
Large-Scale Discovery of Disease-Disease and Disease-Gene Associations. Sci Rep 2016; 6:32404. [PMID: 27578529 PMCID: PMC5006166 DOI: 10.1038/srep32404] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2016] [Accepted: 08/09/2016] [Indexed: 11/08/2022] Open
Abstract
Data-driven phenotype analyses on Electronic Health Record (EHR) data have recently drawn benefits across many areas of clinical practice, uncovering new links in the medical sciences that can potentially affect the well-being of millions of patients. In this paper, EHR data is used to discover novel relationships between diseases by studying their comorbidities (co-occurrences in patients). A novel embedding model is designed to extract knowledge from disease comorbidities by learning from a large-scale EHR database comprising more than 35 million inpatient cases spanning nearly a decade, revealing significant improvements on disease phenotyping over current computational approaches. In addition, the use of the proposed methodology is extended to discover novel disease-gene associations by including valuable domain knowledge from genome-wide association studies. To evaluate our approach, its effectiveness is compared against a held-out set where, again, it revealed very compelling results. For selected diseases, we further identify candidate gene lists for which disease-gene associations were not studied previously. Thus, our approach provides biomedical researchers with new tools to filter genes of interest, thus, reducing costly lab studies.
Collapse
|
20
|
Liu W, Sun Z, Xie H. The analyses of human inherited disease and tissue-specific proteins in the interaction network. J Biomed Inform 2016; 61:10-8. [PMID: 27012904 DOI: 10.1016/j.jbi.2016.03.012] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 03/04/2016] [Accepted: 03/17/2016] [Indexed: 01/17/2023]
Abstract
OBJECTIVES With the announcement of human proteome and interaction data, it becomes possible to comprehensively investigate the tissue-expression and network properties of inherited disease proteins. In this study, our goal was to develop methods to map the disease and expression data and analyze the disorder-tissue associations. METHODS In this paper, we manually classified the human disease proteins into 22 disorder classes and systematically analyzed the properties of disease proteins in different disorder classes. Then, we investigated the similarity of different disorder classes by computing the overlap of different disorder proteins and networks. We proposed two novel measures, Enrichment Ratio and P-value for comparative analysis of disease proteins across tissues and revealed the associations between disorder classes and tissues/cells. RESULTS Compared with non-disease proteins, disease proteins tend to express in more tissues, have higher expression levels and interact with more other proteins in the network. The overlap percentages of networks are much higher than those of proteins, implying that different disorder classes usually influence each other by means of their interacting neighbors. The metabolic, muscular and hematologic proteins are related with most tissues/cells, and cancer proteins are closely associated with the disorders in immune cells. CONCLUSION This paper provided novel methods to investigate proteome-wide disease proteins and their interacting networks in order to understand different disease's associations.
Collapse
Affiliation(s)
- Wei Liu
- College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, Changsha 410073, China.
| | - Zhiqiang Sun
- College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, Changsha 410073, China
| | - Hongwei Xie
- College of Mechanical & Electronic Engineering and Automatization, National University of Defense Technology, Changsha 410073, China
| |
Collapse
|
21
|
Sharma A, Ghatge M, Mundkur L, Vangala RK. Translational informatics approach for identifying the functional molecular communicators linking coronary artery disease, infection and inflammation. Mol Med Rep 2016; 13:3904-12. [PMID: 27035874 PMCID: PMC4838147 DOI: 10.3892/mmr.2016.5013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 02/18/2016] [Indexed: 12/02/2022] Open
Abstract
Translational informatics approaches are required for the integration of diverse and accumulating data to enable the administration of effective translational medicine specifically in complex diseases such as coronary artery disease (CAD). In the current study, a novel approach for elucidating the association between infection, inflammation and CAD was used. Genes for CAD were collected from the CAD-gene database and those for infection and inflammation were collected from the UniProt database. The cytomegalovirus (CMV)-induced genes were identified from the literature and the CAD-associated clinical phenotypes were obtained from the Unified Medical Language System. A total of 55 gene ontologies (GO) termed functional communicator ontologies were identifed in the gene sets linking clinical phenotypes in the diseasome network. The network topology analysis suggested that important functions including viral entry, cell adhesion, apoptosis, inflammatory and immune responses networked with clinical phenotypes. Microarray data was extracted from the Gene Expression Omnibus (dataset: GSE48060) for highly networked disease myocardial infarction. Further analysis of differentially expressed genes and their GO terms suggested that CMV infection may trigger a xenobiotic response, oxidative stress, inflammation and immune modulation. Notably, the current study identified γ-glutamyl transferase (GGT)-5 as a potential biomarker with an odds ratio of 1.947, which increased to 2.561 following the addition of CMV and CMV-neutralizing antibody (CMV-NA) titers. The C-statistics increased from 0.530 for conventional risk factors (CRFs) to 0.711 for GGT in combination with the above mentioned infections and CRFs. Therefore, the translational informatics approach used in the current study identified a potential molecular mechanism for CMV infection in CAD, and a potential biomarker for risk prediction.
Collapse
Affiliation(s)
- Ankit Sharma
- Proteomics and Coagulation Unit, Thrombosis Research Institute, Bangalore, Karnataka 560099, India
| | - Madankumar Ghatge
- Proteomics and Coagulation Unit, Thrombosis Research Institute, Bangalore, Karnataka 560099, India
| | - Lakshmi Mundkur
- Molecular Immunology Unit, Thrombosis Research Institute, Bangalore, Karnataka 560099, India
| | - Rajani Kanth Vangala
- Proteomics and Coagulation Unit, Thrombosis Research Institute, Bangalore, Karnataka 560099, India
| |
Collapse
|
22
|
Liu W, Wu A, Pellegrini M, Wang X. Integrative analysis of human protein, function and disease networks. Sci Rep 2015; 5:14344. [PMID: 26399914 PMCID: PMC4585831 DOI: 10.1038/srep14344] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2015] [Accepted: 08/26/2015] [Indexed: 12/21/2022] Open
Abstract
Protein-protein interaction (PPI) networks serve as a powerful tool for unraveling protein functions, disease-gene and disease-disease associations. However, a direct strategy for integrating protein interaction, protein function and diseases is still absent. Moreover, the interrelated relationships among these three levels are poorly understood. Here we present a novel systematic method to integrate protein interaction, function, and disease networks. We first identified topological modules in human protein interaction data using the network topological algorithm (NeTA) we previously developed. The resulting modules were then associated with functional terms using Gene Ontology to obtain functional modules. Finally, disease modules were constructed by associating the modules with OMIM and GWAS. We found that most topological modules have cohesive structure, significant pathway annotations and good modularity. Most functional modules (70.6%) fully cover corresponding topological modules, and most disease modules (88.5%) are fully covered by the corresponding functional modules. Furthermore, we identified several protein modules of interest that we describe in detail, which demonstrate the power of our integrative approach. This approach allows us to link genes, and pathways with their corresponding disorders, which may ultimately help us to improve the prevention, diagnosis and treatment of disease.
Collapse
Affiliation(s)
- Wei Liu
- Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Aiping Wu
- Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100080.,Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences &Peking Union Medical College, Beijing 100005.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu 215123, China
| | - Matteo Pellegrini
- Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, CA, 90055
| | - Xiaofan Wang
- Department of Automation, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
23
|
Emmert-Streib F, de Matos Simoes R, Tripathi S, Glazko GV, Dehmer M. A Bayesian analysis of the chromosome architecture of human disorders by integrating reductionist data. Sci Rep 2012; 2:513. [PMID: 22822426 PMCID: PMC3400933 DOI: 10.1038/srep00513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2012] [Accepted: 06/27/2012] [Indexed: 11/09/2022] Open
Abstract
In this paper, we present a Bayesian approach to estimate a chromosome and a disorder network from the Online Mendelian Inheritance in Man (OMIM) database. In contrast to other approaches, we obtain statistic rather than deterministic networks enabling a parametric control in the uncertainty of the underlying disorder-disease gene associations contained in the OMIM, on which the networks are based. From a structural investigation of the chromosome network, we identify three chromosome subgroups that reflect architectural differences in chromosome-disorder associations that are predictively exploitable for a functional analysis of diseases.
Collapse
Affiliation(s)
- Frank Emmert-Streib
- Computational Biology and Machine Learning Lab, Center forCancer Research and Cell Biology, School of Medicine, Dentistry and Biomedical Sciences, Queen’s University Belfast, 97 Lisburn Road, Belfast, UK.
| | | | | | | | | |
Collapse
|