1
|
Ajani SN, Mulla RA, Limkar S, Ashtagi R, Wagh SK, Pawar ME. RETRACTED ARTICLE: DLMBHCO: design of an augmented bioinspired deep learning-based multidomain body parameter analysis via heterogeneous correlative body organ analysis. Soft comput 2024; 28:635. [PMID: 37362266 PMCID: PMC10248994 DOI: 10.1007/s00500-023-08613-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/23/2023] [Indexed: 06/28/2023]
Abstract
Progressive organ-level disorders in the human body are often correlated with diseases in other body parts. For instance, liver diseases can be linked with heart issues, while cancers can be linked with brain diseases (or psychological conditions). Defining such correlations is a complex task, and existing deep learning models that perform this task either showcase lower accuracy or are non-comprehensive when applied to real-time scenarios. To overcome these issues, this text proposes design of an augmented bioinspired deep learning-based multidomain body parameter analysis via heterogeneous correlative body organ analysis. The proposed model initially collects temporal and spatial data scans for different body parts and uses a multidomain feature extraction engine to convert these scans into vector sets. These vectors are processed by a Bacterial Foraging Optimizer (BFO), which assists in identification of highly variant feature sets, which are individually classified into different disease categories. A fusion of Inception Net, XCeption Net, and GoogLeNet Models is used to perform these classifications. The classified categories are linked with other disease types via temporal analysis of blood reports. The temporal analysis engine uses Modified Analytical Hierarchical Processing (MAHP) Model for calculating inter-organ disease dependency probabilities. Based on these probabilities, the model is able to generate a patient-level correlation map, which can be used by clinical experts to suggest remedial treatments, due to which the model was able to identify correlations between brain disorders and kidneys, heart diseases and lungs, heart diseases and liver, brain diseases and different types of cancers with high efficiency when evaluated under clinical scenarios. When validated on MITBIH, DEAP, CT Kidney, RIDER, and PLCO data samples, it was observed that the proposed model was capable of improving accuracy of correlation by 8.5%, while improving precision and recall by 3.2% when compared with existing correlation models under similar clinical scenarios.
Collapse
Affiliation(s)
- Samir N. Ajani
- Department of Computer Science
& Engineering (Data Science), St.
Vincent Pallotti College of Engineering and Technology,
Nagpur, Maharashtra India
| | - Rais Allauddin Mulla
- Department of Computer Engineering, Vasantdada Patil
Pratishthan College of Engineering and Visual Arts, Mumbai, Maharashtra India
| | - Suresh Limkar
- Department of Artificial Intelligence
and Data Science, AISSMS Institute of
Information Technology, Pune,
Maharashtra India
| | - Rashmi Ashtagi
- Department of Computer Engineering,
Vishwakarma Institute of Technology,
Bibwewadi, Pune, 411037 Maharashtra
India
| | - Sharmila K. Wagh
- Department of Computer Engineering,
Modern Education Society’s College of
Engineering, Pune, Maharashtra India
| | - Mahendra Eknath Pawar
- Department of Computer Engineering, Vasantdada Patil
Pratishthan College of Engineering and Visual Arts, Mumbai, Maharashtra India
| |
Collapse
|
2
|
Kartheeswaran KP, Rayan AXA, Varrieth GT. Genetically and semantically aware homogeneous network for prediction and scoring of comorbidities. Comput Biol Med 2024; 183:109252. [PMID: 39418770 DOI: 10.1016/j.compbiomed.2024.109252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 06/29/2024] [Accepted: 10/04/2024] [Indexed: 10/19/2024]
Abstract
OBJECTIVE Patients with comorbidities are highly prone to mortality risk than those suffering from a single disease. Therefore, quantification and prediction of disease comorbidities is necessary to stratify the mortality risk of the patients, predict the probability of their occurrence, design treatment strategies, and to prevent the progression of diseases. Enriching comorbidity disease relationships with rich semantics established by genetic components play a vital role in effectively quantifying and predicting comorbidities. However, the existing studies have not extensively explored the semantic richness conveyed by different types of genetic links connecting the comorbidity pairs. METHODS To solve this, a novel genetic-semantic aware weighted homogeneous network-based method, GSWHomoNet is proposed which first constructs the gene enriched comorbidity heterogeneous network, CoGHetNet with encoded genetic semantic aware weighted meta-path instance disease pair embedding to obtain an enhanced disease node embedding of the network. For enhanced comorbidity prediction and scoring, both direct and indirect semantically enriched comorbidity relationships of the disease nodes is preserved while transforming heterogeneous to homogeneous comorbidity network GSWHomoNet. The proposed GSWHomoNet not only helps discover comorbidity links transductively between known-known disease pairs but also improves the inductive link prediction between known-unknown disease pairs by supplying unknown disease nodes with semantically enriched heterogeneous structural knowledge. RESULTS The effectiveness of the proposed components is proved by AUC scores of 0.895 and 0.860, as well as AUPR scores of 0.903 and 0.873 for transductive and inductive link prediction respectively. In comorbidity scoring, GSWHomoNet outperformed other methods with a correlation result of 0.848. The effect of the improved association prediction ability of the genetic semantic aware weighted meta-path instance embedding based node embedding is proved on disease-microbe and bibliographic heterogeneous network datasets. For biological significance of GSWHomoNet-based comorbidity scoring, we compared it with gene, pathway, and protein-protein interaction (PPI) perspectives, revealing a stronger correlation with the PPI aspect. We identified a substantial number of predicted comorbidity disease pairs, with 77,456 and 48,972 pairs supported by literature evidence for transductive and inductive predictions, respectively. Additionally, we highlighted shared pathways and PPIs for these pairs, demonstrating the robustness of comorbidity predictions.
Collapse
Affiliation(s)
| | - Arockia Xavier Annie Rayan
- Department of Computer Science and Engineering, CEG Campus, Anna University, Chennai, Tamil Nadu, India.
| | | |
Collapse
|
3
|
Al Khatib HS, Neupane S, Kumar Manchukonda H, Golilarz NA, Mittal S, Amirlatifi A, Rahimi S. Patient-centric knowledge graphs: a survey of current methods, challenges, and applications. Front Artif Intell 2024; 7:1388479. [PMID: 39540199 PMCID: PMC11558794 DOI: 10.3389/frai.2024.1388479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 09/18/2024] [Indexed: 11/16/2024] Open
Abstract
Patient-Centric Knowledge Graphs (PCKGs) represent an important shift in healthcare that focuses on individualized patient care by mapping the patient's health information holistically and multi-dimensionally. PCKGs integrate various types of health data to provide healthcare professionals with a comprehensive understanding of a patient's health, enabling more personalized and effective care. This literature review explores the methodologies, challenges, and opportunities associated with PCKGs, focusing on their role in integrating disparate healthcare data and enhancing patient care through a unified health perspective. In addition, this review also discusses the complexities of PCKG development, including ontology design, data integration techniques, knowledge extraction, and structured representation of knowledge. It highlights advanced techniques such as reasoning, semantic search, and inference mechanisms essential in constructing and evaluating PCKGs for actionable healthcare insights. We further explore the practical applications of PCKGs in personalized medicine, emphasizing their significance in improving disease prediction and formulating effective treatment plans. Overall, this review provides a foundational perspective on the current state-of-the-art and best practices of PCKGs, guiding future research and applications in this dynamic field.
Collapse
Affiliation(s)
- Hassan S. Al Khatib
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, United States
| | | | | | | | | | | | | |
Collapse
|
4
|
Chen J, Gu Z, Lai L, Pei J. In silico protein function prediction: the rise of machine learning-based approaches. MEDICAL REVIEW (2021) 2023; 3:487-510. [PMID: 38282798 PMCID: PMC10808870 DOI: 10.1515/mr-2023-0038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 10/11/2023] [Indexed: 01/30/2024]
Abstract
Proteins function as integral actors in essential life processes, rendering the realm of protein research a fundamental domain that possesses the potential to propel advancements in pharmaceuticals and disease investigation. Within the context of protein research, an imperious demand arises to uncover protein functionalities and untangle intricate mechanistic underpinnings. Due to the exorbitant costs and limited throughput inherent in experimental investigations, computational models offer a promising alternative to accelerate protein function annotation. In recent years, protein pre-training models have exhibited noteworthy advancement across multiple prediction tasks. This advancement highlights a notable prospect for effectively tackling the intricate downstream task associated with protein function prediction. In this review, we elucidate the historical evolution and research paradigms of computational methods for predicting protein function. Subsequently, we summarize the progress in protein and molecule representation as well as feature extraction techniques. Furthermore, we assess the performance of machine learning-based algorithms across various objectives in protein function prediction, thereby offering a comprehensive perspective on the progress within this field.
Collapse
Affiliation(s)
- Jiaxiao Chen
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Zhonghui Gu
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Luhua Lai
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- BNLMS, College of Chemistry and Molecular Engineering, Peking University, Beijing, China
- Research Unit of Drug Design Method, Chinese Academy of Medical Sciences (2021RU014), Beijing, China
| | - Jianfeng Pei
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- Research Unit of Drug Design Method, Chinese Academy of Medical Sciences (2021RU014), Beijing, China
| |
Collapse
|
5
|
Liu T, Lan G, Feenstra KA, Huang Z, Heringa J. Towards a knowledge graph for pre-/probiotics and microbiota-gut-brain axis diseases. Sci Rep 2022; 12:18977. [PMID: 36347868 PMCID: PMC9643397 DOI: 10.1038/s41598-022-21735-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 09/30/2022] [Indexed: 11/09/2022] Open
Abstract
Scientific publications present biological relationships but are structured for human reading, making it difficult to use this resource for semantic integration and querying. Existing databases, on the other hand, are well structured for automated analysis, but do not contain comprehensive biological knowledge. We devised an approach for constructing comprehensive knowledge graphs from these two types of resources and applied it to investigate relationships between pre-/probiotics and microbiota-gut-brain axis diseases. To this end, we created (i) a knowledge base, dubbed ppstatement, containing manually curated detailed annotations, and (ii) a knowledge base, called ppconcept, containing automatically annotated concepts. The resulting Pre-/Probiotics Knowledge Graph (PPKG) combines these two knowledge bases with three other public databases (i.e. MeSH, UMLS and SNOMED CT). To validate the performance of PPKG and to demonstrate the added value of integrating two knowledge bases, we created four biological query cases. The query cases demonstrate that we can retrieve co-occurring concepts of interest, and also that combining the two knowledge bases leads to more comprehensive query results than utilizing them separately. The PPKG enables users to pose research queries such as "which pre-/probiotics combinations may benefit depression?", potentially leading to novel biological insights.
Collapse
Affiliation(s)
- Ting Liu
- grid.12380.380000 0004 1754 9227Department of Computer Science, Center for Integrative Bioinformatics, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands ,grid.12380.380000 0004 1754 9227Knowledge Representation and Reasoning Group, Department of Computer Science, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Gongjin Lan
- grid.263817.90000 0004 1773 1790Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055 China
| | - K. Anton Feenstra
- grid.12380.380000 0004 1754 9227Department of Computer Science, Center for Integrative Bioinformatics, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Zhisheng Huang
- grid.12380.380000 0004 1754 9227Knowledge Representation and Reasoning Group, Department of Computer Science, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| | - Jaap Heringa
- grid.12380.380000 0004 1754 9227Department of Computer Science, Center for Integrative Bioinformatics, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| |
Collapse
|
6
|
Sousa D, Couto FM. Biomedical Relation Extraction with Knowledge Graph-based Recommendations. IEEE J Biomed Health Inform 2022; 26:4207-4217. [PMID: 35536818 DOI: 10.1109/jbhi.2022.3173558] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Biomedical Relation Extraction (RE) systems identify and classify relations between biomedical entities to enhance our knowledge of biological and medical processes. Most state-of-the-art systems use deep learning approaches, mainly to target relations between entities of the same type, such as proteins or pharmacological substances. However, these systems are mostly restricted to what they directly identify on the text and ignore specialized domain knowledge bases, such as ontologies, that formalize and integrate biomedical information typically structured as direct acyclic graphs. On the other hand, Knowledge Graph (KG)-based recommendation systems already showed the importance of integrating KGs to add additional features to items. Typical systems have users as people and items that can range from movies to books, which people saw or read and classified according to their satisfaction rate. This work proposes to integrate KGs into biomedical RE through a recommendation model to further improve their range of action. We developed a new RE system, named K-BiOnt, by integrating a baseline state-of-the-art deep biomedical RE system with an existing KG-based recommendation state-of-the-art system. Our results show that adding recommendations from KG-based recommendation improves the system's ability to identify true relations that the baseline deep RE model could not extract from the text. All the software and data supporting our work will be made publicly available upon acceptance of this manuscript.
Collapse
|
7
|
Multitask Healthcare Management Recommendation System Leveraging Knowledge Graph. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:1233483. [PMID: 34777727 PMCID: PMC8589481 DOI: 10.1155/2021/1233483] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Accepted: 09/11/2021] [Indexed: 11/17/2022]
Abstract
In this paper, a novel multitask healthcare management recommendation system leveraging the knowledge graph is proposed, which is based on deep neural network and 5G network, and it can be applied in mobile and terminal device to free up medical resources and provide treatment programs. The technique we applied is referred to as KG-based recommendation system. When several experiments have been carried out, it is demonstrated that it is more intelligent and precise in disease prediction and treatment recommendation, similar to the state of the art. Also, it works well in the accuracy and comprehension, which is much higher and highly consistent with the predictions of the theoretical model. The fact that our work involves studies of multitask healthcare management recommendation system, which can contribute to the smart healthcare development, proves to be promising and encouraging.
Collapse
|
8
|
Alshahrani M, Thafar MA, Essack M. Application and evaluation of knowledge graph embeddings in biomedical data. PeerJ Comput Sci 2021; 7:e341. [PMID: 33816992 PMCID: PMC7959619 DOI: 10.7717/peerj-cs.341] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 11/29/2020] [Indexed: 05/07/2023]
Abstract
Linked data and bio-ontologies enabling knowledge representation, standardization, and dissemination are an integral part of developing biological and biomedical databases. That is, linked data and bio-ontologies are employed in databases to maintain data integrity, data organization, and to empower search capabilities. However, linked data and bio-ontologies are more recently being used to represent information as multi-relational heterogeneous graphs, "knowledge graphs". The reason being, entities and relations in the knowledge graph can be represented as embedding vectors in semantic space, and these embedding vectors have been used to predict relationships between entities. Such knowledge graph embedding methods provide a practical approach to data analytics and increase chances of building machine learning models with high prediction accuracy that can enhance decision support systems. Here, we present a comparative assessment and a standard benchmark for knowledge graph-based representation learning methods focused on the link prediction task for biological relations. We systematically investigated and compared state-of-the-art embedding methods based on the design settings used for training and evaluation. We further tested various strategies aimed at controlling the amount of information related to each relation in the knowledge graph and its effects on the final performance. We also assessed the quality of the knowledge graph features through clustering and visualization and employed several evaluation metrics to examine their uses and differences. Based on this systematic comparison and assessments, we identify and discuss the limitations of knowledge graph-based representation learning methods and suggest some guidelines for the development of more improved methods.
Collapse
Affiliation(s)
- Mona Alshahrani
- Department of Computer Science and Engineering, Jubail University College, Jubail, Saudi Arabia
| | - Maha A. Thafar
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- College of Computing and Information Technology, Taif University, Taif, Saudi Arabia
| | - Magbubah Essack
- Computer, Electrical and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| |
Collapse
|
9
|
Liu T, Pan X, Wang X, Feenstra KA, Heringa J, Huang Z. Predicting the relationships between gut microbiota and mental disorders with knowledge graphs. Health Inf Sci Syst 2020; 9:3. [PMID: 33262885 PMCID: PMC7686388 DOI: 10.1007/s13755-020-00128-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 09/30/2020] [Indexed: 01/14/2023] Open
Abstract
Gut microbiota produce and modulate the production of neurotransmitters which have been implicated in mental disorders. Neurotransmitters may act as ‘matchmaker’ between gut microbiota imbalance and mental disorders. Most of the relevant research effort goes into the relationship between gut microbiota and neurotransmitters and the other between neurotransmitters and mental disorders, while few studies collect and analyze the dispersed research results in systematic ways. We therefore gather the dispersed results that in the existing studies into a structured knowledge base for identifying and predicting the potential relationships between gut microbiota and mental disorders. In this study, we propose to construct a gut microbiota knowledge graph for mental disorder, which named as MiKG4MD. It is extendable by linking to future ontologies by just adding new relationships between existing information and new entities. This extendibility is emphasized for the integration with existing popular ontologies/terminologies, e.g. UMLS, MeSH, and KEGG. We demonstrate the performance of MiKG4MD with three SPARQL query test cases. Results show that the MiKG4MD knowledge graph is an effective method to predict the relationships between gut microbiota and mental disorders.
Collapse
Affiliation(s)
- Ting Liu
- Knowledge Representation and Reasoning (KR&R) Group, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands.,Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Xueli Pan
- Knowledge Representation and Reasoning (KR&R) Group, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Xu Wang
- Knowledge Representation and Reasoning (KR&R) Group, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - K Anton Feenstra
- Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Jaap Heringa
- Center for Integrative Bioinformatics VU (IBIVU), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Zhisheng Huang
- Knowledge Representation and Reasoning (KR&R) Group, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands.,Brain Protection Innovation Center, Capital Medical University, Beijing, China
| |
Collapse
|
10
|
Abstract
Knowledge-based biomedical data science involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey recent progress in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as progress on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing to construct knowledge graphs, and the expansion of novel knowledge-based approaches to clinical and biological domains.
Collapse
Affiliation(s)
- Tiffany J Callahan
- Computational Bioscience Program and Department of Pharmacology, University of Colorado Denver Anschutz Medical Campus, Aurora, Colorado 80045, USA
| | - Ignacio J Tripodi
- Department of Computer Science, University of Colorado, Boulder, Colorado 80309, USA
| | - Harrison Pielke-Lombardo
- Computational Bioscience Program and Department of Pharmacology, University of Colorado Denver Anschutz Medical Campus, Aurora, Colorado 80045, USA
| | - Lawrence E Hunter
- Computational Bioscience Program and Department of Pharmacology, University of Colorado Denver Anschutz Medical Campus, Aurora, Colorado 80045, USA
| |
Collapse
|