1
|
Ma L, Liu Y, Ren Y, Mi N, Fang J, Bao R, Xu X, Zhang H, Tang Y. Integrating bioinformatics and machine learning to uncover lncRNA LINC00269 as a key regulator in Parkinson's disease via pyroptosis pathways. Eur J Med Res 2024; 29:582. [PMID: 39696629 DOI: 10.1186/s40001-024-02201-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Accepted: 12/05/2024] [Indexed: 12/20/2024] Open
Abstract
BACKGROUND Pyroptosis, a specific type of programmed cell death, which has become a significant factor to Parkinson's disease (PD). Concurrently, long non-coding RNAs (lncRNAs) have garnered attention for their regulatory roles in neurodegenerative disorders. This study was designed to ascertain the key lncRNAs in pyroptosis pathways of PD and elucidate their regulatory mechanisms. METHODS Employing a combination of bioinformatics and machine learning, we analyzed PD data sets GSE133347 and GSE110716. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) recognized different lncRNAs. Through various algorithms such as Least Absolute Shrinkage and Selection Operator (LASSO), Random Forest (RF), and Weighted Gene Co-expression Network Analysis (WGCNA), we recognized LINC01606 and LINC00269, which are key factors during the emergence and development of PD. Furthermore, experimental validation was conducted in PD mouse models to confirm these bioinformatics findings. RESULTS The analysis showed that there were a large number of apoptosis-related gene expression changes in Parkinson's syndrome, for example, CASP1 and GSDME were up-regulated, and CASP9 and AIM2 were down-regulated. Among the lncRNAs, LINC01606 and LINC00269 were identified as potential modulators of pyroptosis. Notably, LINC00269 was observed to be significantly downregulated in the brain tissues of a PD mouse model, supporting its involvement in PD. The study also highlighted potential interactions of these lncRNAs with genes like ONECUT2, PRLR, CTNNA3, and LRP2. CONCLUSIONS This study identifies LINC00269 as a potential contributor to pyroptosis pathways in PD. While further investigation is required to fully elucidate its role, these findings provide new insights into PD pathogenesis and suggest potential avenues for future research on diagnostic and therapeutic targets. The study underscores the importance of integrating bioinformatics with experimental validation in neurodegenerative disease research.
Collapse
Affiliation(s)
- LiLi Ma
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Number 23, You Zheng Street, Nan Gang District, Harbin, 150001, Heilongjiang Province, China
- Department of Neurology, Jilin City Hospital of Chemical Industry, Jilin City, Jilin, China
| | - Yue Liu
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Harbin, 150000, Heilongjiang Province, China
| | - Yajing Ren
- School of Medical and Life Sciences, Chengdu University of TCM, Cheng du City, 611137, Sichuan Province, China
| | - Na Mi
- Department of Neurology, Chi Feng Municipal Hospital, Chi Feng City, 024000, Inner Mongolia Autonomous Region, China
| | - Jing Fang
- Department of Neurology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang Province, 150081, People's Republic of China
| | - Rui Bao
- Department of Rehabilitation, The Third Affiliated Hospital of Heilongjiang University of Chinese Medicine, Harbin, 150040, Heilongjiang Province, China
| | - Xiuzhi Xu
- General Medical Department, Heilongjiang Provincial Hospital, Harbin, 150036, Heilongjiang Province, China
| | - Hongjia Zhang
- Department of Neurology, Jilin City Hospital of Chemical Industry, Jilin City, Jilin, China.
| | - Ying Tang
- Department of Neurology, The First Affiliated Hospital of Harbin Medical University, Number 23, You Zheng Street, Nan Gang District, Harbin, 150001, Heilongjiang Province, China.
| |
Collapse
|
2
|
Zhuo E, Yang W, Wang Y, Tang Y, Wang W, Zhou L, Chen Y, Li P, Chen B, Gao W, Liu W. Global trends in machine learning applied to clinical research in liver cancer: Bibliometric and visualization analysis (2001-2024). Medicine (Baltimore) 2024; 103:e40790. [PMID: 39654222 PMCID: PMC11631000 DOI: 10.1097/md.0000000000040790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 11/09/2024] [Accepted: 11/14/2024] [Indexed: 12/12/2024] Open
Abstract
This study explores the intersection of liver cancer and machine learning through bibliometric analysis. The aim is to identify highly cited papers in the field and examine the current research landscape, highlighting emerging trends and key areas of focus in liver cancer and machine learning. By analyzing citation patterns, this study sheds light on the evolving role of machine learning in liver cancer research and its potential for future advancements.
Collapse
Affiliation(s)
- Enba Zhuo
- Department of Anesthesiology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Wenzhi Yang
- First Clinical College, Anhui Medical University, Hefei, China
| | - Yafen Wang
- Department of Radiation Oncology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Yanchao Tang
- Department of Anesthesiology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Wanrong Wang
- First Clinical College, Anhui Medical University, Hefei, China
| | - Lingyan Zhou
- First Clinical College, Anhui Medical University, Hefei, China
| | - Yanjun Chen
- First Clinical College, Anhui Medical University, Hefei, China
| | - Pengman Li
- Department of Anesthesiology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Bangjie Chen
- Department of Oncology, The First Affiliated Hospital of Anhui Medical University, Hefei, China
| | - Weimin Gao
- First Clinical College, Anhui Medical University, Hefei, China
| | - Wang Liu
- Department of General Surgery, Sanya Central Hospital (The Third People’s Hospital of Hainan Province), Sanya, China
| |
Collapse
|
3
|
Zhou K, Tian B, Lu J, Dong B, Xu H. Machine learning-guided synthesis of nanomaterials for breast cancer therapy. Sci Rep 2024; 14:25795. [PMID: 39468211 PMCID: PMC11519650 DOI: 10.1038/s41598-024-76924-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Accepted: 10/17/2024] [Indexed: 10/30/2024] Open
Abstract
Breast cancer is a common malignant tumor, which mostly occurs in female population and is caused by excessive proliferation of breast epithelial cells. Breast cancer can cause nipple discharge, breast lumps and other symptoms, but these symptoms lack certain specificity and are easily confused with other diseases, thus affecting the early treatment of the disease. Once the tumor progresses to the advanced stage, distant metastasis can occur, leading to dysfunction of the affected organs, and even threatening the patients' lives. In this study, we synthesized high drug-loading gel particles and applied them to control the release of insoluble drugs. This method is simple to prepare, cost-effective, and validates their potential in breast cancer therapy. We first characterized the morphology and physicochemical properties of gel loaded with newly synthesized compound 1 by scanning electron microscopy (SEM), Fourier-transform infrared spectroscopy (FT-IR), and thermal gravimetric analysis (TGA). Using newly synthesized insoluble compound 1 as a model drug, its efficacy in treating breast cancer was investigated. The results showed that hydrogel@compound 1 was able to significantly inhibit the proliferation, migration and invasion of breast cancer cells. Additionally, we utilized machine learning to screen three structurally similar compounds, which showed promising therapeutic effects, providing a new approach for the development of novel small-molecule drugs.
Collapse
Affiliation(s)
- Kun Zhou
- Department of General Surgery, Jing'an District Central Hospital of Shanghai, Shanghai, China
| | - Baoxing Tian
- Department of Breast Surgery, Tongren Hospital Affiliated to Shanghai Jiaotong University School of Medicine, Shanghai, China
| | - Ji Lu
- Department of General Surgery, Jing'an District Central Hospital of Shanghai, Shanghai, China
| | - Bing Dong
- Department of General Surgery, Jing'an District Central Hospital of Shanghai, Shanghai, China
| | - Han Xu
- Department of General Surgery, Jing'an District Central Hospital of Shanghai, Shanghai, China.
| |
Collapse
|
4
|
Zhang C, Yang J, Chen S, Sun L, Li K, Lai G, Peng B, Zhong X, Xie B. Artificial intelligence in ovarian cancer drug resistance advanced 3PM approach: subtype classification and prognostic modeling. EPMA J 2024; 15:525-544. [PMID: 39239109 PMCID: PMC11371997 DOI: 10.1007/s13167-024-00374-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Accepted: 07/06/2024] [Indexed: 09/07/2024]
Abstract
Background Ovarian cancer patients' resistance to first-line treatment posed a significant challenge, with approximately 70% experiencing recurrence and developing strong resistance to first-line chemotherapies like paclitaxel. Objectives Within the framework of predictive, preventive, and personalized medicine (3PM), this study aimed to use artificial intelligence to find drug resistance characteristics at the single cell, and further construct the classification strategy and deep learning prognostic models based on these resistance traits, which can better facilitate and perform 3PM. Methods This study employed "Beyondcell," an algorithm capable of predicting cellular drug responses, to calculate the similarity between the expression patterns of 21,937 cells from ovarian cancer samples and the signatures of 5201 drugs to identify drug-resistance cells. Drug resistance features were used to perform 10 multi-omics clustering on the TCGA training set to identify patient subgroups with differential drug responses. Concurrently, a deep learning prognostic model with KAN architecture which had a flexible activation function to better fit the model was constructed for this training set. The constructed patient subtype classifier and prognostic model were evaluated using three external validation sets from GEO: GSE17260, GSE26712, and GSE51088. Results This study identified that endothelial cells are resistant to paclitaxel, doxorubicin, and docetaxel, suggesting their potential as targets for cellular therapy in ovarian cancer patients. Based on drug resistance features, 10 multi-omics clustering identified four patient subtypes with differential responses to four chemotherapy drugs, in which subtype CS2 showed the highest drug sensitivity to all four drugs. The other subtypes also showed enrichment in different biological pathways and immune infiltration, allowing for targeted treatment based on their characteristics. Besides, this study applied the latest KAN architecture in artificial intelligence to replace the MLP structure in the DeepSurv prognostic model, finally demonstrating robust performance on patients' prognosis prediction. Conclusions This study, by classifying patients and constructing prognostic models based on resistance characteristics to first-line drugs, has effectively applied multi-omics data into the realm of 3PM. Supplementary Information The online version contains supplementary material available at 10.1007/s13167-024-00374-4.
Collapse
Affiliation(s)
- Cong Zhang
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Jinxiang Yang
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Siyu Chen
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Lichang Sun
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Kangjie Li
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Guichuan Lai
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Bin Peng
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Xiaoni Zhong
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| | - Biao Xie
- Department of Epidemiology and Health Statistics, School of Public Health, Chongqing Medical University, Yixue Road, Chongqing, 400016 China
| |
Collapse
|
5
|
Mensah‐Bonsu M, Doss C, Gloster C, Muganda P. Gene expression analysis identifies hub genes and pathways distinguishing fatal from survivor outcomes of Ebola virus disease. FASEB Bioadv 2024; 6:298-310. [PMID: 39399477 PMCID: PMC11467745 DOI: 10.1096/fba.2024-00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 06/06/2024] [Accepted: 07/02/2024] [Indexed: 10/15/2024] Open
Abstract
The Ebola virus poses a severe public health threat, yet understanding factors influencing disease outcomes remains incomplete. Our study aimed to identify critical pathways and hub genes associated with fatal and survivor Ebola disease outcomes. We analyzed differentially expressed hub genes (DEGs) between groups with fatal and survival outcomes, as well as a healthy control group. We conducted additional analysis to determine the functions and pathways associated with these DEGs. We found 13,198 DEGs in the fatal and 12,039 DEGs in the survival group compared to healthy controls, and 1873 DEGs in the acute fatal and survivor groups comparison. Upregulated DEGs in the comparison between the acute fatal and survivor groups were linked to ECM receptor interaction, complement and coagulation cascades, and PI3K-Akt signaling. Upregulated hub genes identified from the acute fatal and survivor comparison (FGB, C1QA, SERPINF2, PLAT, C9, SERPINE1, F3, VWF) were enriched in complement and coagulation cascades; the downregulated hub genes (IL1B, 1L17RE, XCL1, CXCL6, CCL4, CD8A, CD8B, CD3D) were associated with immune cell processes. Hub genes CCL2 and F2 were unique to fatal outcomes, while CXCL1, HIST1H4F, and IL1A were upregulated hub genes unique to survival outcomes compared to healthy controls. Our results demonstrate for the first time the association of EVD outcomes to specific hub genes and their associated pathways and biological processes. The identified hub genes and pathways could help better elucidate Ebola disease pathogenesis and contribute to the development of targeted interventions and personalized treatment for distinct EVD outcomes.
Collapse
Affiliation(s)
- Melvin Mensah‐Bonsu
- Applied Science and TechnologyNorth Carolina A&T State UniversityGreensboroNorth CarolinaUSA
| | - Christopher Doss
- Department of Electrical and Computer EngineeringNorth Carolina A&T State UniversityGreensboroNorth CarolinaUSA
| | - Clay Gloster
- Department of Computer Systems TechnologyNorth Carolina A&T State UniversityGreensboroNorth CarolinaUSA
| | - Perpetua Muganda
- Department of BiologyNorth Carolina A&T State UniversityGreensboroNorth CarolinaUSA
| |
Collapse
|
6
|
Li L, Sun M, Wang J, Wan S. Multi-omics based artificial intelligence for cancer research. Adv Cancer Res 2024; 163:303-356. [PMID: 39271266 DOI: 10.1016/bs.acr.2024.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2024]
Abstract
With significant advancements of next generation sequencing technologies, large amounts of multi-omics data, including genomics, epigenomics, transcriptomics, proteomics, and metabolomics, have been accumulated, offering an unprecedented opportunity to explore the heterogeneity and complexity of cancer across various molecular levels and scales. One of the promising aspects of multi-omics lies in its capacity to offer a holistic view of the biological networks and pathways underpinning cancer, facilitating a deeper understanding of its development, progression, and response to treatment. However, the exponential growth of data generated by multi-omics studies present significant analytical challenges. Processing, analyzing, integrating, and interpreting these multi-omics datasets to extract meaningful insights is an ambitious task that stands at the forefront of current cancer research. The application of artificial intelligence (AI) has emerged as a powerful solution to these challenges, demonstrating exceptional capabilities in deciphering complex patterns and extracting valuable information from large-scale, intricate omics datasets. This review delves into the synergy of AI and multi-omics, highlighting its revolutionary impact on oncology. We dissect how this confluence is reshaping the landscape of cancer research and clinical practice, particularly in the realms of early detection, diagnosis, prognosis, treatment and pathology. Additionally, we elaborate the latest AI methods for multi-omics integration to provide a comprehensive insight of the complex biological mechanisms and inherent heterogeneity of cancer. Finally, we discuss the current challenges of data harmonization, algorithm interpretability, and ethical considerations. Addressing these challenges necessitates a multidisciplinary collaboration, paving the promising way for more precise, personalized, and effective treatments for cancer patients.
Collapse
Affiliation(s)
- Lusheng Li
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Mengtao Sun
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States
| | - Jieqiong Wang
- Department of Neurological Sciences, University of Nebraska Medical Center, Omaha, NE, United States
| | - Shibiao Wan
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, Omaha, NE, United States.
| |
Collapse
|
7
|
Saravanan KS, Satish KS, Saraswathy GR, Kuri U, Vastrad SJ, Giri R, Dsouza PL, Kumar AP, Nair G. Innovative target mining stratagems to navigate drug repurposing endeavours. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2024; 205:303-355. [PMID: 38789185 DOI: 10.1016/bs.pmbts.2024.03.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2024]
Abstract
The conventional theory linking a single gene with a particular disease and a specific drug contributes to the dwindling success rates of traditional drug discovery. This requires a substantial shift focussing on contemporary drug design or drug repurposing, which entails linking multiple genes to diverse physiological or pathological pathways and drugs. Lately, drug repurposing, the art of discovering new/unlabelled indications for existing drugs or candidates in clinical trials, is gaining attention owing to its success rates. The rate-limiting phase of this strategy lies in target identification, which is generally driven through disease-centric and/or drug-centric approaches. The disease-centric approach is based on exploration of crucial biomolecules such as genes or proteins underlying pathological cascades of the disease of interest. Investigating these pathological interplays aids in the identification of potential drug targets that can be leveraged for novel therapeutic interventions. The drug-centric approach involves various strategies such as exploring the mechanism of adverse drug reactions that can unearth potential targets, as these untoward reactions might be considered desirable therapeutic actions in other disease conditions. Currently, artificial intelligence is an emerging robust tool that can be used to translate the aforementioned intricate biological networks to render interpretable data for extracting precise molecular targets. Integration of multiple approaches, big data analytics, and clinical corroboration are essential for successful target mining. This chapter highlights the contemporary strategies steering target identification and diverse frameworks for drug repurposing. These strategies are illustrated through case studies curated from recent drug repurposing research inclined towards neurodegenerative diseases, cancer, infections, immunological, and cardiovascular disorders.
Collapse
Affiliation(s)
- Kamatchi Sundara Saravanan
- Department of Pharmacognosy, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Kshreeraja S Satish
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Ganesan Rajalekshmi Saraswathy
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India.
| | - Ushnaa Kuri
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Soujanya J Vastrad
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Ritesh Giri
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Prizvan Lawrence Dsouza
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Adusumilli Pramod Kumar
- Department of Pharmacy Practice, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| | - Gouri Nair
- Department of Pharmacology, Faculty of Pharmacy, M.S. Ramaiah University of Applied Sciences, Bangalore, Karnataka, India
| |
Collapse
|
8
|
Patel Y, Shah T, Dhar MK, Zhang T, Niezgoda J, Gopalakrishnan S, Yu Z. Integrated image and location analysis for wound classification: a deep learning approach. Sci Rep 2024; 14:7043. [PMID: 38528003 DOI: 10.1038/s41598-024-56626-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Accepted: 03/08/2024] [Indexed: 03/27/2024] Open
Abstract
The global burden of acute and chronic wounds presents a compelling case for enhancing wound classification methods, a vital step in diagnosing and determining optimal treatments. Recognizing this need, we introduce an innovative multi-modal network based on a deep convolutional neural network for categorizing wounds into four categories: diabetic, pressure, surgical, and venous ulcers. Our multi-modal network uses wound images and their corresponding body locations for more precise classification. A unique aspect of our methodology is incorporating a body map system that facilitates accurate wound location tagging, improving upon traditional wound image classification techniques. A distinctive feature of our approach is the integration of models such as VGG16, ResNet152, and EfficientNet within a novel architecture. This architecture includes elements like spatial and channel-wise Squeeze-and-Excitation modules, Axial Attention, and an Adaptive Gated Multi-Layer Perceptron, providing a robust foundation for classification. Our multi-modal network was trained and evaluated on two distinct datasets comprising relevant images and corresponding location information. Notably, our proposed network outperformed traditional methods, reaching an accuracy range of 74.79-100% for Region of Interest (ROI) without location classifications, 73.98-100% for ROI with location classifications, and 78.10-100% for whole image classifications. This marks a significant enhancement over previously reported performance metrics in the literature. Our results indicate the potential of our multi-modal network as an effective decision-support tool for wound image classification, paving the way for its application in various clinical contexts.
Collapse
Affiliation(s)
- Yash Patel
- Department of Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
| | - Tirth Shah
- Department of Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
| | - Mrinal Kanti Dhar
- Department of Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
| | - Taiyu Zhang
- Department of Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
| | - Jeffrey Niezgoda
- Advancing the Zenith of Healthcare (AZH) Wound and Vascular Center, Milwaukee, WI, USA
| | | | - Zeyun Yu
- Department of Computer Science, University of Wisconsin-Milwaukee, Milwaukee, WI, USA.
- Department of Biomedical Engineering, University of Wisconsin-Milwaukee, Milwaukee, WI, USA.
| |
Collapse
|
9
|
Xianyu Z, Correia C, Ung CY, Zhu S, Billadeau DD, Li H. The Rise of Hypothesis-Driven Artificial Intelligence in Oncology. Cancers (Basel) 2024; 16:822. [PMID: 38398213 PMCID: PMC10886811 DOI: 10.3390/cancers16040822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 02/12/2024] [Accepted: 02/14/2024] [Indexed: 02/25/2024] Open
Abstract
Cancer is a complex disease involving the deregulation of intricate cellular systems beyond genetic aberrations and, as such, requires sophisticated computational approaches and high-dimensional data for optimal interpretation. While conventional artificial intelligence (AI) models excel in many prediction tasks, they often lack interpretability and are blind to the scientific hypotheses generated by researchers to enable cancer discoveries. Here we propose that hypothesis-driven AI, a new emerging class of AI algorithm, is an innovative approach to uncovering the complex etiology of cancer from big omics data. This review exemplifies how hypothesis-driven AI is different from conventional AI by citing its application in various areas of oncology including tumor classification, patient stratification, cancer gene discovery, drug response prediction, and tumor spatial organization. Our aim is to stress the feasibility of incorporating domain knowledge and scientific hypotheses to craft the design of new AI algorithms. We showcase the power of hypothesis-driven AI in making novel cancer discoveries that can be overlooked by conventional AI methods. Since hypothesis-driven AI is still in its infancy, open questions such as how to better incorporate new knowledge and biological perspectives to ameliorate bias and improve interpretability in the design of AI algorithms still need to be addressed. In conclusion, hypothesis-driven AI holds great promise in the discovery of new mechanistic and functional insights that explain the complexity of cancer etiology and potentially chart a new roadmap to improve treatment regimens for individual patients.
Collapse
Affiliation(s)
- Zilin Xianyu
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
| | - Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
| | - Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
| | - Shizhen Zhu
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
- Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Daniel D. Billadeau
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
- Department of Immunology, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (Z.X.); (C.C.); (C.Y.U.); (S.Z.); (D.D.B.)
| |
Collapse
|
10
|
Ung CY, Correia C, Li H, Adams CM, Westendorf JJ, Zhu S. Multiorgan locked-state model of chronic diseases and systems pharmacology opportunities. Drug Discov Today 2024; 29:103825. [PMID: 37967790 PMCID: PMC11109989 DOI: 10.1016/j.drudis.2023.103825] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/29/2023] [Accepted: 11/08/2023] [Indexed: 11/17/2023]
Abstract
With increasing human life expectancy, the global medical burden of chronic diseases is growing. Hence, chronic diseases are a pressing health concern and will continue to be in decades to come. Chronic diseases often involve multiple malfunctioning organs in the body. An imminent question is how interorgan crosstalk contributes to the etiology of chronic diseases. We conceived the locked-state model (LoSM), which illustrates how interorgan communication can give rise to body-wide memory-like properties that 'lock' healthy or pathological conditions. Next, we propose cutting-edge systems biology and artificial intelligence strategies to decipher chronic multiorgan locked states. Finally, we discuss the clinical implications of the LoSM and assess the power of systems-based therapies to dismantle pathological multiorgan locked states while improving treatments for chronic diseases.
Collapse
Affiliation(s)
- Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
| | - Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA
| | - Christopher M Adams
- Division of Endocrinology, Diabetes, Metabolism and Nutrition, Mayo Clinic, Rochester, MN, USA; Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, USA
| | - Jennifer J Westendorf
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, USA; Department of Orthopedic Surgery, Mayo Clinic, Rochester, MN, USA
| | - Shizhen Zhu
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, USA; Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
11
|
Fernandez ME, Martinez-Romero J, Aon MA, Bernier M, Price NL, de Cabo R. How is Big Data reshaping preclinical aging research? Lab Anim (NY) 2023; 52:289-314. [PMID: 38017182 DOI: 10.1038/s41684-023-01286-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 10/10/2023] [Indexed: 11/30/2023]
Abstract
The exponential scientific and technological progress during the past 30 years has favored the comprehensive characterization of aging processes with their multivariate nature, leading to the advent of Big Data in preclinical aging research. Spanning from molecular omics to organism-level deep phenotyping, Big Data demands large computational resources for storage and analysis, as well as new analytical tools and conceptual frameworks to gain novel insights leading to discovery. Systems biology has emerged as a paradigm that utilizes Big Data to gain insightful information enabling a better understanding of living organisms, visualized as multilayered networks of interacting molecules, cells, tissues and organs at different spatiotemporal scales. In this framework, where aging, health and disease represent emergent states from an evolving dynamic complex system, context given by, for example, strain, sex and feeding times, becomes paramount for defining the biological trajectory of an organism. Using bioinformatics and artificial intelligence, the systems biology approach is leading to remarkable advances in our understanding of the underlying mechanism of aging biology and assisting in creative experimental study designs in animal models. Future in-depth knowledge acquisition will depend on the ability to fully integrate information from different spatiotemporal scales in organisms, which will probably require the adoption of theories and methods from the field of complex systems. Here we review state-of-the-art approaches in preclinical research, with a focus on rodent models, that are leading to conceptual and/or technical advances in leveraging Big Data to understand basic aging biology and its full translational potential.
Collapse
Affiliation(s)
- Maria Emilia Fernandez
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
| | - Jorge Martinez-Romero
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
- Laboratory of Epidemiology and Population Science, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
| | - Miguel A Aon
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
- Laboratory of Cardiovascular Science, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
| | - Michel Bernier
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
| | - Nathan L Price
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA
| | - Rafael de Cabo
- Experimental Gerontology Section, Translational Gerontology Branch, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA.
| |
Collapse
|
12
|
Favreau E, Geist KS, Wyatt CDR, Toth AL, Sumner S, Rehan SM. Co-expression Gene Networks and Machine-learning Algorithms Unveil a Core Genetic Toolkit for Reproductive Division of Labour in Rudimentary Insect Societies. Genome Biol Evol 2023; 15:evac174. [PMID: 36527688 PMCID: PMC9830183 DOI: 10.1093/gbe/evac174] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 12/06/2022] [Accepted: 12/10/2022] [Indexed: 12/23/2022] Open
Abstract
The evolution of eusociality requires that individuals forgo some or all their own reproduction to assist the reproduction of others in their group, such as a primary egg-laying queen. A major open question is how genes and genetic pathways sculpt the evolution of eusociality, especially in rudimentary forms of sociality-those with smaller cooperative nests when compared with species such as honeybees that possess large societies. We lack comprehensive comparative studies examining shared patterns and processes across multiple social lineages. Here we examine the mechanisms of molecular convergence across two lineages of bees and wasps exhibiting such rudimentary societies. These societies consist of few individuals and their life histories range from facultative to obligately social. Using six species across four independent origins of sociality, we conduct a comparative meta-analysis of publicly available transcriptomes. Standard methods detected little similarity in patterns of differential gene expression in brain transcriptomes among reproductive and non-reproductive individuals across species. By contrast, both supervised machine learning and consensus co-expression network approaches uncovered sets of genes with conserved expression patterns among reproductive and non-reproductive phenotypes across species. These sets overlap substantially, and may comprise a shared genetic "toolkit" for sociality across the distantly related taxa of bees and wasps and independently evolved lineages of sociality. We also found many lineage-specific genes and co-expression modules associated with social phenotypes and possible signatures of shared life-history traits. These results reveal how taxon-specific molecular mechanisms complement a core toolkit of molecular processes in sculpting traits related to the evolution of eusociality.
Collapse
Affiliation(s)
- Emeline Favreau
- Department of Genetics, Environment, Evolution, University College London, London WC1E 6BT, United Kingdom
| | - Katherine S Geist
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011
| | - Christopher D R Wyatt
- Department of Genetics, Environment, Evolution, University College London, London WC1E 6BT, United Kingdom
| | - Amy L Toth
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011
| | - Seirian Sumner
- Department of Genetics, Environment, Evolution, University College London, London WC1E 6BT, United Kingdom
| | - Sandra M Rehan
- Department of Biology, York University, Toronto, ON M3J 1P3, Canada
| |
Collapse
|
13
|
Wu M, Huang W, Yang N, Liu Y. Learn from antibody–drug conjugates: consideration in the future construction of peptide-drug conjugates for cancer therapy. Exp Hematol Oncol 2022; 11:93. [DOI: 10.1186/s40164-022-00347-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Accepted: 10/17/2022] [Indexed: 11/09/2022] Open
Abstract
AbstractCancer is one of the leading causes of death worldwide due to high heterogeneity. Although chemotherapy remains the mainstay of cancer therapy, non-selective toxicity and drug resistance of mono-chemotherapy incur broad criticisms. Subsequently, various combination strategies have been developed to improve clinical efficacy, also known as cocktail therapy. However, conventional “cocktail administration” is just passable, due to the potential toxicities to normal tissues and unsatisfactory synergistic effects, especially for the combined drugs with different pharmacokinetic properties. The drug conjugates through coupling the conventional chemotherapeutics to a carrier (such as antibody and peptide) provide an alternative strategy to improve therapeutic efficacy and simultaneously reduce the unspecific toxicities, by virtue of the advantages of highly specific targeting ability and potent killing effect. Although 14 antibody–drug conjugates (ADCs) have been approved worldwide and more are being investigated in clinical trials so far, several limitations have been disclosed during clinical application. Compared with ADCs, peptide-drug conjugates (PDCs) possess several advantages, including easy industrial synthesis, low cost, high tissue penetration and fast clearance. So far, only a handful of PDCs have been approved, highlighting tremendous development potential. Herein, we discuss the progress and pitfalls in the development of ADCs and underline what can learn from ADCs for the better construction of PDCs in the future.
Collapse
|
14
|
LSAP: A Machine Learning Method for Leaf-Senescence-Associated Genes Prediction. LIFE (BASEL, SWITZERLAND) 2022; 12:life12071095. [PMID: 35888183 PMCID: PMC9316258 DOI: 10.3390/life12071095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 07/16/2022] [Accepted: 07/17/2022] [Indexed: 11/16/2022]
Abstract
Plant leaves, which convert light energy into chemical energy, serve as a major food source on Earth. The decrease in crop yield and quality is caused by plant leaf premature senescence. It is important to detect senescence-associated genes. In this study, we collected 5853 genes from a leaf senescence database and developed a leaf-senescence-associated genes (SAGs) prediction model using the support vector machine (SVM) and XGBoost algorithms. This is the first computational approach for predicting SAGs with the sequence dataset. The SVM-PCA-Kmer-PC-PseAAC model achieved the best performance (F1score = 0.866, accuracy = 0.862 and receiver operating characteristic = 0.922), and based on this model, we developed a SAGs prediction tool called “SAGs_Anno”. We identified a total of 1,398,277 SAGs from 3,165,746 gene sequences from 83 species, including 12 lower plants and 71 higher plants. Interestingly, leafy species showed a higher percentage of SAGs, while leafless species showed a lower percentage of SAGs. Finally, we constructed the Leaf SAGs Annotation Platform using these available datasets and the SAGs_Anno tool, which helps users to easily predict, download, and search for plant leaf SAGs of all species. Our study will provide rich resources for plant leaf-senescence-associated genes research.
Collapse
|
15
|
Guo S, Zhang H, Chu Y, Jiang Q, Ma Y. A neural network-based framework to understand the type 2 diabetes-related alteration of the human gut microbiome. IMETA 2022; 1:e20. [PMID: 38868565 PMCID: PMC10989819 DOI: 10.1002/imt2.20] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 03/13/2022] [Accepted: 03/15/2022] [Indexed: 06/14/2024]
Abstract
The identification of microbial markers adequate to delineate the disease-related microbiome alterations from the complex human gut microbiota is of great interest. Here, we develop a framework combining neural network (NN) and random forest, resulting in 40 marker species and 90 marker genes identified from the metagenomic data set (185 healthy and 183 type 2 diabetes [T2D] samples), respectively. In terms of these markers, the NN model obtained higher accuracy in classifying the T2D-related samples than other methods; the interaction network analyses identified the key species and functional modules; the regression analysis determined that fasting blood glucose is the most significant factor (p < 0.05) in the T2D-related alteration of the human gut microbiome. We also observed that those marker species varied little across the case and control samples greatly shift in the different stages of the T2D development, suggestive of their important roles in the T2D-related microbiome alteration. Our study provides a new way of identifying the disease-related biomarkers and analyzing the role they may play in the development of the disease.
Collapse
Affiliation(s)
- Shun Guo
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Key Laboratory of Quantitative Engineering Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Shenzhen Key Laboratory of Synthetic Genomics; Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
| | - Haoran Zhang
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Key Laboratory of Quantitative Engineering Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Shenzhen Key Laboratory of Synthetic Genomics; Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
| | - Yunmeng Chu
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Key Laboratory of Quantitative Engineering Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Shenzhen Key Laboratory of Synthetic Genomics; Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
| | - Qingshan Jiang
- Shenzhen Key Lab for High Performance Data Mining, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
| | - Yingfei Ma
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Key Laboratory of Quantitative Engineering Biology, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
- Shenzhen Key Laboratory of Synthetic Genomics; Guangdong Provincial Key Laboratory of Synthetic Genomics, Shenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdongChina
| |
Collapse
|
16
|
You Y, Lai X, Pan Y, Zheng H, Vera J, Liu S, Deng S, Zhang L. Artificial intelligence in cancer target identification and drug discovery. Signal Transduct Target Ther 2022; 7:156. [PMID: 35538061 PMCID: PMC9090746 DOI: 10.1038/s41392-022-00994-0] [Citation(s) in RCA: 111] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 03/14/2022] [Accepted: 04/05/2022] [Indexed: 02/08/2023] Open
Abstract
Artificial intelligence is an advanced method to identify novel anticancer targets and discover novel drugs from biology networks because the networks can effectively preserve and quantify the interaction between components of cell systems underlying human diseases such as cancer. Here, we review and discuss how to employ artificial intelligence approaches to identify novel anticancer targets and discover drugs. First, we describe the scope of artificial intelligence biology analysis for novel anticancer target investigations. Second, we review and discuss the basic principles and theory of commonly used network-based and machine learning-based artificial intelligence algorithms. Finally, we showcase the applications of artificial intelligence approaches in cancer target identification and drug discovery. Taken together, the artificial intelligence models have provided us with a quantitative framework to study the relationship between network characteristics and cancer, thereby leading to the identification of potential anticancer targets and the discovery of novel drug candidates.
Collapse
Affiliation(s)
- Yujie You
- College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Xin Lai
- Laboratory of Systems Tumor Immunology, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, Erlangen, 91052, Germany
| | - Yi Pan
- Faculty of Computer Science and Control Engineering, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Room D513, 1068 Xueyuan Avenue, Shenzhen University Town, Shenzhen, 518055, China
| | - Huiru Zheng
- School of Computing, Ulster University, Belfast, BT15 1ED, UK
| | - Julio Vera
- Laboratory of Systems Tumor Immunology, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, Erlangen, 91052, Germany
| | - Suran Liu
- College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Senyi Deng
- Institute of Thoracic Oncology, Department of Thoracic Surgery, West China Hospital, Sichuan University, Chengdu, 610065, China.
| | - Le Zhang
- College of Computer Science, Sichuan University, Chengdu, 610065, China.
- Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou, 310024, China.
- Key Laboratory of Systems Health Science of Zhejiang Province, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou, 310024, China.
| |
Collapse
|
17
|
Shehab M, Abualigah L, Shambour Q, Abu-Hashem MA, Shambour MKY, Alsalibi AI, Gandomi AH. Machine learning in medical applications: A review of state-of-the-art methods. Comput Biol Med 2022; 145:105458. [PMID: 35364311 DOI: 10.1016/j.compbiomed.2022.105458] [Citation(s) in RCA: 133] [Impact Index Per Article: 44.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 03/23/2022] [Accepted: 03/24/2022] [Indexed: 12/11/2022]
Abstract
Applications of machine learning (ML) methods have been used extensively to solve various complex challenges in recent years in various application areas, such as medical, financial, environmental, marketing, security, and industrial applications. ML methods are characterized by their ability to examine many data and discover exciting relationships, provide interpretation, and identify patterns. ML can help enhance the reliability, performance, predictability, and accuracy of diagnostic systems for many diseases. This survey provides a comprehensive review of the use of ML in the medical field highlighting standard technologies and how they affect medical diagnosis. Five major medical applications are deeply discussed, focusing on adapting the ML models to solve the problems in cancer, medical chemistry, brain, medical imaging, and wearable sensors. Finally, this survey provides valuable references and guidance for researchers, practitioners, and decision-makers framing future research and development directions.
Collapse
Affiliation(s)
- Mohammad Shehab
- Information Technology, The World Islamic Sciences and Education University. Amman, Jordan.
| | - Laith Abualigah
- Faculty of Computer Sciences and Informatics, Amman Arab University, Amman, Jordan; School of Computer Sciences, Universiti Sains Malaysia, Pulau, Pinang, 11800, Malaysia.
| | - Qusai Shambour
- Department of Software Engineering, Al-Ahliyya Amman University, Amman, Jordan.
| | - Muhannad A Abu-Hashem
- Department of Geomatics, Faculty of Architecture and Planning, King Abdulaziz University, Jeddah, Saudi Arabia.
| | | | | | - Amir H Gandomi
- Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia.
| |
Collapse
|
18
|
Correia C, Weiskittel TM, Ung CY, Villasboas Bisneto JC, Billadeau DD, Kaufmann SH, Li H. Uncovering Pharmacological Opportunities for Cancer Stem Cells-A Systems Biology View. Front Cell Dev Biol 2022; 10:752326. [PMID: 35359437 PMCID: PMC8962639 DOI: 10.3389/fcell.2022.752326] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 02/10/2022] [Indexed: 12/14/2022] Open
Abstract
Cancer stem cells (CSCs) represent a small fraction of the total cancer cell population, yet they are thought to drive disease propagation, therapy resistance and relapse. Like healthy stem cells, CSCs possess the ability to self-renew and differentiate. These stemness phenotypes of CSCs rely on multiple molecular cues, including signaling pathways (for example, WNT, Notch and Hedgehog), cell surface molecules that interact with cellular niche components, and microenvironmental interactions with immune cells. Despite the importance of understanding CSC biology, our knowledge of how neighboring immune and tumor cell populations collectively shape CSC stemness is incomplete. Here, we provide a systems biology perspective on the crucial roles of cellular population identification and dissection of cell regulatory states. By reviewing state-of-the-art single-cell technologies, we show how innovative systems-based analysis enables a deeper understanding of the stemness of the tumor niche and the influence of intratumoral cancer cell and immune cell compositions. We also summarize strategies for refining CSC systems biology, and the potential role of this approach in the development of improved anticancer treatments. Because CSCs are amenable to cellular transitions, we envision how systems pharmacology can become a major engine for discovery of novel targets and drug candidates that can modulate state transitions for tumor cell reprogramming. Our aim is to provide deeper insights into cancer stemness from a systems perspective. We believe this approach has great potential to guide the development of more effective personalized cancer therapies that can prevent CSC-mediated relapse.
Collapse
Affiliation(s)
- Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States
| | - Taylor M Weiskittel
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States
| | - Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States
| | | | - Daniel D Billadeau
- Department of Immunology, Mayo Clinic, Rochester, MN, United States,Division of Oncology Research, Mayo Clinic, Rochester, MN, United States
| | - Scott H Kaufmann
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States,Division of Hematology, Department of Medicine, Mayo Clinic, Rochester, MN, United States,Division of Oncology Research, Mayo Clinic, Rochester, MN, United States
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN, United States,*Correspondence: Hu Li,
| |
Collapse
|
19
|
Ung CY, Levee TM, Zhang C, Correia C, Yeo KS, Li H, Zhu S. Gene utility recapitulates chromosomal aberrancies in advanced stage neuroblastoma. Comput Struct Biotechnol J 2022; 20:3291-3303. [PMID: 35832612 PMCID: PMC9251784 DOI: 10.1016/j.csbj.2022.06.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 06/11/2022] [Indexed: 11/03/2022] Open
Abstract
Neuroblastoma (NB) is the most common extracranial solid tumor in children. Although only a few recurrent somatic mutations have been identified, chromosomal abnormalities, including the loss of heterozygosity (LOH) at the chromosome 1p and gains of chromosome 17q, are often seen in the high-risk cases. The biological basis and evolutionary forces that drive such genetic abnormalities remain enigmatic. Here, we conceptualize the Gene Utility Model (GUM) that seeks to identify genes driving biological signaling via their collective gene utilities and apply it to understand the impact of those differentially utilized genes on constraining the evolution of NB karyotypes. By employing a computational process-guided flow algorithm to model gene utility in protein–protein networks that built based on transcriptomic data, we conducted several pairwise comparative analyses to uncover genes with differential utilities in stage 4 NBs with distinct classification. We then constructed a utility karyotype by mapping these differentially utilized genes to their respective chromosomal loci. Intriguingly, hotspots of the utility karyotype, to certain extent, can consistently recapitulate the major chromosomal abnormalities of NBs and also provides clues to yet identified predisposition sites. Hence, our study not only provides a new look, from a gene utility perspective, into the known chromosomal abnormalities detected by integrative genomic sequencing efforts, but also offers new insights into the etiology of NB and provides a framework to facilitate the identification of novel therapeutic targets for this devastating childhood cancer.
Collapse
|
20
|
Mey F, Clauwaert J, Van Huffel K, Waegeman W, De Mey M. Improving the performance of machine learning models for biotechnology: The quest for deus ex machina. Biotechnol Adv 2021; 53:107858. [PMID: 34695560 DOI: 10.1016/j.biotechadv.2021.107858] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 10/13/2021] [Accepted: 10/14/2021] [Indexed: 11/24/2022]
Abstract
Machine learning is becoming an integral part of the Design-Build-Test-Learn cycle in biotechnology. Machine learning models learn from collected datasets such as omics data and predict a defined outcome, which has led to both production improvements and predictive tools in the field. Robust prediction of the behavior of microbial cell factories and production processes not only greatly increases our understanding of the function of such systems, but also provides significant savings of development time. However, many pitfalls when modeling biological data - bad fit, noisy data, model instability, low data quantity and imbalances in the data - cause models to suffer in their performance. Here we provide an accessible, in-depth analysis on the problems created by these pitfalls, as well as means of their detection and mediation, with a focus on supervised learning. Assessing the state of the art, we show that, currently, in-depth analyses of model performance are often absent and must be improved. This review provides a toolbox for the analysis of model robustness and performance, and simultaneously proposes a standard for the community to facilitate future work. It is further accompanied by an interactive online tutorial on the discussed issues.
Collapse
Affiliation(s)
- Friederike Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Jim Clauwaert
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Kirsten Van Huffel
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium
| | - Willem Waegeman
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Marjan De Mey
- Centre for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, 9000 Ghent, Belgium.
| |
Collapse
|
21
|
Weiskittel TM, Correia C, Yu GT, Ung CY, Kaufmann SH, Billadeau DD, Li H. The Trifecta of Single-Cell, Systems-Biology, and Machine-Learning Approaches. Genes (Basel) 2021; 12:1098. [PMID: 34356114 PMCID: PMC8306972 DOI: 10.3390/genes12071098] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 07/13/2021] [Accepted: 07/18/2021] [Indexed: 12/18/2022] Open
Abstract
Together, single-cell technologies and systems biology have been used to investigate previously unanswerable questions in biomedicine with unparalleled detail. Despite these advances, gaps in analytical capacity remain. Machine learning, which has revolutionized biomedical imaging analysis, drug discovery, and systems biology, is an ideal strategy to fill these gaps in single-cell studies. Machine learning additionally has proven to be remarkably synergistic with single-cell data because it remedies unique challenges while capitalizing on the positive aspects of single-cell data. In this review, we describe how systems-biology algorithms have layered machine learning with biological components to provide systems level analyses of single-cell omics data, thus elucidating complex biological mechanisms. Accordingly, we highlight the trifecta of single-cell, systems-biology, and machine-learning approaches and illustrate how this trifecta can significantly contribute to five key areas of scientific research: cell trajectory and identity, individualized medicine, pharmacology, spatial omics, and multi-omics. Given its success to date, the systems-biology, single-cell omics, and machine-learning trifecta has proven to be a potent combination that will further advance biomedical research.
Collapse
Affiliation(s)
- Taylor M. Weiskittel
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| | - Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| | - Grace T. Yu
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| | - Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| | - Scott H. Kaufmann
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| | - Daniel D. Billadeau
- Department of Immunology, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA;
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Mayo Clinic, 200 First, Street SW, Rochester, MN 55905, USA; (T.M.W.); (C.C.); (G.T.Y.); (C.Y.U.); (S.H.K.)
| |
Collapse
|
22
|
Hunt C, Montgomery S, Berkenpas JW, Sigafoos N, Oakley JC, Espinosa J, Justice N, Kishaba K, Hippe K, Si D, Hou J, Ding H, Cao R. Recent Progress of Machine Learning in Gene Therapy. Curr Gene Ther 2021; 22:132-143. [PMID: 34161210 DOI: 10.2174/1566523221666210622164133] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/15/2021] [Accepted: 04/02/2021] [Indexed: 11/22/2022]
Abstract
With new developments in biomedical technology, it is now a viable therapeutic treatment to alter genes with techniques like CRISPR. At the same time, it is increasingly cheaper to do whole genome sequencing, resulting in rapid advancement in gene therapy and editing in precision medicine. Thus, understanding the current industry and academic applications of gene therapy provides an important backdrop to future scientific developments. Additionally, machine learning and artificial intelligence techniques allow for the reduction of time and money spent in the development of new gene therapy products and techniques. In this paper, we survey the current progress of gene therapy treatments for several diseases and explore machine learning applications in gene therapy. We also discuss the ethical implications of gene therapy and the use of machine learning in precision medicine. Machine learning and gene therapy are both topics gaining popularity in various publications, and we conclude that there is still room for continued research and application of machine learning techniques in the gene therapy field.
Collapse
Affiliation(s)
- Cassandra Hunt
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, United States
| | - Sandra Montgomery
- Department of Physics, Pacific Lutheran University, Tacoma, WA, United States
| | | | - Noel Sigafoos
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, United States
| | - John Christian Oakley
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, United States
| | - Jacob Espinosa
- Department of Mathematics, Pacific Lutheran University, Tacoma, WA, United States
| | - Nicola Justice
- Department of Mathematics, Pacific Lutheran University, Tacoma, WA, United States
| | - Kiyomi Kishaba
- Department of Humanities, Pacific Lutheran University, Tacoma, WA, United States
| | - Kyle Hippe
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, United States
| | - Dong Si
- Division of Computing Software Systems, University of Washington-Bothell, Bothell, WA, United States
| | - Jie Hou
- Department of Computer Science, Saint Louis University, St. Louis, MO, United States
| | - Hui Ding
- School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University, Tacoma, WA, United States
| |
Collapse
|
23
|
Vatansever S, Schlessinger A, Wacker D, Kaniskan HÜ, Jin J, Zhou M, Zhang B. Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions. Med Res Rev 2021; 41:1427-1473. [PMID: 33295676 PMCID: PMC8043990 DOI: 10.1002/med.21764] [Citation(s) in RCA: 131] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 10/30/2020] [Accepted: 11/20/2020] [Indexed: 01/11/2023]
Abstract
Neurological disorders significantly outnumber diseases in other therapeutic areas. However, developing drugs for central nervous system (CNS) disorders remains the most challenging area in drug discovery, accompanied with the long timelines and high attrition rates. With the rapid growth of biomedical data enabled by advanced experimental technologies, artificial intelligence (AI) and machine learning (ML) have emerged as an indispensable tool to draw meaningful insights and improve decision making in drug discovery. Thanks to the advancements in AI and ML algorithms, now the AI/ML-driven solutions have an unprecedented potential to accelerate the process of CNS drug discovery with better success rate. In this review, we comprehensively summarize AI/ML-powered pharmaceutical discovery efforts and their implementations in the CNS area. After introducing the AI/ML models as well as the conceptualization and data preparation, we outline the applications of AI/ML technologies to several key procedures in drug discovery, including target identification, compound screening, hit/lead generation and optimization, drug response and synergy prediction, de novo drug design, and drug repurposing. We review the current state-of-the-art of AI/ML-guided CNS drug discovery, focusing on blood-brain barrier permeability prediction and implementation into therapeutic discovery for neurological diseases. Finally, we discuss the major challenges and limitations of current approaches and possible future directions that may provide resolutions to these difficulties.
Collapse
Affiliation(s)
- Sezen Vatansever
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Avner Schlessinger
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Daniel Wacker
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of NeuroscienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - H. Ümit Kaniskan
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Jian Jin
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Ming‐Ming Zhou
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Bin Zhang
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| |
Collapse
|
24
|
Auslander N, Gussow AB, Koonin EV. Incorporating Machine Learning into Established Bioinformatics Frameworks. Int J Mol Sci 2021; 22:2903. [PMID: 33809353 PMCID: PMC8000113 DOI: 10.3390/ijms22062903] [Citation(s) in RCA: 44] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/08/2021] [Accepted: 03/10/2021] [Indexed: 12/23/2022] Open
Abstract
The exponential growth of biomedical data in recent years has urged the application of numerous machine learning techniques to address emerging problems in biology and clinical research. By enabling the automatic feature extraction, selection, and generation of predictive models, these methods can be used to efficiently study complex biological systems. Machine learning techniques are frequently integrated with bioinformatic methods, as well as curated databases and biological networks, to enhance training and validation, identify the best interpretable features, and enable feature and model investigation. Here, we review recently developed methods that incorporate machine learning within the same framework with techniques from molecular evolution, protein structure analysis, systems biology, and disease genomics. We outline the challenges posed for machine learning, and, in particular, deep learning in biomedicine, and suggest unique opportunities for machine learning techniques integrated with established bioinformatics approaches to overcome some of these challenges.
Collapse
Affiliation(s)
| | | | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA;
| |
Collapse
|
25
|
Shahmoradi M, Rezvani Z. Functional Prediction of Long Noncoding RNAs in Cutaneous Melanoma Using a Systems Biology Approach. Bioinform Biol Insights 2021; 15:1177932220988508. [PMID: 33613027 PMCID: PMC7868446 DOI: 10.1177/1177932220988508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 12/20/2020] [Indexed: 11/17/2022] Open
Abstract
Cutaneous melanoma is the most aggressive type of skin cancer which its incidence has significantly increased in recent years worldwide. Thus, more investigations are required to identify the underlying mechanisms of melanoma malignant transformation and metastasis. In this context, long noncoding RNAs (lncRNAs) are a new type of noncoding transcripts that their dysregulations are associated with almost all cancers including melanoma. However, the precise functional roles of most of the significantly altered lncRNAs in melanoma have not yet been fully inspected. In this study, a comprehensive list of lncRNAs was interrogated across cutaneous melanoma samples to identify the significantly altered/dysregulated lncRNAs. To this end, lncRNAs were filtered in several steps and the selected lncRNAs projected to a bioinformatic and systems biology analysis using several publicly available databases and tools such as GEPIA and cBioPortal. According to our results, 30 lncRNAs were notably altered/dysregulated in cutaneous melanoma most of which were co-expressed with each other. Also, co-expression/alteration and differential expression analyses led to the selection of 12 out of these 30 lncRNAs as cutaneous melanoma key lncRNAs. Furthermore, functional demonstrated that these 12 lncRNAs might be involved in melanoma-relevant biological processes and pathways. In addition, the end result of our analyses demonstrated that these lncRNAs are associated with the clinicopathological features of melanoma patients. These 12 lncRNAs need to be further investigated in future studies to characterize their exact roles in melanoma development and to identify their potential for being used as drug targets and/or biomarkers for cutaneous melanoma.
Collapse
Affiliation(s)
- Mozhdeh Shahmoradi
- Division of Biotechnology, Department of Cell and Molecular Biology, Faculty of Chemistry, University of Kashan, Kashan, Iran
| | - Zahra Rezvani
- Division of Biotechnology, Department of Cell and Molecular Biology, Faculty of Chemistry, University of Kashan, Kashan, Iran
| |
Collapse
|
26
|
Monie DD, Bhandarkar AR, Parney IF, Correia C, Sarkaria JN, Vile RG, Li H. Synthetic and systems biology principles in the design of programmable oncolytic virus immunotherapies for glioblastoma. Neurosurg Focus 2021; 50:E10. [PMID: 33524942 DOI: 10.3171/2020.12.focus20855] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 12/04/2020] [Indexed: 12/11/2022]
Abstract
Oncolytic viruses (OVs) are a class of immunotherapeutic agents with promising preclinical results for the treatment of glioblastoma (GBM) but have shown limited success in recent clinical trials. Advanced bioengineering principles from disciplines such as synthetic and systems biology are needed to overcome the current challenges faced in developing effective OV-based immunotherapies for GBMs, including off-target effects and poor clinical responses. Synthetic biology is an emerging field that focuses on the development of synthetic DNA constructs that encode networks of genes and proteins (synthetic genetic circuits) to perform novel functions, whereas systems biology is an analytical framework that enables the study of complex interactions between host pathways and these synthetic genetic circuits. In this review, the authors summarize synthetic and systems biology concepts for developing programmable, logic-based OVs to treat GBMs. Programmable OVs can increase selectivity for tumor cells and enhance the local immunological response using synthetic genetic circuits. The authors discuss key principles for developing programmable OV-based immunotherapies, including how to 1) select an appropriate chassis, a vector that carries a synthetic genetic circuit, and 2) design a synthetic genetic circuit that can be programmed to sense key signals in the GBM microenvironment and trigger release of a therapeutic payload. To illustrate these principles, some original laboratory data are included, highlighting the need for systems biology studies, as well as some preliminary network analyses in preparation for synthetic biology applications. Examples from the literature of state-of-the-art synthetic genetic circuits that can be packaged into leading candidate OV chassis are also surveyed and discussed.
Collapse
Affiliation(s)
- Dileep D Monie
- Departments of1Immunology.,6Mayo Clinic Alix School of Medicine.,7Mayo Clinic Graduate School of Biomedical Sciences; and Mayo Clinic College of Medicine and Science, Rochester, Minnesota
| | | | | | - Cristina Correia
- 5Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic
| | | | | | - Hu Li
- 5Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic
| |
Collapse
|
27
|
Machine Learning and Systems Biology Approaches to Characterize Dosage-Based Gene Dependencies in Cancer Cells. JOURNAL OF BIOINFORMATICS AND SYSTEMS BIOLOGY : OPEN ACCESS 2021; 4:13-32. [PMID: 33842927 PMCID: PMC8031731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/29/2022]
Abstract
Mapping of cancer survivability factors allows for the identification of novel biological insights for drug targeting. Using genomic editing techniques, gene dependencies can be extracted in a high-throughput and quantitative manner. Dependencies have been predicted using machine learning techniques on -omics data, but the biological consequences of dependency predictor pairs has not been explored. In this work we devised a framework to explore gene dependency using an ensemble of machine learning methods, and our learned models captured meaningful biological information beyond just gene dependency prediction. We show that dosage-based dependent predictors (DDPs) primarily belonged to transcriptional regulation ontologies. We also found that anti-sense RNAs and long- noncoding RNA transcripts display DDPs. Network analyses revealed that SOX10, HLA-J, and ZEB2 act as a triad of network hubs in the dependent-predictor network. Collectively, we demonstrate the powerful combination of machine learning and systems biology approach can illuminate new insights in understanding gene dependency and guide novel targeting avenues.
Collapse
|
28
|
Agamah FE, Mazandu GK, Hassan R, Bope CD, Thomford NE, Ghansah A, Chimusa ER. Computational/in silico methods in drug target and lead prediction. Brief Bioinform 2020; 21:1663-1675. [PMID: 31711157 PMCID: PMC7673338 DOI: 10.1093/bib/bbz103] [Citation(s) in RCA: 116] [Impact Index Per Article: 23.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 07/17/2019] [Accepted: 07/18/2019] [Indexed: 01/10/2023] Open
Abstract
Drug-like compounds are most of the time denied approval and use owing to the unexpected clinical side effects and cross-reactivity observed during clinical trials. These unexpected outcomes resulting in significant increase in attrition rate centralizes on the selected drug targets. These targets may be disease candidate proteins or genes, biological pathways, disease-associated microRNAs, disease-related biomarkers, abnormal molecular phenotypes, crucial nodes of biological network or molecular functions. This is generally linked to several factors, including incomplete knowledge on the drug targets and unpredicted pharmacokinetic expressions upon target interaction or off-target effects. A method used to identify targets, especially for polygenic diseases, is essential and constitutes a major bottleneck in drug development with the fundamental stage being the identification and validation of drug targets of interest for further downstream processes. Thus, various computational methods have been developed to complement experimental approaches in drug discovery. Here, we present an overview of various computational methods and tools applied in predicting or validating drug targets and drug-like molecules. We provide an overview on their advantages and compare these methods to identify effective methods which likely lead to optimal results. We also explore major sources of drug failure considering the challenges and opportunities involved. This review might guide researchers on selecting the most efficient approach or technique during the computational drug discovery process.
Collapse
Affiliation(s)
- Francis E Agamah
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
| | - Gaston K Mazandu
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
- African Institute for Mathematical Sciences, Muizenberg, Cape Town 7945, South Africa
| | - Radia Hassan
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
| | - Christian D Bope
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
- Faculty of Sciences, University of Kinshasa, Kinshasa, Democratic Republic of Congo
| | - Nicholas E Thomford
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
- School of Medical Sciences, University of Cape Coast, PMB, Cape Coast, Ghana
| | - Anita Ghansah
- Noguchi Memorial Institute for Medical Research, College of Health Sciences, University of Ghana, PO Box LG 581, Legon, Ghana
| | - Emile R Chimusa
- Division of Human Genetics, Department of Pathology, University of Cape Town, Observatory 7925, South Africa
| |
Collapse
|
29
|
Rauschert S, Raubenheimer K, Melton PE, Huang RC. Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification. Clin Epigenetics 2020; 12:51. [PMID: 32245523 PMCID: PMC7118917 DOI: 10.1186/s13148-020-00842-4] [Citation(s) in RCA: 98] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Accepted: 03/22/2020] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Machine learning is a sub-field of artificial intelligence, which utilises large data sets to make predictions for future events. Although most algorithms used in machine learning were developed as far back as the 1950s, the advent of big data in combination with dramatically increased computing power has spurred renewed interest in this technology over the last two decades. MAIN BODY Within the medical field, machine learning is promising in the development of assistive clinical tools for detection of e.g. cancers and prediction of disease. Recent advances in deep learning technologies, a sub-discipline of machine learning that requires less user input but more data and processing power, has provided even greater promise in assisting physicians to achieve accurate diagnoses. Within the fields of genetics and its sub-field epigenetics, both prime examples of complex data, machine learning methods are on the rise, as the field of personalised medicine is aiming for treatment of the individual based on their genetic and epigenetic profiles. CONCLUSION We now have an ever-growing number of reported epigenetic alterations in disease, and this offers a chance to increase sensitivity and specificity of future diagnostics and therapies. Currently, there are limited studies using machine learning applied to epigenetics. They pertain to a wide variety of disease states and have used mostly supervised machine learning methods.
Collapse
Affiliation(s)
- S Rauschert
- Telethon Kids Institute, University of Western Australia, Nedlands, Perth, Western Australia.
| | - K Raubenheimer
- School of Medicine, Notre Dame University, Fremantle, Western Australia
| | - P E Melton
- Centre for Genetic Origins of Health and Disease, The University of Western Australia and Curtin University, Perth, Western Australia
- School of Pharmacy and Biomedical Sciences, Curtin University, Bentley, Western Australia
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Tasmania, Australia
| | - R C Huang
- Telethon Kids Institute, University of Western Australia, Nedlands, Perth, Western Australia
| |
Collapse
|
30
|
Manzoni C, Lewis PA, Ferrari R. Network Analysis for Complex Neurodegenerative Diseases. CURRENT GENETIC MEDICINE REPORTS 2020. [DOI: 10.1007/s40142-020-00181-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Abstract
Purpose of Review
Biomedicine is witnessing a paradigm shift in the way complex disorders are investigated. In particular, the need for big data interpretation has led to the development of pipelines that require the cooperation of different fields of expertise, including medicine, functional biology, informatics, mathematics and systems biology. This review sits at the crossroad of different disciplines and surveys the recent developments in the use of graph theory (in the form of network analysis) to interpret large and different datasets in the context of complex neurodegenerative diseases. It aims at a professional audience with different backgrounds.
Recent Findings
Biomedicine has entered the era of big data, and this is actively changing the way we approach and perform research. The increase in size and power of biomedical studies has led to the establishment of multi-centre, international working groups coordinating open access platforms for data generation, storage and analysis. Particularly, pipelines for data interpretation are under development, and network analysis is gaining momentum since it represents a versatile approach to study complex systems made of interconnected multiple players.
Summary
We will describe the era of big data in biomedicine and survey the major freely accessible multi-omics datasets. We will then introduce the principles of graph theory and provide examples of network analysis applied to the interpretation of complex neurodegenerative disorders.
Collapse
|
31
|
Ung CY, Ghanat Bari M, Zhang C, Liang J, Correia C, Li H. Regulostat Inferelator: a novel network biology platform to uncover molecular devices that predetermine cellular response phenotypes. Nucleic Acids Res 2019; 47:e82. [PMID: 31114928 PMCID: PMC6698671 DOI: 10.1093/nar/gkz417] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 04/18/2019] [Accepted: 05/17/2019] [Indexed: 12/24/2022] Open
Abstract
With the emergence of genome editing technologies and synthetic biology, it is now possible to engineer genetic circuits driving a cell's phenotypic response to a stressor. However, capturing a continuous response, rather than simply a binary ‘on’ or ‘off’ response, remains a bioengineering challenge. No tools currently exist to identify gene candidates responsible for predetermining and fine-tuning cell response phenotypes. To address this gap, we devised a novel Regulostat Inferelator (RSI) algorithm to decipher intrinsic molecular devices or networks that predetermine cellular phenotypic responses. The RSI algorithm is designed to extract gene expression patterns from basal transcriptomic data in order to identify ‘regulostat’ constituent gene pairs, which exhibit rheostat-like mode-of-cooperation capable of fine-tuning cellular response. Our proof-of-concept study provides computational evidence for the existence of regulostats and that these networks predetermine cellular response prior to exposure to a stressor or drug. In addition, our work, for the first time, provides evidence of context-specific, drug–regulostat interactions in predetermining drug response phenotypes in cancer cells. Given RSI-inferred regulostat networks offer insights for prioritizing gene candidates capable of rendering a resistant phenotype sensitive to a given drug, we envision that this tool will be of great value in bioengineering and medicine.
Collapse
Affiliation(s)
- Choong Yong Ung
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| | - Mehrab Ghanat Bari
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| | - Cheng Zhang
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| | - Jingjing Liang
- Department of Population and Quantitative Health Science, Case Western Reserve University, Cleveland, OH, USA
| | - Cristina Correia
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| | - Hu Li
- Center for Individualized Medicine, Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine, Rochester, MN 55905, USA
| |
Collapse
|
32
|
Akter S, Xu D, Nagel SC, Bromfield JJ, Pelch K, Wilshire GB, Joshi T. Machine Learning Classifiers for Endometriosis Using Transcriptomics and Methylomics Data. Front Genet 2019; 10:766. [PMID: 31552087 PMCID: PMC6737999 DOI: 10.3389/fgene.2019.00766] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 07/19/2019] [Indexed: 12/29/2022] Open
Abstract
Endometriosis is a complex and common gynecological disorder yet a poorly understood disease affecting about 176 million women worldwide and causing significant impact on their quality of life and economic burden. Neither a definitive clinical symptom nor a minimally invasive diagnostic method is available, thus leading to an average of 4 to 11 years of diagnostic latency. Discovery of relevant biological patterns from microarray expression or next generation sequencing (NGS) data has been advanced over the last several decades by applying various machine learning tools. We performed machine learning analysis using 38 RNA-seq and 80 enrichment-based DNA methylation (MBD-seq) datasets. We experimented how well various supervised machine learning methods such as decision tree, partial least squares discriminant analysis (PLSDA), support vector machine, and random forest perform in classifying endometriosis from the control samples trained on both transcriptomics and methylomics data. The assessment was done from two different perspectives for improving classification performances: a) implication of three different normalization techniques and b) implication of differential analysis using the generalized linear model (GLM). Several candidate biomarker genes were identified by multiple machine learning experiments including NOTCH3, SNAPC2, B4GALNT1, SMAP2, DDB2, GTF3C5, and PTOV1 from the transcriptomics data analysis and TRPM6, RASSF2, TNIP2, RP3-522J7.6, FGD3, and MFSD14B from the methylomics data analysis. We concluded that an appropriate machine learning diagnostic pipeline for endometriosis should use TMM normalization for transcriptomics data, and quantile or voom normalization for methylomics data, GLM for feature space reduction and classification performance maximization.
Collapse
Affiliation(s)
- Sadia Akter
- Informatics Institute, University of Missouri, Columbia, MO, United States
| | - Dong Xu
- Informatics Institute, University of Missouri, Columbia, MO, United States
- Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, United States
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, United States
| | - Susan C. Nagel
- OB/GYN and Women’s Health, University of Missouri School of Medicine, Columbia, MO, United States
| | - John J. Bromfield
- OB/GYN and Women’s Health, University of Missouri School of Medicine, Columbia, MO, United States
| | - Katherine Pelch
- OB/GYN and Women’s Health, University of Missouri School of Medicine, Columbia, MO, United States
| | | | - Trupti Joshi
- Informatics Institute, University of Missouri, Columbia, MO, United States
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, United States
- Health Management and Informatics, University of Missouri, Columbia, MO, United States
| |
Collapse
|
33
|
Nakamura M, Bax HJ, Scotto D, Souri EA, Sollie S, Harris RJ, Hammar N, Walldius G, Winship A, Ghosh S, Montes A, Spicer JF, Van Hemelrijck M, Josephs DH, Lacy KE, Tsoka S, Karagiannis SN. Immune mediator expression signatures are associated with improved outcome in ovarian carcinoma. Oncoimmunology 2019; 8:e1593811. [PMID: 31069161 PMCID: PMC6492968 DOI: 10.1080/2162402x.2019.1593811] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 02/17/2019] [Accepted: 03/02/2019] [Indexed: 01/04/2023] Open
Abstract
Immune and inflammatory cascades may play multiple roles in ovarian cancer. We aimed to identify relationships between expression of immune and inflammatory mediators and patient outcomes. We interrogated differential gene expression of 44 markers and marker combinations (n = 1,978) in 1,656 ovarian carcinoma patient tumors, alongside matched 5-year overall survival (OS) data in silico. Using machine learning methods, we investigated whether genomic expression of these 44 mediators can discriminate between malignant and non-malignant tissues in 839 ovarian cancer and 115 non-malignant ovary samples. We furthermore assessed inflammation markers in 289 ovarian cancer patients’ sera in the Swedish Apolipoprotein MOrtality-related RISk (AMORIS) cohort. Expression of the 44 mediators could discriminate between malignant and non-malignant tissues with at least 96% accuracy. Higher expression of classical Th1, Th2, Th17, anti-parasitic/infection and M1 macrophage mediator signatures were associated with better OS. Contrastingly, inflammatory and angiogenic mediators, CXCL-12, C-reactive protein (CRP) and platelet-derived growth factor subunit A (PDGFA) were negatively associated with OS. Of the serum inflammatory markers in the AMORIS cohort, women with ovarian cancer who had elevated levels of haptoglobin (≥1.4 g/L) had a higher risk of dying from ovarian cancer compared to those with haptoglobin levels <1.4 g/L (HR = 2.09, 95% CI:1.38–3.16). Our findings indicate that elevated “classical” immune mediators, associated with response to pathogen antigen challenge, may confer immunological advantage in ovarian cancer, while inflammatory markers appear to have negative prognostic value. These highlight associations between immune protection, inflammation and clinical outcomes, and offer opportunities for patient stratification based on secretome markers.
Collapse
Affiliation(s)
- Mano Nakamura
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - Heather J Bax
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK.,School of Cancer and Pharmaceutical Sciences, King's College London, London, UK
| | - Daniele Scotto
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - Elmira Amiri Souri
- Department of Informatics, Faculty of Natural and Mathematical Sciences, King's College London, London, UK
| | - Sam Sollie
- King's College London, School of Cancer and Pharmaceutical Sciences, Translational Oncology & Urology Research (TOUR), London, UK
| | - Robert J Harris
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - Niklas Hammar
- Unit of Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Goran Walldius
- Unit of Cardiovascular Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Anna Winship
- Departments of Medical Oncology and Clinical Oncology, Guy's and St Thomas' NHS Foundation Trust, London, UK
| | - Sharmistha Ghosh
- Departments of Medical Oncology and Clinical Oncology, Guy's and St Thomas' NHS Foundation Trust, London, UK
| | - Ana Montes
- Departments of Medical Oncology and Clinical Oncology, Guy's and St Thomas' NHS Foundation Trust, London, UK
| | - James F Spicer
- School of Cancer and Pharmaceutical Sciences, King's College London, London, UK
| | - Mieke Van Hemelrijck
- King's College London, School of Cancer and Pharmaceutical Sciences, Translational Oncology & Urology Research (TOUR), London, UK.,Unit of Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Debra H Josephs
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK.,School of Cancer and Pharmaceutical Sciences, King's College London, London, UK
| | - Katie E Lacy
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| | - Sophia Tsoka
- Department of Informatics, Faculty of Natural and Mathematical Sciences, King's College London, London, UK
| | - Sophia N Karagiannis
- St. John's Institute of Dermatology, School of Basic and Medical Biosciences, King's College London, London, UK
| |
Collapse
|
34
|
Zhang W, Li W, Zhang J, Wang N. Data Integration of Hybrid Microarray and Single Cell Expression Data to Enhance Gene Network Inference. Curr Bioinform 2019. [DOI: 10.2174/1574893614666190104142228] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Background:
Gene Regulatory Network (GRN) inference algorithms aim to explore
casual interactions between genes and transcriptional factors. High-throughput transcriptomics
data including DNA microarray and single cell expression data contain complementary
information in network inference.
Objective:
To enhance GRN inference, data integration across various types of expression data
becomes an economic and efficient solution.
Method:
In this paper, a novel E-alpha integration rule-based ensemble inference algorithm is
proposed to merge complementary information from microarray and single cell expression data.
This paper implements a Gradient Boosting Tree (GBT) inference algorithm to compute
importance scores for candidate gene-gene pairs. The proposed E-alpha rule quantitatively
evaluates the credibility levels of each information source and determines the final ranked list.
Results:
Two groups of in silico gene networks are applied to illustrate the effectiveness of the
proposed E-alpha integration. Experimental outcomes with size50 and size100 in silico gene
networks suggest that the proposed E-alpha rule significantly improves performance metrics
compared with single information source.
Conclusion:
In GRN inference, the integration of hybrid expression data using E-alpha rule
provides a feasible and efficient way to enhance performance metrics than solely increasing
sample sizes.
Collapse
Affiliation(s)
- Wei Zhang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Wenchao Li
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Jianming Zhang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| | - Ning Wang
- Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310013, China
| |
Collapse
|
35
|
Samandari M, Julia MG, Rice A, Chronopoulos A, Del Rio Hernandez AE. Liquid biopsies for management of pancreatic cancer. Transl Res 2018; 201:98-127. [PMID: 30118658 DOI: 10.1016/j.trsl.2018.07.008] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 06/17/2018] [Accepted: 07/17/2018] [Indexed: 02/07/2023]
Abstract
Pancreatic cancer is one of the main causes of cancer-related deaths worldwide. It is asymptomatic at an early stage, and most diagnosis occurs when the disease is already at a late stage, by which time the tumor is nonresectable. In order to increase the overall survival of patients with pancreatic cancer, as well as to decrease the cancer burden, it is necessary to perform early diagnosis, prognosis stratifications and cancer monitoring using accurate, minimally invasive, and cost-effective methods. Liquid biopsies seek to detect tumor-associated biomarkers in a variety of extractable body fluids and can help to monitor treatment response and disease progression, and even predict patient outcome. In patients with pancreatic cancer, tumor-derived materials, primarily circulating tumor DNA, circulating tumor cells and exosomes, are being studied for inclusion in the management of the disease. This review focuses on describing the biology of these biomarkers, methods for their enrichment and detection, as well as their potential for clinical application. Moreover, we discuss the future direction of liquid biopsies and introduce how they can be exploited toward point of care personalized medicine for the management of pancreatic cancer.
Collapse
Affiliation(s)
- Mohamadmahdi Samandari
- Cellular and Molecular Biomechanics Laboratory, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - María Gil Julia
- Cellular and Molecular Biomechanics Laboratory, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - Alistair Rice
- Cellular and Molecular Biomechanics Laboratory, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - Antonios Chronopoulos
- Cellular and Molecular Biomechanics Laboratory, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - Armando E Del Rio Hernandez
- Cellular and Molecular Biomechanics Laboratory, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom.
| |
Collapse
|
36
|
Zomnir MG, Lipkin L, Pacula M, Dominguez Meneses E, MacLeay A, Duraisamy S, Nadhamuni N, Al Turki SH, Zheng Z, Rivera M, Nardi V, Dias-Santagata D, Iafrate AJ, Le LP, Lennerz JK. Artificial Intelligence Approach for Variant Reporting. JCO Clin Cancer Inform 2018; 2:CCI.16.00079. [PMID: 30364844 PMCID: PMC6198661 DOI: 10.1200/cci.16.00079] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Purpose Next-generation sequencing technologies are actively applied in clinical oncology. Bioinformatics pipeline analysis is an integral part of this process; however, humans cannot yet realize the full potential of the highly complex pipeline output. As a result, the decision to include a variant in the final report during routine clinical sign-out remains challenging. Methods We used an artificial intelligence approach to capture the collective clinical sign-out experience of six board-certified molecular pathologists to build and validate a decision support tool for variant reporting. We extracted all reviewed and reported variants from our clinical database and tested several machine learning models. We used 10-fold cross-validation for our variant call prediction model, which derives a contiguous prediction score from 0 to 1 (no to yes) for clinical reporting. Results For each of the 19,594 initial training variants, our pipeline generates approximately 500 features, which results in a matrix of > 9 million data points. From a comparison of naive Bayes, decision trees, random forests, and logistic regression models, we selected models that allow human interpretability of the prediction score. The logistic regression model demonstrated 1% false negativity and 2% false positivity. The final models' Youden indices were 0.87 and 0.77 for screening and confirmatory cutoffs, respectively. Retraining on a new assay and performance assessment in 16,123 independent variants validated our approach (Youden index, 0.93). We also derived individual pathologist-centric models (virtual consensus conference function), and a visual drill-down functionality allows assessment of how underlying features contributed to a particular score or decision branch for clinical implementation. Conclusion Our decision support tool for variant reporting is a practically relevant artificial intelligence approach to harness the next-generation sequencing bioinformatics pipeline output when the complexity of data interpretation exceeds human capabilities.
Collapse
Affiliation(s)
| | - Lev Lipkin
- All authors: Massachusetts General Hospital, Boston, MA
| | - Maciej Pacula
- All authors: Massachusetts General Hospital, Boston, MA
| | | | | | | | | | | | - Zongli Zheng
- All authors: Massachusetts General Hospital, Boston, MA
| | - Miguel Rivera
- All authors: Massachusetts General Hospital, Boston, MA
| | | | | | | | - Long P. Le
- All authors: Massachusetts General Hospital, Boston, MA
| | | |
Collapse
|
37
|
Quevedo-Tumailli VF, Ortega-Tenezaca B, González-Díaz H. Chromosome Gene Orientation Inversion Networks (GOINs) of Plasmodium Proteome. J Proteome Res 2018; 17:1258-1268. [DOI: 10.1021/acs.jproteome.7b00861] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Viviana F. Quevedo-Tumailli
- RNASA-IMEDIR,
Computer Science Faculty, University of A Coruña, 15071 A Coruña, Spain
- Universidad Estatal Amazónica UEA, Puyo, Pastaza, Ecuador
| | - Bernabé Ortega-Tenezaca
- RNASA-IMEDIR,
Computer Science Faculty, University of A Coruña, 15071 A Coruña, Spain
- Universidad Estatal Amazónica UEA, Puyo, Pastaza, Ecuador
- Universidad Regional Autónoma de los Andes UNIANDES-Puyo, Puyo, Pastaza,Ecuador
| | - Humbert González-Díaz
- Dept.
of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940 Leioa, Biscay, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
38
|
Huang S, Cai N, Pacheco PP, Narrandes S, Wang Y, Xu W. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genomics Proteomics 2018; 15:41-51. [PMID: 29275361 PMCID: PMC5822181 DOI: 10.21873/cgp.20063] [Citation(s) in RCA: 378] [Impact Index Per Article: 54.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 10/03/2017] [Accepted: 10/23/2017] [Indexed: 12/23/2022] Open
Abstract
Machine learning with maximization (support) of separating margin (vector), called support vector machine (SVM) learning, is a powerful classification tool that has been used for cancer genomic classification or subtyping. Today, as advancements in high-throughput technologies lead to production of large amounts of genomic and epigenomic data, the classification feature of SVMs is expanding its use in cancer genomics, leading to the discovery of new biomarkers, new drug targets, and a better understanding of cancer driver genes. Herein we reviewed the recent progress of SVMs in cancer genomic studies. We intend to comprehend the strength of the SVM learning and its future perspective in cancer genomic applications.
Collapse
Affiliation(s)
- Shujun Huang
- College of Pharmacy, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
- Research Institute of Oncology and Hematology, CancerCare Manitoba, Winnipeg, Canada
| | - Nianguang Cai
- Research Institute of Oncology and Hematology, CancerCare Manitoba, Winnipeg, Canada
| | - Pedro Penzuti Pacheco
- Research Institute of Oncology and Hematology, CancerCare Manitoba, Winnipeg, Canada
| | - Shavira Narrandes
- Research Institute of Oncology and Hematology, CancerCare Manitoba, Winnipeg, Canada
- Departments of Biochemistry and Medical Genetics, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
| | - Yang Wang
- Department of Computer Science, Faculty of Sciences, University of Manitoba, Winnipeg, Canada
| | - Wayne Xu
- Research Institute of Oncology and Hematology, CancerCare Manitoba, Winnipeg, Canada
- Departments of Biochemistry and Medical Genetics, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
- College of Pharmacy, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, Canada
| |
Collapse
|