Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Marchese Robinson RL, Palczewska A, Palczewski J, Kidley N. Comparison of the Predictive Performance and Interpretability of Random Forest and Linear Models on Benchmark Data Sets. J Chem Inf Model 2017;57:1773-1792. [PMID: 28715209 DOI: 10.1021/acs.jcim.6b00753] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

For:	Marchese Robinson RL, Palczewska A, Palczewski J, Kidley N. Comparison of the Predictive Performance and Interpretability of Random Forest and Linear Models on Benchmark Data Sets. J Chem Inf Model 2017;57:1773-1792. [PMID: 28715209 DOI: 10.1021/acs.jcim.6b00753] [Citation(s) in RCA: 59] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Number

Cited by Other Article(s)

Zhai S, Tan Y, Zhu C, Zhang C, Gao Y, Mao Q, Zhang Y, Duan H, Yin Y. PepExplainer: An explainable deep learning model for selection-based macrocyclic peptide bioactivity prediction and optimization. Eur J Med Chem 2024;275:116628. [PMID: 38944933 DOI: 10.1016/j.ejmech.2024.116628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/21/2024] [Accepted: 06/24/2024] [Indexed: 07/02/2024]

Wei Z, Wang X, Lu L, Li S, Long W, Zhang L, Shen S. Construction of an Early Risk Prediction Model for Type 2 Diabetic Peripheral Neuropathy Based on Random Forest. Comput Inform Nurs 2024;42:665-674. [PMID: 38913980 DOI: 10.1097/cin.0000000000001157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]

Zhang R, Zhu H, Chen M, Sang W, Lu K, Li Z, Wang C, Zhang L, Yin FF, Yang Z. A dual-radiomics model for overall survival prediction in early-stage NSCLC patient using pre-treatment CT images. Front Oncol 2024;14:1419621. [PMID: 39206157 PMCID: PMC11349529 DOI: 10.3389/fonc.2024.1419621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 07/26/2024] [Indexed: 09/04/2024] Open

Abstract

Introduction

Radiation therapy (RT) is one of the primary treatment options for early-stage non-small cell lung cancer (ES-NSCLC). Therefore, accurately predicting the overall survival (OS) rate following radiotherapy is crucial for implementing personalized treatment strategies. This work aims to develop a dual-radiomics (DR) model to (1) predict 3-year OS in ES-NSCLC patients receiving RT using pre-treatment CT images, and (2) provide explanations between feature importanceand model prediction performance.

Methods

The publicly available TCIA Lung1 dataset with 132 ES-NSCLC patients received RT were studied: 89/43 patients in the under/over 3-year OS group. For each patient, two types of radiomic features were examined: 56 handcrafted radiomic features (HRFs) extracted within gross tumor volume, and 512 image deep features (IDFs) extracted using a pre-trained U-Net encoder. They were combined as inputs to an explainable boosting machine (EBM) model for OS prediction. The EBM's mean absolute scores for HRFs and IDFs were used as feature importance explanations. To evaluate identified feature importance, the DR model was compared with EBM using either (1) key or (2) non-key feature type only. Comparison studies with other models, including supporting vector machine (SVM) and random forest (RF), were also included. The performance was evaluated by the area under the receiver operating characteristic curve (AUCROC), accuracy, sensitivity, and specificity with a 100-fold Monte Carlo cross-validation.

Results

The DR model showed highestperformance in predicting 3-year OS (AUCROC=0.81 ± 0.04), and EBM scores suggested that IDFs showed significantly greater importance (normalized mean score=0.0019) than HRFs (score=0.0008). The comparison studies showed that EBM with key feature type (IDFs-only demonstrated comparable AUCROC results (0.81 ± 0.04), while EBM with non-key feature type (HRFs-only) showed limited AUCROC (0.64 ± 0.10). The results suggested that feature importance score identified by EBM is highly correlated with OS prediction performance. Both SVM and RF models were unable to explain key feature type while showing limited overall AUCROC=0.66 ± 0.07 and 0.77 ± 0.06, respectively. Accuracy, sensitivity, and specificity showed a similar trend.

Discussion

In conclusion, a DR model was successfully developed to predict ES-NSCLC OS based on pre-treatment CT images. The results suggested that the feature importance from DR model is highly correlated to the model prediction power.

Collapse

Rodoplu Solovchuk D. Advances in AI-assisted biochip technology for biomedicine. Biomed Pharmacother 2024;177:116997. [PMID: 38943990 DOI: 10.1016/j.biopha.2024.116997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 06/13/2024] [Accepted: 06/15/2024] [Indexed: 07/01/2024] Open

Shah SK, Chaple DR, Masand VH, Jawarkar RD, Chaudhari S, Abiramasundari A, Zaki MEA, Al-Hussain SA. Multi-Target In-Silico modeling strategies to discover novel angiotensin converting enzyme and neprilysin dual inhibitors. Sci Rep 2024;14:15991. [PMID: 38987327 PMCID: PMC11237057 DOI: 10.1038/s41598-024-66230-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/28/2024] [Indexed: 07/12/2024] Open

Lorca M, Muscia GC, Pérez-Benavente S, Bautista JM, Acosta A, González C, Sabadini G, Mella J, Asís SE, Mellado M. 2D/3D-QSAR Model Development Based on a Quinoline Pharmacophoric Core for the Inhibition of Plasmodium falciparum: An In Silico Approach with Experimental Validation. Pharmaceuticals (Basel) 2024;17:889. [PMID: 39065740 PMCID: PMC11279914 DOI: 10.3390/ph17070889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 06/19/2024] [Accepted: 06/27/2024] [Indexed: 07/28/2024] Open

Zhang R, Nolte D, Sanchez-Villalobos C, Ghosh S, Pal R. Topological regression as an interpretable and efficient tool for quantitative structure-activity relationship modeling. Nat Commun 2024;15:5072. [PMID: 38871711 DOI: 10.1038/s41467-024-49372-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Accepted: 06/04/2024] [Indexed: 06/15/2024] Open

Kumar N, Acharya V. Advances in machine intelligence-driven virtual screening approaches for big-data. Med Res Rev 2024;44:939-974. [PMID: 38129992 DOI: 10.1002/med.21995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Revised: 07/15/2023] [Accepted: 10/29/2023] [Indexed: 12/23/2023]

Chou RT, Ouattara A, Adams M, Berry AA, Takala-Harrison S, Cummings MP. Positive-unlabeled learning identifies vaccine candidate antigens in the malaria parasite Plasmodium falciparum. NPJ Syst Biol Appl 2024;10:44. [PMID: 38678051 PMCID: PMC11055854 DOI: 10.1038/s41540-024-00365-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 03/29/2024] [Indexed: 04/29/2024] Open

Zhang S, Luo X, Mai B. Multi-task machine learning models for simultaneous prediction of tissue-to-blood partition coefficients of chemicals in mammals. ENVIRONMENTAL RESEARCH 2024;241:117603. [PMID: 37939805 DOI: 10.1016/j.envres.2023.117603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 10/25/2023] [Accepted: 11/04/2023] [Indexed: 11/10/2023]

Lu J, Ji X, Liu X, Jiang Y, Li G, Fang P, Li W, Zuo A, Guo Z, Yang S, Ji Y, Lu D. Machine learning-based radiomics strategy for prediction of acquired EGFR T790M mutation following treatment with EGFR-TKI in NSCLC. Sci Rep 2024;14:446. [PMID: 38172228 PMCID: PMC10764785 DOI: 10.1038/s41598-023-50984-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 12/28/2023] [Indexed: 01/05/2024] Open

Affiliation(s)

Jiameng Lu Department of Respiratory, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Institute of Respiratory Diseases, Shandong Institute of Anesthesia and Respiratory Critical Medicine, 16766 Jingshilu, Lixia, Jinan, 250014, Shandong, People's Republic of China School of Microelectronics, Shandong University, Jinan, 250100, Shandong, People's Republic of China
Xiaoqing Ji Department of Nursing, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, 250014, Shandong, People's Republic of China
Xinyi Liu Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
Yunxiu Jiang Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
Gang Li Department of Radiology, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Medicine and Health Key Laboratory of Abdominal Medicine Imaging, Shandong Lung Cancer Institute, Shandong Institute of Neuroimmunology, Jinan, 250000, Shandong, China
Ping Fang Department of Blood Transfusion, The First Affiliated Hospital of Shandong First Medical University and Shandong Province Qianfoshan Hospital, Jinan, 250014, Shandong, China
Wei Li Department of Radiology, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Medicine and Health Key Laboratory of Abdominal Medicine Imaging, Shandong Lung Cancer Institute, Shandong Institute of Neuroimmunology, Jinan, 250000, Shandong, China
Anli Zuo Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
Zihan Guo Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
Shuran Yang Graduate School of Shandong First Medical University, Jinan, 250000, Shandong, People's Republic of China
Yanbo Ji Department of Nursing, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Jinan, 250014, Shandong, People's Republic of China
Degan Lu Department of Respiratory, The First Affiliated Hospital of Shandong First Medical University and Shandong Provincial Qianfoshan Hospital, Shandong Institute of Respiratory Diseases, Shandong Institute of Anesthesia and Respiratory Critical Medicine, 16766 Jingshilu, Lixia, Jinan, 250014, Shandong, People's Republic of China.

Collapse

Wang C, Liu J, Qiu C, Su X, Ma N, Li J, Wang S, Qu S. Identifying the drivers of chlorophyll-a dynamics in a landscape lake recharged by reclaimed water using interpretable machine learning. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;906:167483. [PMID: 37832666 DOI: 10.1016/j.scitotenv.2023.167483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Revised: 09/21/2023] [Accepted: 09/28/2023] [Indexed: 10/15/2023]

Gao T, Ren H, He S, Liang D, Xu Y, Chen K, Wang Y, Zhu Y, Dong H, Xu Z, Chen W, Cheng W, Jing F, Tao X. Development of an interpretable machine learning-based intelligent system of exercise prescription for cardio-oncology preventive care: A study protocol. Front Cardiovasc Med 2023;9:1091885. [PMID: 38106819 PMCID: PMC10722170 DOI: 10.3389/fcvm.2022.1091885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 12/12/2022] [Indexed: 12/19/2023] Open

Abstract

Background

Cardiovascular disease (CVD) and cancer are the first and second causes of death in over 130 countries across the world. They are also among the top three causes in almost 180 countries worldwide. Cardiovascular complications are often noticed in cancer patients, with nearly 20% exhibiting cardiovascular comorbidities. Physical exercise may be helpful for cancer survivors and people living with cancer (PLWC), as it prevents relapses, CVD, and cardiotoxicity. Therefore, it is beneficial to recommend exercise as part of cardio-oncology preventive care.

Objective

With the progress of deep learning algorithms and the improvement of big data processing techniques, artificial intelligence (AI) has gradually become popular in the fields of medicine and healthcare. In the context of the shortage of medical resources in China, it is of great significance to adopt AI and machine learning methods for prescription recommendations. This study aims to develop an interpretable machine learning-based intelligent system of exercise prescription for cardio-oncology preventive care, and this paper presents the study protocol.

Methods

This will be a retrospective machine learning modeling cohort study with interventional methods (i.e., exercise prescription). We will recruit PLWC participants at baseline (from 1 January 2025 to 31 December 2026) and follow up over several years (from 1 January 2027 to 31 December 2028). Specifically, participants will be eligible if they are (1) PLWC in Stage I or cancer survivors from Stage I; (2) aged between 18 and 55 years; (3) interested in physical exercise for rehabilitation; (4) willing to wear smart sensors/watches; (5) assessed by doctors as suitable for exercise interventions. At baseline, clinical exercise physiologist certificated by the joint training program (from 1 January 2023 to 31 December 2024) of American College of Sports Medicine and Chinese Association of Sports Medicine will recommend exercise prescription to each participant. During the follow-up, effective exercise prescription will be determined by assessing the CVD status of the participants.

Expected outcomes

This study aims to develop not only an interpretable machine learning model to recommend exercise prescription but also an intelligent system of exercise prescription for precision cardio-oncology preventive care.

Ethics

This study is approved by Human Experimental Ethics Inspection of Guangzhou Sport University.

Clinical trial registration

http://www.chictr.org.cn, identifier ChiCTR2300077887.

Collapse

Affiliation(s)

Tianyu Gao School of Physical Education, Jinan University, Guangzhou, China
Hao Ren Institute for Healthcare Artificial Intelligence Application, Guangdong Second Provincial General Hospital, Guangzhou, China Faculty of Data Science, City University of Macau, Macao, Macao SAR, China
Shan He Guangzhou Sport University, Guangzhou, China
Deyi Liang Guangdong Women and Children Hospital, Guangzhou, China
Yuming Xu Division of Physical Education, Guangdong University of Finance and Economics, Guangzhou, China School of Education, City University of Macau, Macao, Macao SAR, China
Kecheng Chen School of Data Science, City University of Hong Kong, Hong Kong, Hong Kong SAR, China
Yufan Wang Department of Industrial Engineering and Management, School of Mechanical Engineering, Shanghai Jiao Tong University, Shanghai, China
Yuxin Zhu Syns Institute of Educational Research, Hong Kong, Hong Kong SAR, China
Heling Dong School of Physical Education, Jinan University, Guangzhou, China
Zhongzhi Xu School of Public Health, Sun Yat-Sen University, Guangzhou, China
Weiming Chen Department of Health Medicine, Guangdong Second Provincial General Hospital, Guangzhou, China
Weibin Cheng Institute for Healthcare Artificial Intelligence Application, Guangdong Second Provincial General Hospital, Guangzhou, China School of Data Science, City University of Hong Kong, Hong Kong, Hong Kong SAR, China
Fengshi Jing Institute for Healthcare Artificial Intelligence Application, Guangdong Second Provincial General Hospital, Guangzhou, China Faculty of Data Science, City University of Macau, Macao, Macao SAR, China UNC Project-China, UNC Global, School of Medicine, The University of North Carolina, Chapel Hill, NC, United States
Xiaoyu Tao Zhuhai College of Science and Technology, Zhuhai, China ZCST Health and Medicine Industry Research Institute, Zhuhai, China

Collapse

Jia X, Wang T, Zhu H. Advancing Computational Toxicology by Interpretable Machine Learning. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:17690-17706. [PMID: 37224004 PMCID: PMC10666545 DOI: 10.1021/acs.est.3c00653] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 05/05/2023] [Accepted: 05/05/2023] [Indexed: 05/26/2023]

Al-Maini M, Maindarkar M, Kitas GD, Khanna NN, Misra DP, Johri AM, Mantella L, Agarwal V, Sharma A, Singh IM, Tsoulfas G, Laird JR, Faa G, Teji J, Turk M, Viskovic K, Ruzsa Z, Mavrogeni S, Rathore V, Miner M, Kalra MK, Isenovic ER, Saba L, Fouda MM, Suri JS. Artificial intelligence-based preventive, personalized and precision medicine for cardiovascular disease/stroke risk assessment in rheumatoid arthritis patients: a narrative review. Rheumatol Int 2023;43:1965-1982. [PMID: 37648884 DOI: 10.1007/s00296-023-05415-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 07/31/2023] [Indexed: 09/01/2023]

Abstract

The challenges associated with diagnosing and treating cardiovascular disease (CVD)/Stroke in Rheumatoid arthritis (RA) arise from the delayed onset of symptoms. Existing clinical risk scores are inadequate in predicting cardiac events, and conventional risk factors alone do not accurately classify many individuals at risk. Several CVD biomarkers consider the multiple pathways involved in the development of atherosclerosis, which is the primary cause of CVD/Stroke in RA. To enhance the accuracy of CVD/Stroke risk assessment in the RA framework, a proposed approach involves combining genomic-based biomarkers (GBBM) derived from plasma and/or serum samples with innovative non-invasive radiomic-based biomarkers (RBBM), such as measurements of synovial fluid, plaque area, and plaque burden. This review presents two hypotheses: (i) RBBM and GBBM biomarkers exhibit a significant correlation and can precisely detect the severity of CVD/Stroke in RA patients. (ii) Artificial Intelligence (AI)-based preventive, precision, and personalized (aiP3) CVD/Stroke risk AtheroEdge™ model (AtheroPoint™, CA, USA) that utilizes deep learning (DL) to accurately classify the risk of CVD/stroke in RA framework. The authors conducted a comprehensive search using the PRISMA technique, identifying 153 studies that assessed the features/biomarkers of RBBM and GBBM for CVD/Stroke. The study demonstrates how DL models can be integrated into the AtheroEdge™-aiP3 framework to determine the risk of CVD/Stroke in RA patients. The findings of this review suggest that the combination of RBBM with GBBM introduces a new dimension to the assessment of CVD/Stroke risk in the RA framework. Synovial fluid levels that are higher than normal lead to an increase in the plaque burden. Additionally, the review provides recommendations for novel, unbiased, and pruned DL algorithms that can predict CVD/Stroke risk within a RA framework that is preventive, precise, and personalized.

Collapse

Affiliation(s)

Mustafa Al-Maini Allergy, Clinical Immunology and Rheumatology Institute, Toronto, ON, L4Z 4C4, Canada
Mahesh Maindarkar Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, 95661, USA Asia Pacific Vascular Society, New Delhi, 110001, India
George D Kitas Academic Affairs, Dudley Group NHS Foundation Trust, Dudley, DY1 2HQ, UK Arthritis Research UK Epidemiology Unit, Manchester University, Manchester, M13 9PL, UK
Narendra N Khanna Asia Pacific Vascular Society, New Delhi, 110001, India Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi, 110001, India
Durga Prasanna Misra Department of Immunology, SGPIMS, Lucknow, 226014, India
Amer M Johri Division of Cardiology, Department of Medicine, Queen's University, Kingston, Canada
Laura Mantella Division of Cardiology, Department of Medicine, University of Toronto, Toronto, Canada
Vikas Agarwal Department of Immunology, SGPIMS, Lucknow, 226014, India
Aman Sharma Department of Immunology, SGPIMS, Lucknow, 226014, India
Inder M Singh Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, 95661, USA
George Tsoulfas Department of Surgery, Aristoteleion University of Thessaloniki, 54124, Thessaloniki, Greece
John R Laird Heart and Vascular Institute, Adventist Health St. Helena, St Helena, CA, 94574, USA
Gavino Faa Department of Pathology, Azienda Ospedaliero Universitaria, 09124, Cagliari, Italy
Jagjit Teji Ann and Robert H. Lurie Children's Hospital of Chicago, Chicago, IL, 60611, USA
Monika Turk The Hanse-Wissenschaftskolleg Institute for Advanced Study, 27753, Delmenhorst, Germany
Klaudija Viskovic Department of Radiology and Ultrasound, UHID, 10 000, Zagreb, Croatia
Zoltan Ruzsa Invasive Cardiology Division, University of Szeged, Szeged, Hungary
Sophie Mavrogeni Cardiology Clinic, Onassis Cardiac Surgery Centre, Athens, Greece
Vijay Rathore Nephrology Department, Kaiser Permanente, Sacramento, CA, 95823, USA
Martin Miner Men's Health Centre, Miriam Hospital Providence, Providence, RI, 02906, USA
Manudeep K Kalra Department of Radiology, Harvard Medical School, Boston, MA, USA
Esma R Isenovic Department of Radiobiology and Molecular Genetics, National Institute of the Republic of Serbia, University of Belgrade, 11000, Belgrade, Serbia
Luca Saba Department of Radiology, Azienda Ospedaliero Universitaria, 40138, Cagliari, Italy
Mostafa M Fouda Department of Electrical and Computer Engineering, Idaho State University, Pocatello, ID, 83209, USA
Jasjit S Suri Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, 95661, USA.

Collapse

Luo L, Li B, Wang X, Cui L, Liu G. Interpretable spatial identity neural network-based epidemic prediction. Sci Rep 2023;13:18159. [PMID: 37875546 PMCID: PMC10598274 DOI: 10.1038/s41598-023-45177-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 10/17/2023] [Indexed: 10/26/2023] Open

Xiang Y, Tang YH, Lin G, Reker D. Interpretable Molecular Property Predictions Using Marginalized Graph Kernels. J Chem Inf Model 2023;63:4633-4640. [PMID: 37504964 DOI: 10.1021/acs.jcim.3c00396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Charvet CJ, Ofori K, Falcone C, Rigby Dames BA. Transcription, structure, and organoids translate time across the lifespan of humans and great apes. PNAS NEXUS 2023;2:pgad230. [PMID: 37554928 PMCID: PMC10406161 DOI: 10.1093/pnasnexus/pgad230] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 02/20/2023] [Accepted: 07/13/2023] [Indexed: 08/10/2023]

Rossi RJ, Tisherman RA, Jaeger JM, Domen J, Shonkoff SBC, DiGiulio DC. Historic and Contemporary Surface Disposal of Produced Water Likely Inputs Arsenic and Selenium to Surficial Aquifers. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023;57:7559-7567. [PMID: 37146013 DOI: 10.1021/acs.est.3c01219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]

Wu Y, Grant S, Chen W, Szarka A. Refining acute human exposure assessment to pesticides in surface water: An integrated data-driven modeling approach. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023;865:161190. [PMID: 36581287 DOI: 10.1016/j.scitotenv.2022.161190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 12/03/2022] [Accepted: 12/21/2022] [Indexed: 06/17/2023]

Abstract

The substantial spatial and temporal variability of pesticides has led to large uncertainties when determining their peak aqueous concentrations. There is however a lack of large-scale studies dealing with accurate determination of annual maximum daily concentration (AMDC) across the landscape and over time based on the publicly available monitoring data. We developed a novel data-driven approach that firstly used time series modeling to generate AMDCs for qualified water monitoring sites in the conterminous U.S. With feature variables such as pesticide use and land cover compiled into the dataset, machine learning models using eXtreme Gradient Boosting (XGBoost) and Random Forest Regressor (RF) were then developed to estimate AMDCs in surface waters across the U.S. Both models exhibited significant predictability, while a hybrid model consisting of the average predictions by XGBoost and RF model had the highest prediction accuracy (mean absolute error (MAE): 1.23; R²: 0.61). The analysis of permutation variable importance indicated that pesticide use and drainage area were the two most important drivers. Partial dependence analysis revealed that pesticide use, precipitation, cultivated crop land cover and solubility exhibited concentration-promoting effects, whereas drainage area and molecular weight had concentration-demoting effects. Soil adsorption coefficient (Koc) showed nonmonotonic effects. The hybrid model was used to predict and map AMDCs of four example pesticides, including 2,4-dichlorophenoxyacetic acid (2,4-D), atrazine, glyphosate and imidacloprid during 2016-2019 at national scale. The predictive capability was validated using independent monitoring datasets. The fully evaluated approach significantly reduced the uncertainties in modeling annual peak concentrations and served as a valuable solution for conducting geographically oriented, highly refined exposure assessments for pesticides.

Collapse

Zhang HS, Feng QD, Zhang DY, Zhu GL, Yang L. Bacterial community structure in geothermal springs on the northern edge of Qinghai-Tibet plateau. Front Microbiol 2023;13:994179. [PMID: 37180363 PMCID: PMC10172933 DOI: 10.3389/fmicb.2022.994179] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 12/13/2022] [Indexed: 03/19/2023] Open

Romero-Gainza E, Stewart C. AI-Driven Validation of Digital Agriculture Models. SENSORS (BASEL, SWITZERLAND) 2023;23:1187. [PMID: 36772227 PMCID: PMC9919666 DOI: 10.3390/s23031187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 01/12/2023] [Accepted: 01/16/2023] [Indexed: 06/18/2023]

Yang J, Zhang D, Cai Y, Yu K, Li M, Liu L, Chen X. Computational Prediction of Drug Phenotypic Effects Based on Substructure-Phenotype Associations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:256-265. [PMID: 35239490 DOI: 10.1109/tcbb.2022.3155453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Abstract

Identifying drug phenotypic effects, including therapeutic effects and adverse drug reactions (ADRs), is an inseparable part for evaluating the potentiality of new drug candidates (NDCs). However, current computational methods for predicting phenotypic effects of NDCs are mainly based on the overall structure of an NDC or a related target. These approaches often lead to inconsistencies between the structures and functions and limit the prediction space of NDCs. In this study, first, we constructed quantitative associations of substructure-domain, domain-ADR, and domain-ATC (Anatomical Therapeutic Chemical Classification System code) through L1LOG and L1SVM machine learning models. These associations represent relationships between phenotypes (ADRs and ATCs) and local structures of drugs and proteins. Then, based on these established associations, substructure-phenotype relationships were constructed which were utilized to quantify drug-phenotype relationships. Thus, this approach could achieve high-throughput and effective evaluations of the druggability of NDCs by referring to the established substructure-phenotype relationships and structural information of NDCs without additional prior knowledge. Using this computational pipeline, 83,205 drug-ATC relationships (including 1,479 drugs and 178 ATCs) and 306,421 drug-ADR relationships (including 1,752 drugs and 454 ADRs) were predicted in total. The prediction results were validated at four levels: five-fold cross validation, public databases, literature, and molecular docking. Furthermore, three case studies demonstrated the feasibility of our method. 79 ATCs and 269 ADRs were predicted to be related to Maraviroc, an approved drug, including the existing antiviral effect in clinical use. Additionally, we also found risk substructures of severe ADRs, for example, SUB215 (>= 1, saturated or only aromatic carbon ring size 7) can result in shock. And we analyzed the mechanism of action (MOA) of interested drugs based on the established drug-substructure-domain-protein associations. In a word, this approach through establishing drug-substructure-phenotype relationships can achieve quantitative prediction of phenotypes for a given NDC or drug without any prior knowledge except its structure information. Using that way, we can directly obtain the relationships between substructure and phenotype of a compound, which is more convenient to analyze the phenotypic mechanism of drugs and accelerate the process of rational drug design.

Collapse

Wang S, Wang J, Zhu MX, Tan Q. Machine learning for the prediction of minor amputation in University of Texas grade 3 diabetic foot ulcers. PLoS One 2022;17:e0278445. [PMID: 36472981 PMCID: PMC9725167 DOI: 10.1371/journal.pone.0278445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 11/16/2022] [Indexed: 12/12/2022] Open

Khanna NN, Maindarkar MA, Viswanathan V, Puvvula A, Paul S, Bhagawati M, Ahluwalia P, Ruzsa Z, Sharma A, Kolluri R, Krishnan PR, Singh IM, Laird JR, Fatemi M, Alizad A, Dhanjil SK, Saba L, Balestrieri A, Faa G, Paraskevas KI, Misra DP, Agarwal V, Sharma A, Teji JS, Al-Maini M, Nicolaides A, Rathore V, Naidu S, Liblik K, Johri AM, Turk M, Sobel DW, Miner M, Viskovic K, Tsoulfas G, Protogerou AD, Mavrogeni S, Kitas GD, Fouda MM, Kalra MK, Suri JS. Cardiovascular/Stroke Risk Stratification in Diabetic Foot Infection Patients Using Deep Learning-Based Artificial Intelligence: An Investigative Study. J Clin Med 2022;11:6844. [PMID: 36431321 PMCID: PMC9693632 DOI: 10.3390/jcm11226844] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 11/15/2022] [Accepted: 11/16/2022] [Indexed: 11/22/2022] Open

Abstract

A diabetic foot infection (DFI) is among the most serious, incurable, and costly to treat conditions. The presence of a DFI renders machine learning (ML) systems extremely nonlinear, posing difficulties in CVD/stroke risk stratification. In addition, there is a limited number of well-explained ML paradigms due to comorbidity, sample size limits, and weak scientific and clinical validation methodologies. Deep neural networks (DNN) are potent machines for learning that generalize nonlinear situations. The objective of this article is to propose a novel investigation of deep learning (DL) solutions for predicting CVD/stroke risk in DFI patients. The Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) search strategy was used for the selection of 207 studies. We hypothesize that a DFI is responsible for increased morbidity and mortality due to the worsening of atherosclerotic disease and affecting coronary artery disease (CAD). Since surrogate biomarkers for CAD, such as carotid artery disease, can be used for monitoring CVD, we can thus use a DL-based model, namely, Long Short-Term Memory (LSTM) and Recurrent Neural Networks (RNN) for CVD/stroke risk prediction in DFI patients, which combines covariates such as office and laboratory-based biomarkers, carotid ultrasound image phenotype (CUSIP) lesions, along with the DFI severity. We confirmed the viability of CVD/stroke risk stratification in the DFI patients. Strong designs were found in the research of the DL architectures for CVD/stroke risk stratification. Finally, we analyzed the AI bias and proposed strategies for the early diagnosis of CVD/stroke in DFI patients. Since DFI patients have an aggressive atherosclerotic disease, leading to prominent CVD/stroke risk, we, therefore, conclude that the DL paradigm is very effective for predicting the risk of CVD/stroke in DFI patients.

Collapse

Affiliation(s)

Narendra N. Khanna Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi 110001, India
Mahesh A. Maindarkar Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA Department of Biomedical Engineering, North Eastern Hill University, Shillong 793022, India
Vijay Viswanathan MV Diabetes Centre, Royapuram, Chennai 600013, India
Anudeep Puvvula Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA Annu’s Hospitals for Skin and Diabetes, Nellore 524101, India
Sudip Paul Department of Biomedical Engineering, North Eastern Hill University, Shillong 793022, India
Mrinalini Bhagawati Department of Biomedical Engineering, North Eastern Hill University, Shillong 793022, India
Puneet Ahluwalia Max Institute of Cancer Care, Max Super Specialty Hospital, New Delhi 110017, India
Zoltan Ruzsa Invasive Cardiology Division, Faculty of Medicine, University of Szeged, 6720 Szeged, Hungary
Aditya Sharma Division of Cardiovascular Medicine, University of Virginia, Charlottesville, VA 22904, USA
Raghu Kolluri Ohio Health Heart and Vascular, Columbus, OH 43214, USA
Padukone R. Krishnan Neurology Department, Fortis Hospital, Bangalore 560076, India
Inder M. Singh Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA
John R. Laird Heart and Vascular Institute, Adventist Health St. Helena, St Helena, CA 94574, USA
Mostafa Fatemi Department of Physiology & Biomedical Engineering, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
Azra Alizad Department of Radiology, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
Surinder K. Dhanjil Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA
Luca Saba Department of Radiology, Azienda Ospedaliero Universitaria, 40138 Cagliari, Italy
Antonella Balestrieri Cardiovascular Prevention and Research Unit, Department of Pathophysiology, National & Kapodistrian University of Athens, 15772 Athens, Greece
Gavino Faa Department of Pathology, Azienda Ospedaliero Universitaria, 09124 Cagliari, Italy
Kosmas I. Paraskevas Department of Vascular Surgery, Central Clinic of Athens, 15772 Athens, Greece
Durga Prasanna Misra Department of Immunology, SGPGIMS, Lucknow 226014, India
Vikas Agarwal Department of Immunology, SGPGIMS, Lucknow 226014, India
Aman Sharma Department of Immunology, SGPGIMS, Lucknow 226014, India
Jagjit S. Teji Ann and Robert H. Lurie Children’s Hospital of Chicago, Chicago, IL 60611, USA
Mustafa Al-Maini Allergy, Clinical Immunology and Rheumatology Institute, Toronto, ON L4Z 4C4, Canada
Andrew Nicolaides Vascular Screening and Diagnostic Centre, University of Nicosia Medical School, Egkomi 2408, Cyprus
Vijay Rathore AtheroPoint™, Roseville, CA 95661, USA
Subbaram Naidu Electrical Engineering Department, University of Minnesota, Duluth, MN 55812, USA
Kiera Liblik Department of Medicine, Division of Cardiology, Queen’s University, Kingston, ON K7L 3N6, Canada
Amer M. Johri Department of Medicine, Division of Cardiology, Queen’s University, Kingston, ON K7L 3N6, Canada
Monika Turk The Hanse-Wissenschaftskolleg Institute for Advanced Study, 27753 Delmenhorst, Germany
David W. Sobel Rheumatology Unit, National Kapodistrian University of Athens, 15772 Athens, Greece
Martin Miner Men’s Health Centre, Miriam Hospital Providence, Providence, RI 02906, USA
Klaudija Viskovic Department of Radiology and Ultrasound, University Hospital for Infectious Diseases, 10000 Zagreb, Croatia
George Tsoulfas Department of Surgery, Aristoteleion University of Thessaloniki, 54124 Thessaloniki, Greece
Athanasios D. Protogerou Cardiovascular Prevention and Research Unit, Department of Pathophysiology, National & Kapodistrian University of Athens, 15772 Athens, Greece
Sophie Mavrogeni Cardiology Clinic, Onassis Cardiac Surgery Centre, 17674 Athens, Greece
George D. Kitas Academic Affairs, Dudley Group NHS Foundation Trust, Dudley DY1 2HQ, UK Arthritis Research UK Epidemiology Unit, Manchester University, Manchester M13 9PL, UK
Mostafa M. Fouda Department of Electrical and Computer Engineering, Idaho State University, Pocatello, ID 83209, USA
Mannudeep K. Kalra Department of Radiology, Harvard Medical School, Boston, MA 02115, USA
Jasjit S. Suri Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA 95661, USA

Collapse

Bellamy H, Rehim AA, Orhobor OI, King R. Batched Bayesian Optimization for Drug Design in Noisy Environments. J Chem Inf Model 2022;62:3970-3981. [PMID: 36044048 PMCID: PMC9472273 DOI: 10.1021/acs.jcim.2c00602] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Effah CY, Miao R, Drokow EK, Agboyibor C, Qiao R, Wu Y, Miao L, Wang Y. Machine learning-assisted prediction of pneumonia based on non-invasive measures. Front Public Health 2022;10:938801. [PMID: 35968461 PMCID: PMC9371749 DOI: 10.3389/fpubh.2022.938801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2022] [Accepted: 06/23/2022] [Indexed: 11/13/2022] Open

Abstract

Background

Pneumonia is an infection of the lungs that is characterized by high morbidity and mortality. The use of machine learning systems to detect respiratory diseases via non-invasive measures such as physical and laboratory parameters is gaining momentum and has been proposed to decrease diagnostic uncertainty associated with bacterial pneumonia. Herein, this study conducted several experiments using eight machine learning models to predict pneumonia based on biomarkers, laboratory parameters, and physical features.

Methods

We perform machine-learning analysis on 535 different patients, each with 45 features. Data normalization to rescale all real-valued features was performed. Since it is a binary problem, we categorized each patient into one class at a time. We designed three experiments to evaluate the models: (1) feature selection techniques to select appropriate features for the models, (2) experiments on the imbalanced original dataset, and (3) experiments on the SMOTE data. We then compared eight machine learning models to evaluate their effectiveness in predicting pneumonia

Results

Biomarkers such as C-reactive protein and procalcitonin demonstrated the most significant discriminating power. Ensemble machine learning models such as RF (accuracy = 92.0%, precision = 91.3%, recall = 96.0%, f1-Score = 93.6%) and XGBoost (accuracy = 90.8%, precision = 92.6%, recall = 92.3%, f1-score = 92.4%) achieved the highest performance accuracy on the original dataset with AUCs of 0.96 and 0.97, respectively. On the SMOTE dataset, RF and XGBoost achieved the highest prediction results with f1-scores of 92.0 and 91.2%, respectively. Also, AUC of 0.97 was achieved for both RF and XGBoost models.

Conclusions

Our models showed that in the diagnosis of pneumonia, individual clinical history, laboratory indicators, and symptoms do not have adequate discriminatory power. We can also conclude that the ensemble ML models performed better in this study.

Collapse

Artificial intelligence and machine-learning approaches in structure and ligand-based discovery of drugs affecting central nervous system. Mol Divers 2022;27:959-985. [PMID: 35819579 DOI: 10.1007/s11030-022-10489-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 06/21/2022] [Indexed: 12/11/2022]

Beers AT, Frey SN. Greater sage‐grouse habitat selection varies across the marginal habitat of its lagging range margin. Ecosphere 2022. [DOI: 10.1002/ecs2.4146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

Meli R, Morris GM, Biggin PC. Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review. FRONTIERS IN BIOINFORMATICS 2022;2:885983. [PMID: 36187180 PMCID: PMC7613667 DOI: 10.3389/fbinf.2022.885983] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 05/11/2022] [Indexed: 01/01/2023] Open

Jiménez-Luna J, Skalic M, Weskamp N. Benchmarking Molecular Feature Attribution Methods with Activity Cliffs. J Chem Inf Model 2022;62:274-283. [PMID: 35019265 DOI: 10.1021/acs.jcim.1c01163] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Polypharmacology: The science of multi-targeting molecules. Pharmacol Res 2022;176:106055. [PMID: 34990865 DOI: 10.1016/j.phrs.2021.106055] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/23/2021] [Accepted: 12/31/2021] [Indexed: 12/28/2022]

Wang S, Xia C, Zheng Q, Wang A, Tan Q. Machine Learning Models for Predicting the Risk of Hard-to-Heal Diabetic Foot Ulcers in a Chinese Population. Diabetes Metab Syndr Obes 2022;15:3347-3359. [PMID: 36341229 PMCID: PMC9628710 DOI: 10.2147/dmso.s383960] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 10/20/2022] [Indexed: 11/05/2022] Open

Abstract

BACKGROUND

Early detection of hard-to-heal diabetic foot ulcers (DFUs) is vital to prevent a poor prognosis. The purpose of this work was to employ clinical characteristics to create an optimal predictive model of hard-to-heal DFUs (failing to decrease by >50% at 4 weeks) based on machine learning algorithms.

METHODS

A total of 362 DFU patients hospitalized in two tertiary hospitals in eastern China were enrolled in this study. The training dataset and validation dataset were split at a ratio of 7:3. Univariate logistic analysis and clinical experience were utilized to screen clinical characteristics as predictive features. The following six machine learning algorithms were used to build prediction models for differentiating hard-to-heal DFUs: support vector machine, the naïve Bayesian (NB) model, k-nearest neighbor, general linear regression, adaptive boosting, and random forest. Five cross-validations were employed to realize the model's parameters. Accuracy, precision, recall, F1-scores, and AUCs were utilized to compare and evaluate the models' efficacy. On the basis of the best model identified, the significance of each characteristic was evaluated, and then an online calculator was developed.

RESULTS

Independent predictors for model establishment included sex, insulin use, random blood glucose, wound area, diabetic retinopathy, peripheral arterial disease, smoking history, serum albumin, serum creatinine, and C-reactive protein. After evaluation, the NB model was identified as the most generalizable model, with an AUC of 0.864, a recall of 0.907, and an F1-score of 0.744. Random blood glucose, C-reactive protein, and wound area were determined to be the three most important influencing factors. A corresponding online calculator was created (https://predicthardtoheal.azurewebsites.net/).

CONCLUSION

Based on clinical characteristics, machine learning algorithms can achieve acceptable predictions of hard-to-heal DFUs, with the NB model performing the best. Our online calculator can assist doctors in identifying the possibility of hard-to-heal DFUs at the time of admission to reduce the likelihood of a dismal prognosis.

Collapse

Zhan M, Chen Z, Ding C, Qu Q, Wang G, Liu S, Wen F. Risk prediction for delayed clearance of high-dose methotrexate in pediatric hematological malignancies by machine learning. Int J Hematol 2021;114:483-493. [PMID: 34170480 DOI: 10.1007/s12185-021-03184-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 06/21/2021] [Accepted: 06/21/2021] [Indexed: 10/21/2022]

Ye Z, Yang W, Yang Y, Ouyang D. Interpretable machine learning methods for in vitro pharmaceutical formulation development. FOOD FRONTIERS 2021. [DOI: 10.1002/fft2.78] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open

Vatansever S, Schlessinger A, Wacker D, Kaniskan HÜ, Jin J, Zhou M, Zhang B. Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions. Med Res Rev 2021;41:1427-1473. [PMID: 33295676 PMCID: PMC8043990 DOI: 10.1002/med.21764] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 10/30/2020] [Accepted: 11/20/2020] [Indexed: 01/11/2023]

Affiliation(s)

Sezen Vatansever Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
Avner Schlessinger Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
Daniel Wacker Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Department of NeuroscienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
H. Ümit Kaniskan Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
Jian Jin Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
Ming‐Ming Zhou Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
Bin Zhang Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA

Collapse

Gousiadou C, Marchese Robinson RL, Kotzabasaki M, Doganis P, Wilkins TA, Jia X, Sarimveis H, Harper SL. Machine learning predictions of concentration-specific aggregate hazard scores of inorganic nanomaterials in embryonic zebrafish. Nanotoxicology 2021;15:446-476. [PMID: 33586589 DOI: 10.1080/17435390.2021.1872113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Barton-Henry K, Wenz L, Levermann A. Decay radius of climate decision for solar panels in the city of Fresno, USA. Sci Rep 2021;11:8571. [PMID: 33883574 PMCID: PMC8060319 DOI: 10.1038/s41598-021-87714-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 03/30/2021] [Indexed: 11/09/2022] Open

Wu Z, Jiang D, Hsieh CY, Chen G, Liao B, Cao D, Hou T. Hyperbolic relational graph convolution networks plus: a simple but highly efficient QSAR-modeling method. Brief Bioinform 2021;22:6235968. [PMID: 33866354 DOI: 10.1093/bib/bbab112] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 03/11/2021] [Accepted: 03/12/2021] [Indexed: 01/04/2023] Open

Alcantara RS, Day EM, Hahn ME, Grabowski AM. Sacral acceleration can predict whole-body kinetics and stride kinematics across running speeds. PeerJ 2021;9:e11199. [PMID: 33954039 PMCID: PMC8048400 DOI: 10.7717/peerj.11199] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 03/10/2021] [Indexed: 12/31/2022] Open

Abstract

Background

Stress fractures are injuries caused by repetitive loading during activities such as running. The application of advanced analytical methods such as machine learning to data from multiple wearable sensors has allowed for predictions of biomechanical variables associated with running-related injuries like stress fractures. However, it is unclear if data from a single wearable sensor can accurately estimate variables that characterize external loading during running such as peak vertical ground reaction force (vGRF), vertical impulse, and ground contact time. Predicting these biomechanical variables with a single wearable sensor could allow researchers, clinicians, and coaches to longitudinally monitor biomechanical running-related injury risk factors without expensive force-measuring equipment.

Purpose

We quantified the accuracy of applying quantile regression forest (QRF) and linear regression (LR) models to sacral-mounted accelerometer data to predict peak vGRF, vertical impulse, and ground contact time across a range of running speeds.

Methods

Thirty-seven collegiate cross country runners (24 females, 13 males) ran on a force-measuring treadmill at 3.8-5.4 m/s while wearing an accelerometer clipped posteriorly to the waistband of their running shorts. We cross-validated QRF and LR models by training them on acceleration data, running speed, step frequency, and body mass as predictor variables. Trained models were then used to predict peak vGRF, vertical impulse, and contact time. We compared predicted values to those calculated from a force-measuring treadmill on a subset of data (n = 9) withheld during model training. We quantified prediction accuracy by calculating the root mean square error (RMSE) and mean absolute percentage error (MAPE).

Results

The QRF model predicted peak vGRF with a RMSE of 0.150 body weights (BW) and MAPE of 4.27 ± 2.85%, predicted vertical impulse with a RMSE of 0.004 BW*s and MAPE of 0.80 ± 0.91%, and predicted contact time with a RMSE of 0.011 s and MAPE of 4.68 ± 3.00%. The LR model predicted peak vGRF with a RMSE of 0.139 BW and MAPE of 4.04 ± 2.57%, predicted vertical impulse with a RMSE of 0.002 BW*s and MAPE of 0.50 ± 0.42%, and predicted contact time with a RMSE of 0.008 s and MAPE of 3.50 ± 2.27%. There were no statistically significant differences between QRF and LR model prediction MAPE for peak vGRF (p = 0.549) or vertical impulse (p = 0.073), but the LR model's MAPE for contact time was significantly lower than the QRF model's MAPE (p = 0.0497).

Conclusions

Our findings indicate that the QRF and LR models can accurately predict peak vGRF, vertical impulse, and contact time (MAPE < 5%) from a single sacral-mounted accelerometer across a range of running speeds. These findings may be beneficial for researchers, clinicians, or coaches seeking to monitor running-related injury risk factors without force-measuring equipment.

Collapse

Wang Y, Yu Y, Han W, Zhang YJ, Jiang L, Xue HD, Lei J, Jin ZY, Yu JC. CT Radiomics for Distinction of Human Epidermal Growth Factor Receptor 2 Negative Gastric Cancer. Acad Radiol 2021;28:e86-e92. [PMID: 32303442 DOI: 10.1016/j.acra.2020.02.018] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2019] [Revised: 02/12/2020] [Accepted: 02/14/2020] [Indexed: 02/07/2023]

Jiménez-Luna J, Skalic M, Weskamp N, Schneider G. Coloring Molecules with Explainable Artificial Intelligence for Preclinical Relevance Assessment. J Chem Inf Model 2021;61:1083-1094. [PMID: 33629843 DOI: 10.1021/acs.jcim.0c01344] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Wu Z, Zhu M, Kang Y, Leung ELH, Lei T, Shen C, Jiang D, Wang Z, Cao D, Hou T. Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 QSAR data sets. Brief Bioinform 2020;22:6032614. [PMID: 33313673 DOI: 10.1093/bib/bbaa321] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/09/2020] [Accepted: 10/19/2020] [Indexed: 12/18/2022] Open

Abstract

Although a wide variety of machine learning (ML) algorithms have been utilized to learn quantitative structure-activity relationships (QSARs), there is no agreed single best algorithm for QSAR learning. Therefore, a comprehensive understanding of the performance characteristics of popular ML algorithms used in QSAR learning is highly desirable. In this study, five linear algorithms [linear function Gaussian process regression (linear-GPR), linear function support vector machine (linear-SVM), partial least squares regression (PLSR), multiple linear regression (MLR) and principal component regression (PCR)], three analogizers [radial basis function support vector machine (rbf-SVM), K-nearest neighbor (KNN) and radial basis function Gaussian process regression (rbf-GPR)], six symbolists [extreme gradient boosting (XGBoost), Cubist, random forest (RF), multiple adaptive regression splines (MARS), gradient boosting machine (GBM), and classification and regression tree (CART)] and two connectionists [principal component analysis artificial neural network (pca-ANN) and deep neural network (DNN)] were employed to learn the regression-based QSAR models for 14 public data sets comprising nine physicochemical properties and five toxicity endpoints. The results show that rbf-SVM, rbf-GPR, XGBoost and DNN generally illustrate better performances than the other algorithms. The overall performances of different algorithms can be ranked from the best to the worst as follows: rbf-SVM > XGBoost > rbf-GPR > Cubist > GBM > DNN > RF > pca-ANN > MARS > linear-GPR ≈ KNN > linear-SVM ≈ PLSR > CART ≈ PCR ≈ MLR. In terms of prediction accuracy and computational efficiency, SVM and XGBoost are recommended to the regression learning for small data sets, and XGBoost is an excellent choice for large data sets. We then investigated the performances of the ensemble models by integrating the predictions of multiple ML algorithms. The results illustrate that the ensembles of two or three algorithms in different categories can indeed improve the predictions of the best individual ML algorithms.

Collapse

Multiclass machine learning vs. conventional calculators for stroke/CVD risk assessment using carotid plaque predictors with coronary angiography scores as gold standard: a 500 participants study. Int J Cardiovasc Imaging 2020;37:1171-1187. [DOI: 10.1007/s10554-020-02099-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 11/03/2020] [Indexed: 02/07/2023]

Tinkov O, Polishchuk P, Matveieva M, Grigorev V, Grigoreva L, Porozov Y. The Influence of Structural Patterns on Acute Aquatic Toxicity of Organic Compounds. Mol Inform 2020;40:e2000209. [PMID: 33029954 DOI: 10.1002/minf.202000209] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 10/01/2020] [Indexed: 12/28/2022]

Jiménez-Luna J, Grisoni F, Schneider G. Drug discovery with explainable artificial intelligence. NAT MACH INTELL 2020. [DOI: 10.1038/s42256-020-00236-4] [Citation(s) in RCA: 152] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Wang Y, Liu W, Yu Y, Liu JJ, Jiang L, Xue HD, Lei J, Jin Z, Yu JC. Prediction of the Depth of Tumor Invasion in Gastric Cancer: Potential Role of CT Radiomics. Acad Radiol 2020;27:1077-1084. [PMID: 31761666 DOI: 10.1016/j.acra.2019.10.020] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 10/22/2019] [Accepted: 10/25/2019] [Indexed: 12/12/2022]

Chen CH, Tanaka K, Kotera M, Funatsu K. Comparison and improvement of the predictability and interpretability with ensemble learning models in QSPR applications. J Cheminform 2020;12:19. [PMID: 33430997 PMCID: PMC7106596 DOI: 10.1186/s13321-020-0417-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 02/05/2020] [Indexed: 12/23/2022] Open

Luo Y, Tang Z, Hu X, Lu S, Miao B, Hong S, Bai H, Sun C, Qiu J, Liang H, Na N. Machine learning for the prediction of severe pneumonia during posttransplant hospitalization in recipients of a deceased-donor kidney transplant. ANNALS OF TRANSLATIONAL MEDICINE 2020;8:82. [PMID: 32175375 DOI: 10.21037/atm.2020.01.09] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Abstract

Background

Pneumonia accounts for the majority of infection-related deaths after kidney transplantation. We aimed to build a predictive model based on machine learning for severe pneumonia in recipients of deceased-donor transplants within the perioperative period after surgery.

Methods

We collected the features of kidney transplant recipients and used a tree-based ensemble classification algorithm (Random Forest or AdaBoost) and a nonensemble classifier (support vector machine, Naïve Bayes, or logistic regression) to build the predictive models. We used the area under the precision-recall curve (AUPRC) and the area under the receiver operating characteristic curve (AUROC) to evaluate the predictive performance via ten-fold cross validation.

Results

Five hundred nineteen patients who underwent transplantation from January 2015 to December 2018 were included. Forty-three severe pneumonia episodes (8.3%) occurred during hospitalization after surgery. Significant differences in the recipients' age, diabetes status, HBsAg level, operation time, reoperation, usage of anti-fungal drugs, preoperative albumin and immunoglobulin levels, preoperative pulmonary lesions, and delayed graft function, as well as donor age, were observed between patients with and without severe pneumonia (P<0.05). We screened eight important features correlated with severe pneumonia using the recursive feature elimination method and then constructed a predictive model based on these features. The top three features were preoperative pulmonary lesions, reoperation and recipient age (with importance scores of 0.194, 0.124 and 0.078, respectively). Among the machine learning algorithms described above, the Random Forest algorithm displayed better predictive performance, with a sensitivity of 0.67, specificity of 0.97, positive likelihood ratio of 22.33, negative likelihood ratio of 0.34, AUROC of 0.91, and AUPRC of 0.72.

Conclusions

The Random Forest model is potentially useful for predicting severe pneumonia in kidney transplant recipients. Recipients with a potential preoperative potential pulmonary infection, who are of older age and who require reoperation should be monitored carefully to prevent the occurrence of severe pneumonia.

Collapse

Neural-based approaches to overcome feature selection and applicability domain in drug-related property prediction. Appl Soft Comput 2019. [DOI: 10.1016/j.asoc.2019.105777] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]