1
|
Rouzbahani AK, Khalili-Tanha G, Rajabloo Y, Khojasteh-Leylakoohi F, Garjan HS, Nazari E, Avan A. Machine learning algorithms and biomarkers identification for pancreatic cancer diagnosis using multi-omics data integration. Pathol Res Pract 2024; 263:155602. [PMID: 39357184 DOI: 10.1016/j.prp.2024.155602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 09/21/2024] [Accepted: 09/24/2024] [Indexed: 10/04/2024]
Abstract
PURPOSE Pancreatic cancer is a lethal type of cancer with most of the cases being diagnosed in an advanced stage and poor prognosis. Developing new diagnostic and prognostic markers for pancreatic cancer can significantly improve early detection and patient outcomes. These biomarkers can potentially revolutionize medical practice by enabling personalized, more effective, targeted treatments, ultimately improving patient outcomes. METHODS The search strategy was developed following PRISMA guidelines. A comprehensive search was performed across four electronic databases: PubMed, Scopus, EMBASE, and Web of Science, covering all English publications up to September 2022. The Newcastle-Ottawa Scale (NOS) was utilized to assess bias, categorizing studies as "good," "fair," or "poor" quality based on their NOS scores. Descriptive statistics for all included studies were compiled and reviewed, along with the NOS scores for each study to indicate their quality assessment. RESULTS Our results showed that SVM and RF are the most widely used algorithms in machine learning and data analysis, particularly for biomarker identification. SVM, a supervised learning algorithm, is employed for both classification and regression by mapping data points in high-dimensional space to identify the optimal separating hyperplane between classes. CONCLUSIONS The application of machine-learning algorithms in the search for novel biomarkers in pancreatic cancer represents a significant advancement in the field. By harnessing the power of artificial intelligence, researchers are poised to make strides towards earlier detection and more effective treatment, ultimately improving patient outcomes in this challenging disease.
Collapse
Affiliation(s)
- Arian Karimi Rouzbahani
- Student Research Committee, Lorestan University of Medical Sciences, Khorramabad, Iran; USERN Office, Lorestan University of Medical Sciences, Khorramabad, Iran
| | - Ghazaleh Khalili-Tanha
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Yasamin Rajabloo
- Student Research Committee, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | | | - Hassan Shokri Garjan
- Department of Health Information Technology, School of Management University of Medical Sciences, Tabriz, Iran
| | - Elham Nazari
- Proteomics Research Center, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - Amir Avan
- Metabolic Syndrome Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Medical Genetics Research Center, Mashhad University of Medical Sciences, Mashhad, Iran; Basic Sciences Research Institute, Mashhad University of Medical Sciences, Mashhad, Iran.
| |
Collapse
|
2
|
Yazici H, Ugurlu O, Aygul Y, Ugur MA, Sen YK, Yildirim M. Predicting severity of acute appendicitis with machine learning methods: a simple and promising approach for clinicians. BMC Emerg Med 2024; 24:101. [PMID: 38886641 PMCID: PMC11184860 DOI: 10.1186/s12873-024-01023-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 06/12/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUNDS Acute Appendicitis (AA) is one of the most common surgical emergencies worldwide. This study aims to investigate the predictive performances of 6 different Machine Learning (ML) algorithms for simple and complicated AA. METHODS Data regarding operated AA patients between 2012 and 2022 were analyzed retrospectively. Based on operative findings, patients were evaluated under two groups: perforated AA and none-perforated AA. The features that showed statistical significance (p < 0.05) in both univariate and multivariate analysis were included in the prediction models as input features. Five different error metrics and the area under the receiver operating characteristic curve (AUC) were used for model comparison. RESULTS A total number of 1132 patients were included in the study. Patients were divided into training (932 samples), testing (100 samples), and validation (100 samples) sets. Age, gender, neutrophil count, lymphocyte count, Neutrophil to Lymphocyte ratio, total bilirubin, C-Reactive Protein (CRP), Appendix Diameter, and PeriAppendicular Liquid Collection (PALC) were significantly different between the two groups. In the multivariate analysis, age, CRP, and PALC continued to show a significant difference in the perforated AA group. According to univariate and multivariate analysis, two data sets were used in the prediction model. K-Nearest Neighbors and Logistic Regression algorithms achieved the best prediction performance in the validation group with an accuracy of 96%. CONCLUSION The results showed that using only three input features (age, CRP, and PALC), the severity of AA can be predicted with high accuracy. The developed prediction model can be useful in clinical practice.
Collapse
Affiliation(s)
- Hilmi Yazici
- General Surgery Department, Marmara University Pendik Research and Training Hospital, Istanbul, Turkey.
| | - Onur Ugurlu
- Faculty of Engineering and Architecture, Izmir Bakircay University, Izmir, Turkey
| | - Yesim Aygul
- Department of Mathematics, Ege University, Izmir, Turkey
| | - Mehmet Alperen Ugur
- General Surgery Department, University of Health Sciences Izmir Bozyaka Research and Training Hospital, Izmir, Turkey
| | - Yigit Kaan Sen
- General Surgery Department, University of Health Sciences Izmir Bozyaka Research and Training Hospital, Izmir, Turkey
| | - Mehmet Yildirim
- General Surgery Department, University of Health Sciences Izmir Bozyaka Research and Training Hospital, Izmir, Turkey
| |
Collapse
|
3
|
Qiu C, Su K, Luo Z, Tian Q, Zhao L, Wu L, Deng H, Shen H. Developing and comparing deep learning and machine learning algorithms for osteoporosis risk prediction. Front Artif Intell 2024; 7:1355287. [PMID: 38919268 PMCID: PMC11196804 DOI: 10.3389/frai.2024.1355287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/31/2024] [Indexed: 06/27/2024] Open
Abstract
Introduction Osteoporosis, characterized by low bone mineral density (BMD), is an increasingly serious public health issue. So far, several traditional regression models and machine learning (ML) algorithms have been proposed for predicting osteoporosis risk. However, these models have shown relatively low accuracy in clinical implementation. Recently proposed deep learning (DL) approaches, such as deep neural network (DNN), which can discover knowledge from complex hidden interactions, offer a new opportunity to improve predictive performance. In this study, we aimed to assess whether DNN can achieve a better performance in osteoporosis risk prediction. Methods By utilizing hip BMD and extensive demographic and routine clinical data of 8,134 subjects with age more than 40 from the Louisiana Osteoporosis Study (LOS), we developed and constructed a novel DNN framework for predicting osteoporosis risk and compared its performance in osteoporosis risk prediction with four conventional ML models, namely random forest (RF), artificial neural network (ANN), k-nearest neighbor (KNN), and support vector machine (SVM), as well as a traditional regression model termed osteoporosis self-assessment tool (OST). Model performance was assessed by area under 'receiver operating curve' (AUC) and accuracy. Results By using 16 discriminative variables, we observed that the DNN approach achieved the best predictive performance (AUC = 0.848) in classifying osteoporosis (hip BMD T-score ≤ -1.0) and non-osteoporosis risk (hip BMD T-score > -1.0) subjects, compared to the other approaches. Feature importance analysis showed that the top 10 most important variables identified by the DNN model were weight, age, gender, grip strength, height, beer drinking, diastolic pressure, alcohol drinking, smoke years, and economic level. Furthermore, we performed subsampling analysis to assess the effects of varying number of sample size and variables on the predictive performance of these tested models. Notably, we observed that the DNN model performed equally well (AUC = 0.846) even by utilizing only the top 10 most important variables for osteoporosis risk prediction. Meanwhile, the DNN model can still achieve a high predictive performance (AUC = 0.826) when sample size was reduced to 50% of the original dataset. Conclusion In conclusion, we developed a novel DNN model which was considered to be an effective algorithm for early diagnosis and intervention of osteoporosis in the aging population.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Hongwen Deng
- Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, United States
| | - Hui Shen
- Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, United States
| |
Collapse
|
4
|
Males I, Boban Z, Kumric M, Vrdoljak J, Berkovic K, Pogorelic Z, Bozic J. Applying an explainable machine learning model might reduce the number of negative appendectomies in pediatric patients with a high probability of acute appendicitis. Sci Rep 2024; 14:12772. [PMID: 38834671 DOI: 10.1038/s41598-024-63513-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Accepted: 05/29/2024] [Indexed: 06/06/2024] Open
Abstract
The diagnosis of acute appendicitis and concurrent surgery referral is primarily based on clinical presentation, laboratory and radiological imaging. However, utilizing such an approach results in as much as 10-15% of negative appendectomies. Hence, in the present study, we aimed to develop a machine learning (ML) model designed to reduce the number of negative appendectomies in pediatric patients with a high clinical probability of acute appendicitis. The model was developed and validated on a registry of 551 pediatric patients with suspected acute appendicitis that underwent surgical treatment. Clinical, anthropometric, and laboratory features were included for model training and analysis. Three machine learning algorithms were tested (random forest, eXtreme Gradient Boosting, logistic regression) and model explainability was obtained. Random forest model provided the best predictions achieving mean specificity and sensitivity of 0.17 ± 0.01 and 0.997 ± 0.001 for detection of acute appendicitis, respectively. Furthermore, the model outperformed the appendicitis inflammatory response (AIR) score across most sensitivity-specificity combinations. Finally, the random forest model again provided the best predictions for discrimination between complicated appendicitis, and either uncomplicated acute appendicitis or no appendicitis at all, with a joint mean sensitivity of 0.994 ± 0.002 and specificity of 0.129 ± 0.009. In conclusion, the developed ML model might save as much as 17% of patients with a high clinical probability of acute appendicitis from unnecessary surgery, while missing the needed surgery in only 0.3% of cases. Additionally, it showed better diagnostic accuracy than the AIR score, as well as good accuracy in predicting complicated acute appendicitis over uncomplicated and negative cases bundled together. This may be useful in centers that advocate for the conservative treatment of uncomplicated appendicitis. Nevertheless, external validation is needed to support these findings.
Collapse
Affiliation(s)
- Ivan Males
- Department of Abdominal Surgery, University Hospital of Split, Spinciceva 1, 21000, Split, Croatia
| | - Zvonimir Boban
- Department of Medical Physics and Biophysics, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
| | - Marko Kumric
- Department of Pathophysiology, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
- Laboratory for Cardiometabolic Research, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
| | - Josip Vrdoljak
- Department of Pathophysiology, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
- Laboratory for Cardiometabolic Research, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
| | - Karlotta Berkovic
- Department of Surgery, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia
| | - Zenon Pogorelic
- Department of Surgery, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia.
- Department of Pediatric Surgery, University Hospital of Split, Spinciceva 1, 21000, Split, Croatia.
| | - Josko Bozic
- Department of Medical Physics and Biophysics, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia.
- Department of Pathophysiology, School of Medicine, University of Split, Soltanska 2A, 21000, Split, Croatia.
| |
Collapse
|
5
|
Bianchi V, Giambusso M, De Iacob A, Chiarello MM, Brisinda G. Artificial intelligence in the diagnosis and treatment of acute appendicitis: a narrative review. Updates Surg 2024; 76:783-792. [PMID: 38472633 PMCID: PMC11129994 DOI: 10.1007/s13304-024-01801-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 02/24/2024] [Indexed: 03/14/2024]
Abstract
Artificial intelligence is transforming healthcare. Artificial intelligence can improve patient care by analyzing large amounts of data to help make more informed decisions regarding treatments and enhance medical research through analyzing and interpreting data from clinical trials and research projects to identify subtle but meaningful trends beyond ordinary perception. Artificial intelligence refers to the simulation of human intelligence in computers, where systems of artificial intelligence can perform tasks that require human-like intelligence like speech recognition, visual perception, pattern-recognition, decision-making, and language processing. Artificial intelligence has several subdivisions, including machine learning, natural language processing, computer vision, and robotics. By automating specific routine tasks, artificial intelligence can improve healthcare efficiency. By leveraging machine learning algorithms, the systems of artificial intelligence can offer new opportunities for enhancing both the efficiency and effectiveness of surgical procedures, particularly regarding training of minimally invasive surgery. As artificial intelligence continues to advance, it is likely to play an increasingly significant role in the field of surgical learning. Physicians have assisted to a spreading role of artificial intelligence in the last decade. This involved different medical specialties such as ophthalmology, cardiology, urology, but also abdominal surgery. In addition to improvements in diagnosis, ascertainment of efficacy of treatment and autonomous actions, artificial intelligence has the potential to improve surgeons' ability to better decide if acute surgery is indicated or not. The role of artificial intelligence in the emergency departments has also been investigated. We considered one of the most common condition the emergency surgeons have to face, acute appendicitis, to assess the state of the art of artificial intelligence in this frequent acute disease. The role of artificial intelligence in diagnosis and treatment of acute appendicitis will be discussed in this narrative review.
Collapse
Affiliation(s)
- Valentina Bianchi
- Emergency Surgery and Trauma Center, Department of Abdominal and Endocrine Metabolic Medical and Surgical Sciences, IRCCS, Fondazione Policlinico Universitario A Gemelli, Largo Agostino Gemelli 8, 00168, Rome, Italy
| | - Mauro Giambusso
- General Surgery Operative Unit, Vittorio Emanuele Hospital, 93012, Gela, Italy
| | - Alessandra De Iacob
- Emergency Surgery and Trauma Center, Department of Abdominal and Endocrine Metabolic Medical and Surgical Sciences, IRCCS, Fondazione Policlinico Universitario A Gemelli, Largo Agostino Gemelli 8, 00168, Rome, Italy
| | - Maria Michela Chiarello
- Department of Surgery, General Surgery Operative Unit, Azienda Sanitaria Provinciale Cosenza, 87100, Cosenza, Italy
| | - Giuseppe Brisinda
- Emergency Surgery and Trauma Center, Department of Abdominal and Endocrine Metabolic Medical and Surgical Sciences, IRCCS, Fondazione Policlinico Universitario A Gemelli, Largo Agostino Gemelli 8, 00168, Rome, Italy.
- Catholic School of Medicine, University Department of Translational Medicine and Surgery, 00168, Rome, Italy.
| |
Collapse
|
6
|
Zhang L, Huang Y, Huang M, Zhao CH, Zhang YJ, Wang Y. Development of Cost-Effective Fatty Liver Disease Prediction Models in a Chinese Population: Statistical and Machine Learning Approaches. JMIR Form Res 2024; 8:e53654. [PMID: 38363597 PMCID: PMC10907948 DOI: 10.2196/53654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 01/23/2024] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open
Abstract
BACKGROUND The increasing prevalence of nonalcoholic fatty liver disease (NAFLD) in China presents a significant public health concern. Traditional ultrasound, commonly used for fatty liver screening, often lacks the ability to accurately quantify steatosis, leading to insufficient follow-up for patients with moderate-to-severe steatosis. Transient elastography (TE) provides a more quantitative diagnosis of steatosis and fibrosis, closely aligning with biopsy results. Moreover, machine learning (ML) technology holds promise for developing more precise diagnostic models for NAFLD using a variety of laboratory indicators. OBJECTIVE This study aims to develop a novel ML-based diagnostic model leveraging TE results for staging hepatic steatosis. The objective was to streamline the model's input features, creating a cost-effective and user-friendly tool to distinguish patients with NAFLD requiring follow-up. This innovative approach merges TE and ML to enhance diagnostic accuracy and efficiency in NAFLD assessment. METHODS The study involved a comprehensive analysis of health examination records from Suzhou Municipal Hospital, spanning from March to May 2023. Patient data and questionnaire responses were meticulously inputted into Microsoft Excel 2019, followed by thorough data cleaning and model development using Python 3.7, with libraries scikit-learn and numpy to ensure data accuracy. A cohort comprising 978 residents with complete medical records and TE results was included for analysis. Various classification models, including logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), random forest (RF), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost), were constructed and evaluated based on the area under the receiver operating characteristic curve (AUROC). RESULTS Among the 916 patients included in the study, 273 were diagnosed with moderate-to-severe NAFLD. The concordance rate between traditional ultrasound and TE for detecting moderate-to-severe NAFLD was 84.6% (231/273). The AUROC values for the RF, LightGBM, XGBoost, SVM, KNN, and LR models were 0.91, 0.86, 0.83, 0.88, 0.77, and 0.81, respectively. These models achieved accuracy rates of 84%, 81%, 78%, 81%, 76%, and 77%, respectively. Notably, the RF model exhibited the best performance. A simplified RF model was developed with an AUROC of 0.88, featuring 62% sensitivity and 90% specificity. This simplified model used 6 key features: waist circumference, BMI, fasting plasma glucose, uric acid, total bilirubin, and high-sensitivity C-reactive protein. This approach offers a cost-effective and user-friendly tool while streamlining feature acquisition for training purposes. CONCLUSIONS The study introduces a groundbreaking, cost-effective ML algorithm that leverages health examination data for identifying moderate-to-severe NAFLD. This model has the potential to significantly impact public health by enabling targeted investigations and interventions for NAFLD. By integrating TE and ML technologies, the study showcases innovative approaches to advancing NAFLD diagnostics.
Collapse
Affiliation(s)
- Liang Zhang
- Department of General Practice, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou, China
| | - Yueqing Huang
- Department of General Practice, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou, China
| | - Min Huang
- Department of General Practice, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou, China
| | - Chun-Hua Zhao
- Department of General Practice, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou, China
| | - Yan-Jun Zhang
- School of Information Science and Engineering, Southeast University, Nanjing, China
| | - Yi Wang
- The First Clinical Medical College, Nanjing Medical University, Nanjing, China
| |
Collapse
|
7
|
Turjo EA, Rahman MH. Assessing risk factors for malnutrition among women in Bangladesh and forecasting malnutrition using machine learning approaches. BMC Nutr 2024; 10:22. [PMID: 38303093 PMCID: PMC10832135 DOI: 10.1186/s40795-023-00808-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Accepted: 12/04/2023] [Indexed: 02/03/2024] Open
Abstract
BACKGROUND This paper presents an in-depth examination of malnutrition in women in Bangladesh. Malnutrition in women is a major public health issue related to different diseases and has negative repercussions for children, such as premature birth, decreased infection resistance, and an increased risk of death. Moreover, malnutrition is a severe problem in Bangladesh. Data from the Bangladesh Demographic Health Survey (BDHS) conducted in 2017-18 was used to identify risk factors for malnourished women and to create a machine learning-based strategy to detect their nutritional status. METHODS A total of 17022 women participants are taken to conduct the research. All the participants are from different regions and different ages. A chi-square test with a five percent significance level is used to identify possible risk variables for malnutrition in women and six machine learning-based classifiers (Naïve Bayes, two types of Decision Tree, Logistic Regression, Random Forest, and Gradient Boosting Machine) were used to predict the malnutrition of women. The models are being evaluated using different parameters like accuracy, sensitivity, specificity, positive predictive value, negative predictive value, [Formula: see text] score, and area under the curve (AUC). RESULTS Descriptive data showed that 45% of the population studied were malnourished women, and the chi-square test illustrated that all fourteen variables are significantly associated with malnutrition in women and among them, age and wealth index had the most influence on their nutritional status, while water source had the least impact. Random Forest had an accuracy of 60% and 60.2% for training and test data sets, respectively. CART and Gradient Boosting Machine also had close accuracy like Random Forest but based on other performance metrics such as kappa and [Formula: see text] scores Random Forest got the highest rank among others. Also, it had the highest accuracy and [Formula: see text] scores in k-fold validation along with the highest AUC (0.604). CONCLUSION The Random Forest (RF) approach is a reasonably superior machine learning-based algorithm for forecasting women's nutritional status in Bangladesh in comparison to other ML algorithms investigated in this work. The suggested approach will aid in forecasting which women are at high susceptibility to malnutrition, hence decreasing the strain on the healthcare system.
Collapse
Affiliation(s)
- Estiyak Ahmed Turjo
- Department of Statistics and Data Science, Jahangirnagar University, Dhaka, Bangladesh
| | - Md Habibur Rahman
- Department of Statistics and Data Science, Jahangirnagar University, Dhaka, Bangladesh.
| |
Collapse
|
8
|
Marcinkevičs R, Reis Wolfertstetter P, Klimiene U, Chin-Cheong K, Paschke A, Zerres J, Denzinger M, Niederberger D, Wellmann S, Ozkan E, Knorr C, Vogt JE. Interpretable and intervenable ultrasonography-based machine learning models for pediatric appendicitis. Med Image Anal 2024; 91:103042. [PMID: 38000257 DOI: 10.1016/j.media.2023.103042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 11/10/2023] [Accepted: 11/20/2023] [Indexed: 11/26/2023]
Abstract
Appendicitis is among the most frequent reasons for pediatric abdominal surgeries. Previous decision support systems for appendicitis have focused on clinical, laboratory, scoring, and computed tomography data and have ignored abdominal ultrasound, despite its noninvasive nature and widespread availability. In this work, we present interpretable machine learning models for predicting the diagnosis, management and severity of suspected appendicitis using ultrasound images. Our approach utilizes concept bottleneck models (CBM) that facilitate interpretation and interaction with high-level concepts understandable to clinicians. Furthermore, we extend CBMs to prediction problems with multiple views and incomplete concept sets. Our models were trained on a dataset comprising 579 pediatric patients with 1709 ultrasound images accompanied by clinical and laboratory data. Results show that our proposed method enables clinicians to utilize a human-understandable and intervenable predictive model without compromising performance or requiring time-consuming image annotation when deployed. For predicting the diagnosis, the extended multiview CBM attained an AUROC of 0.80 and an AUPR of 0.92, performing comparably to similar black-box neural networks trained and tested on the same dataset.
Collapse
Affiliation(s)
- Ričards Marcinkevičs
- Department of Computer Science, ETH Zurich, Universitätstrasse 6, Zürich, 8092, Switzerland.
| | - Patricia Reis Wolfertstetter
- Department of Pediatric Surgery and Pediatric Orthopedics, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Steinmetzstrasse 1-3, Regensburg, 93049, Germany; Faculty of Medicine, University of Regensburg, Franz-Josef-Strauss-Allee 11, Regensburg, 93053, Germany.
| | - Ugne Klimiene
- Department of Computer Science, ETH Zurich, Universitätstrasse 6, Zürich, 8092, Switzerland
| | - Kieran Chin-Cheong
- Department of Computer Science, ETH Zurich, Universitätstrasse 6, Zürich, 8092, Switzerland
| | - Alyssia Paschke
- Faculty of Medicine, University of Regensburg, Franz-Josef-Strauss-Allee 11, Regensburg, 93053, Germany
| | - Julia Zerres
- Faculty of Medicine, University of Regensburg, Franz-Josef-Strauss-Allee 11, Regensburg, 93053, Germany
| | - Markus Denzinger
- Department of Pediatric Surgery and Pediatric Orthopedics, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Steinmetzstrasse 1-3, Regensburg, 93049, Germany; Faculty of Medicine, University of Regensburg, Franz-Josef-Strauss-Allee 11, Regensburg, 93053, Germany
| | - David Niederberger
- Department of Computer Science, ETH Zurich, Universitätstrasse 6, Zürich, 8092, Switzerland
| | - Sven Wellmann
- Faculty of Medicine, University of Regensburg, Franz-Josef-Strauss-Allee 11, Regensburg, 93053, Germany; Division of Neonatology, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Steinmetzstrasse 1-3, Regensburg, 93049, Germany
| | - Ece Ozkan
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 43 Vassar Street, Cambridge, 02139, USA
| | - Christian Knorr
- Department of Pediatric Surgery and Pediatric Orthopedics, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Steinmetzstrasse 1-3, Regensburg, 93049, Germany
| | - Julia E Vogt
- Department of Computer Science, ETH Zurich, Universitätstrasse 6, Zürich, 8092, Switzerland.
| |
Collapse
|
9
|
Issaiy M, Zarei D, Saghazadeh A. Artificial Intelligence and Acute Appendicitis: A Systematic Review of Diagnostic and Prognostic Models. World J Emerg Surg 2023; 18:59. [PMID: 38114983 PMCID: PMC10729387 DOI: 10.1186/s13017-023-00527-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/06/2023] [Indexed: 12/21/2023] Open
Abstract
BACKGROUND To assess the efficacy of artificial intelligence (AI) models in diagnosing and prognosticating acute appendicitis (AA) in adult patients compared to traditional methods. AA is a common cause of emergency department visits and abdominal surgeries. It is typically diagnosed through clinical assessments, laboratory tests, and imaging studies. However, traditional diagnostic methods can be time-consuming and inaccurate. Machine learning models have shown promise in improving diagnostic accuracy and predicting outcomes. MAIN BODY A systematic review following the PRISMA guidelines was conducted, searching PubMed, Embase, Scopus, and Web of Science databases. Studies were evaluated for risk of bias using the Prediction Model Risk of Bias Assessment Tool. Data points extracted included model type, input features, validation strategies, and key performance metrics. RESULTS In total, 29 studies were analyzed, out of which 21 focused on diagnosis, seven on prognosis, and one on both. Artificial neural networks (ANNs) were the most commonly employed algorithm for diagnosis. Both ANN and logistic regression were also widely used for categorizing types of AA. ANNs showed high performance in most cases, with accuracy rates often exceeding 80% and AUC values peaking at 0.985. The models also demonstrated promising results in predicting postoperative outcomes such as sepsis risk and ICU admission. Risk of bias was identified in a majority of studies, with selection bias and lack of internal validation being the most common issues. CONCLUSION AI algorithms demonstrate significant promise in diagnosing and prognosticating AA, often surpassing traditional methods and clinical scores such as the Alvarado scoring system in terms of speed and accuracy.
Collapse
Affiliation(s)
- Mahbod Issaiy
- School of Medicine, Tehran University of Medical Sciences (TUMS), Tehran, Iran
- Systematic Review and Meta-Analysis Expert Group (SRMEG), Universal Scientific Education and Research Network (USERN), Tehran, Iran
| | - Diana Zarei
- School of Medicine, Iran University of Medical Sciences, Tehran, Iran
- Advanced Diagnostic and Interventional Radiology Research Center (ADIR), Tehran University of Medical Science, Tehran, Iran
| | - Amene Saghazadeh
- Systematic Review and Meta-Analysis Expert Group (SRMEG), Universal Scientific Education and Research Network (USERN), Tehran, Iran.
- Research Center for Immunodeficiencies, Children's Medical Center, Tehran University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
10
|
Harmantepe AT, Dikicier E, Gönüllü E, Ozdemir K, Kamburoğlu MB, Yigit M. A different way to diagnosis acute appendicitis: machine learning. POLISH JOURNAL OF SURGERY 2023; 96:38-43. [PMID: 38629278 DOI: 10.5604/01.3001.0053.5994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Abstract
<b><br>Indroduction:</b> Machine learning is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns, and make decisions with minimal human intervention.</br> <b><br>Aim:</b> Our aim is to predict acute appendicitis, which is the most common indication for emergency surgery, using machine learning algorithms with an easy and inexpensive method.</br> <b><br>Materials and methods:</b> Patients who were treated surgically with a prediagnosis of acute appendicitis in a single center between 2011 and 2021 were analyzed. Patients with right lower quadrant pain were selected. A total of 189 positive and 156 negative appendectomies were found. Gender and hemogram were used as features. Machine learning algorithms and data analysis were made in Python (3.7) programming language.</br> <b><br>Results:</b> Negative appendectomies were found in 62% (n = 97) of the women and in 38% (n = 59) of the men. Positive appendectomies were present in 38% (n = 72) of the women and 62% (n = 117) of the men. The accuracy in the test data was 82.7% in logistic regression, 68.9% in support vector machines, 78.1% in k-nearest neighbors, and 83.9% in neural networks. The accuracy in the voting classifier created with logistic regression, k-nearest neighbor, support vector machines, and artificial neural networks was 86.2%. In the voting classifier, the sensitivity was 83.7% and the specificity was 88.6%.</br> <b><br>Conclusions:</b> The results of our study show that machine learning is an effective method for diagnosing acute appendicitis. This study presents a practical, easy, fast, and inexpensive method to predict the diagnosis of acute appendicitis.</br>.
Collapse
Affiliation(s)
| | - Enis Dikicier
- Sakarya University Faculty of Medicine, Department of General Surgery
| | - Emre Gönüllü
- Sakarya University Education and Research Hospital, Department of General Surgery
| | | | | | - Merve Yigit
- Sakarya University Education and Research Hospital, Department of General Surgery
| |
Collapse
|
11
|
Zhang J, Xu Z, Fu Y, Chen L. Prediction of the Risk of Bone Mineral Density Decrease in Type 2 Diabetes Mellitus Patients Based on Traditional Multivariate Logistic Regression and Machine Learning: A Preliminary Study. Diabetes Metab Syndr Obes 2023; 16:2885-2898. [PMID: 37744700 PMCID: PMC10517691 DOI: 10.2147/dmso.s422515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 09/05/2023] [Indexed: 09/26/2023] Open
Abstract
Purpose There remains a lack of a machine learning (ML) model incorporating body composition to assess the risk of bone mineral density (BMD) decreases in type 2 diabetes mellitus (T2DM) patients. We aimed to use ML algorithms and the traditional multivariate logistic regression to establish prediction models for BMD decreases in T2DM patients over 50 years of age, and compare the performance of the two methods. Patients and Methods This cross-sectional study was conducted among 450 patients with T2DM from 1 August 2016 to 31 December 2022. The participants were divided into a normal BMD group and a decreased BMD group. Traditional multivariate logistic regression and six ML algorithms were selected to construct male and female models. Two nomograms were constructed to evaluate the risk of BMD decreases in the male and female T2DM patients, respectively. The ML models with the highest area under the curve (AUC) were compared with the traditional multivariate logistic regression models in terms of discriminant ability and clinical applicability. Results The optimal ML model was the extreme gradient boost (XGBoost) model. The AUCs of the traditional multivariate logistic regression and the XGBoost models were 0.722 and 0.800 in the male testing dataset, respectively, and 0.876 and 0.880 in the female testing dataset, respectively. The decision curve analysis results suggested that using the XGBoost models to predict the risk of BMD decreases obtained more net benefits compared with the traditional models in both sexes. Conclusion We preliminarily proved that the XGBoost models outperformed most other ML models in both sexes and achieved higher accuracy than traditional analyses. Due to the limited sample size in the study, it is necessary to validate our findings in larger prospective cohort studies.
Collapse
Affiliation(s)
- Junli Zhang
- Department of Endocrinology and Metabolism, The Third Affiliated Hospital of Soochow University, Changzhou, People’s Republic of China
| | - Zhenghui Xu
- Department of Endocrinology and Metabolism, The Third Affiliated Hospital of Soochow University, Changzhou, People’s Republic of China
| | - Yu Fu
- Department of Clinical Nutrition, The Third Affiliated Hospital of Soochow University, Changzhou, People’s Republic of China
| | - Lu Chen
- Department of Endocrinology and Metabolism, The Third Affiliated Hospital of Soochow University, Changzhou, People’s Republic of China
- Department of Clinical Nutrition, The Third Affiliated Hospital of Soochow University, Changzhou, People’s Republic of China
| |
Collapse
|
12
|
Chung J, Cheong JH, Kim JM, Lee DH, Yi HJ, Choi KS, Ahn JS, Park JC, Park W. Is Fetal-Type Posterior Cerebral Artery a Risk Factor for Recurrence in Coiled Internal Carotid Artery-Incorporating Posterior Communicating Artery Aneurysms? Analysis of Conventional Statistics, Computational Fluid Dynamics, and Random Forest With Hyper-Ensemble Approach. Neurosurgery 2023; 93:611-621. [PMID: 37057916 DOI: 10.1227/neu.0000000000002458] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 01/20/2023] [Indexed: 04/15/2023] Open
Abstract
BACKGROUND The fetal-type posterior cerebral artery (FPCA) has been regarded as the risk factor for recurrence in coiled internal carotid artery-incorporating posterior communicating artery (ICA-PCoA) aneurysm. However, it has not been proven in previous literature studies. OBJECTIVE To reveal the impact of FPCA on the recurrence of ICA-PCoA aneurysms using conventional statistical analysis, computational fluid dynamics (CFD) simulation, and random forest with hyper-ensemble approach (RF with HEA). METHODS Vascular parameters and clinical information from patients who underwent coil embolization ICA-PCoA aneurysms from January 2011 to December 2016 were obtained. Conventional statistical analysis was applied to a total of 95 cases obtained from patients with a follow-up of more than 6 months. For CFD simulation, 3 sets of three-dimensional models were used to understand the hemodynamical characteristics of various FPCAs. The RF with HEA was applied to reinforce the clinical data analysis. RESULTS The conventional statistical analysis fails to reveal that FPCA is a risk factor. CFD analysis shows that the diameter of FPCA alone is less likely to be a risk factor. The RF with HEA shows that the impact of FPCA is also minor compared with that of the packing density in the recurrence of coiled ICA-PCoA aneurysms. CONCLUSION The gathered results of all 3 analyses show more clear evidence that FPCA is not a risk factor for coiled ICA-PCoA aneurysms. Hence, we may conclude that FPCA itself is doubtful to be the major risk factor in the recurrence of coiled ICA-PCoA aneurysms.
Collapse
Affiliation(s)
- Jaewoo Chung
- Department of Neurosurgery, Dankook University, Cheonan, Republic of Korea
| | - Jin Hwan Cheong
- Department of Neurosurgery, Hanyang University Guri Hospital, Guri, Republic of Korea
| | - Jae Min Kim
- Department of Neurosurgery, Hanyang University Guri Hospital, Guri, Republic of Korea
| | - Deok Hee Lee
- Department of Radiology, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Hyeong-Joong Yi
- Department of Neurosurgery, Hanyang University Medical Center, Seoul, Republic of Korea
| | - Kyu-Sun Choi
- Department of Neurosurgery, Hanyang University Medical Center, Seoul, Republic of Korea
| | - Jae Sung Ahn
- Department of Neurosurgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Jung Cheol Park
- Department of Neurosurgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| | - Wonhyoung Park
- Department of Neurosurgery, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
| |
Collapse
|
13
|
Chen SL, Chin SC, Chan KC, Ho CY. A Machine Learning Approach to Assess Patients with Deep Neck Infection Progression to Descending Mediastinitis: Preliminary Results. Diagnostics (Basel) 2023; 13:2736. [PMID: 37685275 PMCID: PMC10486957 DOI: 10.3390/diagnostics13172736] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 07/25/2023] [Accepted: 08/22/2023] [Indexed: 09/10/2023] Open
Abstract
BACKGROUND Deep neck infection (DNI) is a serious infectious disease, and descending mediastinitis is a fatal infection of the mediastinum. However, no study has applied artificial intelligence to assess progression to descending mediastinitis in DNI patients. Thus, we developed a model to assess the possible progression of DNI to descending mediastinitis. METHODS Between August 2017 and December 2022, 380 patients with DNI were enrolled; 75% of patients (n = 285) were assigned to the training group for validation, whereas the remaining 25% (n = 95) were assigned to the test group to determine the accuracy. The patients' clinical and computed tomography (CT) parameters were analyzed via the k-nearest neighbor method. The predicted and actual progression of DNI patients to descending mediastinitis were compared. RESULTS In the training and test groups, there was no statistical significance (all p > 0.05) noted at clinical variables (age, gender, chief complaint period, white blood cells, C-reactive protein, diabetes mellitus, and blood sugar), deep neck space (parapharyngeal, submandibular, retropharyngeal, and multiple spaces involved, ≥3), tracheostomy performance, imaging parameters (maximum diameter of abscess and nearest distance from abscess to level of sternum notch), or progression to mediastinitis. The model had a predictive accuracy of 82.11% (78/95 patients), with sensitivity and specificity of 41.67% and 87.95%, respectively. CONCLUSIONS Our model can assess the progression of DNI to descending mediastinitis depending on clinical and imaging parameters. It can be used to identify DNI patients who will benefit from prompt treatment.
Collapse
Affiliation(s)
- Shih-Lung Chen
- Department of Otorhinolaryngology & Head and Neck Surgery, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Shy-Chyi Chin
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
- Department of Medical Imaging and Intervention, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
| | - Kai-Chieh Chan
- Department of Otorhinolaryngology & Head and Neck Surgery, Chang Gung Memorial Hospital, New Taipei City 333, Taiwan
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Chia-Ying Ho
- School of Medicine, Chang Gung University, Taoyuan 333, Taiwan
- Division of Chinese Internal Medicine, Center for Traditional Chinese Medicine, Chang Gung Memorial Hospital, Taoyuan 333, Taiwan
| |
Collapse
|
14
|
Henn J, Hatterscheidt S, Sahu A, Buness A, Dohmen J, Arensmeyer J, Feodorovici P, Sommer N, Schmidt J, Kalff JC, Matthaei H. Machine Learning for Decision-Support in Acute Abdominal Pain - Proof of Concept and Central Considerations. Zentralbl Chir 2023; 148:376-383. [PMID: 37562397 DOI: 10.1055/a-2125-1559] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023]
Abstract
Acute abdominal pain is a common presenting symptom in the emergency department and represents heterogeneous causes and diagnoses. There is often a decision to be made regarding emergency surgical care. Machine learning (ML) could be used here as a decision-support and relieve the time and personnel resource shortage.Patients with acute abdominal pain presenting to the Department of Surgery at Bonn University Hospital in 2020 and 2021 were retrospectively analyzed. Clinical parameters as well as laboratory values were used as predictors. After randomly splitting into a training and test data set (ratio 80 to 20), three ML algorithms were comparatively trained and validated. The entire procedure was repeated 20 times.A total of 1357 patients were identified and included in the analysis, with one in five (n = 276, 20.3%) requiring emergency abdominal surgery within 24 hours. Patients operated on were more likely to be male (p = 0.026), older (p = 0.006), had more gastrointestinal symptoms (nausea: p < 0.001, vomiting p < 0.001) as well as a more recent onset of pain (p < 0.001). Tenderness (p < 0.001) and guarding (p < 0.001) were more common in surgically treated patients and blood analyses showed increased inflammation levels (white blood cell count: p < 0.001, CRP: p < 0.001) and onset of organ dysfunction (creatinine: p < 0.014, quick p < 0.001). Of the three trained algorithms, the tree-based methods (h2o random forest and cforest) showed the best performance. The algorithms classified patients, i.e., predicted surgery, with a median AUC ROC of 0.81 and 0.79 and AUC PRC of 0.56 in test sets.A proof-of-concept was achieved with the development of an ML model for predicting timely surgical therapy for acute abdomen. The ML algorithm can be a valuable tool in decision-making. Especially in the context of heavily used medical resources, the algorithm can help to use these scarce resources more effectively. Technological progress, especially regarding artificial intelligence, increasingly enables evidence-based approaches in surgery but requires a strictly interdisciplinary approach. In the future, the use and handling of ML should be integrated into surgical training.
Collapse
Affiliation(s)
- Jonas Henn
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Simon Hatterscheidt
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Anshupa Sahu
- Institute for Medical Biometry, Informatics and Epidemiology, University Hospital Bonn, Bonn, Germany
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
| | - Andreas Buness
- Institute for Medical Biometry, Informatics and Epidemiology, University Hospital Bonn, Bonn, Germany
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
| | - Jonas Dohmen
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Jan Arensmeyer
- Division of Thoracic Surgery, Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Philipp Feodorovici
- Division of Thoracic Surgery, Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Nils Sommer
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Joachim Schmidt
- Division of Thoracic Surgery, Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
- Department of Thoracic Surgery, Helios Hospital Bonn Rhein-Sieg, Bonn, Germany
| | - Jörg C Kalff
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| | - Hanno Matthaei
- Department of General, Visceral, Thoracic and Vascular Surgery, University Hospital Bonn, Bonn, Germany
| |
Collapse
|
15
|
Phan-Mai TA, Thai TT, Mai TQ, Vu KA, Mai CC, Nguyen DA. Validity of Machine Learning in Detecting Complicated Appendicitis in a Resource-Limited Setting: Findings from Vietnam. BIOMED RESEARCH INTERNATIONAL 2023; 2023:5013812. [PMID: 37090195 PMCID: PMC10121350 DOI: 10.1155/2023/5013812] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 11/30/2022] [Accepted: 03/28/2023] [Indexed: 04/25/2023]
Abstract
Background Complicated appendicitis, a potentially life-threatening condition, is common. However, the diagnosis of this condition is mainly based on physician's experiences and advanced diagnostic equipment. This study built and validated machine learning models to facilitate the detection of complicated appendicitis. Methods A retrospective cohort study was conducted based on medical charts of all patients undergoing a laparoscopic appendectomy at a city hospital during 2016-2020. The synthetic minority over-sampling technique (SMOTE) was used to adjust for the imbalance. Multiple classification approaches were used to train and validate models including support vector machine (SVM), decision tree (DT), K-nearest neighbor (KNN), logistic regression (LR), artificial neural network (ANN), and gradient boosting (GB). Results Among 1,950 patients included in the data analysis, there were 483 patients identified as having complicated appendicitis (24.8%). Based on data without SMOTE adjustment for imbalance, the accuracy levels and AUCs were high in all models using different parameters, ranging from 0.687 to 0.815. After adjusting for imbalance data using SMOTE, AUC and accuracy levels in the models using imbalance adjusted data were higher. Of these, the GB had all AUC and accuracy values of approximately 0.8 or more in both adjusted and unadjusted data. Conclusions Machine learning approaches including SVM, DT, logistic, KNN, ANN, and GB have a high level of validity in classifying patients with complicated appendicitis and patients without complicated appendicitis. Among these, GB had the highest level of validity and should be used or further validated. Our study indicates the beneficial potentials of machine learning techniques in a clinical setting in general and in the diagnosis of complicated appendicitis in particular.
Collapse
Affiliation(s)
- Tuong-Anh Phan-Mai
- General Surgery Department, Nhan dan Gia Dinh Hospital, 1 No Trang Long Street, Ward 7, Binh Thanh District, Ho Chi Minh City, Vietnam
| | - Truc Thanh Thai
- Department of Medical Statistics and Informatics, University of Medicine and Pharmacy at Ho Chi Minh City, 217 Hong Bang Street, Ward 11, District 5, Ho Chi Minh City, Vietnam
| | - Thanh Quoc Mai
- Department of Medical Statistics and Informatics, University of Medicine and Pharmacy at Ho Chi Minh City, 217 Hong Bang Street, Ward 11, District 5, Ho Chi Minh City, Vietnam
| | - Kiet Anh Vu
- Planning Department, Nhan dan Gia Dinh Hospital, 1 No Trang Long Street, Ward 7, Binh Thanh District, Ho Chi Minh City, Vietnam
| | - Cong Chi Mai
- General Surgery Department, Nhan dan Gia Dinh Hospital, 1 No Trang Long Street, Ward 7, Binh Thanh District, Ho Chi Minh City, Vietnam
| | - Dung Anh Nguyen
- General Surgery Department, Nhan dan Gia Dinh Hospital, 1 No Trang Long Street, Ward 7, Binh Thanh District, Ho Chi Minh City, Vietnam
| |
Collapse
|
16
|
Chen H, Huang L, Jiang X, Wang Y, Bian Y, Ma S, Liu X. Establishment and analysis of a disease risk prediction model for the systemic lupus erythematosus with random forest. Front Immunol 2022; 13:1025688. [PMID: 36405750 PMCID: PMC9667742 DOI: 10.3389/fimmu.2022.1025688] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 10/17/2022] [Indexed: 09/25/2023] Open
Abstract
Systemic lupus erythematosus (SLE) is a latent, insidious autoimmune disease, and with the development of gene sequencing in recent years, our study aims to develop a gene-based predictive model to explore the identification of SLE at the genetic level. First, gene expression datasets of SLE whole blood samples were collected from the Gene Expression Omnibus (GEO) database. After the datasets were merged, they were divided into training and validation datasets in the ratio of 7:3, where the SLE samples and healthy samples of the training dataset were 334 and 71, respectively, and the SLE samples and healthy samples of the validation dataset were 143 and 30, respectively. The training dataset was used to build the disease risk prediction model, and the validation dataset was used to verify the model identification ability. We first analyzed differentially expressed genes (DEGs) and then used Lasso and random forest (RF) to screen out six key genes (OAS3, USP18, RTP4, SPATS2L, IFI27 and OAS1), which are essential to distinguish SLE from healthy samples. With six key genes incorporated and five iterations of 10-fold cross-validation performed into the RF model, we finally determined the RF model with optimal mtry. The mean values of area under the curve (AUC) and accuracy of the models were over 0.95. The validation dataset was then used to evaluate the AUC performance and our model had an AUC of 0.948. An external validation dataset (GSE99967) with an AUC of 0.810, an accuracy of 0.836, and a sensitivity of 0.921 was used to assess the model's performance. The external validation dataset (GSE185047) of all SLE patients yielded an SLE sensitivity of up to 0.954. The final high-throughput RF model had a mean value of AUC over 0.9, again showing good results. In conclusion, we identified key genetic biomarkers and successfully developed a novel disease risk prediction model for SLE that can be used as a new SLE disease risk prediction aid and contribute to the identification of SLE.
Collapse
Affiliation(s)
- Huajian Chen
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Li Huang
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Xinyue Jiang
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Yue Wang
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Yan Bian
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Shumei Ma
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
| | - Xiaodong Liu
- School of Public Health and Management, Wenzhou Medical University, Wenzhou, China
- South Zhejiang Institute of Radiation Medicine and Nuclear Technology, Wenzhou Medical University, Wenzhou, China
- Key Laboratory of Watershed Science and Health of Zhejiang Province, Wenzhou Medical University, Wenzhou, China
| |
Collapse
|
17
|
Tracking Health, Performance and Recovery in Athletes Using Machine Learning. Sports (Basel) 2022; 10:sports10100160. [PMID: 36287773 PMCID: PMC9611450 DOI: 10.3390/sports10100160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 10/07/2022] [Accepted: 10/17/2022] [Indexed: 11/30/2022] Open
Abstract
Training and competitive periods can temporarily impair the performance of an athlete. This disruption can be short- or long-term, lasting up to several days. We analyzed the health indicators of 3661 athletes during an in-depth medical examination. At the time of inclusion in the study, the athletes were healthy. Instrumental examinations (fluorography, ultrasound examination of the abdominal cavity and pelvic organs, echocardiography, electrocardiography, and stress testing “to failure”), laboratory examinations (general urinalysis and biochemical and general clinical blood analysis), and examinations by specialists (ophthalmologist, otolaryngologist, surgeon, cardiologist, neurologist, dentist, gynecologist (women), endocrinologist, and therapist) were performed. This study analyzed the significance of determining the indicators involved in the implementation of the “catabolism” and “anabolism” phenotypes using the random forest and multinomial logistic regression machine learning methods. The use of decision forest and multinomial regression models made it possible to identify the most significant indicators of blood and urine biochemistry for the analysis of phenotypes as a characterization of the effectiveness of recovery processes in the post-competitive period in athletes. We found that the parameters of muscle metabolism, such as aspartate aminotransferase, creatine kinase, lactate dehydrogenase, and alanine aminotransferase levels, and the parameters of the ornithine cycle, such as creatinine, urea acid, and urea levels, made the most significant contribution to the classification of two types of metabolism: catabolism and anabolism.
Collapse
|
18
|
Zhang M, Ke B, Zhuo H, Guo B. Diagnostic model based on bioinformatics and machine learning to distinguish Kawasaki disease using multiple datasets. BMC Pediatr 2022; 22:512. [PMID: 36042431 PMCID: PMC9425821 DOI: 10.1186/s12887-022-03557-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 08/17/2022] [Indexed: 12/03/2022] Open
Abstract
Background Kawasaki disease (KD), characterized by systemic vasculitis, is the leading cause of acquired heart disease in children. Herein, we developed a diagnostic model, with some prognosis ability, to help distinguish children with KD. Methods Gene expression datasets were downloaded from Gene Expression Omnibus (GEO), and gene sets with a potential pathogenic mechanism in KD were identified using differential expressed gene (DEG) screening, pathway enrichment analysis, random forest (RF) screening, and artificial neural network (ANN) construction. Results We extracted 2,017 DEGs (1,130 with upregulated and 887 with downregulated expression) from GEO. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses showed that the DEGs were significantly enriched in innate/adaptive immune response-related processes. Subsequently, the results of weighted gene co-expression network analysis and DEG screening were combined and, using RF and ANN, a model with eight genes (VPS9D1, CACNA1E, SH3GLB1, RAB32, ADM, GYG1, PGS1, and HIST2H2AC) was constructed. Classification results of the new model for KD diagnosis showed excellent performance for different datasets, including those of patients with KD, convalescents, and healthy individuals, with area under the curve values of 1, 0.945, and 0.95, respectively. Conclusions We used machine learning methods to construct and validate a diagnostic model using multiple bioinformatic datasets, and identified molecules expected to serve as new biomarkers for or therapeutic targets in KD. Supplementary Information The online version contains supplementary material available at 10.1186/s12887-022-03557-y.
Collapse
Affiliation(s)
- Mengyi Zhang
- Department of Laboratory Medicine, West China Second University Hospital, Sichuan University, No. 20, Section 3, Renmin South Road, Chengdu, 610041, PR, Sichuan Province, China.,Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China
| | - Bocuo Ke
- Department of Laboratory Medicine, West China Second University Hospital, Sichuan University, No. 20, Section 3, Renmin South Road, Chengdu, 610041, PR, Sichuan Province, China.,Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China
| | - Huichuan Zhuo
- Department of Laboratory Medicine, West China Second University Hospital, Sichuan University, No. 20, Section 3, Renmin South Road, Chengdu, 610041, PR, Sichuan Province, China.,Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China
| | - Binhan Guo
- Department of Laboratory Medicine, West China Second University Hospital, Sichuan University, No. 20, Section 3, Renmin South Road, Chengdu, 610041, PR, Sichuan Province, China. .,Key Laboratory of Birth Defects and Related Diseases of Women and Children (Sichuan University), Ministry of Education, Chengdu, China.
| |
Collapse
|
19
|
Deep Learning Artificial Intelligence to Predict the Need for Tracheostomy in Patients of Deep Neck Infection Based on Clinical and Computed Tomography Findings—Preliminary Data and a Pilot Study. Diagnostics (Basel) 2022; 12:diagnostics12081943. [PMID: 36010293 PMCID: PMC9406478 DOI: 10.3390/diagnostics12081943] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/08/2022] [Accepted: 08/10/2022] [Indexed: 12/02/2022] Open
Abstract
Background: Deep neck infection (DNI) can lead to airway obstruction. Rather than intubation, some patients need tracheostomy to secure the airway. However, no study has used deep learning (DL) artificial intelligence (AI) to predict the need for tracheostomy in DNI patients. Thus, the purpose of this study was to develop a DL framework to predict the need for tracheostomy in DNI patients. Methods: 392 patients with DNI were enrolled in this study between August 2016 and April 2022; 80% of the patients (n = 317) were randomly assigned to a training group for model validation, and the remaining 20% (n = 75) were assigned to the test group to determine model accuracy. The k-nearest neighbor method was applied to analyze the clinical and computed tomography (CT) data of the patients. The predictions of the model with regard to the need for tracheostomy were compared with actual decisions made by clinical experts. Results: No significant differences were observed in clinical or CT parameters between the training group and test groups. The DL model yielded a prediction accuracy of 78.66% (59/75 cases). The sensitivity and specificity values were 62.50% and 80.60%, respectively. Conclusions: We demonstrated a DL framework to predict the need for tracheostomy in DNI patients based on clinical and CT data. The model has potential for clinical application; in particular, it may assist less experienced clinicians to determine whether tracheostomy is necessary in cases of DNI.
Collapse
|
20
|
Li L, Wang L, Lu L, Zhu T. Machine learning prediction of postoperative unplanned 30-day hospital readmission in older adult. Front Mol Biosci 2022; 9:910688. [PMID: 36032677 PMCID: PMC9399440 DOI: 10.3389/fmolb.2022.910688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 07/06/2022] [Indexed: 11/26/2022] Open
Abstract
Background: Although unplanned hospital readmission is an important indicator for monitoring the perioperative quality of hospital care, few published studies of hospital readmission have focused on surgical patient populations, especially in the elderly. We aimed to investigate if machine learning approaches can be used to predict postoperative unplanned 30-day hospital readmission in old surgical patients. Methods: We extracted demographic, comorbidity, laboratory, surgical, and medication data of elderly patients older than 65 who underwent surgeries under general anesthesia in West China Hospital, Sichuan University from July 2019 to February 2021. Different machine learning approaches were performed to evaluate whether unplanned 30-day hospital readmission can be predicted. Model performance was assessed using the following metrics: AUC, accuracy, precision, recall, and F1 score. Calibration of predictions was performed using Brier Score. A feature ablation analysis was performed, and the change in AUC with the removal of each feature was then assessed to determine feature importance. Results: A total of 10,535 unique surgeries and 10,358 unique surgical elderly patients were included. The overall 30-day unplanned readmission rate was 3.36%. The AUCs of the six machine learning algorithms predicting postoperative 30-day unplanned readmission ranged from 0.6865 to 0.8654. The RF + XGBoost algorithm overall performed the best with an AUC of 0.8654 (95% CI, 0.8484–0.8824), accuracy of 0.9868 (95% CI, 0.9834–0.9902), precision of 0.3960 (95% CI, 0.3854–0.4066), recall of 0.3184 (95% CI, 0.259–0.3778), and F1 score of 0.4909 (95% CI, 0.3907–0.5911). The Brier scores of the six machine learning algorithms predicting postoperative 30-day unplanned readmission ranged from 0.3721 to 0.0464, with RF + XGBoost showing the best calibration capability. The most five important features of RF + XGBoost were operation duration, white blood cell count, BMI, total bilirubin concentration, and blood glucose concentration. Conclusion: Machine learning algorithms can accurately predict postoperative unplanned 30-day readmission in elderly surgical patients.
Collapse
Affiliation(s)
- Linji Li
- Department of Anesthesiology, West China Hospital, Sichuan University and The Research Units of West China (2018RU012), Chinese Academy of Medical Sciences, Chengdu, China
- Department of Anesthesiology, The Second Clinical Medical College, North Sichuan Medical College, Nanchong Central Hospital, Nanchong, China
| | - Linna Wang
- College of Computer Science, Sichuan University, Chengdu, China
| | - Li Lu
- College of Computer Science, Sichuan University, Chengdu, China
- *Correspondence: Li Lu, ; Tao Zhu,
| | - Tao Zhu
- Department of Anesthesiology, West China Hospital, Sichuan University and The Research Units of West China (2018RU012), Chinese Academy of Medical Sciences, Chengdu, China
- *Correspondence: Li Lu, ; Tao Zhu,
| |
Collapse
|
21
|
Vohra R, Hussain A, Dudyala AK, Pahareeya J, Khan W. Multi-class classification algorithms for the diagnosis of anemia in an outpatient clinical setting. PLoS One 2022; 17:e0269685. [PMID: 35793343 PMCID: PMC9258850 DOI: 10.1371/journal.pone.0269685] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 05/26/2022] [Indexed: 11/19/2022] Open
Abstract
Anemia is one of the most pressing public health issues in the world with iron deficiency a major public health issue worldwide. The highest prevalence of anemia is in developing countries. The complete blood count is a blood test used to diagnose the prevalence of anemia. While earlier studies have framed the problem of diagnosis as a binary classification problem, this paper frames it as a multi class (three classes) classification problem with mild, moderate and severe classes. The three classes for the anemia classification (mild, moderate, severe) are so chosen as the world health organization (WHO) guidelines formalize this categorization based on the Haemoglobin (HGB) values of the chosen sample of patients in the Complete Blood Count (CBC) patient data set. Complete blood count test data was collected in an outpatient clinical setting in India. We used Feature selection with Majority voting to identify the key attributes in the input patient data set. In addition, since the original data set was imbalanced we used Synthetic Minority Oversampling Technique (SMOTE) to balance the data set. Four data sets including the original data set were used to perform the data experiments. Six standard machine learning algorithms were utilised to test our four data sets, performing multi class classification. Benchmarking these algorithms was performed and tabulated using both10 fold cross validation and hold out methods. The experimental results indicated that multilayer perceptron network was predominantly giving good recall values across mild and moderate class which are early and middle stages of the disease. With a good prediction model at early stages, medical intervention can provide preventive measure from further deterioration into severe stage or recommend the use of supplements to overcome this problem.
Collapse
Affiliation(s)
- Rajan Vohra
- School of Creative Technologies, University of Bolton, Bolton, United Kingdom
- * E-mail:
| | - Abir Hussain
- Department of Electrical Engineering, University of Sharjah, Sharjah, UAE
- Department of Computer Science, Liverpool John Moore’s University, Liverpool, United Kingdom
| | - Anil Kumar Dudyala
- Department of Computer Science, National Institute of Technology Patna (NIT Patna), Patna, India
| | - Jankisharan Pahareeya
- Department of Information Technology, Rustamji Institute of Technology, BSF Academy, Tekanpur, Gwalior, India
| | - Wasiq Khan
- Department of Computer Science, Liverpool John Moore’s University, Liverpool, United Kingdom
| |
Collapse
|
22
|
Luo M, Feng Y, Luo J, Li X, Han J, Li T. Predictive performances of 6 data mining techniques for obstructive sleep apnea-hypopnea syndrome. Medicine (Baltimore) 2022; 101:e29724. [PMID: 35776998 PMCID: PMC9239632 DOI: 10.1097/md.0000000000029724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
PURPOSE This study compared the effects of 6 types of obstructive sleep apnea-hypopnea syndrome (OSAHS) prediction models to develop a reference for selecting OSAHS data mining tools in clinical practice. METHODS This cross-sectional study included 401 cases. They were randomly divided into 2 groups: training (70%) and testing (30%). Logistic regression, a Bayesian network, an artificial neural network, a support vector learning machine, C5.0, and a classification and regression tree were each adopted to establish 6 prediction models. After training, the 6 models were used to test the remaining samples and calculate the correct and error rates of each model. RESULTS Twenty-one input variables for which the difference between the patient and nonpatient groups was statistically significant were considered. The models found the abdominal circumference, neck circumference, and nocturia ≥2 per night to be the most important variables. The support vector machine, neural network, and C5.0 models performed better than the classification and regression tree, Bayesian network, and logistic regression models. CONCLUSIONS In terms of predicting the risk of OSAHS, the support vector machine, neural network, and C5.0 were superior to the classification and regression tree, Bayesian network, and logistic regression models. However, such results were obtained based on the data of a single center, so they need to be further validated by other institutions.
Collapse
Affiliation(s)
- Miao Luo
- Department of Respiratory Medicine, Hospital Affiliated Guilin Medical College, Guilin, China
| | - Yuan Feng
- Sleep Disorder Center, Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Jingying Luo
- Department of Dermatology, The Second Affiliated Hospital of Guilin Medical University, Guilin, China
| | - XiaoLin Li
- Sleep Disorder Center, Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - JianFang Han
- Sleep Disorder Center, Nanfang Hospital, Southern Medical University, Guangzhou, China
| | - Taoping Li
- Sleep Disorder Center, Nanfang Hospital, Southern Medical University, Guangzhou, China
- *Correspondence: Taoping Li, Department of Sleep Disorder Center, Nanfang Hospital, Southern Medical University, No. 1838 Guangzhou Dadao Bei, Guangzhou 510515, China (e-mail: )
| |
Collapse
|
23
|
Sun D, Peng H, Wu Z. Establishment and Analysis of a Combined Diagnostic Model of Alzheimer's Disease With Random Forest and Artificial Neural Network. Front Aging Neurosci 2022; 14:921906. [PMID: 35847663 PMCID: PMC9280980 DOI: 10.3389/fnagi.2022.921906] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 06/01/2022] [Indexed: 12/31/2022] Open
Abstract
Alzheimer's disease (AD) is a neurodegenerative condition that causes cognitive decline over time. Because existing diagnostic approaches for AD are limited, improving upon previously established diagnostic models based on genetic biomarkers is necessary. Firstly, four AD gene expression datasets were collected from the Gene Expression Omnibus (GEO) database. Two datasets were used to establish diagnostic models, and the other two datasets were used to verify the model effect. We merged GSE5281 with GSE44771 as the training dataset and found 120 DEGs. Then, we used random forest (RF) to screen 6 key genes (KLF15, MAFF, ITPKB, SST, DDIT4, and NRXN3) as being critical for separating AD and normal samples. The weights of these key genes were measured, and a diagnostic model was created using an artificial neural network (ANN). The area under the curve (AUC) of the model is 0.953, while the accuracy is 0.914. In the final step, two validation datasets were utilized to assess AUC performance. In GSE109887, our model had an AUC of 0.854, and in GSE132903, it had an AUC of 0.810. To summarize, we successfully identified key gene biomarkers and developed a new AD diagnostic model.
Collapse
|
24
|
Fitzsimmons L, Dewan M, Dexheimer JW. Diversity in Machine Learning: A Systematic Review of Text-Based Diagnostic Applications. Appl Clin Inform 2022; 13:569-582. [PMID: 35613914 DOI: 10.1055/s-0042-1749119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
Abstract
OBJECTIVE As the storage of clinical data has transitioned into electronic formats, medical informatics has become increasingly relevant in providing diagnostic aid. The purpose of this review is to evaluate machine learning models that use text data for diagnosis and to assess the diversity of the included study populations. METHODS We conducted a systematic literature review on three public databases. Two authors reviewed every abstract for inclusion. Articles were included if they used or developed machine learning algorithms to aid in diagnosis. Articles focusing on imaging informatics were excluded. RESULTS From 2,260 identified papers, we included 78. Of the machine learning models used, neural networks were relied upon most frequently (44.9%). Studies had a median population of 661.5 patients, and diseases and disorders of 10 different body systems were studied. Of the 35.9% (N = 28) of papers that included race data, 57.1% (N = 16) of study populations were majority White, 14.3% were majority Asian, and 7.1% were majority Black. In 75% (N = 21) of papers, White was the largest racial group represented. Of the papers included, 43.6% (N = 34) included the sex ratio of the patient population. DISCUSSION With the power to build robust algorithms supported by massive quantities of clinical data, machine learning is shaping the future of diagnostics. Limitations of the underlying data create potential biases, especially if patient demographics are unknown or not included in the training. CONCLUSION As the movement toward clinical reliance on machine learning accelerates, both recording demographic information and using diverse training sets should be emphasized. Extrapolating algorithms to demographics beyond the original study population leaves large gaps for potential biases.
Collapse
Affiliation(s)
- Lane Fitzsimmons
- College of Agriculture and Life Science, Cornell University, Ithaca, New York, United States
| | - Maya Dewan
- Division of Critical Care Medicine, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, United States.,Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States
| | - Judith W Dexheimer
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, Ohio, United States.,Division of Emergency Medicine; Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, Cincinnati, Ohio, United States
| |
Collapse
|
25
|
Xia J, Wang Z, Yang D, Li R, Liang G, Chen H, Heidari AA, Turabieh H, Mafarja M, Pan Z. Performance optimization of support vector machine with oppositional grasshopper optimization for acute appendicitis diagnosis. Comput Biol Med 2022; 143:105206. [PMID: 35101730 DOI: 10.1016/j.compbiomed.2021.105206] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 12/27/2021] [Accepted: 12/30/2021] [Indexed: 12/13/2022]
Abstract
Preoperative differentiation of complicated and uncomplicated appendicitis is challenging. The research goal was to construct a new intelligent diagnostic rule that is accurate, fast, noninvasive, and cost-effective, distinguishing between complicated and uncomplicated appendicitis. Overall, 298 patients with acute appendicitis from the Wenzhou Central Hospital were recruited, and information on their demographic characteristics, clinical findings, and laboratory data was retrospectively reviewed and applied in this study. First, the most significant variables, including C-reactive protein (CRP), heart rate, body temperature, and neutrophils discriminating complicated from uncomplicated appendicitis, were identified using random forest analysis. Second, an improved grasshopper optimization algorithm-based support vector machine was used to construct the diagnostic model to discriminate complicated appendicitis (CAP) from uncomplicated appendicitis (UAP). The resultant optimal model can produce an average of 83.56% accuracy, 81.71% sensitivity, 85.33% specificity, and 0.6732 Matthews correlation coefficients. Based on existing routinely available markers, the proposed intelligent diagnosis model is highly reliable. Thus, the model can potentially be used to assist doctors in making correct clinical decisions.
Collapse
Affiliation(s)
- Jianfu Xia
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Zhifei Wang
- Department of Hepatobiliary, Pancreatic and Minimally Invasive Surgery, Zhejiang Provincial People's Hospital, Hangzhou, 310014, China.
| | - Daqing Yang
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Rizeng Li
- Department of General Surgery, The Second Affiliated Hospital of Shanghai University (Wenzhou Central Hospital), Wenzhou, Zhejiang, 325000, China.
| | - Guoxi Liang
- Department of Information Technology, Wenzhou Polytechnic, Wenzhou, 325035, China.
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Ali Asghar Heidari
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Hamza Turabieh
- Department of Information Technology, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif, 21944, Taif, Saudi Arabia.
| | - Majdi Mafarja
- Department of Computer Science, Birzeit University, Birzeit, 72439, Palestine.
| | - Zhifang Pan
- The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| |
Collapse
|
26
|
Henn J, Buness A, Schmid M, Kalff JC, Matthaei H. Machine learning to guide clinical decision-making in abdominal surgery-a systematic literature review. Langenbecks Arch Surg 2022; 407:51-61. [PMID: 34716472 PMCID: PMC8847247 DOI: 10.1007/s00423-021-02348-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Accepted: 10/03/2021] [Indexed: 12/16/2022]
Abstract
PURPOSE An indication for surgical therapy includes balancing benefits against risk, which remains a key task in all surgical disciplines. Decisions are oftentimes based on clinical experience while guidelines lack evidence-based background. Various medical fields capitalized the application of machine learning (ML), and preliminary research suggests promising implications in surgeons' workflow. Hence, we evaluated ML's contemporary and possible future role in clinical decision-making (CDM) focusing on abdominal surgery. METHODS Using the PICO framework, relevant keywords and research questions were identified. Following the PRISMA guidelines, a systemic search strategy in the PubMed database was conducted. Results were filtered by distinct criteria and selected articles were manually full text reviewed. RESULTS Literature review revealed 4,396 articles, of which 47 matched the search criteria. The mean number of patients included was 55,843. A total of eight distinct ML techniques were evaluated whereas AUROC was applied by most authors for comparing ML predictions vs. conventional CDM routines. Most authors (N = 30/47, 63.8%) stated ML's superiority in the prediction of benefits and risks of surgery. The identification of highly relevant parameters to be integrated into algorithms allowing a more precise prognosis was emphasized as the main advantage of ML in CDM. CONCLUSIONS A potential value of ML for surgical decision-making was demonstrated in several scientific articles. However, the low number of publications with only few collaborative studies between surgeons and computer scientists underpins the early phase of this highly promising field. Interdisciplinary research initiatives combining existing clinical datasets and emerging techniques of data processing may likely improve CDM in abdominal surgery in the future.
Collapse
Affiliation(s)
- Jonas Henn
- Department of General, Visceral, Thoracic and Vascular Surgery, University of Bonn, Bonn, Germany
| | - Andreas Buness
- Institute for Medical Biometry, Informatics and Epidemiology, University of Bonn, Bonn, Germany
- Institute for Genomic Statistics and Bioinformatics, University of Bonn, Bonn, Germany
| | - Matthias Schmid
- Institute for Medical Biometry, Informatics and Epidemiology, University of Bonn, Bonn, Germany
| | - Jörg C Kalff
- Department of General, Visceral, Thoracic and Vascular Surgery, University of Bonn, Bonn, Germany
| | - Hanno Matthaei
- Department of General, Visceral, Thoracic and Vascular Surgery, University of Bonn, Bonn, Germany.
| |
Collapse
|
27
|
Lin SY, Law KM, Yeh YC, Wu KC, Lai JH, Lin CH, Hsu WH, Lin CC, Kao CH. Applying Machine Learning to Carotid Sonographic Features for Recurrent Stroke in Patients With Acute Stroke. Front Cardiovasc Med 2022; 9:804410. [PMID: 35155629 PMCID: PMC8833232 DOI: 10.3389/fcvm.2022.804410] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 01/04/2022] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Although carotid sonographic features have been used as predictors of recurrent stroke, few large-scale studies have explored the use of machine learning analysis of carotid sonographic features for the prediction of recurrent stroke. METHODS We retrospectively collected electronic medical records of enrolled patients from the data warehouse of China Medical University Hospital, a tertiary medical center in central Taiwan, from January 2012 to November 2018. We included patients who underwent a documented carotid ultrasound within 30 days of experiencing an acute first stroke during the study period. We classified these participants into two groups: those with non-recurrent stroke (those who has not been diagnosed with acute stroke again during the study period) and those with recurrent stoke (those who has been diagnosed with acute stroke during the study period). A total of 1,235 carotid sonographic parameters were analyzed. Data on the patients' demographic characteristics and comorbidities were also collected. Python 3.7 was used as the programming language, and the scikit-learn toolkit was used to complete the derivation and verification of the machine learning methods. RESULTS In total, 2,411 patients were enrolled in this study, of whom 1,896 and 515 had non-recurrent and recurrent stroke, respectively. After extraction, 43 features of carotid sonography (36 carotid sonographic parameters and seven transcranial color Doppler sonographic parameter) were analyzed. For predicting recurrent stroke, CatBoost achieved the highest area under the curve (0.844, CIs 95% 0.824-0.868), followed by the Light Gradient Boosting Machine (0.832, CIs 95% 0.813-0.851), random forest (0.819, CIs 95% 0.802-0.846), support-vector machine (0.759, CIs 95% 0.739-0.781), logistic regression (0.781, CIs 95% 0.764-0.800), and decision tree (0.735, CIs 95% 0.717-0.755) models. CONCLUSION When using the CatBoost model, the top three features for predicting recurrent stroke were determined to be the use of anticoagulation medications, the use of NSAID medications, and the resistive index of the left subclavian artery. The CatBoost model demonstrated efficiency and achieved optimal performance in the predictive classification of non-recurrent and recurrent stroke.
Collapse
Affiliation(s)
- Shih-Yi Lin
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, Taiwan
- Division of Nephrology and Kidney Institute, China Medical University Hospital, Taichung, Taiwan
| | - Kin-Man Law
- Center of Augmented Intelligence in Healthcare, China Medical University Hospital, Taichung, Taiwan
- Department of Computer Science and Engineering, National Chung Hsing University, Taichung, Taiwan
| | - Yi-Chun Yeh
- Center of Augmented Intelligence in Healthcare, China Medical University Hospital, Taichung, Taiwan
| | - Kuo-Chen Wu
- Center of Augmented Intelligence in Healthcare, China Medical University Hospital, Taichung, Taiwan
- Graduate Institute of Biomedical Electronics and Bioinformatics, National Taiwan University, Taipei, Taiwan
| | - Jhih-Han Lai
- Division of Nephrology and Kidney Institute, China Medical University Hospital, Taichung, Taiwan
| | - Chih-Hsueh Lin
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, Taiwan
- Department of Family Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Wu-Huei Hsu
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, Taiwan
- Division of Pulmonary and Critical Care Medicine, China Medical University Hospital and China Medical University, Taichung, Taiwan
| | - Cheng-Chieh Lin
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, Taiwan
- Department of Family Medicine, China Medical University Hospital, Taichung, Taiwan
| | - Chia-Hung Kao
- Graduate Institute of Biomedical Sciences, College of Medicine, China Medical University, Taichung, Taiwan
- Center of Augmented Intelligence in Healthcare, China Medical University Hospital, Taichung, Taiwan
- Department of Nuclear Medicine and Positron Emission Tomography Center, China Medical University Hospital, Taichung, Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| |
Collapse
|
28
|
Bektaş M, Tuynman JB, Costa Pereira J, Burchell GL, van der Peet DL. Machine Learning Algorithms for Predicting Surgical Outcomes after Colorectal Surgery: A Systematic Review. World J Surg 2022; 46:3100-3110. [PMID: 36109367 PMCID: PMC9636121 DOI: 10.1007/s00268-022-06728-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/22/2022] [Indexed: 01/14/2023]
Abstract
BACKGROUND Machine learning (ML) has been introduced in various fields of healthcare. In colorectal surgery, the role of ML has yet to be reported. In this systematic review, an overview of machine learning models predicting surgical outcomes after colorectal surgery is provided. METHODS Databases PubMed, EMBASE, Cochrane, and Web of Science were searched for studies using machine learning models for patients undergoing colorectal surgery. To be eligible for inclusion, studies needed to apply machine learning models for patients undergoing colorectal surgery. Absence of machine learning or colorectal surgery or studies reporting on reviews, children, study abstracts were excluded. The Probast risk of bias tool was used to evaluate the methodological quality of machine learning models. RESULTS A total of 1821 studies were analysed, resulting in the inclusion of 31 articles. A vast proportion of ML algorithms have been used to predict the course of disease and response to neoadjuvant chemoradiotherapy. Radiomics have been applied most frequently, along with predictive accuracies up to 91%. However, most studies included a retrospective study design without external validation or calibration. CONCLUSIONS Machine learning models have shown promising potential in predicting surgical outcomes after colorectal surgery. However, large-scale data is warranted to bridge the gap between calibration and external validation. Clinical implementation is needed to demonstrate the contribution of ML within daily practice.
Collapse
Affiliation(s)
- Mustafa Bektaş
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Jurriaan B. Tuynman
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Jaime Costa Pereira
- Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 1105, 1081 HV Amsterdam, The Netherlands
| | - George L. Burchell
- Medical Library, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| | - Donald L. van der Peet
- Department of Surgery, Amsterdam UMC Location Vrije Universiteit Amsterdam, De Boelelaan 1117, 1081 HV Amsterdam, The Netherlands
| |
Collapse
|
29
|
Jiang N, Zhao Z, Xu P. Predictive Analysis and Evaluation Model of Chronic Liver Disease Based on BP Neural Network with Improved Ant Colony Algorithm. JOURNAL OF HEALTHCARE ENGINEERING 2021; 2021:3927551. [PMID: 34820075 PMCID: PMC8608507 DOI: 10.1155/2021/3927551] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 10/24/2021] [Accepted: 10/30/2021] [Indexed: 11/28/2022]
Abstract
Timely prediction of the mechanism and characteristics of chronic liver disease using next-generation information technology is an effective way to improve the diagnosis rate of chronic liver disease. In this paper, we have proposed a modified backpropagation (BP) neural network with improved ant colony optimization algorithm to process multiple index attribute items describing chronic liver disease and construct a chronic liver disease assessment model. The proposed model is very effective in detecting chronic liver disease on time with acceptable level of accuracy and precision ratio. To verify these claims, the proposed scheme is checked experimentally where 125 groups of 20-dimensional medical test index data items of patients with chronic liver disease were analyzed. Moreover, 13-dimensional index items were preferentially selected as test index attribute items with high sensitivity to chronic liver disease using well-known ROC curves. The 13-dimensional index items were reduced to 5-dimensional comprehensive data items by principal component analysis. The proposed neural network-based model was trained with 115 sets of test indicator sample sets, and the remaining 10 sets of sample sets were used as test samples. Compared with the original 20-dimensional data as the neural network input, the proposed model not only reduces the complexity but also improves the prediction accuracy by 15.07%.
Collapse
Affiliation(s)
- Na Jiang
- Department of Gastroenterology, Hankou Hospital, Wuhan 430012, Hube, China
| | - Zhiwei Zhao
- Department of Gastroenterology, Hankou Hospital, Wuhan 430012, Hube, China
| | - Pan Xu
- Department of Gastroenterology, Hankou Hospital, Wuhan 430012, Hube, China
| |
Collapse
|
30
|
Abstract
OBJECTIVE Child undernutrition is a global public health problem with serious implications. In this study, we estimate predictive algorithms for the determinants of childhood stunting by using various machine learning (ML) algorithms. DESIGN This study draws on data from the Ethiopian Demographic and Health Survey of 2016. Five ML algorithms including eXtreme gradient boosting, k-nearest neighbours (k-NN), random forest, neural network and the generalised linear models were considered to predict the socio-demographic risk factors for undernutrition in Ethiopia. SETTING Households in Ethiopia. PARTICIPANTS A total of 9471 children below 5 years of age participated in this study. RESULTS The descriptive results show substantial regional variations in child stunting, wasting and underweight in Ethiopia. Also, among the five ML algorithms, xgbTree algorithm shows a better prediction ability than the generalised linear mixed algorithm. The best predicting algorithm (xgbTree) shows diverse important predictors of undernutrition across the three outcomes which include time to water source, anaemia history, child age greater than 30 months, small birth size and maternal underweight, among others. CONCLUSIONS The xgbTree algorithm was a reasonably superior ML algorithm for predicting childhood undernutrition in Ethiopia compared to other ML algorithms considered in this study. The findings support improvement in access to water supply, food security and fertility regulation, among others, in the quest to considerably improve childhood nutrition in Ethiopia.
Collapse
|
31
|
To Scan or Not to Scan: Development of a Clinical Decision Support Tool to Determine if Imaging Would Aid in the Diagnosis of Appendicitis. World J Surg 2021; 45:3056-3064. [PMID: 34370058 DOI: 10.1007/s00268-021-06246-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Appendicitis is one of the most common surgically treated diseases in the world. CT scans are often over-utilized and ordered before a surgeon has evaluated the patient. Our aim was to develop a tool using machine learning (ML) algorithms that would help determine if there would be benefit in obtaining a CT scan prior to surgeon consultation. METHODS Retrospective chart review of 100 randomly selected cases who underwent appendectomy and 100 randomly selected controls was completed. Variables included components of the patient's history, laboratory values, CT readings, and pathology. Pathology was used as the gold standard for appendicitis diagnosis. All variables were then used to build the ML algorithms. Random Forest (RF), Support Vector Machine (SVM), and Bayesian Network Classifiers (BNC) models with and without CT scan results were trained and compared to CT scan results alone and the Alvarado score using area under the Receiver Operator Curve (ROC), sensitivity, and specificity measures as well as calibration indices from 500 bootstrapped samples. RESULTS Among the cases that underwent appendectomy, 88% had pathology-confirmed appendicitis. All the ML algorithms had better sensitivity, specificity, and ROC than the Alvarado score. SVM with and without CT had the best indices and could predict if imaging would aid in appendicitis diagnosis. CONCLUSION This study demonstrated that SVM with and without CT results can be used for selective imaging in the diagnosis of appendicitis. This study serves as the initial step and proof-of-concept to externally validate these results with larger and more diverse patient population.
Collapse
|
32
|
Kim S, Lee HK, Lee K. Detecting suicidal risk using MMPI-2 based on machine learning algorithm. Sci Rep 2021; 11:15310. [PMID: 34321546 PMCID: PMC8319391 DOI: 10.1038/s41598-021-94839-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 07/13/2021] [Indexed: 12/25/2022] Open
Abstract
Minnesota Multiphasic Personality Inventory-2 (MMPI-2) is a widely used tool for early detection of psychological maladjustment and assessing the level of adaptation for a large group in clinical settings, schools, and corporations. This study aims to evaluate the utility of MMPI-2 in assessing suicidal risk using the results of MMPI-2 and suicidal risk evaluation. A total of 7,824 datasets collected from college students were analyzed. The MMPI-2-Resturcutred Clinical Scales (MMPI-2-RF) and the response results for each question of the Mini International Neuropsychiatric Interview (MINI) suicidality module were used. For statistical analysis, random forest and K-Nearest Neighbors (KNN) techniques were used with suicidal ideation and suicide attempt as dependent variables and 50 MMPI-2 scale scores as predictors. On applying the random forest method to suicidal ideation and suicidal attempts, the accuracy was 92.9% and 95%, respectively, and the Area Under the Curves (AUCs) were 0.844 and 0.851, respectively. When the KNN method was applied, the accuracy was 91.6% and 94.7%, respectively, and the AUCs were 0.722 and 0.639, respectively. The study confirmed that machine learning using MMPI-2 for a large group provides reliable accuracy in classifying and predicting the subject's suicidal ideation and past suicidal attempts.
Collapse
Affiliation(s)
- Sunhae Kim
- Department of Psychiatry, Hanyang University Medical Center, 222-1, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Republic of Korea
| | - Hye-Kyung Lee
- Department of Nursing, College of Nursing and Health, Kongju National University, Gongju, Republic of Korea
| | - Kounseok Lee
- Department of Psychiatry, Hanyang University Medical Center, 222-1, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Republic of Korea.
| |
Collapse
|
33
|
A multiple combined method for rebalancing medical data with class imbalances. Comput Biol Med 2021; 134:104527. [PMID: 34091384 DOI: 10.1016/j.compbiomed.2021.104527] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/24/2021] [Accepted: 05/25/2021] [Indexed: 11/24/2022]
Abstract
Most classification algorithms assume that classes are in a balanced state. However, datasets with class imbalances are everywhere. The classes of actual medical datasets are imbalanced, severely impacting identification models and even sacrificing the classification accuracy of the minority class, even though it is the most influential and representative. The medical field has irreversible characteristics. Its tolerance rate for misjudgment is relatively low, and errors may cause irreparable harm to patients. Therefore, this study proposes a multiple combined method to rebalance medical data featuring class imbalances. The combined methods include (1) resampling methods (synthetic minority oversampling technique [SMOTE] and undersampling [US]), (2) particle swarm optimization (PSO), and (3) MetaCost. This study conducted two experiments with nine medical datasets to verify and compare the proposed method with the listing methods. A decision tree is used to generate decision rules for easy understanding of the research results. The results show that (1) the proposed method with ensemble learning can improve the area under a receiver operating characteristic curve (AUC), recall, precision, and F1 metrics; (2) MetaCost can increase sensitivity; (3) SMOTE can effectively enhance AUC; (4) US can improve sensitivity, F1, and misclassification costs in data with a high-class imbalance ratio; and (5) PSO-based attribute selection can increase sensitivity and reduce data dimension. Finally, we suggest that the dataset with an imbalanced ratio >9 must use the US results to make the decision. As the imbalanced ratio is < 9, the decision-maker can simultaneously consider the results of SMOTE and US to identify the best decision.
Collapse
|
34
|
Dukhi N, Sewpaul R, Derrick Sekgala M, Olawale Awe O. Artificial Intelligence Approach for Analyzing Anaemia Prevalence in Children and Adolescents in BRICS Countries: A Review. CURRENT RESEARCH IN NUTRITION AND FOOD SCIENCE JOURNAL 2021. [DOI: 10.12944/crnfsj.9.1.01] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Anemia prevalence, especially among children and adolescents, is a serious public health burden in the BRICS countries. This article gives an overview of the current anaemia status in children and adolescents in three BRICS countries, as part of a study that utilizes an artificial intelligence approach for analyzing anaemia prevalence in children and adolescents in South Africa, India and Russia. It posits that the use of machine learning in this area of health research is still novel. The weightage assessment of the crosslink between anaemia risk indicators using a machine learning approach will assist policy makers in identifying the areas of priority to intervene in the BRICS participating countries. Health interventions utilizing artificial intelligence and more specifically, machine learning techniques, remains nascent in LMICs but could lead to improved health outcomes.
Collapse
Affiliation(s)
- Natisha Dukhi
- 1Human and Social Capabilities Division, Human Sciences Research Council, Merchant House, 116-118 Buitengracht Street, Cape Town, South Africa
| | - Ronel Sewpaul
- 1Human and Social Capabilities Division, Human Sciences Research Council, Merchant House, 116-118 Buitengracht Street, Cape Town, South Africa
| | - Machoene Derrick Sekgala
- 1Human and Social Capabilities Division, Human Sciences Research Council, Merchant House, 116-118 Buitengracht Street, Cape Town, South Africa
| | - Olushina Olawale Awe
- 2Department of Mathematical Sciences, Anchor University Lagos, Lagos, Nigeria. 3Institute of Mathematics and Statistics, Federal University of Bahia (UFBA), Salvador, Brazil
| |
Collapse
|
35
|
Prokop V, Stejskal J, Klimova V, Zitek V. The role of foreign technologies and R&D in innovation processes within catching-up CEE countries. PLoS One 2021; 16:e0250307. [PMID: 33886616 PMCID: PMC8061943 DOI: 10.1371/journal.pone.0250307] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 04/04/2021] [Indexed: 11/18/2022] Open
Abstract
Prior research showed that there is a growing consensus among researchers, which point out a key role of external knowledge sources such as external R&D and technologies in enhancing firms´ innovation. However, firms´ from catching-up Central and Eastern European (CEE) countries have already shown in the past that their innovation models differ from those applied, for example, in Western Europe. This study therefore introduces a novel two-staged model combining artificial neural networks and random forests to reveal the importance of internal and external factors influencing firms´ innovation performance in the case of 3,361 firms from six catching-up CEE countries (Czech Republic, Slovakia, Poland, Estonia, Latvia and Lithuania), by using the World Banks´ Enterprise Survey data from 2019. We confirm the hypothesis that innovators in the catching-up CEE countries depend more on internal knowledge sources and, moreover, that participation in the firms groups represents an important factor of firms´ innovation. Surprisingly, we reject the hypothesis that foreign technologies are a crucial source of external knowledge. This study contributes to the theories of open innovation and absorptive capacity in the context of selected CEE countries and provides several practical implications for firms.
Collapse
Affiliation(s)
- Viktor Prokop
- Science and Research Centre, Faculty of Economics and Administration, University of Pardubice, Pardubice, Czech Republic
| | - Jan Stejskal
- Institute of Economic Sciences, Faculty of Economics and Administration, University of Pardubice, Pardubice, Czech Republic
- * E-mail:
| | - Viktorie Klimova
- Department of Regional Economics and Administration, Faculty of Economics and Administration, Masaryk University, Brno, Czech Republic
| | - Vladimir Zitek
- Department of Regional Economics and Administration, Faculty of Economics and Administration, Masaryk University, Brno, Czech Republic
| |
Collapse
|
36
|
Pei X, Deng Q, Liu Z, Yan X, Sun W. Machine Learning Algorithms for Predicting Fatty Liver Disease. ANNALS OF NUTRITION AND METABOLISM 2021; 77:38-45. [PMID: 33849025 DOI: 10.1159/000513654] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Accepted: 12/08/2020] [Indexed: 12/15/2022]
Abstract
BACKGROUND Fatty liver disease (FLD) has become a rampant condition. It is associated with a high rate of morbidity and mortality in a population. The condition is commonly referred as FLD. Early prediction of FLD would allow patients to take necessary preventive, diagnosis, and treatment. The main objective of this research is to develop a machine learning (ML) model to predict FLD that can help medics to classify individuals at high risk of FLD, make novel diagnosis, management, and prevention for FLD. METHODS Total of 3,419 subjects were recruited with 845 having been screened for FLD. Classification models were used in the detection of the disease. These models include logistic regression (LR), random forest (RF), artificial neural networks (ANNs), k-nearest neighbors (KNNs), extreme gradient boosting (XGBoost), and linear discriminant analysis (LDA). Predictive accuracy was assessed by area under curve (AUC), sensitivity, specificity, positive predictive value, and negative predictive value. RESULTS We demonstrated that ML models give more accurate predictions, the best accuracy reached to 0.9415 in the XGBoost model. Feature importance analysis not only confirmed some well-known FLD risk factors, but also demonstrated several novel features for predicting the risk of FLD, such as hemoglobin. CONCLUSION By implementing the XGBoost model, physicians can efficiently identify FLD in general patients; this would help in prevention, early treatment, and management of FLD.
Collapse
Affiliation(s)
- Xieyi Pei
- Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, China.,National Clinical Research Center for Metabolic Diseases, Changsha, China.,Key Laboratory of Diabetes Immunology, Central South University, Ministry of Education, Changsha, China.,Department of Metabolism and Endocrinology, Loudi Central Hospital, Loudi, China
| | - Qingqing Deng
- Department of Geriatrics, The First People's Hospital of Xiangtan City, Xiangtan, China.,Department of Clinical Medicine, Jishou University School of Medicine, Jishou, China
| | - Zhuo Liu
- Department of Metabolism and Endocrinology, Loudi Central Hospital, Loudi, China
| | - Xiang Yan
- Department of Metabolism and Endocrinology, The Second Xiangya Hospital of Central South University, Changsha, China.,National Clinical Research Center for Metabolic Diseases, Changsha, China.,Key Laboratory of Diabetes Immunology, Central South University, Ministry of Education, Changsha, China
| | - Weiping Sun
- Department of Geriatrics, The First People's Hospital of Xiangtan City, Xiangtan, China.,Department of Clinical Medicine, Jishou University School of Medicine, Jishou, China
| |
Collapse
|
37
|
Wu Q, Nasoz F, Jung J, Bhattarai B, Han MV, Greenes RA, Saag KG. Machine learning approaches for the prediction of bone mineral density by using genomic and phenotypic data of 5130 older men. Sci Rep 2021; 11:4482. [PMID: 33627720 PMCID: PMC7904941 DOI: 10.1038/s41598-021-83828-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open
Abstract
The study aimed to utilize machine learning (ML) approaches and genomic data to develop a prediction model for bone mineral density (BMD) and identify the best modeling approach for BMD prediction. The genomic and phenotypic data of Osteoporotic Fractures in Men Study (n = 5130) was analyzed. Genetic risk score (GRS) was calculated from 1103 associated SNPs for each participant after a comprehensive genotype imputation. Data were normalized and divided into a training set (80%) and a validation set (20%) for analysis. Random forest, gradient boosting, neural network, and linear regression were used to develop BMD prediction models separately. Ten-fold cross-validation was used for hyper-parameters optimization. Mean square error and mean absolute error were used to assess model performance. When using GRS and phenotypic covariates as the predictors, all ML models' performance and linear regression in BMD prediction were similar. However, when replacing GRS with the 1103 individual SNPs in the model, ML models performed significantly better than linear regression (with lasso regularization), and the gradient boosting model performed the best. Our study suggested that ML models, especially gradient boosting, can improve BMD prediction in genomic data.
Collapse
Affiliation(s)
- Qing Wu
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA.
- Department of Epidemiology and Biostatistics, School of Public Health, University of Nevada, Las Vegas, NV, USA.
| | - Fatma Nasoz
- Department of Computer Science, University of Nevada, Las Vegas, NV, USA
- The Lincy Institute, University of Nevada, Las Vegas, NV, USA
| | - Jongyun Jung
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA
- Department of Epidemiology and Biostatistics, School of Public Health, University of Nevada, Las Vegas, NV, USA
| | - Bibek Bhattarai
- Department of Computer Science, University of Nevada, Las Vegas, NV, USA
| | - Mira V Han
- Nevada Institute of Personalized Medicine, University of Nevada Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA
- School of Life Sciences, University of Nevada, Las Vegas, NV, USA
| | - Robert A Greenes
- College of Health Solutions, Arizona State University, Phoenix, AZ, USA
- Department of Health Science Research, Mayo Clinic, Scottsdale, AZ, USA
| | - Kenneth G Saag
- Department of Medicine, Division of Clinical Immunology and Rheumatology, the University of Alabama at Birmingham, Birmingham, AL, USA
| |
Collapse
|
38
|
Comparison of machine learning techniques to predict unplanned readmission following total shoulder arthroplasty. J Shoulder Elbow Surg 2021; 30:e50-e59. [PMID: 32868011 DOI: 10.1016/j.jse.2020.05.013] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 04/29/2020] [Accepted: 05/04/2020] [Indexed: 02/01/2023]
Abstract
BACKGROUND Machine learning (ML) techniques have been shown to successfully predict postoperative complications for high-volume orthopedic procedures such as hip and knee arthroplasty and to stratify patients for risk-adjusted bundled payments. The latter has not been done for more heterogeneous, lower-volume procedures such as total shoulder arthroplasty (TSA) with equally limited discussion around strategies to optimize the predictive ability of ML algorithms. The purpose of this study was to (1) assess which of 5 ML algorithms best predicts 30-day readmission, (2) test select ML strategies to optimize the algorithms, and (3) report on which patient variables contribute most to risk prediction in TSA across algorithms. METHODS We identified 9043 patients in the American College of Surgeons National Surgical Quality Improvement Database who underwent primary TSA between 2011 and 2015. Predictors included demographics, comorbidities, laboratory data, and intraoperative variables. The outcome of interest was 30-day unplanned readmission. Five ML algorithms-support-vector machine (SVM), logistic regression, random forest (RF), an adaptive boosting algorithm, and neural network-were trained on the derivation cohort (2011-2014 TSA patients) to predict 30-day unplanned readmission rates. After training, weights for each respective model were fixed and the classifiers were evaluated on the 2015 TSA cohort to simulate a prospective evaluation. C-statistic and f1 scores were used to assess the performance of each classifier. After evaluation, features were removed independently to assess which features most affected classifier performance. RESULTS The derivation and validation cohorts comprised 5857 and 3186 primary TSA patients, respectively, with similar demographics, comorbidities, and 30-day unplanned readmission rates (2.9% vs. 2.7%). Of the ML algorithms, SVM performed the worst with a c-statistic of 0.54 and an f1-score of 0.07, whereas the random-forest classifier performed the best with the highest c-statistic of 0.74 and an f1-score of 0.18. In addition, SVM was most sensitive to loss of single features, whereas the performance of RF did not dramatically decrease after loss of single features. Within the trained RF classifier, 5 variables achieved weights >0.5 in descending order: high bilirubin (>1.9 mg/dL), age >65, race, chronic obstructive pulmonary disease, and American Society of Anesthesiologists' scores ≥3. In our validation cohort, we observed a 2.7% readmission rate. From this cohort, using the RF classifier we were then able to identify 436 high-risk patients with a predicted risk score >0.6, of whom 36 were readmitted (readmission rate of 8.2%). CONCLUSION Predictive analytics algorithms can achieve acceptable prediction of unplanned readmission for TSA with the RF classifier outperforming other common algorithms.
Collapse
|
39
|
Marcinkevics R, Reis Wolfertstetter P, Wellmann S, Knorr C, Vogt JE. Using Machine Learning to Predict the Diagnosis, Management and Severity of Pediatric Appendicitis. Front Pediatr 2021; 9:662183. [PMID: 33996697 PMCID: PMC8116489 DOI: 10.3389/fped.2021.662183] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Accepted: 04/01/2021] [Indexed: 01/07/2023] Open
Abstract
Background: Given the absence of consolidated and standardized international guidelines for managing pediatric appendicitis and the few strictly data-driven studies in this specific, we investigated the use of machine learning (ML) classifiers for predicting the diagnosis, management and severity of appendicitis in children. Materials and Methods: Predictive models were developed and validated on a dataset acquired from 430 children and adolescents aged 0-18 years, based on a range of information encompassing history, clinical examination, laboratory parameters, and abdominal ultrasonography. Logistic regression, random forests, and gradient boosting machines were used for predicting the three target variables. Results: A random forest classifier achieved areas under the precision-recall curve of 0.94, 0.92, and 0.70, respectively, for the diagnosis, management, and severity of appendicitis. We identified smaller subsets of 6, 17, and 18 predictors for each of targets that sufficed to achieve the same performance as the model based on the full set of 38 variables. We used these findings to develop the user-friendly online Appendicitis Prediction Tool for children with suspected appendicitis. Discussion: This pilot study considered the most extensive set of predictor and target variables to date and is the first to simultaneously predict all three targets in children: diagnosis, management, and severity. Moreover, this study presents the first ML model for appendicitis that was deployed as an open access easy-to-use online tool. Conclusion: ML algorithms help to overcome the diagnostic and management challenges posed by appendicitis in children and pave the way toward a more personalized approach to medical decision-making. Further validation studies are needed to develop a finished clinical decision support system.
Collapse
Affiliation(s)
| | - Patricia Reis Wolfertstetter
- Department of Pediatric Surgery and Pediatric Orthopedics, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Regensburg, Germany
| | - Sven Wellmann
- Division of Neonatology, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), University of Regensburg, Regensburg, Germany
| | - Christian Knorr
- Department of Pediatric Surgery and Pediatric Orthopedics, Hospital St. Hedwig of the Order of St. John of God, University Children's Hospital Regensburg (KUNO), Regensburg, Germany
| | - Julia E Vogt
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| |
Collapse
|
40
|
Novaes MT, Ferreira de Carvalho OL, Guimarães Ferreira PH, Nunes Tiraboschi TL, Silva CS, Zambrano JC, Gomes CM, de Paula Miranda E, Abílio de Carvalho Júnior O, de Bessa Júnior J. Prediction of secondary testosterone deficiency using machine learning: A comparative analysis of ensemble and base classifiers, probability calibration, and sampling strategies in a slightly imbalanced dataset. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2021.100538] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
|
41
|
Carlile M, Hurt B, Hsiao A, Hogarth M, Longhurst CA, Dameff C. Deployment of artificial intelligence for radiographic diagnosis of COVID-19 pneumonia in the emergency department. J Am Coll Emerg Physicians Open 2020; 1:1459-1464. [PMID: 33392549 PMCID: PMC7771783 DOI: 10.1002/emp2.12297] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 09/30/2020] [Accepted: 10/05/2020] [Indexed: 12/17/2022] Open
Abstract
OBJECTIVE The coronavirus disease 2019 pandemic has inspired new innovations in diagnosing, treating, and dispositioning patients during high census conditions with constrained resources. Our objective is to describe first experiences of physician interaction with a novel artificial intelligence (AI) algorithm designed to enhance physician abilities to identify ground-glass opacities and consolidation on chest radiographs. METHODS During the first wave of the pandemic, we deployed a previously developed and validated deep-learning AI algorithm for assisted interpretation of chest radiographs for use by physicians at an academic health system in Southern California. The algorithm overlays radiographs with "heat" maps that indicate pneumonia probability alongside standard chest radiographs at the point of care. Physicians were surveyed in real time regarding ease of use and impact on clinical decisionmaking. RESULTS Of the 5125 total visits and 1960 chest radiographs obtained in the emergency department (ED) during the study period, 1855 were analyzed by the algorithm. Among these, emergency physicians were surveyed for their experiences on 202 radiographs. Overall, 86% either strongly agreed or somewhat agreed that the intervention was easy to use in their workflow. Of the respondents, 20% reported that the algorithm impacted clinical decisionmaking. CONCLUSIONS To our knowledge, this is the first published literature evaluating the impact of medical imaging AI on clinical decisionmaking in the emergency department setting. Urgent deployment of a previously validated AI algorithm clinically was easy to use and was found to have an impact on clinical decision making during the predicted surge period of a global pandemic.
Collapse
Affiliation(s)
- Morgan Carlile
- Department of Emergency MedicineUC San Diego HealthSan DiegoCaliforniaUSA
| | - Brian Hurt
- Department of Radiology, UC San Diego HealthSan DiegoCaliforniaUSA
| | - Albert Hsiao
- Department of Radiology, UC San Diego HealthSan DiegoCaliforniaUSA
| | - Michael Hogarth
- Division of Biomedical InformaticsDepartment of MedicineUC San Diego HealthSan DiegoCaliforniaUSA
| | - Christopher A. Longhurst
- Division of Biomedical InformaticsDepartment of MedicineUC San Diego HealthSan DiegoCaliforniaUSA
| | - Christian Dameff
- Department of Emergency MedicineUC San Diego HealthSan DiegoCaliforniaUSA
- Division of Biomedical InformaticsDepartment of MedicineUC San Diego HealthSan DiegoCaliforniaUSA
| |
Collapse
|
42
|
Wu Q, Nasoz F, Jung J, Bhattarai B, Han MV. Machine Learning Approaches for Fracture Risk Assessment: A Comparative Analysis of Genomic and Phenotypic Data in 5130 Older Men. Calcif Tissue Int 2020; 107:353-361. [PMID: 32728911 PMCID: PMC7492432 DOI: 10.1007/s00223-020-00734-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 07/18/2020] [Indexed: 12/22/2022]
Abstract
The study aims were to develop fracture prediction models by using machine learning approaches and genomic data, as well as to identify the best modeling approach for fracture prediction. The genomic data of Osteoporotic Fractures in Men, cohort Study (n = 5130), were analyzed. After a comprehensive genotype imputation, genetic risk score (GRS) was calculated from 1103 associated Single Nucleotide Polymorphisms for each participant. Data were normalized and split into a training set (80%) and a validation set (20%) for analysis. Random forest, gradient boosting, neural network, and logistic regression were used to develop prediction models for major osteoporotic fractures separately, with GRS, bone density, and other risk factors as predictors. In model training, the synthetic minority oversampling technique was used to account for low fracture rate, and tenfold cross-validation was employed for hyperparameters optimization. In the testing, the area under curve (AUC) and accuracy were used to assess the model performance. The McNemar test was employed to examine the accuracy difference between models. The results showed that the prediction performance of gradient boosting was the best, with AUC of 0.71 and an accuracy of 0.88, and the GRS ranked as the 7th most important variable in the model. The performance of random forest and neural network were also significantly better than that of logistic regression. This study suggested that improving fracture prediction in older men can be achieved by incorporating genetic profiling and by utilizing the gradient boosting approach. This result should not be extrapolated to women or young individuals.
Collapse
Affiliation(s)
- Qing Wu
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA.
- Department of Epidemiology and Biostatistics, School of Public Health, University of Nevada, Las Vegas, NV, USA.
| | - Fatma Nasoz
- Department of Computer Science, University of Nevada, Las Vegas, NV, USA
- The Lincy Institute, University of Nevada, Las Vegas, NV, USA
| | - Jongyun Jung
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA
- Department of Epidemiology and Biostatistics, School of Public Health, University of Nevada, Las Vegas, NV, USA
| | - Bibek Bhattarai
- Department of Computer Science, University of Nevada, Las Vegas, NV, USA
| | - Mira V Han
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, 4505 Maryland Parkway, Las Vegas, NV, 89154-4009, USA
- School of Life Sciences, University of Nevada, Las Vegas, NV, USA
| |
Collapse
|
43
|
Wang XX, Zhang Y, Fan BF. Predicting Postherpetic Neuralgia in Patients with Herpes Zoster by Machine Learning: A Retrospective Study. Pain Ther 2020; 9:627-635. [PMID: 32915399 PMCID: PMC7648805 DOI: 10.1007/s40122-020-00196-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Indexed: 11/26/2022] Open
Abstract
Introduction Postherpetic neuralgia (PHN) is a neuropathic pain secondary to shingles. Studies have shown that early pain intervention can reduce the incidence or intensity of PHN. The aim of this study was to predict whether a patient with acute herpetic neuralgia will develop PHN and to help clinicians make better decisions. Method Five hundred two patients with shingles were reviewed and classified according to whether they had PHN. The risk factors associated with PHN were determined by univariate analysis. Logistic regression and random forest algorithms were used to do machine learning, and then the prediction accuracies of the two algorithms were compared, choosing the superior one to predict the next 60 new cases. Results Age, NRS score, rash site, Charlson comorbidity index (CCI) score, antiviral therapy and immunosuppression were found related to the occurrence of PHN. The NRS score was the most closely related factor with an importance of 0.31. As for accuracy, the random forest was 96.24%, better than that of logistic regression in which the accuracy was 92.83%. Then, the random forest model was used to predict 60 newly diagnosed patients with herpes zoster, and the accuracy rate was 88.33% with a 95% confidence interval (CI) of 77.43–95.18%. Conclusions This study provides an idea and a method in which, by analyzing the data of previous cases, we can develop a predictive model to predict whether patients with shingles will develop PHN.
Collapse
Affiliation(s)
- Xin-Xing Wang
- Graduate School of Beijing University of Chinese Medicine, Beijing, 100029, People's Republic of China
- Department of Pain, China-Japan Friendship Hospital, Beijing, 100029, People's Republic of China
| | - Yi Zhang
- Department of Pain, China-Japan Friendship Hospital, Beijing, 100029, People's Republic of China
| | - Bi-Fa Fan
- Department of Pain, China-Japan Friendship Hospital, Beijing, 100029, People's Republic of China.
| |
Collapse
|
44
|
Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2613091. [PMID: 32884937 PMCID: PMC7455828 DOI: 10.1155/2020/2613091] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 07/27/2020] [Accepted: 08/03/2020] [Indexed: 12/14/2022]
Abstract
Polycystic ovary syndrome (PCOS) is one of the most common metabolic and reproductive endocrinopathies. However, few studies have tried to develop a diagnostic model based on gene biomarkers. In this study, we applied a computational method by combining two machine learning algorithms, including random forest (RF) and artificial neural network (ANN), to identify gene biomarkers and construct diagnostic model. We collected gene expression data from Gene Expression Omnibus (GEO) database containing 76 PCOS samples and 57 normal samples; five datasets were utilized, including one dataset for screening differentially expressed genes (DEGs), two training datasets, and two validation datasets. Firstly, based on RF, 12 key genes in 264 DEGs were identified to be vital for classification of PCOS and normal samples. Moreover, the weights of these key genes were calculated using ANN with microarray and RNA-seq training dataset, respectively. Furthermore, the diagnostic models for two types of datasets were developed and named neuralPCOS. Finally, two validation datasets were used to test and compare the performance of neuralPCOS with other two set of marker genes by area under curve (AUC). Our model achieved an AUC of 0.7273 in microarray dataset, and 0.6488 in RNA-seq dataset. To conclude, we uncovered gene biomarkers and developed a novel diagnostic model of PCOS, which would be helpful for diagnosis.
Collapse
|
45
|
Talukder A, Ahammed B. Machine learning algorithms for predicting malnutrition among under-five children in Bangladesh. Nutrition 2020; 78:110861. [PMID: 32592978 DOI: 10.1016/j.nut.2020.110861] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 04/16/2020] [Accepted: 04/20/2020] [Indexed: 12/01/2022]
Abstract
OBJECTIVE The aim of this study was is to predict malnutrition status in under-five children in Bangladesh by using various machine learning (ML) algorithms. METHODS For analysis purposes, the nationally representative secondary records from the 2014 Bangladesh Demographic and Health Survey (BDHS) were used. Five well-known ML algorithms such as linear discriminant analysis (LDA), k-nearest neighbors (k-NN), support vector machines (SVM), random forest (RF), and logistic regression (LR) have been considered to accurately predict malnutrition status among children. Additionally, a systematic assessment of the algorithms was performed by using accuracy, sensitivity, specificity, and Cohen's κ statistic. RESULTS Based on various performance parameters, the best results were accomplished with the RF algorithm, which demonstrated an accuracy of 68.51%, a sensitivity of 94.66%, and a specificity of 69.76%. Additionally, a most extreme discriminative ability appeared by RF classification (Cohen's κ = 0.2434). CONCLUSION On the basis of the findings, we can presume that the RF algorithm was moderately superior to any other ML algorithms used in this study to predict malnutrition status among under-five children in Bangladesh. Finally, the present research recommends applying RF classification with RF feature selection when the prediction of malnutrition is the core interest.
Collapse
Affiliation(s)
- Ashis Talukder
- Statistics Discipline, Khulna University, Khulna, Bangladesh.
| | - Benojir Ahammed
- Statistics Discipline, Khulna University, Khulna, Bangladesh
| |
Collapse
|
46
|
Oh B, Yun JY, Yeo EC, Kim DH, Kim J, Cho BJ. Prediction of Suicidal Ideation among Korean Adults Using Machine Learning: A Cross-Sectional Study. Psychiatry Investig 2020; 17:331-340. [PMID: 32213803 PMCID: PMC7176567 DOI: 10.30773/pi.2019.0270] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Accepted: 02/07/2020] [Indexed: 12/30/2022] Open
Abstract
OBJECTIVE Suicidal ideation (SI) precedes actual suicidal event. Thus, it is important for the prevention of suicide to screen the individuals with SI. This study aimed to identify the factors associated with SI and to build prediction models in Korean adults using machine learning methods. METHODS The 2010-2013 dataset of the Korea National Health and Nutritional Examination Survey was used as the training dataset (n=16,437), and the subset collected in 2015 was used as the testing dataset (n=3,788). Various machine learning algorithms were applied and compared to the conventional logistic regression (LR)-based model. RESULTS Common risk factors for SI included stress awareness, experience of continuous depressive mood, EQ-5D score, depressive disorder, household income, educational status, alcohol abuse, and unmet medical service needs. The prediction performances of the machine learning models, as measured by the area under receiver-operating curve, ranged from 0.794 to 0.877, some of which were better than that of the conventional LR model (0.867). The Bayesian network, LogitBoost with LR, and ANN models outperformed the conventional LR model. CONCLUSION A machine learning-based approach could provide better SI prediction performance compared to a conventional LR-based model. These may help primary care physicians to identify patients at risk of SI and will facilitate the early prevention of suicide.
Collapse
Affiliation(s)
- Bumjo Oh
- Department of Family Medicine, SMG-SNU Boramae Medical Center, Seoul, Republic of Korea
| | - Je-Yeon Yun
- Seoul National University Hospital, Seoul, Republic of Korea.,Yeongeon Student Support Center, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Eun Chong Yeo
- School of Software, Hallym University, Chuncheon, Republic of Korea
| | - Dong-Hoi Kim
- School of Software, Hallym University, Chuncheon, Republic of Korea
| | - Jin Kim
- School of Software, Hallym University, Chuncheon, Republic of Korea
| | - Bum-Joo Cho
- Department of Ophthalmology, Hallym University Sacred Heart Hospital, Hallym University College of Medicine, Anyang, Republic of Korea.,Medical Artificial Intelligence Center, Hallym University Medical Center, Anyang, Republic of Korea.,Institute of New Frontier Research, Hallym University College of Medicine, Chuncheon, Republic of Korea
| |
Collapse
|
47
|
Exploring the Important Attributes of Human Immunodeficiency Virus and Generating Decision Rules. Symmetry (Basel) 2020. [DOI: 10.3390/sym12010067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Acquired Immunodeficiency Syndrome (AIDS) is the most severe phase of Human Immunodeficiency Virus (HIV) infection. Living with HIV results in a weakened immune system, with AIDS being the final stage of HIV and puzzling the world. The current medical environment remains unable to effectively cure AIDS, with treatment depending on long-term antiretroviral therapy (ART). To effectively treat and prevent HIV, it is important to elucidate the key factors of HIV propagation. This study proposes a rough set classifier based on adding recency (R) (i.e., the last physician visit), frequency (F) (i.e., the frequency of medical visits), and monetary (M) (i.e., medication adherence) attributes and integrated attribute selection methods to generate discriminatory rules and find the core attributes of HIV. The collected data consist of 1308 HIV infection records from Taiwan. From the experimental results, the frequency of CD4+ cells in the peripheral blood is able to determine patient medication, treatment willingness, and HIV infection stages, because HIV patients are less likely to be willing to receive long-term ART. Furthermore, drug abuse is found to be the greatest cause of HIV infection. These results show that the additional RFM attributes can improve classification accuracy, with the core attributes being M, R, plasma viral load (PVL) and age. Hence, we suggest that clinical physicians use these core attributes to understand the HIV infection stages.
Collapse
|
48
|
Predicting Hepatitis B Virus Infection Based on Health Examination Data of Community Population. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2019; 16:ijerph16234842. [PMID: 31810204 PMCID: PMC6926879 DOI: 10.3390/ijerph16234842] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 11/26/2019] [Accepted: 11/27/2019] [Indexed: 01/14/2023]
Abstract
Despite a decline in the prevalence of hepatitis B in China, the disease burden remains high. Large populations unaware of infection risk often fail to meet the ideal treatment window, resulting in poor prognosis. The purpose of this study was to develop and evaluate models identifying high-risk populations who should be tested for hepatitis B surface antigen. Data came from a large community-based health screening, including 97,173 individuals, with an average age of 54.94. A total of 33 indicators were collected as model predictors, including demographic characteristics, routine blood indicators, and liver function. Borderline-Synthetic minority oversampling technique (SMOTE) was conducted to preprocess the data and then four predictive models, namely, the extreme gradient boosting (XGBoost), random forest (RF), decision tree (DT), and logistic regression (LR) algorithms, were developed. The positive rate of hepatitis B surface antigen (HBsAg) was 8.27%. The area under the receiver operating characteristic curves for XGBoost, RF, DT, and LR models were 0.779, 0.752, 0.619, and 0.742, respectively. The Borderline-SMOTE XGBoost combined model outperformed the other models, which correctly predicted 13,637/19,435 cases (sensitivity 70.8%, specificity 70.1%), and the variable importance plot of XGBoost model indicated that age was of high importance. The prediction model can be used to accurately identify populations at high risk of hepatitis B infection that should adopt timely appropriate medical treatment measures.
Collapse
|
49
|
Reismann J, Romualdi A, Kiss N, Minderjahn MI, Kallarackal J, Schad M, Reismann M. Diagnosis and classification of pediatric acute appendicitis by artificial intelligence methods: An investigator-independent approach. PLoS One 2019; 14:e0222030. [PMID: 31553729 PMCID: PMC6760759 DOI: 10.1371/journal.pone.0222030] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 08/20/2019] [Indexed: 12/14/2022] Open
Abstract
Acute appendicitis is one of the major causes for emergency surgery in childhood and adolescence. Appendectomy is still the therapy of choice, but conservative strategies are increasingly being studied for uncomplicated inflammation. Diagnosis of acute appendicitis remains challenging, especially due to the frequently unspecific clinical picture. Inflammatory blood markers and imaging methods like ultrasound are limited as they have to be interpreted by experts and still do not offer sufficient diagnostic certainty. This study presents a method for automatic diagnosis of appendicitis as well as the differentiation between complicated and uncomplicated inflammation using values/parameters which are routinely and unbiasedly obtained for each patient with suspected appendicitis. We analyzed full blood counts, c-reactive protein (CRP) and appendiceal diameters in ultrasound investigations corresponding to children and adolescents aged 0–17 years from a hospital based population in Berlin, Germany. A total of 590 patients (473 patients with appendicitis in histopathology and 117 with negative histopathological findings) were analyzed retrospectively with modern algorithms from machine learning (ML) and artificial intelligence (AI). The discovery of informative parameters (biomarker signatures) and training of the classification model were done with a maximum of 35% of the patients. The remaining minimum 65% of patients were used for validation. At clinical relevant cut-off points the accuracy of the biomarker signature for diagnosis of appendicitis was 90% (93% sensitivity, 67% specificity), while the accuracy to correctly identify complicated inflammation was 51% (95% sensitivity, 33% specificity) on validation data. Such a test would be capable to prevent two out of three patients without appendicitis from useless surgery as well as one out of three patients with uncomplicated appendicitis. The presented method has the potential to change today’s therapeutic approach for appendicitis and demonstrates the capability of algorithms from AI and ML to significantly improve diagnostics even based on routine diagnostic parameters.
Collapse
Affiliation(s)
- Josephine Reismann
- Department of Pediatric Surgery, Charité –Universitätsmedizin Berlin, Augustenburger Platz, Berlin, Germany
| | | | - Natalie Kiss
- Department of Pediatric Surgery, Charité –Universitätsmedizin Berlin, Augustenburger Platz, Berlin, Germany
| | - Maximiliane I. Minderjahn
- Department of Pediatric Surgery, Charité –Universitätsmedizin Berlin, Augustenburger Platz, Berlin, Germany
| | | | | | - Marc Reismann
- Department of Pediatric Surgery, Charité –Universitätsmedizin Berlin, Augustenburger Platz, Berlin, Germany
- * E-mail:
| |
Collapse
|
50
|
Quantitative Analysis of Uncertainty in Medical Reporting: Creating a Standardized and Objective Methodology. J Digit Imaging 2019; 31:145-149. [PMID: 29274047 DOI: 10.1007/s10278-017-0041-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Uncertainty in text-based medical reports has long been recognized as problematic, frequently resulting in misunderstanding and miscommunication. One strategy for addressing the negative clinical ramifications of report uncertainty would be the creation of a standardized methodology for characterizing and quantifying uncertainty language, which could provide both the report author and reader with context related to the perceived level of diagnostic confidence and accuracy. A number of computerized strategies could be employed in the creation of this analysis including string search, natural language processing and understanding, histogram analysis, topic modeling, and machine learning. The derived uncertainty data offers the potential to objectively analyze report uncertainty in real time and correlate with outcomes analysis for the purpose of context and user-specific decision support at the point of care, where intervention would have the greatest clinical impact.
Collapse
|