1
|
Belladelli F, De Cobelli F, Piccolo C, Cei F, Re C, Musso G, Rosiello G, Cignoli D, Santangelo A, Fallara G, Matloob R, Bertini R, Gusmini S, Brembilla G, Lucianò R, Tenace N, Salonia A, Briganti A, Montorsi F, Larcher A, Capitanio U. A machine learning-based analysis for the definition of an optimal renal biopsy for kidney cancer. Urol Oncol 2024:S1078-1439(24)00700-2. [PMID: 39516081 DOI: 10.1016/j.urolonc.2024.10.020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 10/07/2024] [Accepted: 10/16/2024] [Indexed: 11/16/2024]
Abstract
OBJECTIVE Renal Tumor biopsy (RTB) can assist clinicians in determining the most suitable approach for treatment of renal cancer. However, RTB's limitations in accurately determining histology and grading have hindered its broader adoption and data on the concordance rate between RTB results and final pathology after surgery are unavailable. Therefore, we aimed to develop a machine learning algorithm to optimize RTB technique and to investigate the degree of concordance between RTB and surgical pathology reports. MATERIALS AND METHODS Within a prospectively maintained database, patients with indeterminate renal masses who underwent RTB at a single tertiary center were identified. We recorded and analyzed the approach (US vs. CT), the number of biopsy cores (NoC), and total core tissue length (LoC) to evaluate their impact on diagnostic outcomes. The K-Nearest Neighbors (KNN), a non-parametric supervised machine learning model, predicted the probability of obtaining pathological characterization and grading. In surgical patients, final pathology reports were compared with RTB results. RESULTS Overall, 197 patients underwent RTB. Overall, 89.8% (n=177) and 44.7% (n=88) of biopsies were informative in terms of histology and grading, respectively. The discrepancy rate between the pathology results from renal tissue biopsy (RTB) and the final pathology report following surgery was 3.6% (n=7) for histology and 5.0% (n=10) for grading. According to the machine learning model, a minimum of 2 cores providing at least 0.8 cm of total tissue should be obtained to achieve the best accuracy in characterizing the cancer. Alternatively, in cases of RTB with more than two cores, no specific minimum tissue threshold is required. CONCLUSIONS The discordance rates between RTB pathology and final surgical pathology are notably minimal. We defined an optimal renal biopsy strategy based on at least 2 cores and at least 0.8 cm of tissue or at least 3 cores and no minimum tissue threshold. PATIENTS SUMMARY RTB is a useful test for kidney cancer, but it's not always perfect. Our study shows that it usually matches up well with what doctors find during surgery. Using machine learning can make RTB even better by helping doctors know how many samples to take. This helps doctors treat kidney cancer more accurately.
Collapse
Affiliation(s)
- F Belladelli
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - F De Cobelli
- Department of Radiology, IRCCS San Raffaele Hospital, Milan, Italy
| | - C Piccolo
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - F Cei
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - C Re
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - G Musso
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - G Rosiello
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - D Cignoli
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - A Santangelo
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - G Fallara
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - R Matloob
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - R Bertini
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - S Gusmini
- Department of Radiology, IRCCS San Raffaele Hospital, Milan, Italy
| | - G Brembilla
- Department of Radiology, IRCCS San Raffaele Hospital, Milan, Italy
| | - R Lucianò
- Department of Pathology, IRCCS San Raffaele Hospital, Milan, Italy
| | - N Tenace
- Department of Pathology, IRCCS San Raffaele Hospital, Milan, Italy
| | - A Salonia
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - A Briganti
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - F Montorsi
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - A Larcher
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy
| | - U Capitanio
- URI - Urological Research Institute, Department of Urology, Division of Experimental Oncology, IRCCS San Raffaele Hospital, Milan, Italy; Department of Urology, IRCCS San Raffaele Hospital, Milan, Italy.
| |
Collapse
|
2
|
Duan M, Zhao X, Li S, Miao G, Bai L, Zhang Q, Yang W, Zhao X. Metabolic score for insulin resistance (METS-IR) predicts all-cause and cardiovascular mortality in the general population: evidence from NHANES 2001-2018. Cardiovasc Diabetol 2024; 23:243. [PMID: 38987779 PMCID: PMC11238348 DOI: 10.1186/s12933-024-02334-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 06/25/2024] [Indexed: 07/12/2024] Open
Abstract
BACKGROUND The prevalence of obesity-associated insulin resistance (IR) is increasing along with the increase in obesity rates. In this study, we compared the predictive utility of four alternative indexes of IR [triglyceride glucose index (TyG index), metabolic score for insulin resistance (METS-IR), the triglyceride/high-density lipoprotein cholesterol (TG/HDL-C) ratio and homeostatic model assessment of insulin resistance (HOMA-IR)] for all-cause mortality and cardiovascular mortality in the general population based on key variables screened by the Boruta algorithm. The aim was to find the best replacement index of IR. METHODS In this study, 14,653 participants were screened from the National Health and Nutrition Examination Survey (2001-2018). And TyG index, METS-IR, TG/HDL-C and HOMA-IR were calculated separately for each participant according to the given formula. The predictive values of IR replacement indexes for all-cause mortality and cardiovascular mortality in the general population were assessed. RESULTS Over a median follow-up period of 116 months, a total of 2085 (10.23%) all-cause deaths and 549 (2.61%) cardiovascular disease (CVD) related deaths were recorded. Multivariate Cox regression and restricted cubic splines analysis showed that among the four indexes, only METS-IR was significantly associated with both all-cause and CVD mortality, and both showed non-linear associations with an approximate "U-shape". Specifically, baseline METS-IR lower than the inflection point (41.33) was negatively associated with mortality [hazard ratio (HR) 0.972, 95% CI 0.950-0.997 for all-cause mortality]. In contrast, baseline METS-IR higher than the inflection point (41.33) was positively associated with mortality (HR 1.019, 95% CI 1.011-1.026 for all-cause mortality and HR 1.028, 95% CI 1.014-1.043 for CVD mortality). We further stratified the METS-IR and showed that significant associations between METS-IR levels and all-cause and cardiovascular mortality were predominantly present in the nonelderly population aged < 65 years. CONCLUSIONS In conjunction with the results of the Boruta algorithm, METS-IR demonstrated a more significant association with all-cause and cardiovascular mortality in the U.S. population compared to the other three alternative IR indexes (TyG index, TG/HDL-C and HOMA-IR), particularly evident in individuals under 65 years old.
Collapse
Affiliation(s)
- Mingxuan Duan
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Xi Zhao
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Shaolin Li
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Guangrui Miao
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Linpeng Bai
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Qingyang Zhang
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Wenxuan Yang
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Xiaoyan Zhao
- Henan Key Laboratory of Hereditary Cardiovascular Diseases, Department of Cardiology, Cardiovascular Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, Henan, China.
| |
Collapse
|
3
|
Park S, Moon J, Eun H, Hong JH, Lee K. Artificial Intelligence-Based Diagnostic Support System for Patent Ductus Arteriosus in Premature Infants. J Clin Med 2024; 13:2089. [PMID: 38610854 PMCID: PMC11012712 DOI: 10.3390/jcm13072089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/02/2024] [Accepted: 04/02/2024] [Indexed: 04/14/2024] Open
Abstract
Background: Patent ductus arteriosus (PDA) is a prevalent congenital heart defect in premature infants, associated with significant morbidity and mortality. Accurate and timely diagnosis of PDA is crucial, given the vulnerability of this population. Methods: We introduce an artificial intelligence (AI)-based PDA diagnostic support system designed to assist medical professionals in diagnosing PDA in premature infants. This study utilized electronic health record (EHR) data from 409 premature infants spanning a decade at Severance Children's Hospital. Our system integrates a data viewer, data analyzer, and AI-based diagnosis supporter, facilitating comprehensive data presentation, analysis, and early symptom detection. Results: The system's performance was evaluated through diagnostic tests involving medical professionals. This early detection model achieved an accuracy rate of up to 84%, enabling detection up to 3.3 days in advance. In diagnostic tests, medical professionals using the system with the AI-based diagnosis supporter outperformed those using the system without the supporter. Conclusions: Our AI-based PDA diagnostic support system offers a comprehensive solution for medical professionals to accurately diagnose PDA in a timely manner in premature infants. The collaborative integration of medical expertise and technological innovation demonstrated in this study underscores the potential of AI-driven tools in advancing neonatal diagnosis and care.
Collapse
Affiliation(s)
- Seoyeon Park
- Department of Computer Science, Yonsei University, 50 Yonsei-ro, Seoul 03722, Republic of Korea; (S.P.); (K.L.)
| | - Junhyung Moon
- Department of Computer Science, Yonsei University, 50 Yonsei-ro, Seoul 03722, Republic of Korea; (S.P.); (K.L.)
| | - Hoseon Eun
- Department of Pediatrics, Yonsei University College of Medicine, 50-1 Yonsei-ro, Seoul 03722, Republic of Korea;
| | - Jin-Hyuk Hong
- School of Integrated Technology, Gwangju Institute of Science and Technology, 123 Cheomdangwagi-ro, Gwangju 61005, Republic of Korea;
| | - Kyoungwoo Lee
- Department of Computer Science, Yonsei University, 50 Yonsei-ro, Seoul 03722, Republic of Korea; (S.P.); (K.L.)
| |
Collapse
|
4
|
Kim J, Villarreal M, Arya S, Hernandez A, Moreira A. Bridging the Gap: Exploring Bronchopulmonary Dysplasia through the Lens of Biomedical Informatics. J Clin Med 2024; 13:1077. [PMID: 38398389 PMCID: PMC10889493 DOI: 10.3390/jcm13041077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 02/07/2024] [Accepted: 02/12/2024] [Indexed: 02/25/2024] Open
Abstract
Bronchopulmonary dysplasia (BPD), a chronic lung disease predominantly affecting premature infants, poses substantial clinical challenges. This review delves into the promise of biomedical informatics (BMI) in reshaping BPD research and care. We commence by highlighting the escalating prevalence and healthcare impact of BPD, emphasizing the necessity for innovative strategies to comprehend its intricate nature. To this end, we introduce BMI as a potent toolset adept at managing and analyzing extensive, diverse biomedical data. The challenges intrinsic to BPD research are addressed, underscoring the inadequacies of conventional approaches and the compelling need for data-driven solutions. We subsequently explore how BMI can revolutionize BPD research, encompassing genomics and personalized medicine to reveal potential biomarkers and individualized treatment strategies. Predictive analytics emerges as a pivotal facet of BMI, enabling early diagnosis and risk assessment for timely interventions. Moreover, we examine how mobile health technologies facilitate real-time monitoring and enhance patient engagement, ultimately refining BPD management. Ethical and legal considerations surrounding BMI implementation in BPD research are discussed, accentuating issues of privacy, data security, and informed consent. In summation, this review highlights BMI's transformative potential in advancing BPD research, addressing challenges, and opening avenues for personalized medicine and predictive analytics.
Collapse
Affiliation(s)
- Jennifer Kim
- Division of Neonatology, Department of Pediatrics, University of Texas Health San Antonio, San Antonio, TX 78229, USA; (J.K.); (M.V.); (A.H.)
| | - Mariela Villarreal
- Division of Neonatology, Department of Pediatrics, University of Texas Health San Antonio, San Antonio, TX 78229, USA; (J.K.); (M.V.); (A.H.)
| | - Shreyas Arya
- Division of Neonatal-Perinatal Medicine, Dayton Children’s Hospital, Dayton, OH 45404, USA
| | - Antonio Hernandez
- Division of Neonatology, Department of Pediatrics, University of Texas Health San Antonio, San Antonio, TX 78229, USA; (J.K.); (M.V.); (A.H.)
| | - Alvaro Moreira
- Division of Neonatology, Department of Pediatrics, University of Texas Health San Antonio, San Antonio, TX 78229, USA; (J.K.); (M.V.); (A.H.)
| |
Collapse
|
5
|
Wang X, Ren J, Ren H, Song W, Qiao Y, Zhao Y, Linghu L, Cui Y, Zhao Z, Chen L, Qiu L. Diabetes mellitus early warning and factor analysis using ensemble Bayesian networks with SMOTE-ENN and Boruta. Sci Rep 2023; 13:12718. [PMID: 37543637 PMCID: PMC10404250 DOI: 10.1038/s41598-023-40036-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 08/03/2023] [Indexed: 08/07/2023] Open
Abstract
Diabetes mellitus (DM) has become the third chronic non-infectious disease affecting patients after tumor, cardiovascular and cerebrovascular diseases, becoming one of the major public health issues worldwide. Detection of early warning risk factors for DM is key to the prevention of DM, which has been the focus of some previous studies. Therefore, from the perspective of residents' self-management and prevention, this study constructed Bayesian networks (BNs) combining feature screening and multiple resampling techniques for DM monitoring data with a class imbalance in Shanxi Province, China, to detect risk factors in chronic disease monitoring programs and predict the risk of DM. First, univariate analysis and Boruta feature selection algorithm were employed to conduct the preliminary screening of all included risk factors. Then, three resampling techniques, SMOTE, Borderline-SMOTE (BL-SMOTE) and SMOTE-ENN, were adopted to deal with data imbalance. Finally, BNs developed by three algorithms (Tabu, Hill-climbing and MMHC) were constructed using the processed data to find the warning factors that strongly correlate with DM. The results showed that the accuracy of DM classification is significantly improved by the BNs constructed by processed data. In particular, the BNs combined with the SMOTE-ENN resampling improved the most, and the BNs constructed by the Tabu algorithm obtained the best classification performance compared with the hill-climbing and MMHC algorithms. The best-performing joint Boruta-SMOTE-ENN-Tabu model showed that the risk factors of DM included family history, age, central obesity, hyperlipidemia, salt reduction, occupation, heart rate, and BMI.
Collapse
Affiliation(s)
- Xuchun Wang
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Jiahui Ren
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Hao Ren
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Wenzhu Song
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Yuchao Qiao
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Ying Zhao
- Shanxi Centre for Disease Control and Prevention, Taiyuan, 030012, Shanxi, China
| | - Liqin Linghu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
- Shanxi Centre for Disease Control and Prevention, Taiyuan, 030012, Shanxi, China
| | - Yu Cui
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Zhiyang Zhao
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China
| | - Limin Chen
- Shanxi Provincial People's Hospital, Taiyuan, Shanxi, China.
| | - Lixia Qiu
- Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, Shanxi, China.
| |
Collapse
|
6
|
Yan FJ, Chen XH, Quan XQ, Wang LL, Wei XY, Zhu JL. Development and validation of an interpretable machine learning model-Predicting mild cognitive impairment in a high-risk stroke population. Front Aging Neurosci 2023; 15:1180351. [PMID: 37396650 PMCID: PMC10308219 DOI: 10.3389/fnagi.2023.1180351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 05/30/2023] [Indexed: 07/04/2023] Open
Abstract
Background Mild cognitive impairment (MCI) is considered a preclinical stage of Alzheimer's disease (AD). People with MCI have a higher risk of developing dementia than healthy people. As one of the risk factors for MCI, stroke has been actively treated and intervened. Therefore, selecting the high-risk population of stroke as the research object and discovering the risk factors of MCI as early as possible can prevent the occurrence of MCI more effectively. Methods The Boruta algorithm was used to screen variables, and eight machine learning models were established and evaluated. The best performing models were used to assess variable importance and build an online risk calculator. Shapley additive explanation is used to explain the model. Results A total of 199 patients were included in the study, 99 of whom were male. Transient ischemic attack (TIA), homocysteine, education, hematocrit (HCT), diabetes, hemoglobin, red blood cells (RBC), hypertension, prothrombin time (PT) were selected by Boruta algorithm. Logistic regression (AUC = 0.8595) was the best model for predicting MCI in high-risk groups of stroke, followed by elastic network (ENET) (AUC = 0.8312), multilayer perceptron (MLP) (AUC = 0.7908), extreme gradient boosting (XGBoost) (AUC = 0.7691), and support vector machine (SVM) (AUC = 0.7527), random forest (RF) (AUC = 0.7451), K-nearest neighbors (KNN) (AUC = 0.7380), decision tree (DT) (AUC = 0.6972). The importance of variables suggests that TIA, diabetes, education, and hypertension are the top four variables of importance. Conclusion Transient ischemic attack (TIA), diabetes, education, and hypertension are the most important risk factors for MCI in high-risk groups of stroke, and early intervention should be performed to reduce the occurrence of MCI.
Collapse
Affiliation(s)
- Feng-Juan Yan
- Department of Geriatrics, Shenzhen Longhua District Central Hospital, Shenzhen, Guangdong, China
| | - Xie-Hui Chen
- Department of Geriatrics, Shenzhen Longhua District Central Hospital, Shenzhen, Guangdong, China
| | - Xiao-Qing Quan
- Department of Geriatrics, Shenzhen Longhua District Central Hospital, Shenzhen, Guangdong, China
| | - Li-Li Wang
- Department of Cardiology, Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, Shandong, China
| | - Xin-Yi Wei
- Department of Cardiology, The Third Hospital of Jinan, Jinan, Shandong, China
| | - Jia-Liang Zhu
- The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| |
Collapse
|
7
|
Xiao T, Dong X, Lu Y, Zhou W. High-Resolution and Multidimensional Phenotypes Can Complement Genomics Data to Diagnose Diseases in the Neonatal Population. PHENOMICS (CHAM, SWITZERLAND) 2023; 3:204-215. [PMID: 37197647 PMCID: PMC10110825 DOI: 10.1007/s43657-022-00071-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 08/03/2022] [Accepted: 08/08/2022] [Indexed: 05/19/2023]
Abstract
Advances in genomic medicine have greatly improved our understanding of human diseases. However, phenome is not well understood. High-resolution and multidimensional phenotypes have shed light on the mechanisms underlying neonatal diseases in greater details and have the potential to optimize clinical strategies. In this review, we first highlight the value of analyzing traditional phenotypes using a data science approach in the neonatal population. We then discuss recent research on high-resolution, multidimensional, and structured phenotypes in neonatal critical diseases. Finally, we briefly introduce current technologies available for the analysis of multidimensional data and the value that can be provided by integrating these data into clinical practice. In summary, a time series of multidimensional phenome can improve our understanding of disease mechanisms and diagnostic decision-making, stratify patients, and provide clinicians with optimized strategies for therapeutic intervention; however, the available technologies for collecting multidimensional data and the best platform for connecting multiple modalities should be considered.
Collapse
Affiliation(s)
- Tiantian Xiao
- Division of Neonatology, Children’s Hospital of Fudan University, National Children’s Medical Center, 399 Wanyuan Road, Shanghai, 201102 China
- Department of Neonatology, Chengdu Women’s and Children’s Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610000 China
| | - Xinran Dong
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| | - Yulan Lu
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| | - Wenhao Zhou
- Division of Neonatology, Children’s Hospital of Fudan University, National Children’s Medical Center, 399 Wanyuan Road, Shanghai, 201102 China
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| |
Collapse
|
8
|
Data harnessing to nurture the human mind for a tailored approach to the child. Pediatr Res 2023; 93:357-365. [PMID: 36180585 DOI: 10.1038/s41390-022-02320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 07/06/2022] [Accepted: 09/12/2022] [Indexed: 11/08/2022]
Abstract
Big data in pediatrics is an ocean of structured and unstructured data. Big data analysis helps to dive into the ocean of data to filter out information that can guide pediatricians in their decision making, precision diagnosis, and targeted therapy. In addition, big data and its analysis have helped in the surveillance, prevention, and performance of the health system. There has been a considerable amount of work in pediatrics that we have tried to highlight in this review and some of it has been already incorporated into the health system. Work in specialties of pediatrics is still forthcoming with the creation of a common data model and amalgamation of the huge "omics" database. The physicians entrusted with the care of children must be aware of the outcome so that they can play a role to ensure that big data algorithms have a clinically relevant effect in improving the health of their patients. They will apply the outcome of big data and its analysis in patient care through clinical algorithms or with the help of embedded clinical support alerts from the electronic medical records. IMPACT: Big data in pediatrics include structured, unstructured data, waveform data, biological, and social data. Big data analytics has unraveled significant information from these databases. This is changing how pediatricians will look at the body of available evidence and translate it into their clinical practice. Data harnessed so far is implemented in certain fields while in others it is in the process of development to become a clinical adjunct to the physician. Common databases are being prepared for future work. Diagnostic and prediction models when incorporated into the health system will guide the pediatrician to a targeted approach to diagnosis and therapy.
Collapse
|
9
|
Li W, Zeng L, Yuan S, Shang Y, Zhuang W, Chen Z, Lyu J. Machine learning for the prediction of cognitive impairment in older adults. Front Neurosci 2023; 17:1158141. [PMID: 37179565 PMCID: PMC10172509 DOI: 10.3389/fnins.2023.1158141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 04/10/2023] [Indexed: 05/15/2023] Open
Abstract
Objective The purpose of this study was to develop and validate a predictive model of cognitive impairment in older adults based on a novel machine learning (ML) algorithm. Methods The complete data of 2,226 participants aged 60-80 years were extracted from the 2011-2014 National Health and Nutrition Examination Survey database. Cognitive abilities were assessed using a composite cognitive functioning score (Z-score) calculated using a correlation test among the Consortium to Establish a Registry for Alzheimer's Disease Word Learning and Delayed Recall tests, Animal Fluency Test, and the Digit Symbol Substitution Test. Thirteen demographic characteristics and risk factors associated with cognitive impairment were considered: age, sex, race, body mass index (BMI), drink, smoke, direct HDL-cholesterol level, stroke history, dietary inflammatory index (DII), glycated hemoglobin (HbA1c), Patient Health Questionnaire-9 (PHQ-9) score, sleep duration, and albumin level. Feature selection is performed using the Boruta algorithm. Model building is performed using ten-fold cross-validation, machine learning (ML) algorithms such as generalized linear model (GLM), random forest (RF), support vector machine (SVM), artificial neural network (ANN), and stochastic gradient boosting (SGB). The performance of these models was evaluated in terms of discriminatory power and clinical application. Results The study ultimately included 2,226 older adults for analysis, of whom 384 (17.25%) had cognitive impairment. After random assignment, 1,559 and 667 older adults were included in the training and test sets, respectively. A total of 10 variables such as age, race, BMI, direct HDL-cholesterol level, stroke history, DII, HbA1c, PHQ-9 score, sleep duration, and albumin level were selected to construct the model. GLM, RF, SVM, ANN, and SGB were established to obtain the area under the working characteristic curve of the test set subjects 0.779, 0.754, 0.726, 0.776, and 0.754. Among all models, the GLM model had the best predictive performance in terms of discriminatory power and clinical application. Conclusions ML models can be a reliable tool to predict the occurrence of cognitive impairment in older adults. This study used machine learning methods to develop and validate a well performing risk prediction model for the development of cognitive impairment in the elderly.
Collapse
Affiliation(s)
- Wanyue Li
- Department of Rehabilitation, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Li Zeng
- The Second Clinical Medical College of Guizhou University of Traditional Chinese Medicine, Guiyang, Guizhou, China
| | - Shiqi Yuan
- Department of Neurology, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Yaru Shang
- Department of Rehabilitation, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
| | - Weisheng Zhuang
- Department of Rehabilitation, Henan Provincial People's Hospital, People's Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Zhuoming Chen
- Department of Rehabilitation, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
- Zhuoming Chen
| | - Jun Lyu
- Department of Clinical Research, The First Affiliated Hospital of Jinan University, Guangzhou, Guangdong, China
- Guangdong Provincial Key Laboratory of Traditional Chinese Medicine Informatization, Guangzhou, Guangdong, China
- *Correspondence: Jun Lyu
| |
Collapse
|
10
|
Xing W, He W, Li X, Chen J, Cao Y, Zhou W, Shen Q, Zhang X, Ta D. Early severity prediction of BPD for premature infants from chest X-ray images using deep learning: A study at the 28th day of oxygen inhalation. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 221:106869. [PMID: 35576685 DOI: 10.1016/j.cmpb.2022.106869] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 04/23/2022] [Accepted: 05/08/2022] [Indexed: 06/15/2023]
Abstract
BACKGROUND AND OBJECTIVE Bronchopulmonary dysplasia is a common respiratory disease in premature infants. The severity is diagnosed at the 56th day after birth or discharge by analyzing the clinical indicators, which may cause the delay of the best treatment opportunity. Thus, we proposed a deep learning-based method using chest X-ray images of the 28th day of oxygen inhalation for the early severity prediction of bronchopulmonary dysplasia in clinic. METHODS We first adopted a two-step lung field extraction method by combining digital image processing and human-computer interaction to form the one-to-one corresponding image and label. The designed XSEG-Net model was then trained for segmenting the chest X-ray images, with the results being used for the analysis of heart development and clinical severity. Therein, Six-Point cardiothoracic ratio measurement algorithm based on corner detection was designed for the analysis of heart development; and the transfer learning of deep convolutional neural network models were used for the early prediction of clinical severities. RESULTS The dice and cross-entropy loss value of the training of XSEG-Net network reached 0.9794 and 0.0146. The dice, volumetric overlap error, relative volume difference, precision, and recall were used to evaluate the trained model in testing set with the result being 98.43 ± 0.39%, 0.49 ± 0.35%, 0.49 ± 0.35%, 98.67 ± 0.40%, and 98.20 ± 0.47%, respectively. The errors between the Six-Point cardiothoracic ratio measurement method and the gold standard were 0.0122 ± 0.0084. The deep convolutional neural network model based on VGGNet had the promising prediction performance, with the accuracy, precision, sensitivity, specificity, and F1 score reaching 95.58 ± 0.48%, 95.61 ± 0.55%, 95.67 ± 0.44%, 96.98 ± 0.42%, and 95.61±0.48%, respectively. CONCLUSIONS These experimental results of the proposed methods in lung field segmentation, cardiothoracic ratio measurement and clinic severity prediction were better than previous methods, which proved that this method had great potential for clinical application.
Collapse
Affiliation(s)
- Wenyu Xing
- Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai 200438, China; Human Phenome Institute, Fudan University, Shanghai 200438, China
| | - Wen He
- Department of Respiratory, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Xiaoling Li
- Department of Respiratory, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Jiangang Chen
- Shanghai Key Laboratory of Multidimensional Information Processing, East China Normal University, Shanghai 200237, China
| | - Yun Cao
- Department of Neonatology, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Wenhao Zhou
- Department of Neonatology, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Quanli Shen
- Department of Radiology, Children's Hospital of Fudan University, Shanghai 201102, China
| | - Xiaobo Zhang
- Department of Respiratory, Children's Hospital of Fudan University, Shanghai 201102, China.
| | - Dean Ta
- Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai 200438, China; Department of Rehabilitation Medicine, Huashan Hospital, Fudan University, Shanghai 200040, China.
| |
Collapse
|
11
|
Yue S, Li S, Huang X, Liu J, Hou X, Zhao Y, Niu D, Wang Y, Tan W, Wu J. Machine learning for the prediction of acute kidney injury in patients with sepsis. J Transl Med 2022; 20:215. [PMID: 35562803 PMCID: PMC9101823 DOI: 10.1186/s12967-022-03364-0] [Citation(s) in RCA: 97] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 03/26/2022] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Acute kidney injury (AKI) is the most common and serious complication of sepsis, accompanied by high mortality and disease burden. The early prediction of AKI is critical for timely intervention and ultimately improves prognosis. This study aims to establish and validate predictive models based on novel machine learning (ML) algorithms for AKI in critically ill patients with sepsis. METHODS Data of patients with sepsis were extracted from the Medical Information Mart for Intensive Care III (MIMIC- III) database. Feature selection was performed using a Boruta algorithm. ML algorithms such as logistic regression (LR), k-nearest neighbors (KNN), support vector machine (SVM), decision tree, random forest, Extreme Gradient Boosting (XGBoost), and artificial neural network (ANN) were applied for model construction by utilizing tenfold cross-validation. The performances of these models were assessed in terms of discrimination, calibration, and clinical application. Moreover, the discrimination of ML-based models was compared with those of Sequential Organ Failure Assessment (SOFA) and the customized Simplified Acute Physiology Score (SAPS) II model. RESULTS A total of 3176 critically ill patients with sepsis were included for analysis, of which 2397 cases (75.5%) developed AKI during hospitalization. A total of 36 variables were selected for model construction. The models of LR, KNN, SVM, decision tree, random forest, ANN, XGBoost, SOFA and SAPS II score were established and obtained area under the receiver operating characteristic curves of 0.7365, 0.6637, 0.7353, 0.7492, 0.7787, 0.7547, 0.821, 0.6457 and 0.7015, respectively. The XGBoost model had the best predictive performance in terms of discrimination, calibration, and clinical application among all models. CONCLUSION The ML models can be reliable tools for predicting AKI in septic patients. The XGBoost model has the best predictive performance, which can be used to assist clinicians in identifying high-risk patients and implementing early interventions to reduce mortality.
Collapse
Affiliation(s)
- Suru Yue
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Shasha Li
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Xueying Huang
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Jie Liu
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Xuefei Hou
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Yumei Zhao
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Dongdong Niu
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Yufeng Wang
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China
| | - Wenkai Tan
- Department of Gastroenterology, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.
| | - Jiayuan Wu
- Clinical Research Service Center, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China. .,Collaborative Innovation Engineering Technology Research Center of Clinical Medical Big Data Cloud Service in Medical Consortium of West Guangdong Province, The Affiliated Hospital of Guangdong Medical University, Zhanjiang, 524001, Guangdong Province, China.
| |
Collapse
|
12
|
Machine Learning Models for Predicting Mortality in 7472 Very Low Birth Weight Infants Using Data from a Nationwide Neonatal Network. Diagnostics (Basel) 2022; 12:diagnostics12030625. [PMID: 35328178 PMCID: PMC8947011 DOI: 10.3390/diagnostics12030625] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 02/26/2022] [Accepted: 03/01/2022] [Indexed: 11/30/2022] Open
Abstract
Statistical and analytical methods using artificial intelligence approaches such as machine learning (ML) are increasingly being applied to the field of pediatrics, particularly to neonatology. This study compared the representative ML analysis and the logistic regression (LR), which is a traditional statistical analysis method, using them to predict mortality of very low birth weight infants (VLBWI). We included 7472 VLBWI data from a nationwide Korean neonatal network. Eleven predictor variables (neonatal factors: male sex, gestational age, 5 min Apgar scores, body temperature, and resuscitation at birth; maternal factors: diabetes mellitus, hypertension, chorioamnionitis, premature rupture of membranes, antenatal steroid, and cesarean delivery) were selected based on clinical impact and statistical analysis. We compared the predicted mortality between ML methods—such as artificial neural network (ANN), random forest (RF), and support vector machine (SVM)—and LR with a randomly selected training set (80%) and a test set (20%). The model performances of area under the receiver operating curve (95% confidence interval) equaled LR 0.841 (0.811−0.872), ANN 0.845 (0.815−0.875), and RF 0.826 (0.795−0.858). The exception was SVM 0.631 (0.578−0.683). No statistically significant differences were observed between the performance of LR, ANN, and RF (i.e., p > 0.05). However, the SVM model was lower (p < 0.01). We suggest that VLBWI mortality prediction using ML methods would yield the same prediction rate as the traditional statistical LR method and may be suitable for predicting mortality. However, low prediction rates are observed in certain ML methods; hence, further research is needed on these limitations and selecting an appropriate method.
Collapse
|
13
|
Zhou Y, Gao J. Why not try to predict autism spectrum disorder with crucial biomarkers in cuproptosis signaling pathway? Front Psychiatry 2022; 13:1037503. [PMID: 36405901 PMCID: PMC9667021 DOI: 10.3389/fpsyt.2022.1037503] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/17/2022] [Indexed: 01/24/2023] Open
Abstract
The exact pathogenesis of autism spectrum disorder (ASD) is still unclear, yet some potential mechanisms may not have been evaluated before. Cuproptosis is a novel form of regulated cell death reported this year, and no study has reported the relationship between ASD and cuproptosis. This study aimed to identify ASD in suspected patients early using machine learning models based on biomarkers of the cuproptosis pathway. We collected gene expression profiles from brain samples from ASD model mice and blood samples from humans with ASD, selected crucial genes in the cuproptosis signaling pathway, and then analysed these genes with different machine learning models. The accuracy, sensitivity, specificity, and areas under the receiver operating characteristic curves of the machine learning models were estimated in the training, internal validation, and external validation cohorts. Differences between models were determined with Bonferroni's test. The results of screening with the Boruta algorithm showed that FDX1, DLAT, LIAS, and ATP7B were crucial genes in the cuproptosis signaling pathway for ASD. All selected genes and corresponding proteins were also expressed in the human brain. The k-nearest neighbor, support vector machine and random forest models could identify approximately 72% of patients with ASD. The artificial neural network (ANN) model was the most suitable for the present data because the accuracy, sensitivity, and specificity were 0.90, 1.00, and 0.80, respectively, in the external validation cohort. Thus, we first report the prediction of ASD in suspected patients with machine learning methods based on crucial biomarkers in the cuproptosis signaling pathway, and these findings may contribute to investigations of the potential pathogenesis and early identification of ASD.
Collapse
Affiliation(s)
- Yu Zhou
- Department of Child Rehabilitation Division, Huai'an Maternal and Child Health Care Center, Huai'an, China.,Affiliated Hospital of Yang Zhou University Medical College, Huai'an Maternal and Child Health Care Center, Huai'an, China
| | - Jing Gao
- Department of Child Rehabilitation Division, Huai'an Maternal and Child Health Care Center, Huai'an, China.,Affiliated Hospital of Yang Zhou University Medical College, Huai'an Maternal and Child Health Care Center, Huai'an, China
| |
Collapse
|