1
|
Kadomatsu Y, Emoto R, Kubo Y, Nakanishi K, Ueno H, Kato T, Nakamura S, Mizuno T, Matsui S, Chen-Yoshikawa TF. Development of a machine learning-based risk model for postoperative complications of lung cancer surgery. Surg Today 2024:10.1007/s00595-024-02878-y. [PMID: 38896280 DOI: 10.1007/s00595-024-02878-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 04/30/2024] [Indexed: 06/21/2024]
Abstract
PURPOSE To develop a comorbidity risk score specifically for lung resection surgeries. METHODS We reviewed the medical records of patients who underwent lung resections for lung cancer, and developed a risk model using data from 2014 to 2017 (training dataset), validated using data from 2018 to 2019 (validation dataset). Forty variables were analyzed, including 35 factors related to the patient's overall condition and five factors related to surgical techniques and tumor-related factors. The risk model for postoperative complications was developed using an elastic net regularized generalized linear model. The performance of the risk model was evaluated using receiver operating characteristic curves and compared with the Charlson Comorbidity Index (CCI). RESULTS The rate of postoperative complications was 34.7% in the training dataset and 21.9% in the validation dataset. The final model consisted of 20 variables, including age, surgical-related factors, respiratory function tests, and comorbidities, such as chronic obstructive pulmonary disease, a history of ischemic heart disease, and 12 blood test results. The area under the curve (AUC) for the developed risk model was 0.734, whereas the AUC for the CCI was 0.521 in the validation dataset. CONCLUSIONS The new machine learning model could predict postoperative complications with acceptable accuracy. CLINICAL REGISTRATION NUMBER 2020-0375.
Collapse
Affiliation(s)
- Yuka Kadomatsu
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan.
| | - Ryo Emoto
- Department of Biostatistics, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Yoko Kubo
- Department of Preventive Medicine, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Keita Nakanishi
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Harushi Ueno
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Taketo Kato
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Shota Nakamura
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Tetsuya Mizuno
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| | - Shigeyuki Matsui
- Department of Biostatistics, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Toyofumi Fengshi Chen-Yoshikawa
- Department of Thoracic Surgery, Nagoya University Graduate School of Medicine, 65 Tsurumai-Cho, Showa-Ku, Nagoya, 466-8550, Japan
| |
Collapse
|
2
|
Kiyasseh D, Laca J, Haque TF, Otiato M, Miles BJ, Wagner C, Donoho DA, Trinh QD, Anandkumar A, Hung AJ. Human visual explanations mitigate bias in AI-based assessment of surgeon skills. NPJ Digit Med 2023; 6:54. [PMID: 36997642 PMCID: PMC10063676 DOI: 10.1038/s41746-023-00766-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 01/21/2023] [Indexed: 04/03/2023] Open
Abstract
Artificial intelligence (AI) systems can now reliably assess surgeon skills through videos of intraoperative surgical activity. With such systems informing future high-stakes decisions such as whether to credential surgeons and grant them the privilege to operate on patients, it is critical that they treat all surgeons fairly. However, it remains an open question whether surgical AI systems exhibit bias against surgeon sub-cohorts, and, if so, whether such bias can be mitigated. Here, we examine and mitigate the bias exhibited by a family of surgical AI systems-SAIS-deployed on videos of robotic surgeries from three geographically-diverse hospitals (USA and EU). We show that SAIS exhibits an underskilling bias, erroneously downgrading surgical performance, and an overskilling bias, erroneously upgrading surgical performance, at different rates across surgeon sub-cohorts. To mitigate such bias, we leverage a strategy -TWIX-which teaches an AI system to provide a visual explanation for its skill assessment that otherwise would have been provided by human experts. We show that whereas baseline strategies inconsistently mitigate algorithmic bias, TWIX can effectively mitigate the underskilling and overskilling bias while simultaneously improving the performance of these AI systems across hospitals. We discovered that these findings carry over to the training environment where we assess medical students' skills today. Our study is a critical prerequisite to the eventual implementation of AI-augmented global surgeon credentialing programs, ensuring that all surgeons are treated fairly.
Collapse
Affiliation(s)
- Dani Kiyasseh
- Department of Computing and Mathematical Sciences, California Institute of Technology, California, CA, USA.
| | - Jasper Laca
- Center for Robotic Simulation and Education, Catherine & Joseph Aresty Department of Urology, University of Southern California, California, CA, USA
| | - Taseen F Haque
- Center for Robotic Simulation and Education, Catherine & Joseph Aresty Department of Urology, University of Southern California, California, CA, USA
| | - Maxwell Otiato
- Center for Robotic Simulation and Education, Catherine & Joseph Aresty Department of Urology, University of Southern California, California, CA, USA
| | - Brian J Miles
- Department of Urology, Houston Methodist Hospital, Texas, TX, USA
| | - Christian Wagner
- Department of Urology, Pediatric Urology and Uro-Oncology, Prostate Center Northwest, St. Antonius-Hospital, Gronau, Germany
| | - Daniel A Donoho
- Division of Neurosurgery, Center for Neuroscience, Children's National Hospital, Washington DC, WA, USA
| | - Quoc-Dien Trinh
- Center for Surgery & Public Health, Department of Surgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Animashree Anandkumar
- Department of Computing and Mathematical Sciences, California Institute of Technology, California, CA, USA
| | - Andrew J Hung
- Center for Robotic Simulation and Education, Catherine & Joseph Aresty Department of Urology, University of Southern California, California, CA, USA.
| |
Collapse
|
3
|
Bin Zahid A, Geocadin RG. Machine learning and self-fulfilling prophecies: Primum non nocere. Resuscitation 2023; 183:109687. [PMID: 36623748 DOI: 10.1016/j.resuscitation.2022.109687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 12/26/2022] [Accepted: 12/29/2022] [Indexed: 01/09/2023]
Affiliation(s)
- Abdullah Bin Zahid
- Departments of Neurology, Anesthesiology-Critical Care Medicine and Neurosurgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Romergryko G Geocadin
- Departments of Neurology, Anesthesiology-Critical Care Medicine and Neurosurgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
4
|
Moukheiber D, Mahindre S, Moukheiber L, Moukheiber M, Wang S, Ma C, Shih G, Peng Y, Gao M. Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays. DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS : SECOND MICCAI WORKSHOP, DALI 2022, HELD IN CONJUNCTION WITH MICCAI 2022, SINGAPORE, SEPTEMBER 22, 2022, PROCEEDINGS. DALI (WORKSHOP) (2ND : 2022 : SINGAPORE) 2022; 13567:112-122. [PMID: 36383493 PMCID: PMC9652771 DOI: 10.1007/978-3-031-17027-0_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This paper aims to identify uncommon cardiothoracic diseases and patterns on chest X-ray images. Training a machine learning model to classify rare diseases with multi-label indications is challenging without sufficient labeled training samples. Our model leverages the information from common diseases and adapts to perform on less common mentions. We propose to use multi-label few-shot learning (FSL) schemes including neighborhood component analysis loss, generating additional samples using distribution calibration and fine-tuning based on multi-label classification loss. We utilize the fact that the widely adopted nearest neighbor-based FSL schemes like ProtoNet are Voronoi diagrams in feature space. In our method, the Voronoi diagrams in the features space generated from multi-label schemes are combined into our geometric DeepVoro Multi-label ensemble. The improved performance in multi-label few-shot classification using the multi-label ensemble is demonstrated in our experiments (The code is publicly available at https://github.com/Saurabh7/Few-shot-learning-multilabel-cxray).
Collapse
Affiliation(s)
| | - Saurabh Mahindre
- University at Buffalo, The State University of New York, Buffalo, NY, USA
| | | | | | - Song Wang
- The University of Texas at Austin, Austin, TX, USA
| | - Chunwei Ma
- University at Buffalo, The State University of New York, Buffalo, NY, USA
| | | | - Yifan Peng
- Weill Cornell Medicine, New York, NY, USA
| | - Mingchen Gao
- University at Buffalo, The State University of New York, Buffalo, NY, USA
| |
Collapse
|
5
|
Kothari R, Chiu C, Moukheiber M, Jehiro M, Bishara A, Lee C, Piracchio R, Celi LA. A descriptive appraisal of quality of reporting in a cohort of machine learning studies in anesthesiology. Anaesth Crit Care Pain Med 2022; 41:101126. [PMID: 35811037 DOI: 10.1016/j.accpm.2022.101126] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 05/18/2022] [Accepted: 05/19/2022] [Indexed: 12/13/2022]
Abstract
BACKGROUND The field of machine learning is being employed more and more in medicine. However, studies have shown that the quality of published studies frequently lacks completeness and adherence to published reporting guidelines. This assessment has not been done in the subspecialty of anesthesiology. METHODS We appraised the quality of reporting of a convenience sample of 67 peer-reviewed publications sourced from the scoping review by Hashimoto et al. Each publication was appraised on the presence of reporting elements (reporting compliance) selected from 4 peer-reviewed guidelines for reporting on machine learning studies. Results are described in several cross sections, including by section of manuscript (e.g. abstract, introduction, etc.), year of publication, impact factor of journal, and impact of publication. RESULTS On average, reporting compliance was 64% ± 13%. There was marked heterogeneity of reporting based on section of manuscript. There was a mild trend towards increased quality of reporting with increasing impact factor of journal of publication and increasing average number of citations per year since publication, and no trend regarding recency of publication. CONCLUSION The quality of reporting of machine learning studies in anesthesiology is on par with other fields, but can benefit from improvement, especially in presenting methodology, results, and discussion points, including interpretation of models and pitfalls therein. Clinicians in today's learning health systems will benefit from skills in appraisal of evidence. Several reporting guidelines have been released, and updates to mainstream guidelines are under development, which we hope will usher in improvement in reporting quality.
Collapse
Affiliation(s)
- Rishi Kothari
- Department of Anesthesia and Perioperative Care, University of California San Francisco, San Francisco, CA 4143, USA.
| | - Catherine Chiu
- Department of Anesthesia and Perioperative Care, University of California San Francisco, San Francisco, CA 4143, USA
| | - Mira Moukheiber
- Picower Institute for Learning & Memory, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Matthew Jehiro
- Department of Biostatistics, State University of New York at Buffalo, Buffalo, NY 14260, USA
| | - Andrew Bishara
- Department of Anesthesia and Perioperative Care, University of California San Francisco, San Francisco, CA 4143, USA
| | - Christine Lee
- Edwards Lifesciences, Critical Care, Irvine, CA 92614, USA
| | - Romain Piracchio
- Department of Anesthesia and Perioperative Care, University of California San Francisco, San Francisco, CA 4143, USA
| | - Leo Anthony Celi
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Medicine, Beth Israel Deaconess Medical Center, Boston, MA 02215, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| |
Collapse
|