1
|
Liang Y, Chen X, Zheng R, Cheng X, Su Z, Wang X, Du H, Zhu M, Li G, Zhong Y, Cheng S, Yu B, Yang Y, Chen R, Cui L, Yao H, Gu Q, Gong C, Jun Z, Huang X, Liu D, Yan X, Wei H, Li Y, Zhang H, Liu Y, Wang F, Zhang G, Fan X, Dai H, Luo X. Validation of an AI-Powered Automated X-ray Bone Age Analyzer in Chinese Children and Adolescents: A Comparison with the Tanner-Whitehouse 3 Method. Adv Ther 2024:10.1007/s12325-024-02944-4. [PMID: 39085749 DOI: 10.1007/s12325-024-02944-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Accepted: 07/04/2024] [Indexed: 08/02/2024]
Abstract
INTRODUCTION Automated bone age assessment (BAA) is of growing interest because of its accuracy and time efficiency in daily practice. In this study, we validated the clinical applicability of a commercially available artificial intelligence (AI)-powered X-ray bone age analyzer equipped with a deep learning-based automated BAA system and compared its performance with that of the Tanner-Whitehouse 3 (TW-3) method. METHODS Radiographs prospectively collected from 30 centers across various regions in China, including 900 Chinese children and adolescents, were assessed independently by six doctors (three experts and three residents) and an AI analyzer for TW3 radius, ulna, and short bones (RUS) and TW3 carpal bone age. The experts' mean estimates were accepted as the gold standard. The performance of the AI analyzer was compared with that of each resident. RESULTS For the estimation of TW3-RUS, the AI analyzer had a mean absolute error (MAE) of 0.48 ± 0.42. The percentage of patients with an absolute error of < 1.0 years was 86.78%. The MAE was significantly lower than that of rater 1 (0.54 ± 0.49, P = 0.0068); however, it was not significant for rater 2 (0.48 ± 0.48) or rater 3 (0.49 ± 0.46). For TW3 carpal, the AI analyzer had an MAE of 0.48 ± 0.65. The percentage of patients with an absolute error of < 1.0 years was 88.78%. The MAE was significantly lower than that of rater 2 (0.58 ± 0.67, P = 0.0018) and numerically lower for rater 1 (0.54 ± 0.64) and rater 3 (0.50 ± 0.53). These results were consistent for the subgroups according to sex, and differences between the age groups were observed. CONCLUSION In this comprehensive validation study conducted in China, an AI-powered X-ray bone age analyzer showed accuracies that matched or exceeded those of doctor raters. This method may improve the efficiency of clinical routines by reducing reading time without compromising accuracy.
Collapse
Affiliation(s)
- Yan Liang
- Department of Pediatrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China
- Hubei Key Laboratory of Pediatric Genetic Metabolic and Endocrine Rare Diseases, Wuhan, 430030, China
| | - Xiaobo Chen
- Department of Endocrinology, Children's Hospital, Capital Institute of Pediatrics, Beijing, 100020, China
| | - Rongxiu Zheng
- Department of Pediatrics, Tianjin Medical University General Hospital, Tianjin, 300052, China
| | - Xinran Cheng
- Department of Pediatric Endocrine Genetics and Metabolism, Chengdu Women's and Children's Center Hospital, Chengdu, 610074, China
| | - Zhe Su
- Department of Endocrinology, Shenzhen Children's Hospital, No. 7019 Yitian Road, Shenzhen, 518038, China
| | - Xiumin Wang
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200030, China
| | - Hongwei Du
- Department of Paediatrics, First Hospital of Jilin University, Changchun, 130021, China
| | - Min Zhu
- Department of Endocrinology, Children's Hospital of Chongqing Medical University, Chongqing, 400014, China
| | - Guimei Li
- Department of Pediatrics, Shandong Provincial Hospital Affiliated to Shandong First Medical University, Jinan, 250021, China
| | - Yan Zhong
- Department of Child Health Care, Hunan Children's Hospital, Changsha, 410007, China
| | - Shengquan Cheng
- Department of Pediatrics, First Affiliated Hospital of Air Force Medical University, Xi'an, 710032, China
| | - Baosheng Yu
- Department of Pediatrics, The Second Affiliated Hospital, Nanjing Medical University, Nanjing, 210003, China
| | - Yu Yang
- Department of Endocrinology and Genetics, Jiangxi Provincial Children's Hospital, Affiliated Children's Hospital of Nanchang University, Nanchang, 330006, China
| | - Ruimin Chen
- Department of Endocrinology, Genetics and Metabolism, Fuzhou Children's Hospital of Fujian Medical University, Fuzhou, 350005, China
| | - Lanwei Cui
- Department of Pediatric, The First Affiliated Hospital of Harbin Medical University, Harbin, 150007, China
| | - Hui Yao
- Department of Endocrinology and Metabolism, Wuhan Children's Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430015, China
| | - Qiang Gu
- Department of Pediatrics, First Affiliated Hospital of Shihezi University, Shihezi, 832000, China
| | - Chunxiu Gong
- Department of Endocrine and Genetics and Metabolism, Beijing Children's Hospital, Capital Medical University, National Centre for Children's Health, Beijing, 100045, China
| | - Zhang Jun
- Department of Pediatrics, The First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510080, China
| | - Xiaoyan Huang
- Department of Genetics, Metabolism and Endocrinology, Hainan Women and Children's Medical Center, Haikou, 570312, China
| | - Deyun Liu
- Department of Pediatrics, The Second Affiliated Hospital of Anhui Medical University, Hefei, 230601, China
| | - Xueqin Yan
- Department of Pediatrics, Boai Hospital of Zhongshan, Zhongshan, 528400, China
| | - Haiyan Wei
- Department of Endocrinology and Metabolism, Genetics, Henan Children's Hospital (Children's Hospital Affiliated to Zhengzhou University), Zhengzhou, 450018, China
| | - Yuwen Li
- Department of Pediatrics, The First Affiliated Hospital of Nanjing Medical University, Nanjing, 210029, China
| | - Huifeng Zhang
- Department of Pediatrics, The Second Hospital of Hebei Medical University, Shijiazhuang, 050000, China
| | - Yanjie Liu
- Department of Pediatrics, Inner Mongolia People's Hospital, Hohhot, 010017, China
| | - Fengyun Wang
- Department of Endocrinology, Children's Hospital of Soochow University, Suzhou, 215025, China
| | - Gaixiu Zhang
- Department of Endocrine and Genetics and Metabolism, Children's Hospital of Shanxi, Taiyuan, 030006, China
| | - Xin Fan
- Department of Pediatric, The Second Affiliated Hospital of Guangxi Medical University, Nanning, 537406, China
| | - Hongmei Dai
- Department of Pediatric, The Third Xiangya Hospital, Central South University, Changsha, 410013, China
| | - Xiaoping Luo
- Department of Pediatrics, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, China.
- Hubei Key Laboratory of Pediatric Genetic Metabolic and Endocrine Rare Diseases, Wuhan, 430030, China.
| |
Collapse
|
2
|
Gao C, Hu C, Qian Q, Li Y, Xing X, Gong P, Lin M, Ding Z. Artificial intelligence model system for bone age assessment of preschool children. Pediatr Res 2024:10.1038/s41390-024-03282-5. [PMID: 38802611 DOI: 10.1038/s41390-024-03282-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 05/04/2024] [Accepted: 05/07/2024] [Indexed: 05/29/2024]
Abstract
BACKGROUD Our study aimed to assess the impact of inter- and intra-observer variations when utilizing an artificial intelligence (AI) system for bone age assessment (BAA) of preschool children. METHODS A retrospective study was conducted involving a total sample of 53 female individuals and 41 male individuals aged 3-6 years in China. Radiographs were assessed by four mid-level radiology reviewers using the TW3 and RUS-CHN methods. Bone age (BA) was analyzed in two separate situations, with/without the assistance of AI. Following a 4-week wash-out period, radiographs were reevaluated in the same manner. Accuracy metrics, the correlation coefficient (ICC)and Bland-Altman plots were employed. RESULTS The accuracy of BAA by the reviewers was significantly improved with AI. The results of RMSE and MAE decreased in both methods (p < 0.001). When comparing inter-observer agreement in both methods and intra-observer reproducibility in two interpretations, the ICC results were improved with AI. The ICC values increased in both two interpretations for both methods and exceeded 0.99 with AI. CONCLUSION In the assessment of BA for preschool children, AI was found to be capable of reducing inter-observer variability and enhancing intra-observer reproducibility, which can be considered an important tool for clinical work by radiologists. IMPACT The RUS-CHN method is a special bone age method devised to be suitable for Chinese children. The preschool stage is a critical phase for children, marked by a high degree of variability that renders BA prediction challenging. The accuracy of BAA by the reviewers can be significantly improved with the aid of an AI model system. This study is the first to assess the impact of inter- and intra-observer variations when utilizing an AI model system for BAA of preschool children using both the TW3 and RUS-CHN methods.
Collapse
Affiliation(s)
- Chengcheng Gao
- Department of Radiology, Hangzhou First People's Hospital, Hangzhou, China
| | - Chunfeng Hu
- Department of Radiology, Hangzhou First People's Hospital, Hangzhou, China
- The Fourth School of Clinical Medicine, Zhejiang Chinese Medicine University, Hangzhou, China
| | - Qi Qian
- Department of Radiology, The Third Affiliated Hospital of Zhejiang Chinese Medicine University, Hangzhou, China
| | - Yangsheng Li
- Department of Radiology, Hangzhou First People's Hospital, Hangzhou, China
| | - Xiaowei Xing
- Rehabilitation Medicine Center, Department of Radiology, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, China
| | | | - Min Lin
- Department of Radiology, The Third Affiliated Hospital of Zhejiang Chinese Medicine University, Hangzhou, China.
- College of Humanities and Management, Zhejiang Chinese Medical University, Hangzhou, China.
| | - Zhongxiang Ding
- Department of Radiology, Hangzhou First People's Hospital, Hangzhou, China.
- Key Laboratory of Clinical Cancer Pharmacology and Toxicology Research of Zhejiang Province, Hangzhou, China.
| |
Collapse
|
3
|
Choi G, Ham S, Je BK, Rhie YJ, Ahn KS, Shim E, Lee MJ. Olecranon bone age assessment in puberty using a lateral elbow radiograph and a deep-learning model. Eur Radiol 2024:10.1007/s00330-024-10748-x. [PMID: 38676732 DOI: 10.1007/s00330-024-10748-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/28/2024] [Accepted: 03/21/2024] [Indexed: 04/29/2024]
Abstract
OBJECTIVES To improve pubertal bone age (BA) evaluation by developing a precise and practical elbow BA classification using the olecranon, and a deep-learning AI model. MATERIALS AND METHODS Lateral elbow radiographs taken for BA evaluation in children under 18 years were collected from January 2020 to June 2022, retrospectively. A novel classification and the olecranon BA were established based on the morphological changes in the olecranon ossification process during puberty. The olecranon BA was compared with other elbow and hand BA methods, using intraclass correlation coefficients (ICCs), and a deep-learning AI model was developed. RESULTS A total of 3508 lateral elbow radiographs (mean age 9.8 ± 1.8 years) were collected. The olecranon BA showed the highest applicability (100%) and interobserver agreement (ICC 0.993) among elbow BA methods. It showed excellent reliability with Sauvegrain (0.967 in girls, 0.969 in boys) and Dimeglio (0.978 in girls, 0.978 in boys) elbow BA methods, as well as Korean standard (KS) hand BA in boys (0.917), and good reliability with KS in girls (0.896) and Greulich-Pyle (GP)/Tanner-Whitehouse (TW)3 (0.835 in girls, 0.895 in boys) hand BA methods. The AI model for olecranon BA showed an accuracy of 0.96 and a specificity of 0.98 with EfficientDet-b4. External validation showed an accuracy of 0.86 and a specificity of 0.91. CONCLUSION The olecranon BA evaluation for puberty, requiring only a lateral elbow radiograph, showed the highest applicability and interobserver agreement, and excellent reliability with other BA evaluation methods, along with a high performance of the AI model. CLINICAL RELEVANCE STATEMENT This AI model uses a single lateral elbow radiograph to determine bone age for puberty from the olecranon ossification center and can improve pubertal bone age assessment with the highest applicability and excellent reliability compared to previous methods. KEY POINTS Elbow bone age is valuable for pubertal bone age assessment, but conventional methods have limitations. Olecranon bone age and its AI model showed high performances for pubertal bone age assessment. Olecranon bone age system is practical and accurate while requiring only a single lateral elbow radiograph.
Collapse
Affiliation(s)
- Gayoung Choi
- Department of Radiology, Korea University Ansan Hospital, Korea University College of Medicine, Seoul, Korea
| | - Sungwon Ham
- Healthcare Readiness Institute for Unified Korea, Korea University Ansan Hospital, Korea University College of Medicine, Seoul, Korea
| | - Bo-Kyung Je
- Department of Radiology, Korea University Ansan Hospital, Korea University College of Medicine, Seoul, Korea.
| | - Young-Jun Rhie
- Department of Pediatrics, Korea University Ansan Hospital, Korea University College of Medicine, Seoul, Korea
| | - Kyung-Sik Ahn
- Department of Radiology, Korea University Anam Hospital, Korea University College of Medicine, Seoul, Korea
| | - Euddeum Shim
- Department of Radiology, Korea University Ansan Hospital, Korea University College of Medicine, Seoul, Korea
| | - Mi-Jung Lee
- Department of Radiology and Research Institute of Radiological Science, Severance Hospital, Yonsei University College of Medicine, Seoul, Korea
| |
Collapse
|
4
|
Martín Pérez SE, Martín Pérez IM, Vega González JM, Molina Suárez R, León Hernández C, Rodríguez Hernández F, Herrera Perez M. Precision and Accuracy of Radiological Bone Age Assessment in Children among Different Ethnic Groups: A Systematic Review. Diagnostics (Basel) 2023; 13:3124. [PMID: 37835867 PMCID: PMC10572703 DOI: 10.3390/diagnostics13193124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 09/24/2023] [Accepted: 09/30/2023] [Indexed: 10/15/2023] Open
Abstract
AIM The aim was to identify, evaluate, and summarize the findings of relevant individual studies on the precision and accuracy of radiological BA assessment procedures among children from different ethnic groups. MATERIALS AND METHODS A qualitative systematic review was carried out following the MOOSE statement and previously registered in PROSPERO (CRD42023449512). A search was performed in MEDLINE (PubMed) (n = 561), the Cochrane Library (n = 261), CINAHL (n = 103), Web of Science (WOS) (n = 181), and institutional repositories (n = 37) using MeSH and free terms combined with the Booleans "AND" and "OR". NOS and ROBINS-E were used to assess the methodological quality and the risk of bias of the included studies, respectively. RESULTS A total of 51 articles (n = 20,100) on radiological BA assessment procedures were precise in terms of intra-observer and inter-observer reliability for all ethnic groups. In Caucasian and Hispanic children, the Greulich-Pyle Atlas (GPA) was accurate at all ages, but in youths, Tanner-Whitehouse radius-ulna-short bones 3 (TW3-RUS) could be an alternative. In Asian and Arab subjects, GPA and Tanner-Whitehouse 3 (TW3) overestimated the BA in adolescents near adulthood. In African youths, GPA overestimated the BA while TW3 was more accurate. CONCLUSION GPA and TW3 radiological BA assessment procedures are both precise but their accuracy in estimating CA among children of different ethnic groups can be altered by racial bias.
Collapse
Affiliation(s)
- Sebastián Eustaquio Martín Pérez
- Departamento de Farmacología y Medicina Física, Área de Radiología y Medicina Física, Sección de Enfermería y Fisioterapia, Facultad de Ciencias de la Salud, Universidad de La Laguna, 38200 Santa Cruz de Tenerife, Spain; (I.M.M.P.); (F.R.H.)
- Escuela de Doctorado y Estudios de Posgrado, Universidad de La Laguna, San Cristóbal de La Laguna, 38203 Santa Cruz de Tenerife, Spain
- Musculoskeletal Pain and Motor Control Research Group, Faculty of Health Sciences, Universidad Europea de Canarias, 38300 Santa Cruz de Tenerife, Spain
- Musculoskeletal Pain and Motor Control Research Group, Faculty of Sport Sciences, Universidad Europea de Madrid, 28670 Villaviciosa de Odón, Spain
| | - Isidro Miguel Martín Pérez
- Departamento de Farmacología y Medicina Física, Área de Radiología y Medicina Física, Sección de Enfermería y Fisioterapia, Facultad de Ciencias de la Salud, Universidad de La Laguna, 38200 Santa Cruz de Tenerife, Spain; (I.M.M.P.); (F.R.H.)
- Escuela de Doctorado y Estudios de Posgrado, Universidad de La Laguna, San Cristóbal de La Laguna, 38203 Santa Cruz de Tenerife, Spain
| | - Jesús María Vega González
- Institute of Legal Medicine and Forensic Sciences of Santa Cruz de Tenerife, 38230 San Cristóbal de La Laguna, Spain;
| | - Ruth Molina Suárez
- Pediatric Endocrinology Unit, Pediatric Department, Hospital Universitario de Canarias, San Cristóbal de La Laguna, 38320 Santa Cruz de Tenerife, Spain;
| | - Coromoto León Hernández
- Departamento de Ingeniería Informática y de Sistemas, Universidad de La Laguna, Apdo. 456, San Cristóbal de La Laguna, 38200 Santa Cruz de Tenerife, España;
| | - Fidel Rodríguez Hernández
- Departamento de Farmacología y Medicina Física, Área de Radiología y Medicina Física, Sección de Enfermería y Fisioterapia, Facultad de Ciencias de la Salud, Universidad de La Laguna, 38200 Santa Cruz de Tenerife, Spain; (I.M.M.P.); (F.R.H.)
| | - Mario Herrera Perez
- School of Medicine (Health Sciences), Universidad de La Laguna, 38200 Santa Cruz de Tenerife, Spain;
- Foot and Ankle Unit, Orthopedic Surgery and Traumatology Department, San Cristóbal de La Laguna, 38320 Santa Cruz de Tenerife, Spain
| |
Collapse
|
5
|
Zhang L, Chen J, Hou L, Xu Y, Liu Z, Huang S, Ou H, Meng Z, Liang L. Clinical application of artificial intelligence in longitudinal image analysis of bone age among GHD patients. Front Pediatr 2022; 10:986500. [PMID: 36440334 PMCID: PMC9691878 DOI: 10.3389/fped.2022.986500] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 10/25/2022] [Indexed: 11/13/2022] Open
Abstract
OBJECTIVE This study aims to explore the clinical value of artificial intelligence (AI)-assisted bone age assessment (BAA) among children with growth hormone deficiency (GHD). METHODS A total of 290 bone age (BA) radiographs were collected from 52 children who participated in the study at Sun Yat-sen Memorial Hospital between January 2016 and August 2017. Senior pediatric endocrinologists independently evaluated BA according to the China 05 (CH05) method, and their consistent results were regarded as the gold standard (GS). Meanwhile, two junior pediatric endocrinologists were asked to assessed BA both with and without assistance from the AI-based BA evaluation system. Six months later, around 20% of the images assessed by the junior pediatric endocrinologists were randomly selected to be re-evaluated with the same procedure half a year later. Root mean square error (RMSE), mean absolute error (MAE), accuracy, and Bland-Altman plots were used to compare differences in BA. The intra-class correlation coefficient (ICC) and one-way repeated ANOVA were used to assess inter- and intra-observer variabilities in BAA. A boxplot of BA evaluated by different raters during the course of treatment and a mixed linear model were used to illustrate inter-rater effect over time. RESULTS A total of 52 children with GHD were included, with mean chronological age and BA by GS of 6.64 ± 2.49 and 5.85 ± 2.30 years at baseline, respectively. After incorporating AI assistance, the performance of the junior pediatric endocrinologists improved (P < 0.001), with MAE and RMSE both decreased by more than 1.65 years (Rater 1: ΔMAE = 1.780, ΔRMSE = 1.655; Rater 2: ΔMAE = 1.794, ΔRMSE = 1.719), and accuracy increasing from approximately 10% to over 91%. The ICC also increased from 0.951 to 0.990. During GHD treatment (at baseline, 6-, 12-, 18-, and 24-months), the difference decreased sharply when AI was applied. Furthermore, a significant inter-rater effect (P = 0.002) also vanished upon AI involvement. CONCLUSION AI-assisted interpretation of BA can improve accuracy and decrease variability in results among junior pediatric endocrinologists in longitudinal cohort studies, which shows potential for further clinical application.
Collapse
Affiliation(s)
- Lina Zhang
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Jia Chen
- Research Center for Healthcare Data Science, Zhejiang Lab, Hangzhou, China
| | - Lele Hou
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Yingying Xu
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Zulin Liu
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Siqi Huang
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Hui Ou
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Zhe Meng
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| | - Liyang Liang
- Department of Pediatrics, Sun Yat-Sen Memorial Hospital, Sun Yat-Sen University, Guangzhou, China
| |
Collapse
|