1
|
A retrieval-augmented chatbot based on GPT-4 provides appropriate differential diagnosis in gastrointestinal radiology: a proof of concept study. Eur Radiol Exp 2024; 8:60. [PMID: 38755410 PMCID: PMC11098977 DOI: 10.1186/s41747-024-00457-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Accepted: 03/12/2024] [Indexed: 05/18/2024] Open
Abstract
BACKGROUND We investigated the potential of an imaging-aware GPT-4-based chatbot in providing diagnoses based on imaging descriptions of abdominal pathologies. METHODS Utilizing zero-shot learning via the LlamaIndex framework, GPT-4 was enhanced using the 96 documents from the Radiographics Top 10 Reading List on gastrointestinal imaging, creating a gastrointestinal imaging-aware chatbot (GIA-CB). To assess its diagnostic capability, 50 cases on a variety of abdominal pathologies were created, comprising radiological findings in fluoroscopy, MRI, and CT. We compared the GIA-CB to the generic GPT-4 chatbot (g-CB) in providing the primary and 2 additional differential diagnoses, using interpretations from senior-level radiologists as ground truth. The trustworthiness of the GIA-CB was evaluated by investigating the source documents as provided by the knowledge-retrieval mechanism. Mann-Whitney U test was employed. RESULTS The GIA-CB demonstrated a high capability to identify the most appropriate differential diagnosis in 39/50 cases (78%), significantly surpassing the g-CB in 27/50 cases (54%) (p = 0.006). Notably, the GIA-CB offered the primary differential in the top 3 differential diagnoses in 45/50 cases (90%) versus g-CB with 37/50 cases (74%) (p = 0.022) and always with appropriate explanations. The median response time was 29.8 s for GIA-CB and 15.7 s for g-CB, and the mean cost per case was $0.15 and $0.02, respectively. CONCLUSIONS The GIA-CB not only provided an accurate diagnosis for gastrointestinal pathologies, but also direct access to source documents, providing insight into the decision-making process, a step towards trustworthy and explainable AI. Integrating context-specific data into AI models can support evidence-based clinical decision-making. RELEVANCE STATEMENT A context-aware GPT-4 chatbot demonstrates high accuracy in providing differential diagnoses based on imaging descriptions, surpassing the generic GPT-4. It provided formulated rationale and source excerpts supporting the diagnoses, thus enhancing trustworthy decision-support. KEY POINTS • Knowledge retrieval enhances differential diagnoses in a gastrointestinal imaging-aware chatbot (GIA-CB). • GIA-CB outperformed the generic counterpart, providing formulated rationale and source excerpts. • GIA-CB has the potential to pave the way for AI-assisted decision support systems.
Collapse
|
2
|
Multicentric development and validation of a multi-scale and multi-task deep learning model for comprehensive lower extremity alignment analysis. Artif Intell Med 2024; 150:102843. [PMID: 38553152 DOI: 10.1016/j.artmed.2024.102843] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 03/11/2024] [Accepted: 03/11/2024] [Indexed: 04/02/2024]
Abstract
Osteoarthritis of the knee, a widespread cause of knee disability, is commonly treated in orthopedics due to its rising prevalence. Lower extremity misalignment, pivotal in knee injury etiology and management, necessitates comprehensive mechanical alignment evaluation via frequently-requested weight-bearing long leg radiographs (LLR). Despite LLR's routine use, current analysis techniques are error-prone and time-consuming. To address this, we conducted a multicentric study to develop and validate a deep learning (DL) model for fully automated leg alignment assessment on anterior-posterior LLR, targeting enhanced reliability and efficiency. The DL model, developed using 594 patients' LLR and a 60%/10%/30% data split for training, validation, and testing, executed alignment analyses via a multi-step process, employing a detection network and nine specialized networks. It was designed to assess all vital anatomical and mechanical parameters for standard clinical leg deformity analysis and preoperative planning. Accuracy, reliability, and assessment duration were compared with three specialized orthopedic surgeons across two distinct institutional datasets (136 and 143 radiographs). The algorithm exhibited equivalent performance to the surgeons in terms of alignment accuracy (DL: 0.21 ± 0.18°to 1.06 ± 1.3°vs. OS: 0.21 ± 0.16°to 1.72 ± 1.96°), interrater reliability (ICC DL: 0.90 ± 0.05 to 1.0 ± 0.0 vs. ICC OS: 0.90 ± 0.03 to 1.0 ± 0.0), and clinically acceptable accuracy (DL: 53.9%-100% vs OS 30.8%-100%). Further, automated analysis significantly reduced analysis time compared to manual annotation (DL: 22 ± 0.6 s vs. OS; 101.7 ± 7 s, p ≤ 0.01). By demonstrating that our algorithm not only matches the precision of expert surgeons but also significantly outpaces them in both speed and consistency of measurements, our research underscores a pivotal advancement in harnessing AI to enhance clinical efficiency and decision-making in orthopaedics.
Collapse
|
3
|
A deep learning approach for projection and body-side classification in musculoskeletal radiographs. Eur Radiol Exp 2024; 8:23. [PMID: 38353812 PMCID: PMC10866807 DOI: 10.1186/s41747-023-00417-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 11/29/2023] [Indexed: 02/16/2024] Open
Abstract
BACKGROUND The growing prevalence of musculoskeletal diseases increases radiologic workload, highlighting the need for optimized workflow management and automated metadata classification systems. We developed a large-scale, well-characterized dataset of musculoskeletal radiographs and trained deep learning neural networks to classify radiographic projection and body side. METHODS In this IRB-approved retrospective single-center study, a dataset of musculoskeletal radiographs from 2011 to 2019 was retrieved and manually labeled for one of 45 possible radiographic projections and the depicted body side. Two classification networks were trained for the respective tasks using the Xception architecture with a custom network top and pretrained weights. Performance was evaluated on a hold-out test sample, and gradient-weighted class activation mapping (Grad-CAM) heatmaps were computed to visualize the influential image regions for network predictions. RESULTS A total of 13,098 studies comprising 23,663 radiographs were included with a patient-level dataset split, resulting in 19,183 training, 2,145 validation, and 2,335 test images. Focusing on paired body regions, training for side detection included 16,319 radiographs (13,284 training, 1,443 validation, and 1,592 test images). The models achieved an overall accuracy of 0.975 for projection and 0.976 for body-side classification on the respective hold-out test sample. Errors were primarily observed in projections with seamless anatomical transitions or non-orthograde adjustment techniques. CONCLUSIONS The deep learning neural networks demonstrated excellent performance in classifying radiographic projection and body side across a wide range of musculoskeletal radiographs. These networks have the potential to serve as presorting algorithms, optimizing radiologic workflow and enhancing patient care. RELEVANCE STATEMENT The developed networks excel at classifying musculoskeletal radiographs, providing valuable tools for research data extraction, standardized image sorting, and minimizing misclassifications in artificial intelligence systems, ultimately enhancing radiology workflow efficiency and patient care. KEY POINTS • A large-scale, well-characterized dataset was developed, covering a broad spectrum of musculoskeletal radiographs. • Deep learning neural networks achieved high accuracy in classifying radiographic projection and body side. • Grad-CAM heatmaps provided insight into network decisions, contributing to their interpretability and trustworthiness. • The trained models can help optimize radiologic workflow and manage large amounts of data.
Collapse
|
4
|
Automated image quality assessment for selecting among multiple magnetic resonance image acquisitions in the German National Cohort study. Sci Rep 2023; 13:22745. [PMID: 38123791 PMCID: PMC10733361 DOI: 10.1038/s41598-023-49569-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 12/09/2023] [Indexed: 12/23/2023] Open
Abstract
In magnetic resonance imaging (MRI), the perception of substandard image quality may prompt repetition of the respective image acquisition protocol. Subsequently selecting the preferred high-quality image data from a series of acquisitions can be challenging. An automated workflow may facilitate and improve this selection. We therefore aimed to investigate the applicability of an automated image quality assessment for the prediction of the subjectively preferred image acquisition. Our analysis included data from 11,347 participants with whole-body MRI examinations performed as part of the ongoing prospective multi-center German National Cohort (NAKO) study. Trained radiologic technologists repeated any of the twelve examination protocols due to induced setup errors and/or subjectively unsatisfactory image quality and chose a preferred acquisition from the resultant series. Up to 11 quantitative image quality parameters were automatically derived from all acquisitions. Regularized regression and standard estimates of diagnostic accuracy were calculated. Controlling for setup variations in 2342 series of two or more acquisitions, technologists preferred the repetition over the initial acquisition in 1116 of 1396 series in which the initial setup was retained (79.9%, range across protocols: 73-100%). Image quality parameters then commonly showed statistically significant differences between chosen and discarded acquisitions. In regularized regression across all protocols, 'structured noise maximum' was the strongest predictor for the technologists' choice, followed by 'N/2 ghosting average'. Combinations of the automatically derived parameters provided an area under the ROC curve between 0.51 and 0.74 for the prediction of the technologists' choice. It is concluded that automated image quality assessment can, despite considerable performance differences between protocols and anatomical regions, contribute substantially to identifying the subjective preference in a series of MRI acquisitions and thus provide effective decision support to readers.
Collapse
|
5
|
Pilot study on high-resolution radiological methods for the analysis of cerebrospinal fluid (CSF) shunt valves. Z Med Phys 2023:S0939-3889(23)00146-0. [PMID: 38104007 DOI: 10.1016/j.zemedi.2023.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 11/23/2023] [Accepted: 11/24/2023] [Indexed: 12/19/2023]
Abstract
OBJECTIVES Despite their life-saving capabilities, cerebrospinal fluid (CSF) shunts exhibit high failure rates, with a large fraction of failures attributed to the regulating valve. Due to a lack of methods for the detailed analysis of valve malfunctions, failure mechanisms are not well understood, and valves often have to be surgically explanted on the mere suspicion of malfunction. The presented pilot study aims to demonstrate radiological methods for comprehensive analysis of CSF shunt valves, considering both the potential for failure analysis in design optimization, and for future clinical in-vivo application to reduce the number of required shunt revision surgeries. The proposed method could also be utilized to develop and support in situ repair methods (e.g. by lysis or ultrasound) of malfunctioning CSF shunt valves. MATERIALS AND METHODS The primary methods described are contrast-enhanced radiographic time series of CSF shunt valves, taken in a favorable projection geometry at low radiation dose, and the machine-learning-based diagnosis of CSF shunt valve obstructions. Complimentarily, we investigate CT-based methods capable of providing accurate ground truth for the training of such diagnostic tools. Using simulated test and training data, the performance of the machine-learning diagnostics in identifying and localizing obstructions within a shunt valve is evaluated regarding per-pixel sensitivity and specificity, the Dice similarity coefficient, and the false positive rate in the case of obstruction free test samples. RESULTS Contrast enhanced subtraction radiography allows high-resolution, time-resolved, low-dose analysis of fluid transport in CSF shunt valves. Complementarily, photon-counting micro-CT allows to investigate valve obstruction mechanisms in detail, and to generate valid ground truth for machine learning-based diagnostics. Machine-learning-based detection of valve obstructions in simulated radiographies shows promising results, with a per-pixel sensitivity >70%, per-pixel specificity >90%, a median Dice coefficient >0.8 and <10% false positives at a detection threshold of 0.5. CONCLUSIONS This ex-vivo study demonstrates obstruction detection in cerebro-spinal fluid shunt valves, combining radiological methods with machine learning under conditions compatible to future in-vivo application. Results indicate that high-resolution contrast-enhanced subtraction radiography, possibly including time-series data, combined with machine-learning image analysis, has the potential to strongly improve the diagnostics of CSF shunt valve failures. The presented method is in principle suitable for in-vivo application, considering both measurement geometry and radiological dose. Further research is needed to validate these results on real-world data and to refine the employed methods. In combination, the presented methods enable comprehensive analysis of valve failure mechanisms, paving the way for improved product development and clinical diagnostics of CSF shunt valves.
Collapse
|
6
|
Deep learning segmentation results in precise delineation of the putamen in multiple system atrophy. Eur Radiol 2023; 33:7160-7167. [PMID: 37121929 PMCID: PMC10511621 DOI: 10.1007/s00330-023-09665-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 03/05/2023] [Accepted: 03/10/2023] [Indexed: 05/02/2023]
Abstract
OBJECTIVES The precise segmentation of atrophic structures remains challenging in neurodegenerative diseases. We determined the performance of a Deep Neural Patchwork (DNP) in comparison to established segmentation algorithms regarding the ability to delineate the putamen in multiple system atrophy (MSA), Parkinson's disease (PD), and healthy controls. METHODS We retrospectively included patients with MSA and PD as well as healthy controls. A DNP was trained on manual segmentations of the putamen as ground truth. For this, the cohort was randomly split into a training (N = 131) and test set (N = 120). The DNP's performance was compared with putaminal segmentations as derived by Automatic Anatomic Labelling, Freesurfer and Fastsurfer. For validation, we assessed the diagnostic accuracy of the resulting segmentations in the delineation of MSA vs. PD and healthy controls. RESULTS A total of 251 subjects (61 patients with MSA, 158 patients with PD, and 32 healthy controls; mean age of 61.5 ± 8.8 years) were included. Compared to the dice-coefficient of the DNP (0.96), we noted significantly weaker performance for AAL3 (0.72; p < .001), Freesurfer (0.82; p < .001), and Fastsurfer (0.84, p < .001). This was corroborated by the superior diagnostic performance of MSA vs. PD and HC of the DNP (AUC 0.93) versus the AUC of 0.88 for AAL3 (p = 0.02), 0.86 for Freesurfer (p = 0.048), and 0.85 for Fastsurfer (p = 0.04). CONCLUSION By utilization of a DNP, accurate segmentations of the putamen can be obtained even if substantial atrophy is present. This allows for more precise extraction of imaging parameters or shape features from the putamen in relevant patient cohorts. CLINICAL RELEVANCE STATEMENT Deep learning-based segmentation of the putamen was superior to currently available algorithms and is beneficial for the diagnosis of multiple system atrophy. KEY POINTS • A Deep Neural Patchwork precisely delineates the putamen and performs equal to human labeling in multiple system atrophy, even when pronounced putaminal volume loss is present. • The Deep Neural Patchwork-based segmentation was more capable to differentiate between multiple system atrophy and Parkinson's disease than the AAL3 atlas, Freesurfer, or Fastsurfer.
Collapse
|
7
|
Performance of ChatGPT, human radiologists, and context-aware ChatGPT in identifying AO codes from radiology reports. Sci Rep 2023; 13:14215. [PMID: 37648742 PMCID: PMC10468502 DOI: 10.1038/s41598-023-41512-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 08/28/2023] [Indexed: 09/01/2023] Open
Abstract
While radiologists can describe a fracture's morphology and complexity with ease, the translation into classification systems such as the Arbeitsgemeinschaft Osteosynthesefragen (AO) Fracture and Dislocation Classification Compendium is more challenging. We tested the performance of generic chatbots and chatbots aware of specific knowledge of the AO classification provided by a vector-index and compared it to human readers. In the 100 radiological reports we created based on random AO codes, chatbots provided AO codes significantly faster than humans (mean 3.2 s per case vs. 50 s per case, p < .001) though not reaching human performance (max. chatbot performance of 86% correct full AO codes vs. 95% in human readers). In general, chatbots based on GPT 4 outperformed the ones based on GPT 3.5-Turbo. Further, we found that providing specific knowledge substantially enhances the chatbot's performance and consistency as the context-aware chatbot based on GPT 4 provided 71% consistent correct full AO codes for the compared to the 2% consistent correct full AO codes for the generic ChatGPT 4. This provides evidence, that refining and providing specific context to ChatGPT will be the next essential step in harnessing its power.
Collapse
|
8
|
A Context-based Chatbot Surpasses Trained Radiologists and Generic ChatGPT in Following the ACR Appropriateness Guidelines. Radiology 2023; 308:e230970. [PMID: 37489981 DOI: 10.1148/radiol.230970] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/26/2023]
Abstract
Background Radiological imaging guidelines are crucial for accurate diagnosis and optimal patient care as they result in standardized decisions and thus reduce inappropriate imaging studies. Purpose In the present study, we investigated the potential to support clinical decision-making using an interactive chatbot designed to provide personalized imaging recommendations from American College of Radiology (ACR) appropriateness criteria documents using semantic similarity processing. Methods We utilized 209 ACR appropriateness criteria documents as specialized knowledge base and employed LlamaIndex, a framework that allows to connect large language models with external data, and the ChatGPT 3.5-Turbo to create an appropriateness criteria contexted chatbot (accGPT). Fifty clinical case files were used to compare the accGPT's performance against general radiologists at varying experience levels and to generic ChatGPT 3.5 and 4.0. Results All chatbots reached at least human performance level. For the 50 case files, the accGPT performed best in providing correct recommendations that were "usually appropriate" according to the ACR criteria and also did provide the highest proportion of consistently correct answers in comparison with generic chatbots and radiologists. Further, the chatbots provided substantial time and cost savings, with an average decision time of 5 minutes and a cost of 0.19 € for all cases, compared to 50 minutes and 29.99 € for radiologists (both p < 0.01). Conclusion ChatGPT-based algorithms have the potential to substantially improve the decision-making for clinical imaging studies in accordance with ACR guidelines. Specifically, a context-based algorithm performed superior to its generic counterpart, demonstrating the value of tailoring AI solutions to specific healthcare applications.
Collapse
|
9
|
Where Position Matters-Deep-Learning-Driven Normalization and Coregistration of Computed Tomography in the Postoperative Analysis of Deep Brain Stimulation. Neuromodulation 2023; 26:302-309. [PMID: 36424266 DOI: 10.1016/j.neurom.2022.10.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 09/12/2022] [Accepted: 10/06/2022] [Indexed: 11/23/2022]
Abstract
INTRODUCTION Recent developments in the postoperative evaluation of deep brain stimulation surgery on the group level warrant the detection of achieved electrode positions based on postoperative imaging. Computed tomography (CT) is a frequently used imaging modality, but because of its idiosyncrasies (high spatial accuracy at low soft tissue resolution), it has not been sufficient for the parallel determination of electrode position and details of the surrounding brain anatomy (nuclei). The common solution is rigid fusion of CT images and magnetic resonance (MR) images, which have much better soft tissue contrast and allow accurate normalization into template spaces. Here, we explored a deep-learning approach to directly relate positions (usually the lead position) in postoperative CT images to the native anatomy of the midbrain and group space. MATERIALS AND METHODS Deep learning is used to create derived tissue contrasts (white matter, gray matter, cerebrospinal fluid, brainstem nuclei) based on the CT image; that is, a convolution neural network (CNN) takes solely the raw CT image as input and outputs several tissue probability maps. The ground truth is based on coregistrations with MR contrasts. The tissue probability maps are then used to either rigidly coregister or normalize the CT image in a deformable way to group space. The CNN was trained in 220 patients and tested in a set of 80 patients. RESULTS Rigorous validation of such an approach is difficult because of the lack of ground truth. We examined the agreements between the classical and proposed approaches and considered the spread of implantation locations across a group of identically implanted subjects, which serves as an indicator of the accuracy of the lead localization procedure. The proposed procedure agrees well with current magnetic resonance imaging-based techniques, and the spread is comparable or even lower. CONCLUSIONS Postoperative CT imaging alone is sufficient for accurate localization of the midbrain nuclei and normalization to the group space. In the context of group analysis, it seems sufficient to have a single postoperative CT image of good quality for inclusion. The proposed approach will allow researchers and clinicians to include cases that were not previously suitable for analysis.
Collapse
|
10
|
Analysis of the accuracy of computer‐assisted
DCIA
flap mandibular reconstruction applying a novel approach based on geometric morphometrics. Head Neck 2022; 44:2810-2819. [DOI: 10.1002/hed.27196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 07/27/2022] [Accepted: 08/31/2022] [Indexed: 11/06/2022] Open
|
11
|
Development and evaluation of machine learning models based on X-ray radiomics for the classification and differentiation of malignant and benign bone tumors. Eur Radiol 2022; 32:6247-6257. [PMID: 35396665 PMCID: PMC9381439 DOI: 10.1007/s00330-022-08764-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 01/03/2022] [Accepted: 02/17/2022] [Indexed: 01/19/2023]
Abstract
OBJECTIVES To develop and validate machine learning models to distinguish between benign and malignant bone lesions and compare the performance to radiologists. METHODS In 880 patients (age 33.1 ± 19.4 years, 395 women) diagnosed with malignant (n = 213, 24.2%) or benign (n = 667, 75.8%) primary bone tumors, preoperative radiographs were obtained, and the diagnosis was established using histopathology. Data was split 70%/15%/15% for training, validation, and internal testing. Additionally, 96 patients from another institution were obtained for external testing. Machine learning models were developed and validated using radiomic features and demographic information. The performance of each model was evaluated on the test sets for accuracy, area under the curve (AUC) from receiver operating characteristics, sensitivity, and specificity. For comparison, the external test set was evaluated by two radiology residents and two radiologists who specialized in musculoskeletal tumor imaging. RESULTS The best machine learning model was based on an artificial neural network (ANN) combining both radiomic and demographic information achieving 80% and 75% accuracy at 75% and 90% sensitivity with 0.79 and 0.90 AUC on the internal and external test set, respectively. In comparison, the radiology residents achieved 71% and 65% accuracy at 61% and 35% sensitivity while the radiologists specialized in musculoskeletal tumor imaging achieved an 84% and 83% accuracy at 90% and 81% sensitivity, respectively. CONCLUSIONS An ANN combining radiomic features and demographic information showed the best performance in distinguishing between benign and malignant bone lesions. The model showed lower accuracy compared to specialized radiologists, while accuracy was higher or similar compared to residents. KEY POINTS • The developed machine learning model could differentiate benign from malignant bone tumors using radiography with an AUC of 0.90 on the external test set. • Machine learning models that used radiomic features or demographic information alone performed worse than those that used both radiomic features and demographic information as input, highlighting the importance of building comprehensive machine learning models. • An artificial neural network that combined both radiomic and demographic information achieved the best performance and its performance was compared to radiology readers on an external test set.
Collapse
|
12
|
Magnetic resonance angiography-derived flow parameters to assess thoracic aortic disease risk. Eur J Cardiothorac Surg 2021; 61:403-404. [PMID: 34893800 DOI: 10.1093/ejcts/ezab533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 11/14/2021] [Indexed: 11/14/2022] Open
|
13
|
Multitask Deep Learning for Segmentation and Classification of Primary Bone Tumors on Radiographs. Radiology 2021; 301:398-406. [PMID: 34491126 DOI: 10.1148/radiol.2021204531] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Background An artificial intelligence model that assesses primary bone tumors on radiographs may assist in the diagnostic workflow. Purpose To develop a multitask deep learning (DL) model for simultaneous bounding box placement, segmentation, and classification of primary bone tumors on radiographs. Materials and Methods This retrospective study analyzed bone tumors on radiographs acquired prior to treatment and obtained from patient data from January 2000 to June 2020. Benign or malignant bone tumors were diagnosed in all patients by using the histopathologic findings as the reference standard. By using split-sample validation, 70% of the patients were assigned to the training set, 15% were assigned to the validation set, and 15% were assigned to the test set. The final performance was evaluated on an external test set by using geographic validation, with accuracy, sensitivity, specificity, and 95% CIs being used for classification, the intersection over union (IoU) being used for bounding box placements, and the Dice score being used for segmentations. Results Radiographs from 934 patients (mean age, 33 years ± 19 [standard deviation]; 419 women) were evaluated in the internal data set, which included 667 benign bone tumors and 267 malignant bone tumors. Six hundred fifty-four patients were in the training set, 140 were in the validation set, and 140 were in the test set. One hundred eleven patients were in the external test set. The multitask DL model achieved 80.2% (89 of 111; 95% CI: 72.8, 87.6) accuracy, 62.9% (22 of 35; 95% CI: 47, 79) sensitivity, and 88.2% (67 of 76; CI: 81, 96) specificity in the classification of bone tumors as malignant or benign. The model achieved an IoU of 0.52 ± 0.34 for bounding box placements and a mean Dice score of 0.60 ± 0.37 for segmentations. The model accuracy was higher than that of two radiologic residents (71.2% and 64.9%; P = .002 and P < .001, respectively) and was comparable with that of two musculoskeletal fellowship-trained radiologists (83.8% and 82.9%; P = .13 and P = .25, respectively) in classifying a tumor as malignant or benign. Conclusion The developed multitask deep learning model allowed for accurate and simultaneous bounding box placement, segmentation, and classification of primary bone tumors on radiographs. © RSNA, 2021 Online supplemental material is available for this article. See also the editorial by Carrino in this issue.
Collapse
|
14
|
A new look at the heart-novel imaging techniques. Herzschrittmacherther Elektrophysiol 2017; 29:14-23. [PMID: 29242981 DOI: 10.1007/s00399-017-0546-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Accepted: 11/24/2017] [Indexed: 01/20/2023]
Abstract
The development and successful implementation of cutting-edge imaging technologies to visualise cardiac anatomy and function is a key component of effective diagnostic efforts in cardiology. Here, we describe a number of recent exciting advances in the field of cardiology spanning from macro- to micro- to nano-scales of observation, including magnetic resonance imaging, computed tomography, optical mapping, photoacoustic imaging, and electron tomography. The methodologies discussed are currently making the transition from scientific research to routine clinical use, albeit at different paces. We discuss the most likely trajectory of this transition into clinical research and standard diagnostics, and highlight the key challenges and opportunities associated with each of the methodologies.
Collapse
|
15
|
Four-dimensional magnetic resonance imaging-derived ascending aortic flow eccentricity and flow compression are linked to aneurysm morphology†. Interact Cardiovasc Thorac Surg 2015; 20:582-7; discussion 587-8. [PMID: 25636325 DOI: 10.1093/icvts/ivu446] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 11/20/2014] [Indexed: 11/12/2022] Open
Abstract
OBJECTIVES The impact of specific blood flow patterns within ascending aortic and/or aortic root aneurysms on aortic morphology is unknown. We investigated the interrelation of ascending aortic flow compression/peripheralization and aneurysm morphology with respect to sinotubuar junction (STJ) definition. METHODS Thirty-one patients (aortic root/ascending aortic aneurysm >45 mm) underwent flow-sensitive 4D magnetic resonance thoracic aortic flow measurement at 3 Tesla (Siemens, Germany) at two different institutions (Freiburg, Germany, and San Francisco, CA, USA). Time-resolved image data post-processing and visualization of mid-systolic, mid-ascending aortic flow were performed using local vector fields. The Flow Compression Index (FCI) was calculated individually as a fraction of the area of high-velocity mid-systolic flow over the complete cross-sectional ascending aortic area. According to aortic aneurysm morphology, patients were grouped as (i) small root, eccentric ascending aortic aneurysm (STJ definition) and (ii) enlarged aortic root, non-eccentric ascending aortic aneurysm with diffuse root and tubular enlargement. RESULTS The mean FCI over all patients was 0.47 ± 0.5 (0.37-0.99). High levels of flow compression/peripheralization (FCI <0.6) were linked to eccentric aneurysm morphology (Group A, n = 11), while low levels or absence of aortic flow compression/peripheralization (FCI >0.8) occurred more often in Group B (n = 20). The FCI was 0.48 ± 0.05 in Group A and 0.78 ± 0.14 in Group B (P < 0.001). Distribution of bicuspid aortic valve (P = 0.6) and type of valve dysfunction (P = 0.22 for aortic stenosis) was not found to be different between groups. CONCLUSIONS Irrespective of aortic valve morphology and function, ascending aortic blood flow patterns are linked to distinct patterns of ascending aortic aneurysm morphology. Implementation of quantitative local blood flow analyses might help to improve aneurysm risk stratification in the future.
Collapse
|
16
|
Reproducibility study of four-dimensional flow MRI of arterial and portal venous liver hemodynamics: influence of spatio-temporal resolution. Magn Reson Med 2013; 72:477-84. [PMID: 24018798 DOI: 10.1002/mrm.24939] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Revised: 08/01/2013] [Accepted: 08/08/2013] [Indexed: 12/22/2022]
Abstract
PURPOSE To evaluate influence of variation in spatio-temporal resolution and scan-rescan reproducibility on three-dimensional (3D) visualization and quantification of arterial and portal venous (PV) liver hemodynamics at four-dimensional (4D) flow MRI. METHODS Scan-rescan reproducibility of 3D hemodynamic analysis of the liver was evaluated in 10 healthy volunteers using 4D flow MRI at 3T with three different spatio-temporal resolutions (2.4 × 2.0 × 2.4 mm(3), 61.2 ms; 2.5 × 2.0 × 2.4 mm(3), 81.6 ms; 2.6 × 2.5 × 2.6 mm(3), 80 ms) and thus different total scan times. Qualitative flow analysis used 3D streamlines and time-resolved particle traces. Quantitative evaluation was based on maximum and mean velocities, flow volume, and vessel lumen area in the hepatic arterial and PV systems. RESULTS 4D flow MRI showed good interobserver variability for assessment of arterial and PV liver hemodynamics. 3D flow visualization revealed limitations for the left intrahepatic PV branch. Lower spatio-temporal resolution resulted in underestimation of arterial velocities (mean 15%, P < 0.05). For the PV system, hemodynamic analyses showed significant differences in the velocities for intrahepatic portal vein vessels (P < 0.05). Scan-rescan reproducibility was good except for flow volumes in the arterial system. CONCLUSION 4D flow MRI for assessment of liver hemodynamics can be performed with low interobserver variability and good reproducibility. Higher spatio-temporal resolution is necessary for complete assessment of the hepatic blood flow required for clinical applications.
Collapse
|
17
|
Mural thrombus in the chronically dissected thoracic Marfan's aorta – impact on reoperation and distal aortic size. Thorac Cardiovasc Surg 2013. [DOI: 10.1055/s-0032-1332545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
18
|
Secondary interventions and conventional surgical procedures on the descending thoracic aorta in the Marfan-Syndrome. Thorac Cardiovasc Surg 2013. [DOI: 10.1055/s-0032-1332532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
19
|
Late complications and distal growth rates of Marfan aortas after proximal aortic repair†. Eur J Cardiothorac Surg 2013; 44:163-71. [DOI: 10.1093/ejcts/ezs674] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
20
|
Interdependencies of aortic arch secondary flow patterns, geometry, and age analysed by 4-dimensional phase contrast magnetic resonance imaging at 3 Tesla. Eur Radiol 2011; 22:1122-30. [PMID: 22207269 DOI: 10.1007/s00330-011-2353-6] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2011] [Revised: 10/25/2011] [Accepted: 10/29/2011] [Indexed: 10/14/2022]
Abstract
OBJECTIVE It was the aim to analyse the impact of age, aortic arch geometry, and size on secondary flow patterns such as helix and vortex flow derived from flow-sensitive magnetic resonance imaging (4D PC-MRI). METHODS 62 subjects (age range = 20-80 years) without circumscribed pathologies of the thoracic aorta (ascending aortic (AAo) diameter: 3.2 ± 0.6 cm [range 2.2-5.1]) were examined by 4D PC-MRI after IRB-approval and written informed consent. Blood flow visualisation based on streamlines and time-resolved 3D particle traces was performed. Aortic diameter, shape (gothic, crook-shaped, cubic), angle, and age were correlated with existence and extent of secondary flow patterns (helicity, vortices); statistical modelling was performed. RESULTS Helical flow was the typical pattern in standard crook-shaped aortic arches. With altered shapes and increasing age, helicity was less common. AAo diameter and age had the highest correlation (r = 0.69 and 0.68, respectively) with number of detected vortices. None of the other arch geometric or demographic variables (for all, P ≥ 0.177) improved statistical modelling. CONCLUSION Substantially different secondary flow patterns can be observed in the normal thoracic aorta. Age and the AAo diameter were the parameters correlating best with presence and amount of vortices. Findings underline the importance of age- and geometry-matched control groups for haemodynamic studies. KEY POINTS • Secondary blood flow patterns (helices, vortices) are commonly observed in the aorta • Secondary flow patterns predominantly depend on patient age and aortic diameter • Geometric factors show a lesser impact on blood flow patterns than age and diameter • Future analyses of flow patterns should incorporate age- and diameter dependencies.
Collapse
|
21
|
Three-dimensional analysis of segmental wall shear stress in the aorta by flow-sensitive four-dimensional-MRI. J Magn Reson Imaging 2009; 30:77-84. [PMID: 19557849 DOI: 10.1002/jmri.21790] [Citation(s) in RCA: 121] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
PURPOSE To assess the distribution and regional differences of flow and vessel wall parameters such as wall shear stress (WSS) and oscillatory shear index (OSI) in the entire thoracic aorta. MATERIALS AND METHODS Thirty-one healthy volunteers (mean age = 23.7 +/- 3.3 years) were examined by flow-sensitive four-dimensional (4D)-MRI at 3T. For eight retrospectively positioned 2D analysis planes distributed along the thoracic aorta, flow parameters and vectorial WSS and OSI were assessed in 12 segments along the vascular circumference. RESULTS Mean absolute time-averaged WSS ranged between 0.25 +/- 0.04 N/m(2) and 0.33 +/- 0.07 N/m(2) and incorporated a substantial circumferential component (-0.05 +/- 0.04 to 0.07 +/- 0.02 N/m(2)). For each analysis plane, a segment with lowest absolute WSS and highest OSI was identified which differed significantly from mean values within the plane (P < 0.05). The distribution of atherogenic low WSS and high OSI closely resembled typical locations of atherosclerotic lesions at the inner aortic curvature and supraaortic branches. CONCLUSION The normal distribution of vectorial WSS and OSI in the entire thoracic aorta derived from flow-sensitive 4D-MRI data provides a reference constituting an important perquisite for the examination of patients with aortic disease. Marked regional differences in absolute WSS and OSI may help explaining why atherosclerotic lesions predominantly develop and progress at specific locations in the aorta.
Collapse
|
22
|
Durchführbakeit der in-vivo Ganzherz-Flussmessung mit fluss-sensitiver 4D-MRT an 3Tesla. ROFO-FORTSCHR RONTG 2009. [DOI: 10.1055/s-0029-1221573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
23
|
Quantitative 2D and 3D phase contrast MRI: optimized analysis of blood flow and vessel wall parameters. Magn Reson Med 2009; 60:1218-31. [PMID: 18956416 DOI: 10.1002/mrm.21778] [Citation(s) in RCA: 341] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Quantification of CINE phase contrast (PC)-MRI data is a challenging task because of the limited spatiotemporal resolution and signal-to-noise ratio (SNR). The method presented in this work combines B-spline interpolation and Green's theorem to provide optimized quantification of blood flow and vessel wall parameters. The B-spline model provided optimal derivatives of the measured three-directional blood velocities onto the vessel contour, as required for vectorial wall shear stress (WSS) computation. Eight planes distributed along the entire thoracic aorta were evaluated in a 19-volunteer study using both high-spatiotemporal-resolution planar two-dimensional (2D)-CINE-PC ( approximately 1.4 x 1.4 mm(2)/24.4 ms) and lower-resolution 3D-CINE-PC ( approximately 2.8 x 1.6 x 3 mm(3)/48.6 ms) with three-directional velocity encoding. Synthetic data, error propagation, and interindividual, intermodality, and interobserver variability were used to evaluate the reliability and reproducibility of the method. While the impact of MR measurement noise was only minor, the limited resolution of PC-MRI introduced systematic WSS underestimations. In vivo data demonstrated close agreement for flow and WSS between 2D- and 3D-CINE-PC as well as observers, and confirmed the reliability of the method. WSS analysis along the aorta revealed the presence of a circumferential WSS component accounting for 10-20%. Initial results in a patient with atherosclerosis suggest the potential of the method for understanding the formation and progression of cardiovascular diseases.
Collapse
|
24
|
Analyse der Durchmesser-Abhängigkeit aortalen Hämodynamik mittels fluss-sensitiver 4D-MRT bei 3 Tesla. ROFO-FORTSCHR RONTG 2008. [DOI: 10.1055/s-2008-1073809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
25
|
Die Analyse aortaler Hämodynamik und Gefäßwandparameter mittels fluss-sensitiver in-vivo 4D-MRT bei 3 Tesla. ROFO-FORTSCHR RONTG 2007; 179:463-72. [PMID: 17436180 DOI: 10.1055/s-2007-962941] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Modern phase contrast MR imaging at 3 Tesla allows the depiction of 3D morphology as well as the acquisition of time-resolved blood flow velocities in 3 directions. In combination with state-of-the-art visualization and data processing software, the qualitative and quantitative analysis of hemodynamic changes associated with vascular pathologies is possible. The 4D nature of the acquired data permits free orientation within the vascular system of interest and offers the opportunity to quantify blood flow and derived vessel wall parameters at any desired location within the data volume without being dependent on predefined 2D slices. The technique has the potential of overcoming the limitations of current diagnostic strategies and of implementing new diagnostic parameters. In light of the recent discussions regarding the influence of the wall shear stress and the oscillatory shear index on the genesis of arteriosclerosis and dilatative vascular processes, flow-sensitive 4D MRI may provide the missing diagnostic link. Instead of relying on experience-based parameters such as aneurysm size, new hemodynamic considerations can deepen our understanding of vascular pathologies. This overview reviews the underlying methodology at 3T, the literature on time-resolved 3D MR velocity mapping, and presents case examples. By presenting the pre- and postoperative assessment of hemodynamics in a thoracic aortic aneurysm and the detailed analysis of blood flow in a patient with coarctation we underline the potential of time-resolved 3D phase contrast MR at 3T for hemodynamic assessment of vascular pathologies, especially in the thoracic aorta.
Collapse
|