Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Number

Cited by Other Article(s)

151

Wu 吴奕忱 Y, Li 李晟 S. Complexity Matters: Normalization to Prototypical Viewpoint Induces Memory Distortion along the Vertical Axis of Scenes. J Neurosci 2024;44:e1175232024. [PMID: 38777600 PMCID: PMC11223457 DOI: 10.1523/jneurosci.1175-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 04/24/2024] [Accepted: 05/13/2024] [Indexed: 05/25/2024] Open

152

Yang W, Qiu H, Luo X, Xie S. SGK-Net: A Novel Navigation Scene Graph Generation Network. SENSORS (BASEL, SWITZERLAND) 2024;24:4329. [PMID: 39001108 PMCID: PMC11244408 DOI: 10.3390/s24134329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/24/2024] [Revised: 06/26/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024]

153

Zuo X, Li HY, Gao S, Zhang P, Du WR. NALA: a Nesterov accelerated look-ahead optimizer for deep learning. PeerJ Comput Sci 2024;10:e2167. [PMID: 38983239 PMCID: PMC11232586 DOI: 10.7717/peerj-cs.2167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 06/09/2024] [Indexed: 07/11/2024]

154

Malinverni ES, Abate D, Agapiou A, Stefano FD, Felicetti A, Paolanti M, Pierdicca R, Zingaretti P. SIGNIFICANCE deep learning based platform to fight illicit trafficking of Cultural Heritage goods. Sci Rep 2024;14:15081. [PMID: 38956250 PMCID: PMC11219783 DOI: 10.1038/s41598-024-65885-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 06/25/2024] [Indexed: 07/04/2024] Open

155

Oku T, Furuya S, Lee A, Altenmüller E. Video-based diagnosis support system for pianists with Musician's dystonia. Front Neurol 2024;15:1409962. [PMID: 39015318 PMCID: PMC11250081 DOI: 10.3389/fneur.2024.1409962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2024] [Accepted: 06/18/2024] [Indexed: 07/18/2024] Open

156

Tian Y, Deng N, Xu J, Wen Z. A fine-grained dataset for sewage outfalls objective detection in natural environments. Sci Data 2024;11:724. [PMID: 38956054 PMCID: PMC11219831 DOI: 10.1038/s41597-024-03574-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2024] [Accepted: 06/24/2024] [Indexed: 07/04/2024] Open

157

Vardhan M, Tanade C, Chen SJ, Mahmood O, Chakravartti J, Jones WS, Kahn AM, Vemulapalli S, Patel M, Leopold JA, Randles A. Diagnostic Performance of Coronary Angiography Derived Computational Fractional Flow Reserve. J Am Heart Assoc 2024;13:e029941. [PMID: 38904250 PMCID: PMC11255717 DOI: 10.1161/jaha.123.029941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 04/18/2024] [Indexed: 06/22/2024]

158

García-Ruiz P, Romero-Ramirez FJ, Muñoz-Salinas R, Marín-Jiménez MJ, Medina-Carnicer R. Large-Scale Indoor Camera Positioning Using Fiducial Markers. SENSORS (BASEL, SWITZERLAND) 2024;24:4303. [PMID: 39001083 PMCID: PMC11244017 DOI: 10.3390/s24134303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 06/27/2024] [Accepted: 06/30/2024] [Indexed: 07/16/2024]

159

Gao Y, Zhang J, Zou C, Bi L, Huang C, Nie J, Yan Y, Yu X, Zhang F, Yao F, Ding L. A method for calculating vector forces at human-mattress interface during sleeping positions utilizing image registration. Sci Rep 2024;14:15238. [PMID: 38956282 PMCID: PMC11220148 DOI: 10.1038/s41598-024-66035-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 06/26/2024] [Indexed: 07/04/2024] Open

Affiliation(s)

Ying Gao Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Jing Zhang Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Chengzhao Zou Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Liwen Bi Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Chengzhen Huang Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Jiachen Nie Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Yongli Yan Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Xinli Yu Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Fujun Zhang De Rucci Healthy Sleep Co., Ltd, Dongguan, 523960, Guangdong, China
Fanglai Yao De Rucci Healthy Sleep Co., Ltd, Dongguan, 523960, Guangdong, China
Li Ding Beijing Advanced Innovation Center for Biomedical Engineering, Key Laboratory for Biomechanics and Mechanobiology of Ministry of Education, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China.

Collapse

160

Fujimoto K, Ashida H. Influence of scene aspect ratio and depth cues on verticality perception bias. J Vis 2024;24:12. [PMID: 39028900 DOI: 10.1167/jov.24.7.12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/21/2024] Open

161

Ji H, Han P, Li J, Liu X, Liu L. Transformer Discharge Carbon-Trace Detection Based on Improved MSRCR Image-Enhancement Algorithm and YOLOv8 Model. SENSORS (BASEL, SWITZERLAND) 2024;24:4309. [PMID: 39001089 PMCID: PMC11244463 DOI: 10.3390/s24134309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Revised: 06/17/2024] [Accepted: 06/27/2024] [Indexed: 07/16/2024]

Abstract

It is difficult to visually detect internal defects in a large transformer with a metal closure. For convenient internal inspection, a micro-robot was adopted, and an inspection method based on an image-enhancement algorithm and an improved deep-learning network was proposed in this paper. Considering the dim environment inside the transformer and the problems of irregular imaging distance and fluctuating supplementary light conditions during image acquisition with the internal-inspection robot, an improved MSRCR algorithm for image enhancement was proposed. It could analyze the local contrast of the image and enhance the details on multiple scales. At the same time, a white-balance algorithm was introduced to enhance the contrast and brightness and solve the problems of overexposure and color distortion. To improve the target recognition performance of complex carbon-trace defects, the SimAM mechanism was incorporated into the Backbone network of the YOLOv8 model to enhance the extraction of carbon-trace features. Meanwhile, the DyHead dynamic detection Head framework was constructed at the output of the YOLOv8 model to improve the perception of local carbon traces with different sizes. To improve the defect target recognition speed of the transformer-inspection robot, a pruning operation was carried out on the YOLOv8 model to remove redundant parameters, realize model lightness, and improve detection efficiency. To verify the effectiveness of the improved algorithm, the detection model was trained and validated with the carbon-trace dataset. The results showed that the MSH-YOLOv8 algorithm achieved an accuracy of 91.80%, which was 3.4 percentage points higher compared to the original YOLOv8 algorithm, and had a significant advantage over other mainstream target-detection algorithms. Meanwhile, the FPS of the proposed algorithm was up to 99.2, indicating that the model computation and model complexity were successfully reduced, which meets the requirements for engineering applications of the transformer internal-inspection robot.

Collapse

162

Lin J, Yin H, Wu Y, Luo J, Ye Q, Zhou B, Xie M, Ye C, Liang J, Li X, Bin W, Yang Z. Stitching method for panoramic nail fold images based on capillary contour enhancement. JOURNAL OF BIOPHOTONICS 2024:e202400105. [PMID: 38955359 DOI: 10.1002/jbio.202400105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 05/21/2024] [Accepted: 06/12/2024] [Indexed: 07/04/2024]

163

Matloob Abbasi M, Iqbal S, Aurangzeb K, Alhussein M, Khan TM. LMBiS-Net: A lightweight bidirectional skip connection based multipath CNN for retinal blood vessel segmentation. Sci Rep 2024;14:15219. [PMID: 38956117 PMCID: PMC11219784 DOI: 10.1038/s41598-024-63496-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 05/29/2024] [Indexed: 07/04/2024] Open

Abstract

Blinding eye diseases are often related to changes in retinal structure, which can be detected by analysing retinal blood vessels in fundus images. However, existing techniques struggle to accurately segment these delicate vessels. Although deep learning has shown promise in medical image segmentation, its reliance on specific operations can limit its ability to capture crucial details such as the edges of the vessel. This paper introduces LMBiS-Net, a lightweight convolutional neural network designed for the segmentation of retinal vessels. LMBiS-Net achieves exceptional performance with a remarkably low number of learnable parameters (only 0.172 million). The network used multipath feature extraction blocks and incorporates bidirectional skip connections for the information flow between the encoder and decoder. In addition, we have optimised the efficiency of the model by carefully selecting the number of filters to avoid filter overlap. This optimisation significantly reduces training time and improves computational efficiency. To assess LMBiS-Net's robustness and ability to generalise to unseen data, we conducted comprehensive evaluations on four publicly available datasets: DRIVE, STARE, CHASE_DB1, and HRF The proposed LMBiS-Net achieves significant performance metrics in various datasets. It obtains sensitivity values of 83.60%, 84.37%, 86.05%, and 83.48%, specificity values of 98.83%, 98.77%, 98.96%, and 98.77%, accuracy (acc) scores of 97.08%, 97.69%, 97.75%, and 96.90%, and AUC values of 98.80%, 98.82%, 98.71%, and 88.77% on the DRIVE, STARE, CHEASE_DB, and HRF datasets, respectively. In addition, it records F1 scores of 83.43%, 84.44%, 83.54%, and 78.73% on the same datasets. Our evaluations demonstrate that LMBiS-Net achieves high segmentation accuracy (acc) while exhibiting both robustness and generalisability across various retinal image datasets. This combination of qualities makes LMBiS-Net a promising tool for various clinical applications.

Collapse

164

Kang Q, Lao Q, Gao J, Liu J, Yi H, Ma B, Zhang X, Li K. Deblurring masked image modeling for ultrasound image analysis. Med Image Anal 2024;97:103256. [PMID: 39047605 DOI: 10.1016/j.media.2024.103256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 03/19/2024] [Accepted: 06/24/2024] [Indexed: 07/27/2024]

165

Nie T, Zhao Y, Yao S. ELA-Net: An Efficient Lightweight Attention Network for Skin Lesion Segmentation. SENSORS (BASEL, SWITZERLAND) 2024;24:4302. [PMID: 39001081 PMCID: PMC11243870 DOI: 10.3390/s24134302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 06/25/2024] [Accepted: 06/27/2024] [Indexed: 07/16/2024]

166

Chiu CH, Chen YJ, Wu Y, Shi Y, Ho TY. Achieve fairness without demographics for dermatological disease diagnosis. Med Image Anal 2024;95:103188. [PMID: 38718715 DOI: 10.1016/j.media.2024.103188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2023] [Revised: 04/29/2024] [Accepted: 04/29/2024] [Indexed: 06/01/2024]

167

Fu J, Yang Z, Melemenidis S, Viswanathan V, Dutt S, Manjappa R, Lau B, Soto LA, Ashraf MR, Skinner L, Yu SJ, Surucu M, Casey KM, Rankin EB, Graves E, Lu W, Loo BW, Gu X. Exploring Deep Learning for Estimating the Isoeffective Dose of FLASH Irradiation From Mouse Intestinal Histological Images. Int J Radiat Oncol Biol Phys 2024;119:1001-1010. [PMID: 38171387 DOI: 10.1016/j.ijrobp.2023.12.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 12/09/2023] [Accepted: 12/23/2023] [Indexed: 01/05/2024]

Abstract

PURPOSE

Ultrahigh-dose-rate (FLASH) irradiation has been reported to reduce normal tissue damage compared with conventional dose rate (CONV) irradiation without compromising tumor control. This proof-of-concept study aims to develop a deep learning (DL) approach to quantify the FLASH isoeffective dose (dose of CONV that would be required to produce the same effect as the given physical FLASH dose) with postirradiation mouse intestinal histology images.

METHODS AND MATERIALS

Eighty-four healthy C57BL/6J female mice underwent 16 MeV electron CONV (0.12 Gy/s; n = 41) or FLASH (200 Gy/s; n = 43) single fraction whole abdominal irradiation. Physical dose ranged from 12 to 16 Gy for FLASH and 11 to 15 Gy for CONV in 1 Gy increments. Four days after irradiation, 9 jejunum cross-sections from each mouse were hematoxylin and eosin stained and digitized for histological analysis. CONV data set was randomly split into training (n = 33) and testing (n = 8) data sets. ResNet101-based DL models were retrained using the CONV training data set to estimate the dose based on histological features. The classical manual crypt counting (CC) approach was implemented for model comparison. Cross-section-wise mean squared error was computed to evaluate the dose estimation accuracy of both approaches. The validated DL model was applied to the FLASH data set to map the physical FLASH dose into the isoeffective dose.

RESULTS

The DL model achieved a cross-section-wise mean squared error of 0.20 Gy2 on the CONV testing data set compared with 0.40 Gy2 of the CC approach. Isoeffective doses estimated by the DL model for FLASH doses of 12, 13, 14, 15, and 16 Gy were 12.19 ± 0.46, 12.54 ± 0.37, 12.69 ± 0.26, 12.84 ± 0.26, and 13.03 ± 0.28 Gy, respectively.

CONCLUSIONS

Our proposed DL model achieved accurate CONV dose estimation. The DL model results indicate that in the physical dose range of 13 to 16 Gy, the biologic dose response of small intestinal tissue to FLASH irradiation is represented by a lower isoeffective dose compared with the physical dose. Our DL approach can be a tool for studying isoeffective doses of other radiation dose modifying interventions.

Collapse

Affiliation(s)

Jie Fu Department of Radiation Oncology, University of Washington, Seattle, Washington; Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Zi Yang Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California; Medical Artificial Intelligence and Automation (MAIA) Laboratory, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, Texas
Stavros Melemenidis Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Vignesh Viswanathan Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Suparna Dutt Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Rakesh Manjappa Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Brianna Lau Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Luis A Soto Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
M Ramish Ashraf Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Lawrie Skinner Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Shu-Jung Yu Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Murat Surucu Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Kerriann M Casey Department of Comparative Medicine, Stanford University School of Medicine, Stanford, California
Erinn B Rankin Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Edward Graves Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California
Weiguo Lu Medical Artificial Intelligence and Automation (MAIA) Laboratory, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, Texas
Billy W Loo Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California.
Xuejun Gu Department of Radiation Oncology, Stanford University School of Medicine, Stanford, California.

Collapse

168

Li X, Chi X, Huang P, Liang Q, Liu J. Deep neural network for the prediction of KRAS, NRAS, and BRAF genotypes in left-sided colorectal cancer based on histopathologic images. Comput Med Imaging Graph 2024;115:102384. [PMID: 38759471 DOI: 10.1016/j.compmedimag.2024.102384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 04/14/2024] [Accepted: 04/14/2024] [Indexed: 05/19/2024]

Abstract

BACKGROUND

The KRAS, NRAS, and BRAF genotypes are critical for selecting targeted therapies for patients with metastatic colorectal cancer (mCRC). Here, we aimed to develop a deep learning model that utilizes pathologic whole-slide images (WSIs) to accurately predict the status of KRAS, NRAS, and BRAFV600E.

METHODS

129 patients with left-sided colon cancer and rectal cancer from the Third Affiliated Hospital of Sun Yat-sen University were assigned to the training and testing cohorts. Utilizing three convolutional neural networks (ResNet18, ResNet50, and Inception v3), we extracted 206 pathological features from H&E-stained WSIs, serving as the foundation for constructing specific pathological models. A clinical feature model was then developed, with carcinoembryonic antigen (CEA) identified through comprehensive multiple regression analysis as the key biomarker. Subsequently, these two models were combined to create a clinical-pathological integrated model, resulting in a total of three genetic prediction models.

RESULT

103 patients were evaluated in the training cohort (1782,302 image tiles), while the remaining 26 patients were enrolled in the testing cohort (489,481 image tiles). Compared with the clinical model and the pathology model, the combined model which incorporated CEA levels and pathological signatures, showed increased predictive ability, with an area under the curve (AUC) of 0.96 in the training and an AUC of 0.83 in the testing cohort, accompanied by a high positive predictive value (PPV 0.92).

CONCLUSION

The combined model demonstrated a considerable ability to accurately predict the status of KRAS, NRAS, and BRAFV600E in patients with left-sided colorectal cancer, with potential application to assist doctors in developing targeted treatment strategies for mCRC patients, and effectively identifying mutations and eliminating the need for confirmatory genetic testing.

Collapse

169

Zhu Z, Nan L, Xie H, Chen H, Wang J, Wei M, Qin J. CSDN: Cross-Modal Shape-Transfer Dual-Refinement Network for Point Cloud Completion. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024;30:3545-3563. [PMID: 37018698 DOI: 10.1109/tvcg.2023.3236061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

170

Liu M, Yang Z, Han W, Xie S. Progressive Neighbor-masked Contrastive Learning for Fusion-style Deep Multi-view Clustering. Neural Netw 2024;179:106503. [PMID: 38986189 DOI: 10.1016/j.neunet.2024.106503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 05/09/2024] [Accepted: 06/29/2024] [Indexed: 07/12/2024]

171

Sheng X, Li L, Liu D, Li H. VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:4579-4596. [PMID: 38252583 DOI: 10.1109/tpami.2024.3356548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]

172

Jiang Z, Sun S, Peng H, Liu Z, Wang J. Multiple-in-Single-Out Object Detector Leveraging Spiking Neural Membrane Systems and Multiple Transformers. Int J Neural Syst 2024;34:2450035. [PMID: 38616293 DOI: 10.1142/s0129065724500357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]

173

Leventhal S, Gyulassy A, Heimann M, Pascucci V. Exploring Classification of Topological Priors With Machine Learning for Feature Extraction. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024;30:3959-3972. [PMID: 37027638 DOI: 10.1109/tvcg.2023.3248632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

174

Wang G, Datta A, Lindquist MA. Improved fMRI-based pain prediction using Bayesian group-wise functional registration. Biostatistics 2024;25:885-903. [PMID: 37805937 DOI: 10.1093/biostatistics/kxad026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 08/22/2023] [Accepted: 08/27/2023] [Indexed: 10/10/2023] Open

175

Huang C, Shi Y, Zhang B, Lyu K. Uncertainty-aware prototypical learning for anomaly detection in medical images. Neural Netw 2024;175:106284. [PMID: 38593560 DOI: 10.1016/j.neunet.2024.106284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 03/14/2024] [Accepted: 03/29/2024] [Indexed: 04/11/2024]

176

de Silva A, Zhao M, Stewart D, Khan FH, Dusek G, Davis J, Pang A. RipViz: Finding Rip Currents by Learning Pathline Behavior. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024;30:3930-3944. [PMID: 37022897 DOI: 10.1109/tvcg.2023.3243834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

177

Yousef AM, Deliyski DD, Zacharias SRC, Naghibolhosseini M. Detection of Vocal Fold Image Obstructions in High-Speed Videoendoscopy During Connected Speech in Adductor Spasmodic Dysphonia: A Convolutional Neural Networks Approach. J Voice 2024;38:951-962. [PMID: 35304042 PMCID: PMC9474736 DOI: 10.1016/j.jvoice.2022.01.028] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 01/30/2022] [Accepted: 01/30/2022] [Indexed: 01/10/2023]

Abstract

OBJECTIVE

Adductor spasmodic dysphonia (AdSD) is a neurogenic voice disorder, affecting the intrinsic laryngeal muscle control. AdSD leads to involuntary laryngeal spasms and only reveals during connected speech. Laryngeal high-speed videoendoscopy (HSV) coupled with a flexible fiberoptic endoscope provides a unique opportunity to study voice production and visualize the vocal fold vibrations in AdSD during speech. The goal of this study is to automatically detect instances during which the image of the vocal folds is optically obstructed in HSV recordings obtained during connected speech.

METHODS

HSV data were recorded from vocally normal adults and patients with AdSD during reading of the "Rainbow Passage", six CAPE-V sentences, and production of the vowel /i/. A convolutional neural network was developed and trained as a classifier to detect obstructed/unobstructed vocal folds in HSV frames. Manually labelled data were used for training, validating, and testing of the network. Moreover, a comprehensive robustness evaluation was conducted to compare the performance of the developed classifier and visual analysis of HSV data.

RESULTS

The developed convolutional neural network was able to automatically detect the vocal fold obstructions in HSV data in vocally normal participants and AdSD patients. The trained network was tested successfully and showed an overall classification accuracy of 94.18% on the testing dataset. The robustness evaluation showed an average overall accuracy of 94.81% on a massive number of HSV frames demonstrating the high robustness of the introduced technique while keeping a high level of accuracy.

CONCLUSIONS

The proposed approach can be used for efficient analysis of HSV data to study laryngeal maneuvers in patients with AdSD during connected speech. Additionally, this method will facilitate development of vocal fold vibratory measures for HSV frames with an unobstructed view of the vocal folds. Indicating parts of connected speech that provide an unobstructed view of the vocal folds can be used for developing optimal passages for precise HSV examination during connected speech and subject-specific clinical voice assessment protocols.

Collapse

178

Zhang O, Dahlquist N, Leete Z, Xu M, Schneider D, Yang C. Long-term imaging of three-dimensional hyphal development using the ePetri dish. BIOMEDICAL OPTICS EXPRESS 2024;15:4292-4299. [PMID: 39022548 PMCID: PMC11249690 DOI: 10.1364/boe.530483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 06/07/2024] [Accepted: 06/07/2024] [Indexed: 07/20/2024]

179

Yoshioka H, Jin R, Hisaka A, Suzuki H. Disease progression modeling with temporal realignment: An emerging approach to deepen knowledge on chronic diseases. Pharmacol Ther 2024;259:108655. [PMID: 38710372 DOI: 10.1016/j.pharmthera.2024.108655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/22/2024] [Accepted: 05/01/2024] [Indexed: 05/08/2024]

180

Pundhir A, Sagar S, Singh P, Raman B. Echoes of images: multi-loss network for image retrieval in vision transformers. Med Biol Eng Comput 2024;62:2037-2058. [PMID: 38436836 DOI: 10.1007/s11517-024-03055-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 02/16/2024] [Indexed: 03/05/2024]

Abstract

This paper introduces a novel approach to enhance content-based image retrieval, validated on two benchmark datasets: ISIC-2017 and ISIC-2018. These datasets comprise skin lesion images that are crucial for innovations in skin cancer diagnosis and treatment. We advocate the use of pre-trained Vision Transformer (ViT), a relatively uncharted concept in the realm of image retrieval, particularly in medical scenarios. In contrast to the traditionally employed Convolutional Neural Networks (CNNs), our findings suggest that ViT offers a more comprehensive understanding of the image context, essential in medical imaging. We further incorporate a weighted multi-loss function, delving into various losses such as triplet loss, distillation loss, contrastive loss, and cross-entropy loss. Our exploration investigates the most resilient combination of these losses to create a robust multi-loss function, thus enhancing the robustness of the learned feature space and ameliorating the precision and recall in the retrieval process. Instead of using all the loss functions, the proposed multi-loss function utilizes the combination of only cross-entropy loss, triplet loss, and distillation loss and gains improvement of 6.52% and 3.45% for mean average precision over ISIC-2017 and ISIC-2018. Another innovation in our methodology is a two-branch network strategy, which concurrently boosts image retrieval and classification. Through our experiments, we underscore the effectiveness and the pitfalls of diverse loss configurations in image retrieval. Furthermore, our approach underlines the advantages of retrieval-based classification through majority voting rather than relying solely on the classification head, leading to enhanced prediction for melanoma - the most lethal type of skin cancer. Our results surpass existing state-of-the-art techniques on the ISIC-2017 and ISIC-2018 datasets by improving mean average precision by 1.01% and 4.36% respectively, emphasizing the efficacy and promise of Vision Transformers paired with our tailor-made weighted loss function, especially in medical contexts. The proposed approach's effectiveness is substantiated through thorough ablation studies and an array of quantitative and qualitative outcomes. To promote reproducibility and support forthcoming research, our source code will be accessible on GitHub.

Collapse

181

Ye Q, Yin H, Lin J, Liang J, Xie M, Ye C, Zhou B, Huang A, Wu Z, Li X, Wu Y. Improved nested U-structure for accurate nailfold capillary segmentation. Microvasc Res 2024;154:104680. [PMID: 38484792 DOI: 10.1016/j.mvr.2024.104680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 02/28/2024] [Accepted: 03/11/2024] [Indexed: 03/18/2024]

182

Wang X, Lv Q, Chen G, Zhang J, Wei Z, Dong J, Fu H, Zhu Z, Liu J, Jin X. MobileSky: Real-Time Sky Replacement for Mobile AR. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2024;30:4304-4320. [PMID: 37030763 DOI: 10.1109/tvcg.2023.3257840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

183

Kim S, Park H, Kang M, Jin KH, Adeli E, Pohl KM, Park SH. Federated learning with knowledge distillation for multi-organ segmentation with partially labeled datasets. Med Image Anal 2024;95:103156. [PMID: 38603844 DOI: 10.1016/j.media.2024.103156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 03/11/2024] [Accepted: 03/20/2024] [Indexed: 04/13/2024]

184

Kang S, Oh HS. Probabilistic Principal Curves on Riemannian Manifolds. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:4843-4849. [PMID: 38265902 DOI: 10.1109/tpami.2024.3357801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]

185

Zou J, Song Y, Liu L, Aviles-Rivero AI, Qin J. Unsupervised lung CT image registration via stochastic decomposition of deformation fields. Comput Med Imaging Graph 2024;115:102397. [PMID: 38735104 DOI: 10.1016/j.compmedimag.2024.102397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 01/30/2024] [Accepted: 05/01/2024] [Indexed: 05/14/2024]

186

Kolluru C, Joseph N, Seckler J, Fereidouni F, Levenson R, Shoffstall A, Jenkins M, Wilson D. NerveTracker: a Python-based software toolkit for visualizing and tracking groups of nerve fibers in serial block-face microscopy with ultraviolet surface excitation images. JOURNAL OF BIOMEDICAL OPTICS 2024;29:076501. [PMID: 38912214 PMCID: PMC11188586 DOI: 10.1117/1.jbo.29.7.076501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 05/10/2024] [Accepted: 05/13/2024] [Indexed: 06/25/2024]

Abstract

Significance

Information about the spatial organization of fibers within a nerve is crucial to our understanding of nerve anatomy and its response to neuromodulation therapies. A serial block-face microscopy method [three-dimensional microscopy with ultraviolet surface excitation (3D-MUSE)] has been developed to image nerves over extended depths ex vivo. To routinely visualize and track nerve fibers in these datasets, a dedicated and customizable software tool is required.

Aim

Our objective was to develop custom software that includes image processing and visualization methods to perform microscopic tractography along the length of a peripheral nerve sample.

Approach

We modified common computer vision algorithms (optic flow and structure tensor) to track groups of peripheral nerve fibers along the length of the nerve. Interactive streamline visualization and manual editing tools are provided. Optionally, deep learning segmentation of fascicles (fiber bundles) can be applied to constrain the tracts from inadvertently crossing into the epineurium. As an example, we performed tractography on vagus and tibial nerve datasets and assessed accuracy by comparing the resulting nerve tracts with segmentations of fascicles as they split and merge with each other in the nerve sample stack.

Results

We found that a normalized Dice overlap (Dice norm ) metric had a mean value above 0.75 across several millimeters along the nerve. We also found that the tractograms were robust to changes in certain image properties (e.g., downsampling in-plane and out-of-plane), which resulted in only a 2% to 9% change to the meanDice norm values. In a vagus nerve sample, tractography allowed us to readily identify that subsets of fibers from four distinct fascicles merge into a single fascicle as we move ∼ 5 mm along the nerve's length.

Conclusions

Overall, we demonstrated the feasibility of performing automated microscopic tractography on 3D-MUSE datasets of peripheral nerves. The software should be applicable to other imaging approaches. The code is available at https://github.com/ckolluru/NerveTracker.

Collapse

187

António J, Valente J, Mora C, Almeida A, Jardim S. DarwinGSE: Towards better image retrieval systems for intellectual property datasets. PLoS One 2024;19:e0304915. [PMID: 38950045 PMCID: PMC11216576 DOI: 10.1371/journal.pone.0304915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 05/20/2024] [Indexed: 07/03/2024] Open

188

Day TG, Matthew J, Budd SF, Venturini L, Wright R, Farruggia A, Vigneswaran TV, Zidere V, Hajnal JV, Razavi R, Simpson JM, Kainz B. Interaction between clinicians and artificial intelligence to detect fetal atrioventricular septal defects on ultrasound: how can we optimize collaborative performance? ULTRASOUND IN OBSTETRICS & GYNECOLOGY : THE OFFICIAL JOURNAL OF THE INTERNATIONAL SOCIETY OF ULTRASOUND IN OBSTETRICS AND GYNECOLOGY 2024;64:28-35. [PMID: 38197584 DOI: 10.1002/uog.27577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 12/19/2023] [Accepted: 12/30/2023] [Indexed: 01/11/2024]

Abstract

OBJECTIVES

Artificial intelligence (AI) has shown promise in improving the performance of fetal ultrasound screening in detecting congenital heart disease (CHD). The effect of giving AI advice to human operators has not been studied in this context. Giving additional information about AI model workings, such as confidence scores for AI predictions, may be a way of further improving performance. Our aims were to investigate whether AI advice improved overall diagnostic accuracy (using a single CHD lesion as an exemplar), and to determine what, if any, additional information given to clinicians optimized the overall performance of the clinician-AI team.

METHODS

An AI model was trained to classify a single fetal CHD lesion (atrioventricular septal defect (AVSD)), using a retrospective cohort of 121 130 cardiac four-chamber images extracted from 173 ultrasound scan videos (98 with normal hearts, 75 with AVSD); a ResNet50 model architecture was used. Temperature scaling of model prediction probability was performed on a validation set, and gradient-weighted class activation maps (grad-CAMs) produced. Ten clinicians (two consultant fetal cardiologists, three trainees in pediatric cardiology and five fetal cardiac sonographers) were recruited from a center of fetal cardiology to participate. Each participant was shown 2000 fetal four-chamber images in a random order (1000 normal and 1000 AVSD). The dataset comprised 500 images, each shown in four conditions: (1) image alone without AI output; (2) image with binary AI classification; (3) image with AI model confidence; and (4) image with grad-CAM image overlays. The clinicians were asked to classify each image as normal or AVSD.

RESULTS

A total of 20 000 image classifications were recorded from 10 clinicians. The AI model alone achieved an accuracy of 0.798 (95% CI, 0.760-0.832), a sensitivity of 0.868 (95% CI, 0.834-0.902) and a specificity of 0.728 (95% CI, 0.702-0.754), and the clinicians without AI achieved an accuracy of 0.844 (95% CI, 0.834-0.854), a sensitivity of 0.827 (95% CI, 0.795-0.858) and a specificity of 0.861 (95% CI, 0.828-0.895). Showing a binary (normal or AVSD) AI model output resulted in significant improvement in accuracy to 0.865 (P < 0.001). This effect was seen in both experienced and less-experienced participants. Giving incorrect AI advice resulted in a significant deterioration in overall accuracy, from 0.761 to 0.693 (P < 0.001), which was driven by an increase in both Type-I and Type-II errors by the clinicians. This effect was worsened by showing model confidence (accuracy, 0.649; P < 0.001) or grad-CAM (accuracy, 0.644; P < 0.001).

CONCLUSIONS

AI has the potential to improve performance when used in collaboration with clinicians, even if the model performance does not reach expert level. Giving additional information about model workings such as model confidence and class activation map image overlays did not improve overall performance, and actually worsened performance for images for which the AI model was incorrect. © 2024 The Authors. Ultrasound in Obstetrics & Gynecology published by John Wiley & Sons Ltd on behalf of International Society of Ultrasound in Obstetrics and Gynecology.

Collapse

Affiliation(s)

T G Day School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department of Congenital Heart Disease, Evelina London Children's Healthcare, Guy's and St Thomas' NHS Foundation Trust, London, UK
J Matthew School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
S F Budd School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
L Venturini School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
R Wright School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
A Farruggia School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
T V Vigneswaran School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department of Congenital Heart Disease, Evelina London Children's Healthcare, Guy's and St Thomas' NHS Foundation Trust, London, UK
V Zidere School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department of Congenital Heart Disease, Evelina London Children's Healthcare, Guy's and St Thomas' NHS Foundation Trust, London, UK Harris Birthright Research Centre, King's College London NHS Foundation Trust, London, UK
J V Hajnal School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
R Razavi School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department of Congenital Heart Disease, Evelina London Children's Healthcare, Guy's and St Thomas' NHS Foundation Trust, London, UK
J M Simpson School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department of Congenital Heart Disease, Evelina London Children's Healthcare, Guy's and St Thomas' NHS Foundation Trust, London, UK
B Kainz School of Biomedical Engineering and Imaging Sciences, Faculty of Life Sciences and Medicine, King's College London, London, UK Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität, Erlangen-Nürnberg, Germany Department of Computing, Faculty of Engineering, Imperial College London, London, UK

Collapse

189

Sung C, Oh JS, Park BS, Kim SS, Song SY, Lee JJ. Diagnostic performance of a deep-learning model using ¹⁸F-FDG PET/CT for evaluating recurrence after radiation therapy in patients with lung cancer. Ann Nucl Med 2024;38:516-524. [PMID: 38589677 DOI: 10.1007/s12149-024-01925-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 03/21/2024] [Indexed: 04/10/2024]

190

Tian X, Zhang Z, Wang C, Zhang W, Qu Y, Ma L, Wu Z, Xie Y, Tao D. Variational Distillation for Multi-View Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2024;46:4551-4566. [PMID: 38133979 DOI: 10.1109/tpami.2023.3343717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]

191

Rekik W, Le Hégarat-Mascle S, Ezzedini S, de Marco G. Detection of atypical attentional behaviors in young subjects. J Neurosci Methods 2024;407:110141. [PMID: 38641265 DOI: 10.1016/j.jneumeth.2024.110141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 04/09/2024] [Accepted: 04/16/2024] [Indexed: 04/21/2024]

192

Cho SM, Joo HH, Golla P, Sahu M, Shankar A, Trakimas DR, Creighton F, Akst L, Taylor RH, Galaiya D. Tremor Assessment in Robot-Assisted Microlaryngeal Surgery Using Computer Vision-Based Tool Tracking. Otolaryngol Head Neck Surg 2024;171:188-196. [PMID: 38488231 PMCID: PMC11211051 DOI: 10.1002/ohn.714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 01/30/2024] [Accepted: 02/09/2024] [Indexed: 04/16/2024]

193

Teufel T, Shu H, Soberanis-Mukul RD, Mangulabnan JE, Sahu M, Vedula SS, Ishii M, Hager G, Taylor RH, Unberath M. OneSLAM to map them all: a generalized approach to SLAM for monocular endoscopic imaging based on tracking any point. Int J Comput Assist Radiol Surg 2024;19:1259-1266. [PMID: 38775904 DOI: 10.1007/s11548-024-03171-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 04/30/2024] [Indexed: 07/10/2024]

Abstract

PURPOSE

Monocular SLAM algorithms are the key enabling technology for image-based surgical navigation systems for endoscopic procedures. Due to the visual feature scarcity and unique lighting conditions encountered in endoscopy, classical SLAM approaches perform inconsistently. Many of the recent approaches to endoscopic SLAM rely on deep learning models. They show promising results when optimized on singular domains such as arthroscopy, sinus endoscopy, colonoscopy or laparoscopy, but are limited by an inability to generalize to different domains without retraining.

METHODS

To address this generality issue, we propose OneSLAM a monocular SLAM algorithm for surgical endoscopy that works out of the box for several endoscopic domains, including sinus endoscopy, colonoscopy, arthroscopy and laparoscopy. Our pipeline builds upon robust tracking any point (TAP) foundation models to reliably track sparse correspondences across multiple frames and runs local bundle adjustment to jointly optimize camera poses and a sparse 3D reconstruction of the anatomy.

RESULTS

We compare the performance of our method against three strong baselines previously proposed for monocular SLAM in endoscopy and general scenes. OneSLAM presents better or comparable performance over existing approaches targeted to that specific data in all four tested domains, generalizing across domains without the need for retraining.

CONCLUSION

OneSLAM benefits from the convincing performance of TAP foundation models but generalizes to endoscopic sequences of different anatomies all while demonstrating better or comparable performance over domain-specific SLAM approaches. Future research on global loop closure will investigate how to reliably detect loops in endoscopic scenes to reduce accumulated drift and enhance long-term navigation capabilities.

Collapse

194

Tian H, Zhang B, Zhang Z, Xu Z, Jin L, Bian Y, Wu J. DenseNet model incorporating hybrid attention mechanisms and clinical features for pancreatic cystic tumor classification. J Appl Clin Med Phys 2024;25:e14380. [PMID: 38715381 PMCID: PMC11244679 DOI: 10.1002/acm2.14380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 02/18/2024] [Accepted: 04/15/2024] [Indexed: 07/14/2024] Open

Abstract

PURPOSE

The aim of this study is to develop a deep learning model capable of discriminating between pancreatic plasma cystic neoplasms (SCN) and mucinous cystic neoplasms (MCN) by leveraging patient-specific clinical features and imaging outcomes. The intent is to offer valuable diagnostic support to clinicians in their clinical decision-making processes.

METHODS

The construction of the deep learning model involved utilizing a dataset comprising abdominal magnetic resonance T2-weighted images obtained from patients diagnosed with pancreatic cystic tumors at Changhai Hospital. The dataset comprised 207 patients with SCN and 93 patients with MCN, encompassing a total of 1761 images. The foundational architecture employed was DenseNet-161, augmented with a hybrid attention mechanism module. This integration aimed to enhance the network's attentiveness toward channel and spatial features, thereby amplifying its performance. Additionally, clinical features were incorporated prior to the fully connected layer of the network to actively contribute to subsequent decision-making processes, thereby significantly augmenting the model's classification accuracy. The final patient classification outcomes were derived using a joint voting methodology, and the model underwent comprehensive evaluation.

RESULTS

Using the five-fold cross validation, the accuracy of the classification model in this paper was 92.44%, with an AUC value of 0.971, a precision rate of 0.956, a recall rate of 0.919, a specificity of 0.933, and an F1-score of 0.936.

CONCLUSION

This study demonstrates that the DenseNet model, which incorporates hybrid attention mechanisms and clinical features, is effective for distinguishing between SCN and MCN, and has potential application for the diagnosis of pancreatic cystic tumors in clinical practice.

Collapse

195

Vun DSY, Bowers R, McGarry A. Vision-based motion capture for the gait analysis of neurodegenerative diseases: A review. Gait Posture 2024;112:95-107. [PMID: 38754258 DOI: 10.1016/j.gaitpost.2024.04.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 04/25/2024] [Accepted: 04/26/2024] [Indexed: 05/18/2024]

Abstract

BACKGROUND

Developments in vision-based systems and human pose estimation algorithms have the potential to detect, monitor and intervene early on neurodegenerative diseases through gait analysis. However, the gap between the technology available and actual clinical practice is evident as most clinicians still rely on subjective observational gait analysis or objective marker-based analysis that is time-consuming.

RESEARCH QUESTION

This paper aims to examine the main developments of vision-based motion capture and how such advances may be integrated into clinical practice.

METHODS

The literature review was conducted in six online databases using Boolean search terms. A commercial system search was also included. A predetermined methodological criterion was then used to assess the quality of the selected articles.

RESULTS

A total of seventeen studies were evaluated, with thirteen studies focusing on gait classification systems and four studies on gait measurement systems. Of the gait classification systems, nine studies utilized artificial intelligence-assisted techniques, while four studies employed statistical techniques. The results revealed high correlations of gait features identified by classifier models with existing clinical rating scales. These systems demonstrated generally high classification accuracies and were effective in diagnosing disease severity levels. Gait measurement systems that extract spatiotemporal and kinematic joint information from video data generally found accurate measurements of gait parameters with low mean absolute errors, high intra- and inter-rater reliability.

SIGNIFICANCE

Low cost, portable vision-based systems can provide proof of concept for the quantification of gait, expansion of gait assessment tools, remote gait analysis of neurodegenerative diseases and a point of care system for orthotic evaluation. However, certain challenges, including small sample sizes, occlusion risks, and selection bias in training models, need to be addressed. Nevertheless, these systems can serve as complementary tools, equipping clinicians with essential gait information to objectively assess disease severity and tailor personalized treatment for enhanced patient care.

Collapse

196

Tzitzimpasis P, Ries M, Raaymakers BW, Zachiu C. Generalized div-curl based regularization for physically constrained deformable image registration. Sci Rep 2024;14:15002. [PMID: 38951683 PMCID: PMC11217375 DOI: 10.1038/s41598-024-65896-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 06/25/2024] [Indexed: 07/03/2024] Open

197

Thandiackal K, Piccinelli L, Gupta R, Pati P, Goksel O. Multi-Scale Feature Alignment for Continual Learning of Unlabeled Domains. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024;43:2599-2609. [PMID: 38381642 DOI: 10.1109/tmi.2024.3368365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]

198

Huang Z, Zhao R, Leung FHF, Banerjee S, Lam KM, Zheng YP, Ling SH. Landmark Localization From Medical Images With Generative Distribution Prior. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024;43:2679-2692. [PMID: 38421850 DOI: 10.1109/tmi.2024.3371948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]

199

Zhu K, Shen Z, Wang M, Jiang L, Zhang Y, Yang T, Zhang H, Zhang M. Visual Knowledge Domain of Artificial Intelligence in Computed Tomography: A Review Based on Bibliometric Analysis. J Comput Assist Tomogr 2024;48:652-662. [PMID: 38271538 DOI: 10.1097/rct.0000000000001585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024]

200

Rossi L, Fiorentino MC, Mancini A, Paolanti M, Rosati R, Zingaretti P. Generalizability and robustness evaluation of attribute-based zero-shot learning. Neural Netw 2024;175:106278. [PMID: 38581809 DOI: 10.1016/j.neunet.2024.106278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 02/15/2024] [Accepted: 03/26/2024] [Indexed: 04/08/2024]