1
|
McCoy JA, Levine LD, Wan G, Chivers C, Teel J, La Cava WG. Intrapartum electronic fetal heart rate monitoring to predict acidemia at birth with the use of deep learning. Am J Obstet Gynecol 2024:S0002-9378(24)00528-3. [PMID: 38663662 DOI: 10.1016/j.ajog.2024.04.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 04/17/2024] [Accepted: 04/18/2024] [Indexed: 05/01/2024]
Abstract
BACKGROUND Electronic fetal monitoring is used in most US hospital births but has significant limitations in achieving its intended goal of preventing intrapartum hypoxic-ischemic injury. Novel deep learning techniques can improve complex data processing and pattern recognition in medicine. OBJECTIVE This study aimed to apply deep learning approaches to develop and validate a model to predict fetal acidemia from electronic fetal monitoring data. STUDY DESIGN The database was created using intrapartum electronic fetal monitoring data from 2006 to 2020 from a large, multisite academic health system. Data were divided into training and testing sets with equal distribution of acidemic cases. Several different deep learning architectures were explored. The primary outcome was umbilical artery acidemia, which was investigated at 4 clinically meaningful thresholds: 7.20, 7.15, 7.10, and 7.05, along with base excess. The receiver operating characteristic curves were generated with the area under the receiver operating characteristic assessed to determine the performance of the models. External validation was performed using a publicly available Czech database of electronic fetal monitoring data. RESULTS A total of 124,777 electronic fetal monitoring files were available, of which 77,132 had <30% missingness in the last 60 minutes of the electronic fetal monitoring tracing. Of these, 21,041 were matched to a corresponding umbilical cord gas result, of which 10,182 were time-stamped within 30 minutes of the last electronic fetal monitoring reading and composed the final dataset. The prevalence rates of the outcomes in the data were 20.9% with a pH of <7.2, 9.1% with a pH of <7.15, 3.3% with a pH of <7.10, and 1.3% with a pH of <7.05. The best performing model achieved an area under the receiver operating characteristic of 0.85 at a pH threshold of <7.05. When predicting the joint outcome of both pH of <7.05 and base excess of less than -10 meq/L, an area under the receiver operating characteristic of 0.89 was achieved. When predicting both pH of <7.20 and base excess of less than -10 meq/L, an area under the receiver operating characteristic of 0.87 was achieved. At a pH of <7.15 and a positive predictive value of 30%, the model achieved a sensitivity of 90% and a specificity of 48%. CONCLUSION The application of deep learning methods to intrapartum electronic fetal monitoring analysis achieves promising performance in predicting fetal acidemia. This technology could help improve the accuracy and consistency of electronic fetal monitoring interpretation.
Collapse
Affiliation(s)
- Jennifer A McCoy
- Maternal Fetal Medicine Research Program, Department of Obstetrics and Gynecology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA.
| | - Lisa D Levine
- Maternal Fetal Medicine Research Program, Department of Obstetrics and Gynecology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Guangya Wan
- School of Data Science, University of Virginia, Charlottesville, VA
| | | | - Joseph Teel
- Department of Family Medicine and Community Health, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - William G La Cava
- Computational Health Informatics Program, Department of Pediatrics, Boston Children's Hospital, Harvard Medical School, Boston, MA
| |
Collapse
|
2
|
Francis F, Luz S, Wu H, Stock SJ, Townsend R. Machine learning on cardiotocography data to classify fetal outcomes: A scoping review. Comput Biol Med 2024; 172:108220. [PMID: 38489990 DOI: 10.1016/j.compbiomed.2024.108220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 02/02/2024] [Accepted: 02/25/2024] [Indexed: 03/17/2024]
Abstract
INTRODUCTION Uterine contractions during labour constrict maternal blood flow and oxygen delivery to the developing baby, causing transient hypoxia. While most babies are physiologically adapted to withstand such intrapartum hypoxia, those exposed to severe hypoxia or with poor physiological reserves may experience neurological injury or death during labour. Cardiotocography (CTG) monitoring was developed to identify babies at risk of hypoxia by detecting changes in fetal heart rate (FHR) patterns. CTG monitoring is in widespread use in intrapartum care for the detection of fetal hypoxia, but the clinical utility is limited by a relatively poor positive predictive value (PPV) of an abnormal CTG and significant inter and intra observer variability in CTG interpretation. Clinical risk and human factors may impact the quality of CTG interpretation. Misclassification of CTG traces may lead to both under-treatment (with the risk of fetal injury or death) or over-treatment (which may include unnecessary operative interventions that put both mother and baby at risk of complications). Machine learning (ML) has been applied to this problem since early 2000 and has shown potential to predict fetal hypoxia more accurately than visual interpretation of CTG alone. To consider how these tools might be translated for clinical practice, we conducted a review of ML techniques already applied to CTG classification and identified research gaps requiring investigation in order to progress towards clinical implementation. MATERIALS AND METHOD We used identified keywords to search databases for relevant publications on PubMed, EMBASE and IEEE Xplore. We used Preferred Reporting Items for Systematic Review and Meta-Analysis for Scoping Reviews (PRISMA-ScR). Title, abstract and full text were screened according to the inclusion criteria. RESULTS We included 36 studies that used signal processing and ML techniques to classify CTG. Most studies used an open-access CTG database and predominantly used fetal metabolic acidosis as the benchmark for hypoxia with varying pH levels. Various methods were used to process and extract CTG signals and several ML algorithms were used to classify CTG. We identified significant concerns over the practicality of using varying pH levels as the CTG classification benchmark. Furthermore, studies needed to be more generalised as most used the same database with a low number of subjects for an ML study. CONCLUSION ML studies demonstrate potential in predicting fetal hypoxia from CTG. However, more diverse datasets, standardisation of hypoxia benchmarks and enhancement of algorithms and features are needed for future clinical implementation.
Collapse
Affiliation(s)
| | | | - Honghan Wu
- Institute of Health Informatics, University College London, UK
| | | | | |
Collapse
|
3
|
Cao Z, Wang G, Xu L, Li C, Hao Y, Chen Q, Li X, Liu G, Wei H. Intelligent antepartum fetal monitoring via deep learning and fusion of cardiotocographic signals and clinical data. Health Inf Sci Syst 2023; 11:16. [PMID: 36950107 PMCID: PMC10025176 DOI: 10.1007/s13755-023-00219-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 02/27/2023] [Indexed: 03/21/2023] Open
Abstract
Purpose Cardiotocography (CTG), which measures uterine contraction (UC) and fetal heart rate (FHR), is a crucial tool for assessing fetal health during pregnancy. However, traditional computerized cardiotocography (cCTG) approaches have non-negligible calibration errors in feature extraction and heavily rely on the expertise and prior experience to define diagnostic features from CTG or FHR signals. Although previous works have studied deep learning methods for extracting CTG or FHR features, these methods still neglect the clinical information of pregnant women. Methods In this paper, we proposed a multimodal deep learning architecture (MMDLA) for intelligent antepartum fetal monitoring that is capable of performing automatic CTG feature extraction, fusion with clinical data and classification. The multimodal feature fusion was achieved by concatenating high-level CTG features, which were extracted from preprocessed CTG signals via a convolution neural network (CNN) with six convolution layers and five fully connected layers, and the clinical data of pregnant women. Eventually, light gradient boosting machine (LGBM) was implemented as fetal status assessment classifier. The effectiveness of MMDLA was evaluated using a dataset of 16,355 cases, each of which includes FHR signal, UC signal and pertinent clinical data like maternal age and gestational age. Results With an accuracy of 90.77% and an area under the curve (AUC) value of 0.9201, the multimodal features performed admirably. The data imbalance issue was also effectively resolved by the LGBM classifier, with a normal-F1 value of 0.9376 and an abnormal-F1 value of 0.8223. Conclusion In summary, the proposed MMDLA is conducive to the realization of intelligent antepartum fetal monitoring.
Collapse
Affiliation(s)
- Zhen Cao
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
| | - Guoqiang Wang
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
| | - Ling Xu
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
| | - Chaowei Li
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
- Nvogene Co., Ltd., Tianjing, China
| | - Yuexing Hao
- Department of Human Centered Design, Cornell University, Ithaca, NY USA
| | - Qinqun Chen
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
| | - Xia Li
- Guangzhou Medical University Second Affiliated Hospital, Guangzhou, China
| | - Guiqing Liu
- The First Affiliated Hospital, Guangzhou University of Chinese Medicine, Guangzhou, China
| | - Hang Wei
- School of Medical Information Engineering, Guangzhou University of Chinese Medicine, Guangzhou, 510006 China
| |
Collapse
|
4
|
Ma Z, Wang J, Yue J, Lin Y. A homologous and heterogeneous multi-view inter-patient adaptive network for arrhythmia detection. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 241:107740. [PMID: 37567144 DOI: 10.1016/j.cmpb.2023.107740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 07/17/2023] [Accepted: 07/30/2023] [Indexed: 08/13/2023]
Abstract
BACKGROUND AND OBJECTIVE Electrocardiogram (ECG) is a widely used diagnostic tool for arrhythmia assessment in clinical practice. However, current arrhythmia detection algorithms rely heavily on signal-based data, while cardiologists often use image-based data. This discrepancy, combined with individual differences in physiological signals, poses challenges for accurate arrhythmia detection. To address these challenges and improve arrhythmia detection performance, we propose a homologous and heterogeneous multi-view inter-patient adaptive network. METHODS We designed a multi-view representation learning module to capture dynamic and morphological characteristics from ECG signals and electrocardiographic images. Expert knowledge was also elicited to gain internally-invariant characteristics of each category. Finally, we designed a new loss function that aligns the embedding of the source and target domains in the feature space to minimize the negative effects of individual differences. RESULTS Experiments on the MIT-BIH arrhythmia database demonstrate that our proposed method outperforms state-of-the-art baselines in terms of accuracy, positive predictive value, sensitivity and F1-score. These results indicate the effectiveness of our method in accurately detecting arrhythmias. CONCLUSIONS Our homologous and heterogeneous multi-view inter-patient adaptive network successfully addresses the challenges of utilizing both ECG signal and electrocardiographic image data for arrhythmia detection and overcoming individual differences in physiological signals. Our proposed method has the potential to improve early diagnosis and treatment outcomes of arrhythmias in clinical practice.
Collapse
Affiliation(s)
- Zhaoyang Ma
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China.
| | - Jing Wang
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China; CAAC Key Laboratory of Intelligent Passenger Service of Civil Aviation, Beijing 100044, China.
| | - Jinghang Yue
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China.
| | - Youfang Lin
- School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; Beijing Key Laboratory of Traffic Data Analysis and Mining, Beijing 100044, China; CAAC Key Laboratory of Intelligent Passenger Service of Civil Aviation, Beijing 100044, China.
| |
Collapse
|
5
|
Ben M’Barek I, Jauvion G, Vitrou J, Holmström E, Koskas M, Ceccaldi PF. DeepCTG® 1.0: an interpretable model to detect fetal hypoxia from cardiotocography data during labor and delivery. Front Pediatr 2023; 11:1190441. [PMID: 37397139 PMCID: PMC10311205 DOI: 10.3389/fped.2023.1190441] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 05/30/2023] [Indexed: 07/04/2023] Open
Abstract
Introduction Cardiotocography, which consists in monitoring the fetal heart rate as well as uterine activity, is widely used in clinical practice to assess fetal wellbeing during labor and delivery in order to detect fetal hypoxia and intervene before permanent damage to the fetus. We present DeepCTG® 1.0, a model able to predict fetal acidosis from the cardiotocography signals. Materials and methods DeepCTG® 1.0 is based on a logistic regression model fed with four features extracted from the last available 30 min segment of cardiotocography signals: the minimum and maximum values of the fetal heart rate baseline, and the area covered by accelerations and decelerations. Those four features have been selected among a larger set of 25 features. The model has been trained and evaluated on three datasets: the open CTU-UHB dataset, the SPaM dataset and a dataset built in hospital Beaujon (Clichy, France). Its performance has been compared with other published models and with nine obstetricians who have annotated the CTU-UHB cases. We have also evaluated the impact of two key factors on the performance of the model: the inclusion of cesareans in the datasets and the length of the cardiotocography segment used to compute the features fed to the model. Results The AUC of the model is 0.74 on the CTU-UHB and Beaujon datasets, and between 0.77 and 0.87 on the SPaM dataset. It achieves a much lower false positive rate (12% vs. 25%) than the most frequent annotation among the nine obstetricians for the same sensitivity (45%). The performance of the model is slightly lower on the cesarean cases only (AUC = 0.74 vs. 0.76) and feeding the model with shorter CTG segments leads to a significant decrease in its performance (AUC = 0.68 with 10 min segments). Discussion Although being relatively simple, DeepCTG® 1.0 reaches a good performance: it compares very favorably to clinical practice and performs slightly better than other published models based on similar approaches. It has the important characteristic of being interpretable, as the four features it is based on are known and understood by practitioners. The model could be improved further by integrating maternofetal clinical factors, using more advanced machine learning or deep learning approaches and having a more robust evaluation of the model based on a larger dataset with more pathological cases and covering more maternity centers.
Collapse
Affiliation(s)
- Imane Ben M’Barek
- Department of Gynecology Obstetrics, Assistance Publique des Hôpitaux de Paris -Beaujon, Clichy, France
- Health Simulation Department, iLumens, Université Paris Cité, Paris, France
| | | | - Juliette Vitrou
- Department of Gynecology Obstetrics, Assistance Publique des Hôpitaux de Paris -Beaujon, Clichy, France
| | - Emilia Holmström
- Department of Gynecology Obstetrics, Assistance Publique des Hôpitaux de Paris -Beaujon, Clichy, France
| | - Martin Koskas
- Department of Gynecology-Obstetrics and Reproduction, Assistance Publique des Hôpitaux de Paris -Bichat, Paris, France
| | | |
Collapse
|
6
|
Ben M'Barek I, Jauvion G, Ceccaldi P. Computerized cardiotocography analysis during labor - A state-of-the-art review. Acta Obstet Gynecol Scand 2022; 102:130-137. [PMID: 36541016 PMCID: PMC9889319 DOI: 10.1111/aogs.14498] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/01/2022] [Accepted: 12/02/2022] [Indexed: 12/24/2022]
Abstract
Cardiotocography is defined as the recording of fetal heart rate and uterine contractions and is widely used during labor as a screening tool to determine fetal wellbeing. The visual interpretation of the cardiotocography signals by the practitioners, following common guidelines, is subject to a high interobserver variability, and the efficiency of cardiotocography monitoring is still debated. Since the 1990s, researchers and practitioners work on designing reliable computer-aided systems to assist practitioners in cardiotocography interpretation during labor. Several systems are integrated in the monitoring devices, mostly based on the guidelines, but they have not clearly demonstrated yet their usefulness. In the last decade, the availability of large clinical databases as well as the emergence of machine learning and deep learning methods in healthcare has led to a surge of studies applying those methods to cardiotocography signals analysis. The state-of-the-art systems perform well to detect fetal hypoxia when evaluated on retrospective cohorts, but several challenges remain to be tackled before they can be used in clinical practice. First, the development and sharing of large, open and anonymized multicentric databases of perinatal and cardiotocography data during labor is required to build more accurate systems. Also, the systems must produce interpretable indicators along with the prediction of the risk of fetal hypoxia in order to be appropriated and trusted by practitioners. Finally, common standards should be built and agreed on to evaluate and compare those systems on retrospective cohorts and to validate their use in clinical practice.
Collapse
Affiliation(s)
- Imane Ben M'Barek
- Department of Obstetrics and GynecologyAssistance Publique Hôpitaux de Paris – Hôpital BeaujonClichy La GarenneFrance,Université Paris CitéParisFrance,Health Simulation Department, iLumensUniversité Paris CitéParisFrance
| | | | - Pierre‐François Ceccaldi
- Université Paris CitéParisFrance,Health Simulation Department, iLumensUniversité Paris CitéParisFrance,Department of Gynecology‐Obstetrics and Reproductive MedicineHôpital FochSuresnesFrance
| |
Collapse
|
7
|
Zhang Y, Deng Y, Zhou Z, Zhang X, Jiao P, Zhao Z. Multimodal learning for fetal distress diagnosis using a multimodal medical information fusion framework. Front Physiol 2022; 13:1021400. [DOI: 10.3389/fphys.2022.1021400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 10/25/2022] [Indexed: 11/09/2022] Open
Abstract
Cardiotocography (CTG) monitoring is an important medical diagnostic tool for fetal well-being evaluation in late pregnancy. In this regard, intelligent CTG classification based on Fetal Heart Rate (FHR) signals is a challenging research area that can assist obstetricians in making clinical decisions, thereby improving the efficiency and accuracy of pregnancy management. Most existing methods focus on one specific modality, that is, they only detect one type of modality and inevitably have limitations such as incomplete or redundant source domain feature extraction, and poor repeatability. This study focuses on modeling multimodal learning for Fetal Distress Diagnosis (FDD); however, exists three major challenges: unaligned multimodalities; failure to learn and fuse the causality and inclusion between multimodal biomedical data; modality sensitivity, that is, difficulty in implementing a task in the absence of modalities. To address these three issues, we propose a Multimodal Medical Information Fusion framework named MMIF, where the Category Constrained-Parallel ViT model (CCPViT) was first proposed to explore multimodal learning tasks and address the misalignment between multimodalities. Based on CCPViT, a cross-attention-based image-text joint component is introduced to establish a Multimodal Representation Alignment Network model (MRAN), explore the deep-level interactive representation between cross-modal data, and assist multimodal learning. Furthermore, we designed a simple-structured FDD test model based on the highly modal alignment MMIF, realizing task delegation from multimodal model training (image and text) to unimodal pathological diagnosis (image). Extensive experiments, including model parameter sensitivity analysis, cross-modal alignment assessment, and pathological diagnostic accuracy evaluation, were conducted to show our models’ superior performance and effectiveness.
Collapse
|
8
|
Accessing Artificial Intelligence for Fetus Health Status Using Hybrid Deep Learning Algorithm (AlexNet-SVM) on Cardiotocographic Data. SENSORS 2022; 22:s22145103. [PMID: 35890783 PMCID: PMC9319518 DOI: 10.3390/s22145103] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 06/25/2022] [Accepted: 07/04/2022] [Indexed: 12/22/2022]
Abstract
Artificial intelligence is serving as an impetus in digital health, clinical support, and health informatics for an informed patient’s outcome. Previous studies only consider classification accuracies of cardiotocographic (CTG) datasets and disregard computational time, which is a relevant parameter in a clinical environment. This paper proposes a modified deep neural algorithm to classify untapped pathological and suspicious CTG recordings with the desired time complexity. In our newly developed classification algorithm, AlexNet architecture is merged with support vector machines (SVMs) at the fully connected layers to reduce time complexity. We used an open-source UCI (Machine Learning Repository) dataset of cardiotocographic (CTG) recordings. We divided 2126 CTG recordings into 3 classes (Normal, Pathological, and Suspected), including 23 attributes that were dynamically programmed and fed to our algorithm. We employed a deep transfer learning (TL) mechanism to transfer prelearned features to our model. To reduce time complexity, we implemented a strategy wherein layers in the convolutional base were partially trained to leave others in the frozen states. We used an ADAM optimizer for the optimization of hyperparameters. The presented algorithm also outperforms the leading architectures (RCNNs, ResNet, DenseNet, and GoogleNet) with respect to real-time accuracies, sensitivities, and specificities of 99.72%, 96.67%, and 99.6%, respectively, making it a viable candidate for clinical settings after real-time validation.
Collapse
|