1
|
Matikolaie FS, Tadj C. Machine Learning-Based Cry Diagnostic System for Identifying Septic Newborns. J Voice 2024; 38:963.e1-963.e14. [PMID: 35193790 DOI: 10.1016/j.jvoice.2021.12.021] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 12/28/2021] [Accepted: 12/29/2021] [Indexed: 10/19/2022]
Abstract
BACKGROUND AND OBJECTIVE Processing the newborns' cry audio signal (CAS) provides valuable information about the newborns' condition. This information can be used to diagnose the disease. This article analyzes the CASs of newborns under two months old using machine learning approaches to develop an automatic diagnostic system for identifying septic infants from healthy ones. Septic infants have not been studied in this context. METHODOLOGY The proposed features include Mel frequency cepstral coefficients and the prosodic features of tilt, rhythm, and intensity. The performance of each feature set was evaluated using a collection of classifiers, including Support Vector Machine (SVM), decision tree, and discriminant analysis. We also examined the majority voting method for improving the classification results and feature manipulation and multiple classifier framework, which has not previously been reported in the literature on developing an automatic diagnostic system based on the infant's CAS. We tested our methodology on two datasets of expiration and inspiration episodes of newborns' CASs. RESULTS AND CONCLUSION The framework of the concatenation of all feature sets using quadratic SVM resulted in the best F-score with 86% for the expiration dataset. Furthermore, the framework of tilt feature set with quadratic discriminant with 83.90% resulted in the best F-score for inspiration. We found out that septic infants cry differently than healthy infants through these experiments. Thus, our proposed method can be used as a noninvasive tool for identifying septic infants from healthy ones only based on their CAS.
Collapse
Affiliation(s)
| | - Chakib Tadj
- Department of Electrical Engineering, École De Technologie Supérieure, Montreal, QC, H3C 1K3, Canada
| |
Collapse
|
2
|
Hammoud M, Getahun MN, Baldycheva A, Somov A. Machine learning-based infant crying interpretation. Front Artif Intell 2024; 7:1337356. [PMID: 38390346 PMCID: PMC10882089 DOI: 10.3389/frai.2024.1337356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Accepted: 01/08/2024] [Indexed: 02/24/2024] Open
Abstract
Crying is an inevitable character trait that occurs throughout the growth of infants, under conditions where the caregiver may have difficulty interpreting the underlying cause of the cry. Crying can be treated as an audio signal that carries a message about the infant's state, such as discomfort, hunger, and sickness. The primary infant caregiver requires traditional ways of understanding these feelings. Failing to understand them correctly can cause severe problems. Several methods attempt to solve this problem; however, proper audio feature representation and classifiers are necessary for better results. This study uses time-, frequency-, and time-frequency-domain feature representations to gain in-depth information from the data. The time-domain features include zero-crossing rate (ZCR) and root mean square (RMS), the frequency-domain feature includes the Mel-spectrogram, and the time-frequency-domain feature includes Mel-frequency cepstral coefficients (MFCCs). Moreover, time-series imaging algorithms are applied to transform 20 MFCC features into images using different algorithms: Gramian angular difference fields, Gramian angular summation fields, Markov transition fields, recurrence plots, and RGB GAF. Then, these features are provided to different machine learning classifiers, such as decision tree, random forest, K nearest neighbors, and bagging. The use of MFCCs, ZCR, and RMS as features achieved high performance, outperforming state of the art (SOTA). Optimal parameters are found via the grid search method using 10-fold cross-validation. Our MFCC-based random forest (RF) classifier approach achieved an accuracy of 96.39%, outperforming SOTA, the scalogram-based shuffleNet classifier, which had an accuracy of 95.17%.
Collapse
Affiliation(s)
- Mohammed Hammoud
- Digital Engineering CREI, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Melaku N Getahun
- Digital Engineering CREI, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Anna Baldycheva
- Engineering Department, University of Exeter, Exeter, United Kingdom
| | - Andrey Somov
- Digital Engineering CREI, Skolkovo Institute of Science and Technology, Moscow, Russia
| |
Collapse
|
3
|
Ozseven T. Infant cry classification by using different deep neural network models and hand-crafted features. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
4
|
Khalilzad Z, Tadj C. Using CCA-Fused Cepstral Features in a Deep Learning-Based Cry Diagnostic System for Detecting an Ensemble of Pathologies in Newborns. Diagnostics (Basel) 2023; 13:diagnostics13050879. [PMID: 36900023 PMCID: PMC10000938 DOI: 10.3390/diagnostics13050879] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/14/2023] [Accepted: 02/21/2023] [Indexed: 03/02/2023] Open
Abstract
Crying is one of the means of communication for a newborn. Newborn cry signals convey precious information about the newborn's health condition and their emotions. In this study, cry signals of healthy and pathologic newborns were analyzed for the purpose of developing an automatic, non-invasive, and comprehensive Newborn Cry Diagnostic System (NCDS) that identifies pathologic newborns from healthy infants. For this purpose, Mel-frequency Cepstral Coefficients (MFCC) and Gammatone Frequency Cepstral Coefficients (GFCC) were extracted as features. These feature sets were also combined and fused through Canonical Correlation Analysis (CCA), which provides a novel manipulation of the features that have not yet been explored in the literature on NCDS designs, to the best of our knowledge. All the mentioned feature sets were fed to the Support Vector Machine (SVM) and Long Short-term Memory (LSTM). Furthermore, two Hyperparameter optimization methods, Bayesian and grid search, were examined to enhance the system's performance. The performance of our proposed NCDS was evaluated with two different datasets of inspiratory and expiratory cries. The CCA fusion feature set using the LSTM classifier accomplished the best F-score in the study, with 99.86% for the inspiratory cry dataset. The best F-score regarding the expiratory cry dataset, 99.44%, belonged to the GFCC feature set employing the LSTM classifier. These experiments suggest the high potential and value of using the newborn cry signals in the detection of pathologies. The framework proposed in this study can be implemented as an early diagnostic tool for clinical studies and help in the identification of pathologic newborns.
Collapse
|
5
|
Khalilzad Z, Kheddache Y, Tadj C. An Entropy-Based Architecture for Detection of Sepsis in Newborn Cry Diagnostic Systems. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1194. [PMID: 36141080 PMCID: PMC9498202 DOI: 10.3390/e24091194] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 08/18/2022] [Accepted: 08/22/2022] [Indexed: 06/16/2023]
Abstract
The acoustic characteristics of cries are an exhibition of an infant's health condition and these characteristics have been acknowledged as indicators for various pathologies. This study focused on the detection of infants suffering from sepsis by developing a simplified design using acoustic features and conventional classifiers. The features for the proposed framework were Mel-frequency Cepstral Coefficients (MFCC), Spectral Entropy Cepstral Coefficients (SENCC) and Spectral Centroid Cepstral Coefficients (SCCC), which were classified through K-nearest Neighborhood (KNN) and Support Vector Machine (SVM) classification methods. The performance of the different combinations of the feature sets was also evaluated based on several measures such as accuracy, F1-score and Matthews Correlation Coefficient (MCC). Bayesian Hyperparameter Optimization (BHPO) was employed to tailor the classifiers uniquely to fit each experiment. The proposed methodology was tested on two datasets of expiratory cries (EXP) and voiced inspiratory cries (INSV). The highest accuracy and F-score were 89.99% and 89.70%, respectively. This framework also implemented a novel feature selection method based on Fuzzy Entropy (FE) as a final experiment. By employing FE, the number of features was reduced by more than 40%, whereas the evaluation measures were not hindered for the EXP dataset and were even enhanced for the INSV dataset. Therefore, it was deduced through these experiments that an entropy-based framework is successful for identifying sepsis in neonates and has the advantage of achieving high performance with conventional machine learning (ML) approaches, which makes it a reliable means for the early diagnosis of sepsis in deprived areas of the world.
Collapse
|
6
|
Vaishnavi V, Suveetha Dhanaselvam P. Neonatal cry signal prediction and classification via dense convolution neural network. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-212473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The study of neonatal cry signals is always an interesting topic and still researcher works interminably to develop some module to predict the actual reason for the baby cry. It is really hard to predict the reason for their cry. The main focus of this paper is to develop a Dense Convolution Neural network (DCNN) to predict the cry. The target cry signal is categorized into five class based on their sound as “Eair”, “Eh”, “Neh”, “Heh” and “Owh”. Prediction of these signals helps in the detection of infant cry reason. The audio and speech features (AS Features) were exacted using Mel-Bark frequency cepstral coefficient from the spectrogram cry signal and fed into DCNN network. The systematic DCNN architecture is modelled with modified activation layer to classify the cry signal. The cry signal is collected in different growth phase of the infants and tested in proposed DCNN architecture. The performance of the system is calculated through parameters accuracy, specificity and sensitivity are calculated. The output of proposed system yielded a balanced accuracy of 92.31%. The highest accuracy level 95.31%, highest specificity level 94.58% and highest sensitivity level 93% attain through proposed technique. From this study, it is concluded that the proposed technique is more efficient in detecting cry signal compared to the existing techniques.
Collapse
Affiliation(s)
- V. Vaishnavi
- Department of Electronics and Communication Engineering, St. Peter’s Engineering College, Chennai
| | - P. Suveetha Dhanaselvam
- Department Electrical and Electronics Engineering, Sethu Institute of Technology, Virudhunagar
| |
Collapse
|
7
|
Comparative Spectrographic Analysis of the Newborns' Cry in the Presence of Tight Intrapartum Nuchal Cord vs. Normal using the Neonat App. Preliminary Results. ACTA ACUST UNITED AC 2019; 55:medicina55120779. [PMID: 31835374 PMCID: PMC6956181 DOI: 10.3390/medicina55120779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Revised: 12/03/2019] [Accepted: 12/05/2019] [Indexed: 11/16/2022]
Abstract
Background and objectives: The objective of this study was to contribute to the evaluation of the newborn (NB) cry as a means of communication and diagnosis. Materials and Methods: The study implied the recording of the spontaneous cry of 101 NBs with no intrapartum events (control sample), and of 72 NBs with nuchal cord (study sample) from the "Bega" University Clinic of Obstetrics-Gynecology and Neonatology of Timisoara, Romania. The sound analysis was based upon: Imagistic highlighting methods, descriptive statistics, and data mining techniques. Results: The differences between the cry of NBs with no intrapartum events and that of NBs affected by nuchal cord are statistically significant regarding the volume unit meter (VUM) (p = 0.0021) and the peak point meter (PPM) (p = 0.041). Conclusions: While clinically there are no differences between the two groups, the cry recorded from the study group (nuchal cord group) shows distinctive characteristics compared to the cry recorded from the control group (eventless intrapartum NBs group).
Collapse
|
8
|
Vignolo L, Albornoz E, Martínez C. Exploring feature extraction methods for infant mood classification. AI COMMUN 2019. [DOI: 10.3233/aic-190620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Leandro D. Vignolo
- Research institute for signals, systems and computational intelligence, sinc(i), FICH – Universidad Nacional del Litoral – CONICET, Santa Fe, Argentina. E-mails: , ,
| | - Enrique M. Albornoz
- Research institute for signals, systems and computational intelligence, sinc(i), FICH – Universidad Nacional del Litoral – CONICET, Santa Fe, Argentina. E-mails: , ,
| | - César E. Martínez
- Research institute for signals, systems and computational intelligence, sinc(i), FICH – Universidad Nacional del Litoral – CONICET, Santa Fe, Argentina. E-mails: , ,
| |
Collapse
|
9
|
Cabon S, Porée F, Simon A, Rosec O, Pladys P, Carrault G. Video and audio processing in paediatrics: a review. Physiol Meas 2019; 40:02TR02. [PMID: 30669130 DOI: 10.1088/1361-6579/ab0096] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
OBJECTIVE Video and sound acquisition and processing technologies have seen great improvements in recent decades, with many applications in the biomedical area. The aim of this paper is to review the overall state of the art of advances within these topics in paediatrics and to evaluate their potential application for monitoring in the neonatal intensive care unit (NICU). APPROACH For this purpose, more than 150 papers dealing with video and audio processing were reviewed. For both topics, clinical applications are described according to the considered cohorts-full-term newborns, infants and toddlers or preterm newborns. Then, processing methods are presented, in terms of data acquisition, feature extraction and characterization. MAIN RESULTS The paper first focuses on the exploitation of video recordings; these began to be automatically processed in the 2000s and we show that they have mainly been used to characterize infant motion. Other applications, including respiration and heart rate estimation and facial analysis, are also presented. Audio processing is then reviewed, with a focus on the analysis of crying. The first studies in this field focused on induced-pain cries and the newest ones deal with spontaneous cries; the analyses are mainly based on frequency features. Then, some papers dealing with non-cry signals are also discussed. SIGNIFICANCE Finally, we show that even if recent improvements in digital video and signal processing allow for increased automation of processing, the context of the NICU makes a fully automated analysis of long recordings problematic. A few proposals for overcoming some of the limitations are given.
Collapse
Affiliation(s)
- S Cabon
- Univ Rennes, CHU Rennes, INSERM, LTSI - UMR 1099, F-35000 Rennes, France. Voxygen, F-22560 Pleumeur-Bodou, France
| | | | | | | | | | | |
Collapse
|
10
|
Ozturk Y, Bizzego A, Esposito G, Furlanello C, Venuti P. Physiological and self-report responses of parents of children with autism spectrum disorder to children crying. RESEARCH IN DEVELOPMENTAL DISABILITIES 2018; 73:31-39. [PMID: 29245046 DOI: 10.1016/j.ridd.2017.12.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/24/2017] [Revised: 11/22/2017] [Accepted: 12/01/2017] [Indexed: 06/07/2023]
Abstract
Little is known about the physiological response of parents of children with Autism Spectrum Disorder (ASD) to crying of children who have already received the diagnosis of ASD. This study aimed to compare cardiac dynamics via Inter-Beat Interval (IBI) and self-reported emotional states of parents of children with ASD and of parents with typically developing (TD) children while listening to crying of children with ASD (ASD cry) and of typically developing children (TD cry). Analyses revealed higher IBI in parents of children with ASD than IBI in parents of TD children while listening to both cry groups; however no differences on self-reported emotional states were observed. Parents of children with ASD were calmer (higher IBI) than parents of TD children while listening to crying. However, ASD cry did not elicit different IBI compared to TD cry. ASD cry and TD cry were differentiated based on parents' self-responses about what they felt during the listening of crying, their physiological responses showed no differences. These results highlight the similarities and differences between self-reported emotional states and physiological responses of parents of children with ASD, and also point to the importance of monitoring parents' physiological responses in addition to their subjective responses.
Collapse
Affiliation(s)
- Yagmur Ozturk
- Department of Brain and Behavioral Science, Psychology Section, University of Pavia, Pavia, Italy
| | - Andrea Bizzego
- FBK - Fondazione Bruno Kessler, Trento, Italy; Department of Information Engineering and Computer Science, University of Trento, Povo, Italy; SKIL Telecom Italia, Trento, Italy
| | - Gianluca Esposito
- Affiliative Behavior and Physiology Lab, Department of Psychology and Cognitive Sciences, University of Trento, Rovereto, Italy; Social and Affiliative Neuroscience Lab, Division of Psychology, Nanyang Technological University, Singapore
| | | | - Paola Venuti
- Observation, Diagnosis and Education, Lab, Department of Psychology and Cognitive Science, University of Trento, Rovereto, Italy.
| |
Collapse
|
11
|
Saraswathy J, Hariharan M, Khairunizam W, Sarojini J, Thiyagar N, Sazali Y, Nisha S. Time–frequency analysis in infant cry classification using quadratic time frequency distributions. Biocybern Biomed Eng 2018. [DOI: 10.1016/j.bbe.2018.05.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|