Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Kim M, Cao B, Mau T, Wang J. Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM Neural Network. IEEE/ACM Trans Audio Speech Lang Process 2017;25:2323-2336. [PMID: 30271809 PMCID: PMC6154510 DOI: 10.1109/taslp.2017.2758999] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

For:	Kim M, Cao B, Mau T, Wang J. Speaker-Independent Silent Speech Recognition from Flesh-Point Articulatory Movements Using an LSTM Neural Network. IEEE/ACM Trans Audio Speech Lang Process 2017;25:2323-2336. [PMID: 30271809 PMCID: PMC6154510 DOI: 10.1109/taslp.2017.2758999] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

Number

Cited by Other Article(s)

Cui Q, Zhang X, Zhang Y, Zheng C, Xie L, Yan Y, Wu EQ, Yin E. A simplified adversarial architecture for cross-subject silent speech recognition using electromyography. J Neural Eng 2024;21:056001. [PMID: 39178906 DOI: 10.1088/1741-2552/ad7321] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 08/23/2024] [Indexed: 08/26/2024]

Abstract

Objective. The decline in the performance of electromyography (EMG)-based silent speech recognition is widely attributed to disparities in speech patterns, articulation habits, and individual physiology among speakers. Feature alignment by learning a discriminative network that resolves domain offsets across speakers is an effective method to address this problem. The prevailing adversarial network with a branching discriminator specializing in domain discrimination renders insufficiently direct contribution to categorical predictions of the classifier.Approach. To this end, we propose a simplified discrepancy-based adversarial network with a streamlined end-to-end structure for EMG-based cross-subject silent speech recognition. Highly aligned features across subjects are obtained by introducing a Nuclear-norm Wasserstein discrepancy metric on the back end of the classification network, which could be utilized for both classification and domain discrimination. Given the low-level and implicitly noisy nature of myoelectric signals, we devise a cascaded adaptive rectification network as the front-end feature extraction network, adaptively reshaping the intermediate feature map with automatically learnable channel-wise thresholds. The resulting features effectively filter out domain-specific information between subjects while retaining domain-invariant features critical for cross-subject recognition.Main results. A series of sentence-level classification experiments with 100 Chinese sentences demonstrate the efficacy of our method, achieving an average accuracy of 89.46% tested on 40 new subjects by training with data from 60 subjects. Especially, our method achieves a remarkable 10.07% improvement compared to the state-of-the-art model when tested on 10 new subjects with 20 subjects employed for training, surpassing its result even with three times training subjects.Significance. Our study demonstrates an improved classification performance of the proposed adversarial architecture using cross-subject myoelectric signals, providing a promising prospect for EMG-based speech interactive application.

Collapse

Affiliation(s)

Qiang Cui Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China
Xingyu Zhang Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China
Yakun Zhang Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China
Changyan Zheng Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China High-tech Institute, Weifang 261000, People's Republic of China
Liang Xie Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China
Ye Yan Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China
Edmond Q Wu Department of Automation, Shanghai Jiao Tong University, Shanghai 200240, People's Republic of China Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai Jiao Tong University, Shanghai 200240, People's Republic of China Shanghai Engineering Research Center of Intelligent Control and Management, Shanghai Jiao Tong University, Shanghai 200240, People's Republic of China
Erwei Yin Defense Innovation Institute, Academy of Military Sciences (AMS), Beijing 100071, People's Republic of China Intelligent Game and Decision Laboratory, Beijing 100071, People's Republic of China Tianjin Artificial Intelligence Innovation Center (TAIIC), Tianjin 300450, People's Republic of China

Collapse

Kwon J, Hwang J, Sung JE, Im CH. Speech synthesis from three-axis accelerometer signals using conformer-based deep neural network. Comput Biol Med 2024;182:109090. [PMID: 39232406 DOI: 10.1016/j.compbiomed.2024.109090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Revised: 08/23/2024] [Accepted: 08/29/2024] [Indexed: 09/06/2024]

Abstract

Silent speech interfaces (SSIs) have emerged as innovative non-acoustic communication methods, and our previous study demonstrated the significant potential of three-axis accelerometer-based SSIs to identify silently spoken words with high classification accuracy. The developed accelerometer-based SSI with only four accelerometers and a small training dataset outperformed a conventional surface electromyography (sEMG)-based SSI. In this study, motivated by the promising initial results, we investigated the feasibility of synthesizing spoken speech from three-axis accelerometer signals. This exploration aimed to assess the potential of accelerometer-based SSIs for practical silent communication applications. Nineteen healthy individuals participated in our experiments. Five accelerometers were attached to the face to acquire speech-related facial movements while the participants read 270 Korean sentences aloud. For the speech synthesis, we used a convolution-augmented Transformer (Conformer)-based deep neural network model to convert the accelerometer signals into a Mel spectrogram, from which an audio waveform was synthesized using HiFi-GAN. To evaluate the quality of the generated Mel spectrograms, ten-fold cross-validation was performed, and the Mel cepstral distortion (MCD) was chosen as the evaluation metric. As a result, an average MCD of 5.03 ± 0.65 was achieved using four optimized accelerometers based on our previous study. Furthermore, the quality of generated Mel spectrograms was significantly enhanced by adding one more accelerometer attached under the chin, achieving an average MCD of 4.86 ± 0.65 (p < 0.001, Wilcoxon signed-rank test). Although an objective comparison is difficult, these results surpass those obtained using conventional SSIs based on sEMG, electromagnetic articulography, and electropalatography with the fewest sensors and a similar or smaller number of sentences to train the model. Our proposed approach will contribute to the widespread adoption of accelerometer-based SSIs, leveraging the advantages of accelerometers like low power consumption, invulnerability to physiological artifacts, and high portability.

Collapse

Elbourhamy DM. Automated sentiment analysis of visually impaired students' audio feedback in virtual learning environments. PeerJ Comput Sci 2024;10:e2143. [PMID: 38983237 PMCID: PMC11232573 DOI: 10.7717/peerj-cs.2143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 05/29/2024] [Indexed: 07/11/2024]

Zhou C, Li X, Feng F, Zhang J, Lyu H, Wu W, Tang X, Luo B, Li D, Xiang W, Yao D. Inter-patient ECG heartbeat classification for arrhythmia classification: a new approach of multi-layer perceptron with weight capsule and sequence-to-sequence combination. Front Physiol 2023;14:1247587. [PMID: 37841320 PMCID: PMC10569428 DOI: 10.3389/fphys.2023.1247587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 09/07/2023] [Indexed: 10/17/2023] Open

Cao B, Ravi S, Sebkhi N, Bhavsar A, Inan OT, Xu W, Wang J. MagTrack: A Wearable Tongue Motion Tracking System for Silent Speech Interfaces. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023;66:3206-3221. [PMID: 37146629 PMCID: PMC10555459 DOI: 10.1044/2023_jslhr-22-00319] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 09/06/2022] [Accepted: 02/20/2023] [Indexed: 05/07/2023]

Abstract

PURPOSE

Current electromagnetic tongue tracking devices are not amenable for daily use and thus not suitable for silent speech interface and other applications. We have recently developed MagTrack, a novel wearable electromagnetic articulograph tongue tracking device. This study aimed to validate MagTrack for potential silent speech interface applications.

METHOD

We conducted two experiments: (a) classification of eight isolated vowels in consonant-vowel-consonant form and (b) continuous silent speech recognition. In these experiments, we used data from healthy adult speakers collected with MagTrack. The performance of vowel classification was measured by accuracies. The continuous silent speech recognition was measured by phoneme error rates. The performance was then compared with results using data collected with commercial electromagnetic articulograph in a prior study.

RESULTS

The isolated vowel classification using MagTrack achieved an average accuracy of 89.74% when leveraging all MagTrack signals (x, y, z coordinates; orientation; and magnetic signals), which outperformed the accuracy using commercial electromagnetic articulograph data (only y, z coordinates) in our previous study. The continuous speech recognition from two subjects using MagTrack achieved phoneme error rates of 73.92% and 66.73%, respectively. The commercial electromagnetic articulograph achieved 64.53% from the same subject (66.73% using MagTrack data).

CONCLUSIONS

MagTrack showed comparable results with the commercial electromagnetic articulograph when using the same localized information. Adding raw magnetic signals would improve the performance of MagTrack. Our preliminary testing demonstrated the potential for silent speech interface as a lightweight wearable device. This work also lays the foundation to support MagTrack's potential for other applications including visual feedback-based speech therapy and second language learning.

Collapse

Csapó TG, Gosztolya G, Tóth L, Shandiz AH, Markó A. Optimizing the Ultrasound Tongue Image Representation for Residual Network-Based Articulatory-to-Acoustic Mapping. SENSORS (BASEL, SWITZERLAND) 2022;22:8601. [PMID: 36433196 PMCID: PMC9696288 DOI: 10.3390/s22228601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/28/2022] [Accepted: 10/31/2022] [Indexed: 06/16/2023]

Wu J, Zhang Y, Xie L, Yan Y, Zhang X, Liu S, An X, Yin E, Ming D. A novel silent speech recognition approach based on parallel inception convolutional neural network and Mel frequency spectral coefficient. Front Neurorobot 2022;16:971446. [PMID: 36119717 PMCID: PMC9478652 DOI: 10.3389/fnbot.2022.971446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 07/07/2022] [Indexed: 11/13/2022] Open

Cao B, Wisler A, Wang J. Speaker Adaptation on Articulation and Acoustics for Articulation-to-Speech Synthesis. SENSORS (BASEL, SWITZERLAND) 2022;22:6056. [PMID: 36015817 PMCID: PMC9416444 DOI: 10.3390/s22166056] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/03/2022] [Accepted: 08/08/2022] [Indexed: 05/23/2023]

Abstract

Silent speech interfaces (SSIs) convert non-audio bio-signals, such as articulatory movement, to speech. This technology has the potential to recover the speech ability of individuals who have lost their voice but can still articulate (e.g., laryngectomees). Articulation-to-speech (ATS) synthesis is an algorithm design of SSI that has the advantages of easy-implementation and low-latency, and therefore is becoming more popular. Current ATS studies focus on speaker-dependent (SD) models to avoid large variations of articulatory patterns and acoustic features across speakers. However, these designs are limited by the small data size from individual speakers. Speaker adaptation designs that include multiple speakers' data have the potential to address the issue of limited data size from single speakers; however, few prior studies have investigated their performance in ATS. In this paper, we investigated speaker adaptation on both the input articulation and the output acoustic signals (with or without direct inclusion of data from test speakers) using the publicly available electromagnetic articulatory (EMA) dataset. We used Procrustes matching and voice conversion for articulation and voice adaptation, respectively. The performance of the ATS models was measured objectively by the mel-cepstral distortions (MCDs). The synthetic speech samples were generated and are provided in the supplementary material. The results demonstrated the improvement brought by both Procrustes matching and voice conversion on speaker-independent ATS. With the direct inclusion of target speaker data in the training process, the speaker-adaptive ATS achieved a comparable performance to speaker-dependent ATS. To our knowledge, this is the first study that has demonstrated that speaker-adaptive ATS can achieve a non-statistically different performance to speaker-dependent ATS.

Collapse

Sang Y, Chen X. Human-computer interactive physical education teaching method based on speech recognition engine technology. Front Public Health 2022;10:941083. [PMID: 35923977 PMCID: PMC9339716 DOI: 10.3389/fpubh.2022.941083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open

Abstract

With the advent of the era of artificial intelligence, speech recognition engine technology has a profound impact on social production, life, education, and other fields. Voice interaction is the most basic and practical type of human-computer interaction. To build an intelligent and automatic physical education teaching mode, this paper combines human-computer interaction based on speech recognition technology with physical education teaching. Students input through voice signals, and the system receives signals, analyzes signals, recognizes signals, and feeds back information to students in multiple forms. For the system to process the external speech signal, this paper uses the Mel cepstral coefficient algorithm to extract the speech information. By comparing the speech recognition rate and antinoise rate of Hidden Markov Model, Probabilistic Statistics Neural Network, and Hybrid Model (Hidden Markov and Rate Statistical Neural Network combination), the speech recognition engine uses the hybrid model, and its speech recognition rate is 98.3%, and the average antinoise rate can reach 85%. By comparing the human-computer interaction physical education teaching method with the traditional teaching method, the human-computer interaction method is superior to the traditional teaching method in the acquisition of physical knowledge, the acquisition of physical skills, the satisfaction of physical education courses and the ability of active learning. It effectively solves the drawbacks of traditional physical education and rationally uses human-computer interaction technology. On the basis of not violating physical education, realize the diversification of physical education, improve the quality of teaching, improve students' individual development and students' autonomous learning ability. Therefore, the combination of human-computer interaction and physical education based on recognition engine technology is the trend of today's physical education development.

Collapse

Wagner C, Schaffer P, Amini Digehsara P, Bärhold M, Plettemeier D, Birkholz P. Silent speech command word recognition using stepped frequency continuous wave radar. Sci Rep 2022;12:4192. [PMID: 35273225 PMCID: PMC8913675 DOI: 10.1038/s41598-022-07842-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 02/21/2022] [Indexed: 11/25/2022] Open

Zhou Z, Tam VWL, Lam EY. A Portable Sign Language Collection and Translation Platform with Smart Watches Using a BLSTM-Based Multi-Feature Framework. MICROMACHINES 2022;13:mi13020333. [PMID: 35208457 PMCID: PMC8877205 DOI: 10.3390/mi13020333] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 02/12/2022] [Accepted: 02/17/2022] [Indexed: 11/16/2022]

Abstract

Continuous sign language recognition (CSLR) using different types of sensors to precisely recognize sign language in real time is a very challenging but important research direction in sensor technology. Many previous methods are vision-based, with computationally intensive algorithms to process a large number of image/video frames possibly contaminated with noises, which can result in a large translation delay. On the other hand, gesture-based CSLR relying on hand movement data captured on wearable devices may require less computation resources and translation time. Thus, it is more efficient to provide instant translation during real-world communication. However, the insufficient amount of information provided by the wearable sensors often affect the overall performance of this system. To tackle this issue, we propose a bidirectional long short-term memory (BLSTM)-based multi-feature framework for conducting gesture-based CSLR precisely with two smart watches. In this framework, multiple sets of input features are extracted from the collected gesture data to provide a diverse spectrum of valuable information to the underlying BLSTM model for CSLR. To demonstrate the effectiveness of the proposed framework, we test it on an extremely challenging and radically new dataset of Hong Kong sign language (HKSL), in which hand movement data are collected from 6 individual signers for 50 different sentences. The experimental results reveal that the proposed framework attains a much lower word error rate compared with other existing machine learning or deep learning approaches for gesture-based CSLR. Based on this framework, we further propose a portable sign language collection and translation platform, which can simplify the procedure of collecting gesture-based sign language dataset and recognize sign language through smart watch data in real time, in order to break the communication barrier for the sign language users.

Collapse

Wang N, Xu H, Xu F, Cheng L. The algorithmic composition for music copyright protection under deep learning and blockchain. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107763] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Dam Deformation Interpretation and Prediction Based on a Long Short-Term Memory Model Coupled with an Attention Mechanism. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11146625] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Vojtech JM, Chan MD, Shiwani B, Roy SH, Heaton JT, Meltzner GS, Contessa P, De Luca G, Patel R, Kline JC. Surface Electromyography-Based Recognition, Synthesis, and Perception of Prosodic Subvocal Speech. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021;64:2134-2153. [PMID: 33979177 PMCID: PMC8740708 DOI: 10.1044/2021_jslhr-20-00257] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Abstract

Purpose This study aimed to evaluate a novel communication system designed to translate surface electromyographic (sEMG) signals from articulatory muscles into speech using a personalized, digital voice. The system was evaluated for word recognition, prosodic classification, and listener perception of synthesized speech. Method sEMG signals were recorded from the face and neck as speakers with (n = 4) and without (n = 4) laryngectomy subvocally recited (silently mouthed) a speech corpus comprising 750 phrases (150 phrases with variable phrase-level stress). Corpus tokens were then translated into speech via personalized voice synthesis (n = 8 synthetic voices) and compared against phrases produced by each speaker when using their typical mode of communication (n = 4 natural voices, n = 4 electrolaryngeal [EL] voices). Naïve listeners (n = 12) evaluated synthetic, natural, and EL speech for acceptability and intelligibility in a visual sort-and-rate task, as well as phrasal stress discriminability via a classification mechanism. Results Recorded sEMG signals were processed to translate sEMG muscle activity into lexical content and categorize variations in phrase-level stress, achieving a mean accuracy of 96.3% (SD = 3.10%) and 91.2% (SD = 4.46%), respectively. Synthetic speech was significantly higher in acceptability and intelligibility than EL speech, also leading to greater phrasal stress classification accuracy, whereas natural speech was rated as the most acceptable and intelligible, with the greatest phrasal stress classification accuracy. Conclusion This proof-of-concept study establishes the feasibility of using subvocal sEMG-based alternative communication not only for lexical recognition but also for prosodic communication in healthy individuals, as well as those living with vocal impairments and residual articulatory function. Supplemental Material https://doi.org/10.23641/asha.14558481.

Collapse

Zhang R, Guo Z, Meng Y, Wang S, Li S, Niu R, Wang Y, Guo Q, Li Y. Comparison of ARIMA and LSTM in Forecasting the Incidence of HFMD Combined and Uncombined with Exogenous Meteorological Variables in Ningbo, China. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph18116174. [PMID: 34200378 PMCID: PMC8201362 DOI: 10.3390/ijerph18116174] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 05/26/2021] [Accepted: 06/03/2021] [Indexed: 11/30/2022]

Sebkhi N, Bhavsar A, Anderson DV, Wang J, Inan OT. Inertial Measurements for Tongue Motion Tracking Based on Magnetic Localization with Orientation Compensation. IEEE SENSORS JOURNAL 2021;21:7964-7971. [PMID: 33746627 PMCID: PMC7978385 DOI: 10.1109/jsen.2020.3046469] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]

Lee W, Seong JJ, Ozlu B, Shim BS, Marakhimov A, Lee S. Biosignal Sensors and Deep Learning-Based Speech Recognition: A Review. SENSORS (BASEL, SWITZERLAND) 2021;21:1399. [PMID: 33671282 PMCID: PMC7922488 DOI: 10.3390/s21041399] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 02/01/2021] [Accepted: 02/12/2021] [Indexed: 11/16/2022]

Automated Atrial Fibrillation Detection using a Hybrid CNN-LSTM Network on Imbalanced ECG Datasets. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102194] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Tang Z, Chai X, Wang Y, Cao S. Gene Regulatory Network Construction Based on a Particle Swarm Optimization of a Long Short-term Memory Network. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191023115224] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract Background: The Gene Regulatory Network (GRN) is a model for studying the function and behavior of genes by treating the genome as a whole, which can reveal the gene expression mechanism. However, due to the dynamics, nonlinearity, and complexity of gene expression data, it is a challenging task to construct a GRN precisely. And in the circulating cooling water system, the Slime-Forming Bacteria (SFB) is one of the bacteria that helps to form dirt. In order to explore the microbial fouling mechanism of SFB, constructing a GRN for the fouling-forming genes of SFB is significant. Objective: Propose an effective GRN construction method and construct a GRN for the foulingforming genes of SFB. Methods: In this paper, a combination method of Long Short-Term Memory Network (LSTM) and Mean Impact Value (MIV) was applied for GRN reconstruction. Firstly, LSTM was employed to establish a gene expression prediction model. To improve the performance of LSTM, a Particle Swarm Optimization (PSO) was introduced to optimize the weight and learning rate. Then, the MIV was used to infer the regulation among genes. In view of the fouling-forming problem of SFB, we have designed electromagnetic field experiments and transcriptome sequencing experiments to locate the fouling-forming genes and obtain gene expression data. Results: In order to test the proposed approach, the proposed method was applied to three datasets: a simulated dataset and two real biology datasets. By comparing with other methods, the experimental results indicate that the proposed method has higher modeling accuracy and it can be used to effectively construct a GRN. And at last, a GRN for fouling-forming genes of SFB was constructed using the proposed approach. Conclusion: The experiments indicated that the proposed approach can reconstruct a GRN precisely, and compared with other approaches, the proposed approach performs better in extracting the regulations among genes. Collapse

Kolokas N, Drosou A, Tzovaras D. Text synthesis from keywords: a comparison of recurrent-neural-network-based architectures and hybrid approaches. Neural Comput Appl 2020. [DOI: 10.1007/s00521-019-04435-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Bozec A, Culié D, Poissonnet G, Dassonville O. Current Role of Total Laryngectomy in the Era of Organ Preservation. Cancers (Basel) 2020;12:cancers12030584. [PMID: 32138168 PMCID: PMC7139381 DOI: 10.3390/cancers12030584] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 02/26/2020] [Accepted: 02/27/2020] [Indexed: 01/02/2023] Open

Oh SL, Ng EYK, Tan RS, Acharya UR. Automated beat-wise arrhythmia diagnosis using modified U-net on extended electrocardiographic recordings with heterogeneous arrhythmia types. Comput Biol Med 2018;105:92-101. [PMID: 30599317 DOI: 10.1016/j.compbiomed.2018.12.012] [Citation(s) in RCA: 82] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 12/18/2018] [Accepted: 12/18/2018] [Indexed: 01/10/2023]

Automated diagnosis of arrhythmia using combination of CNN and LSTM techniques with variable length heart beats. Comput Biol Med 2018;102:278-287. [PMID: 29903630 DOI: 10.1016/j.compbiomed.2018.06.002] [Citation(s) in RCA: 227] [Impact Index Per Article: 37.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Revised: 06/01/2018] [Accepted: 06/02/2018] [Indexed: 11/22/2022]