Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

47
(from Reference Citation Analysis)

Article PDFs (16)

Cited by > 0 (28)

Searched Name

speech enhancement

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Lee GW, Kim HK. Cluster-Based Pairwise Contrastive Loss for Noise-Robust Speech Recognition. Sensors (Basel) 2024;24:2573. [PMID: 38676191 PMCID: PMC11054889 DOI: 10.3390/s24082573] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/08/2024] [Accepted: 04/16/2024] [Indexed: 04/28/2024]

Zhang Z, Tian Y, Zhou T, Zhao Y, Zhang J, Li J. Towards an Environmentally Robust Speech Assistant System for Emergency Medical Services. Stud Health Technol Inform 2024;310:1071-1075. [PMID: 38269979 DOI: 10.3233/shti231129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]

Koh HI, Na S, Kim MN. Speech Perception Improvement Algorithm Based on a Dual-Path Long Short-Term Memory Network. Bioengineering (Basel) 2023;10:1325. [PMID: 38002449 PMCID: PMC10669314 DOI: 10.3390/bioengineering10111325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 11/08/2023] [Accepted: 11/13/2023] [Indexed: 11/26/2023] Open

Song Y, Madhu N. Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement. Sensors (Basel) 2023;23:6438. [PMID: 37514732 PMCID: PMC10384514 DOI: 10.3390/s23146438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 07/10/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023]

Abstract

Using the source-filter model of speech production, clean speech signals can be decomposed into an excitation component and an envelope component that is related to the phoneme being uttered. Therefore, restoring the envelope of degraded speech during speech enhancement can improve the intelligibility and quality of output. As the number of phonemes in spoken speech is limited, they can be adequately represented by a correspondingly limited number of envelopes. This can be exploited to improve the estimation of speech envelopes from a degraded signal in a data-driven manner. The improved envelopes are then used in a second stage to refine the final speech estimate. Envelopes are typically derived from the linear prediction coefficients (LPCs) or from the cepstral coefficients (CCs). The improved envelope is obtained either by mapping the degraded envelope onto pre-trained codebooks (classification approach) or by directly estimating it from the degraded envelope (regression approach). In this work, we first investigate the optimal features for envelope representation and codebook generation by a series of oracle tests. We demonstrate that CCs provide better envelope representation compared to using the LPCs. Further, we demonstrate that a unified speech codebook is advantageous compared to the typical codebook that manually splits speech and silence as separate entries. Next, we investigate low-complexity neural network architectures to map degraded envelopes to the optimal codebook entry in practical systems. We confirm that simple recurrent neural networks yield good performance with a low complexity and number of parameters. We also demonstrate that with a careful choice of the feature and architecture, a regression approach can further improve the performance at a lower computational cost. However, as also seen from the oracle tests, the benefit of the two-stage framework is now chiefly limited by the statistical noise floor estimate, leading to only a limited improvement in extremely adverse conditions. This highlights the need for further research on joint estimation of speech and noise for optimum enhancement.

Collapse

Rascon C. Characterization of Deep Learning-Based Speech-Enhancement Techniques in Online Audio Processing Applications. Sensors (Basel) 2023;23:s23094394. [PMID: 37177598 PMCID: PMC10181690 DOI: 10.3390/s23094394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 04/24/2023] [Accepted: 04/28/2023] [Indexed: 05/15/2023]

Chen H, Zhang X. CGA-MGAN: Metric GAN Based on Convolution-Augmented Gated Attention for Speech Enhancement. Entropy (Basel) 2023;25:e25040628. [PMID: 37190416 PMCID: PMC10137386 DOI: 10.3390/e25040628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 03/15/2023] [Accepted: 04/04/2023] [Indexed: 05/17/2023]

Chai S, Guo C, Guan C, Fang L. Deep Learning-Based Speech Enhancement of an Extrinsic Fabry-Perot Interferometric Fiber Acoustic Sensor System. Sensors (Basel) 2023;23:3574. [PMID: 37050634 PMCID: PMC10098526 DOI: 10.3390/s23073574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 03/17/2023] [Accepted: 03/23/2023] [Indexed: 06/19/2023]

Pandey A, Wang D. Attentive Training: A New Training Framework for Speech Enhancement. IEEE/ACM Trans Audio Speech Lang Process 2023;31:1360-1370. [PMID: 37899765 PMCID: PMC10602021 DOI: 10.1109/taslp.2023.3260711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/31/2023]

Tan K, Mao W, Guo X, Lu H, Zhang C, Cao Z, Wang X. CST: Complex Sparse Transformer for Low-SNR Speech Enhancement. Sensors (Basel) 2023;23:2376. [PMID: 36904579 PMCID: PMC10007472 DOI: 10.3390/s23052376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/05/2023] [Accepted: 02/16/2023] [Indexed: 06/18/2023]

Ye M, Wan H. Improved Transformer-Based Dual-Path Network with Amplitude and Complex Domain Feature Fusion for Speech Enhancement. Entropy (Basel) 2023;25:228. [PMID: 36832595 PMCID: PMC9955017 DOI: 10.3390/e25020228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 06/18/2023]

Zheng C, Zhang H, Liu W, Luo X, Li A, Li X, Moore BCJ. Sixty Years of Frequency-Domain Monaural Speech Enhancement: From Traditional to Deep Learning Methods. Trends Hear 2023;27:23312165231209913. [PMID: 37956661 PMCID: PMC10658184 DOI: 10.1177/23312165231209913] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 10/09/2023] [Indexed: 11/15/2023] Open

Hao X, Zhu D, Wang X, Yang L, Zeng H. A Speech Enhancement Algorithm for Speech Reconstruction Based on Laser Speckle Images. Sensors (Basel) 2022;23:330. [PMID: 36616925 PMCID: PMC9823416 DOI: 10.3390/s23010330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 09/02/2022] [Accepted: 09/03/2022] [Indexed: 06/17/2023]

Liu TH, Chi JZ, Wu BL, Chen YS, Huang CH, Chu YS. Design and Implementation of Machine Tool Life Inspection System Based on Sound Sensing. Sensors (Basel) 2022;23:284. [PMID: 36616882 PMCID: PMC9823646 DOI: 10.3390/s23010284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 12/15/2022] [Accepted: 12/23/2022] [Indexed: 06/17/2023]

Wang H, Zhang X, Wang D. Fusing Bone-conduction and Air-conduction Sensors for Complex-Domain Speech Enhancement. IEEE/ACM Trans Audio Speech Lang Process 2022;30:3134-3143. [PMID: 37124143 PMCID: PMC10147322 DOI: 10.1109/taslp.2022.3209943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

Lee GW, Kim HK. Two-Step Joint Optimization with Auxiliary Loss Function for Noise-Robust Speech Recognition. Sensors (Basel) 2022;22:5381. [PMID: 35891070 PMCID: PMC9324918 DOI: 10.3390/s22145381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 07/16/2022] [Accepted: 07/17/2022] [Indexed: 06/15/2023]

Abstract

In this paper, a new two-step joint optimization approach based on the asynchronous subregion optimization method is proposed for training a pipeline model composed of two different models. The first-step processing of the proposed joint optimization approach trains the front-end model only, and the second-step processing trains all the parameters of the combined model together. In the asynchronous subregion optimization method, the first-step processing only supports the goal of the front-end model. However, the first-step processing of the proposed approach works with a new loss function to make the front-end model support the goal of the back-end model. The proposed optimization approach was applied, here, to a pipeline composed of a deep complex convolutional recurrent network (DCCRN)-based speech enhancement model and a conformer-transducer-based ASR model as a front-end and a back-end, respectively. Then, the performance of the proposed two-step joint optimization approach was evaluated on the LibriSpeech automatic speech recognition (ASR) corpus in noisy environments by measuring the character error rate (CER) and word error rate (WER). In addition, an ablation study was carried out to examine the effectiveness of the proposed optimization approach on each of the processing blocks in the conformer-transducer ASR model. Consequently, it was shown from the ablation study that the conformer-transducer-based ASR model with the joint network trained only by the proposed optimization approach achieved the lowest average CER and WER. Moreover, the proposed optimization approach reduced the average CER and WER on the Test-Noisy dataset under matched noise conditions by 0.30% and 0.48%, respectively, compared to the approach of separate optimization of speech enhancement and ASR. Compared to the conventional two-step joint optimization approach, the proposed optimization approach provided average CER and WER reductions of 0.22% and 0.31%, respectively. Moreover, it was revealed that the proposed optimization approach achieved a lower average CER and WER, by 0.32% and 0.43%, respectively, than the conventional optimization approach under mismatched noise conditions.

Collapse

Tao T, Zheng H, Yang J, Guo Z, Zhang Y, Ao J, Chen Y, Lin W, Tan X. Sound Localization and Speech Enhancement Algorithm Based on Dual-Microphone. Sensors (Basel) 2022;22:s22030715. [PMID: 35161469 PMCID: PMC8840739 DOI: 10.3390/s22030715] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 01/11/2022] [Accepted: 01/12/2022] [Indexed: 11/16/2022]

Ali MN, Falavigna D, Brutti A. Time-Domain Joint Training Strategies of Speech Enhancement and Intent Classification Neural Models. Sensors (Basel) 2022;22:374. [PMID: 35009917 DOI: 10.3390/s22010374] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/29/2021] [Accepted: 12/30/2021] [Indexed: 12/10/2022]

Kang Y, Zheng N, Meng Q. Deep Learning-Based Speech Enhancement With a Loss Trading Off the Speech Distortion and the Noise Residue for Cochlear Implants. Front Med (Lausanne) 2021;8:740123. [PMID: 34820392 PMCID: PMC8606413 DOI: 10.3389/fmed.2021.740123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Accepted: 10/04/2021] [Indexed: 11/18/2022] Open

Gnanamanickam J, Natarajan Y, K R SP. A Hybrid Speech Enhancement Algorithm for Voice Assistance Application. Sensors (Basel) 2021;21:7025. [PMID: 34770332 DOI: 10.3390/s21217025] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Revised: 10/17/2021] [Accepted: 10/18/2021] [Indexed: 11/17/2022]

Abstract

In recent years, speech recognition technology has become a more common notion. Speech quality and intelligibility are critical for the convenience and accuracy of information transmission in speech recognition. The speech processing systems used to converse or store speech are usually designed for an environment without any background noise. However, in a real-world atmosphere, background intervention in the form of background noise and channel noise drastically reduces the performance of speech recognition systems, resulting in imprecise information transfer and exhausting the listener. When communication systems' input or output signals are affected by noise, speech enhancement techniques try to improve their performance. To ensure the correctness of the text produced from speech, it is necessary to reduce the external noises involved in the speech audio. Reducing the external noise in audio is difficult as the speech can be of single, continuous or spontaneous words. In automatic speech recognition, there are various typical speech enhancement algorithms available that have gained considerable attention. However, these enhancement algorithms work well in simple and continuous audio signals only. Thus, in this study, a hybridized speech recognition algorithm to enhance the speech recognition accuracy is proposed. Non-linear spectral subtraction, a well-known speech enhancement algorithm, is optimized with the Hidden Markov Model and tested with 6660 medical speech transcription audio files and 1440 Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS) audio files. The performance of the proposed model is compared with those of various typical speech enhancement algorithms, such as iterative signal enhancement algorithm, subspace-based speech enhancement, and non-linear spectral subtraction. The proposed cascaded hybrid algorithm was found to achieve a minimum word error rate of 9.5% and 7.6% for medical speech and RAVDESS speech, respectively. The cascading of the speech enhancement and speech-to-text conversion architectures results in higher accuracy for enhanced speech recognition. The evaluation results confirm the incorporation of the proposed method with real-time automatic speech recognition medical applications where the complexity of terms involved is high.

Collapse

Kuruvila I, Muncke J, Fischer E, Hoppe U. Extracting the Auditory Attention in a Dual-Speaker Scenario From EEG Using a Joint CNN-LSTM Model. Front Physiol 2021;12:700655. [PMID: 34408661 PMCID: PMC8365753 DOI: 10.3389/fphys.2021.700655] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 07/05/2021] [Indexed: 11/25/2022] Open

Chu K, Collins L, Mainsah B. A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS. Proc IEEE Int Conf Acoust Speech Signal Process 2021;2021:6498-6502. [PMID: 34512195 PMCID: PMC8425961 DOI: 10.1109/icassp39728.2021.9413986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Tan K, Wang D. Towards Model Compression for Deep Learning Based Speech Enhancement. IEEE/ACM Trans Audio Speech Lang Process 2021;29:1785-1794. [PMID: 34179220 PMCID: PMC8224477 DOI: 10.1109/taslp.2021.3082282] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Zhou Y, Wang H, Chu Y, Liu H. A Robust Dual-Microphone Generalized Sidelobe Canceller Using a Bone-Conduction Sensor for Speech Enhancement. Sensors (Basel) 2021;21:1878. [PMID: 33800201 PMCID: PMC7962448 DOI: 10.3390/s21051878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 02/23/2021] [Accepted: 03/05/2021] [Indexed: 11/21/2022]

Li L, Rehr R, Bruns P, Gerkmann T, Röder B. A Survey on Probabilistic Models in Human Perception and Machines. Front Robot AI 2021;7:85. [PMID: 33501252 PMCID: PMC7805657 DOI: 10.3389/frobt.2020.00085] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Accepted: 05/29/2020] [Indexed: 11/29/2022] Open

Gößling N, Marquardt D, Doclo S. Perceptual Evaluation of Binaural MVDR-Based Algorithms to Preserve the Interaural Coherence of Diffuse Noise Fields. Trends Hear 2020;24:2331216520919573. [PMID: 32339061 PMCID: PMC7225838 DOI: 10.1177/2331216520919573] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

Kim SM. Wearable Hearing Device Spectral Enhancement Driven by Non-Negative Sparse Coding-Based Residual Noise Reduction. Sensors (Basel) 2020;20:E5751. [PMID: 33050447 DOI: 10.3390/s20205751] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/02/2020] [Accepted: 10/06/2020] [Indexed: 11/17/2022]

Shankar N, Bhat GS, Panahi IMS. Real-time single-channel deep neural network-based speech enhancement on edge devices. Interspeech 2020;2020:3281-3285. [PMID: 33898608 PMCID: PMC8064406 DOI: 10.21437/interspeech.2020-1901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Zhou Y, Chen Y, Ma Y, Liu H. A Real-Time Dual-Microphone Speech Enhancement Algorithm Assisted by Bone Conduction Sensor. Sensors (Basel) 2020;20:E5050. [PMID: 32899533 PMCID: PMC7571026 DOI: 10.3390/s20185050] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 09/02/2020] [Accepted: 09/03/2020] [Indexed: 11/16/2022]

Wang ZQ, Wang P, Wang D. Complex Spectral Mapping for Single- and Multi-Channel Speech Enhancement and Robust ASR. IEEE/ACM Trans Audio Speech Lang Process 2020;28:1778-1787. [PMID: 33748326 PMCID: PMC7971156 DOI: 10.1109/taslp.2020.2998279] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]

Summerfield Q, Stubbs RJ. Strengths and Weaknesses of Procedures for Separating Simultaneous Voices. Acta Otolaryngol 2019;109:91-100. [PMID: 31905523 DOI: 10.1080/00016489.1990.12088414] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Chen Y, Chen W, Zhang P, Chen P. [Research progress of microphone array based front-end speech enhancement technology for cochlear implant]. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi 2019;36:696-704. [PMID: 31441274 PMCID: PMC10319500 DOI: 10.7507/1001-5515.201805050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Indexed: 11/03/2022]

Chen Y, Chen Y. [Research of front-end speech enhancement and beamforming algorithm based on dual microphoneforcochlear implant]. Sheng Wu Yi Xue Gong Cheng Xue Za Zhi 2019;36:468-477. [PMID: 31232551 DOI: 10.7507/1001-5515.201810025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]

Flanagan S, Zorilă TC, Stylianou Y, Moore BCJ. Speech Processing to Improve the Perception of Speech in Background Noise for Children With Auditory Processing Disorder and Typically Developing Peers. Trends Hear 2019;22:2331216518756533. [PMID: 29441834 PMCID: PMC5815419 DOI: 10.1177/2331216518756533] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Tan K, Chen J, Wang D. Gated Residual Networks with Dilated Convolutions for Monaural Speech Enhancement. IEEE/ACM Trans Audio Speech Lang Process 2019;27:189-198. [PMID: 31355300 PMCID: PMC6660163 DOI: 10.1109/taslp.2018.2876171] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Wang D, Chen J. Supervised Speech Separation Based on Deep Learning: An Overview. IEEE/ACM Trans Audio Speech Lang Process 2018;26:1702-1726. [PMID: 31223631 PMCID: PMC6586438 DOI: 10.1109/taslp.2018.2842159] [Citation(s) in RCA: 118] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Lee CH, Rao BD, Garudadri H. Bone-Conduction Sensor Assisted Noise Estimation for Improved Speech Enhancement. Interspeech 2018;2018:1180-1184. [PMID: 34307636 DOI: 10.21437/interspeech.2018-1046] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Shankar N, Kucuk A, Reddy CKA, Bhat GS, Panahi IMS. Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids. Annu Int Conf IEEE Eng Med Biol Soc 2018;2018:417-420. [PMID: 30440422 PMCID: PMC7398114 DOI: 10.1109/embc.2018.8512369] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Wiinberg A, Zaar J, Dau T. Effects of Expanding Envelope Fluctuations on Consonant Perception in Hearing-Impaired Listeners. Trends Hear 2018;22:2331216518775293. [PMID: 29756553 PMCID: PMC5954573 DOI: 10.1177/2331216518775293] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Chen YY. Speech Enhancement of Mobile Devices Based on the Integration of a Dual Microphone Array and a Background Noise Elimination Algorithm. Sensors (Basel) 2018;18:E1467. [PMID: 29738481 DOI: 10.3390/s18051467] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Revised: 04/29/2018] [Accepted: 05/03/2018] [Indexed: 11/21/2022]

Lee G, Dae Na S, Seong K, Cho JH, Nam Kim M. Wavelet speech enhancement algorithm using exponential semi-soft mask filtering. Bioengineered 2016;7:352-356. [PMID: 27436063 DOI: 10.1080/21655979.2016.1197617] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open

Chen F, Li S, Li C, Liu M, Li Z, Xue H, Jing X, Wang J. A Novel Method for Speech Acquisition and Enhancement by 94 GHz Millimeter-Wave Sensor. Sensors (Basel) 2015;16:E50. [PMID: 26729126 DOI: 10.3390/s16010050] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Revised: 12/10/2015] [Accepted: 12/23/2015] [Indexed: 12/02/2022]

Hu H, Lutman ME, Ewert SD, Li G, Bleeck S. Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants. Trends Hear 2015;19:19/0/2331216515616941. [PMID: 26721919 PMCID: PMC4771045 DOI: 10.1177/2331216515616941] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Goldsworthy RL. Two-microphone spatial filtering improves speech reception for cochlear-implant users in reverberant conditions with multiple noise sources. Trends Hear 2014;18:18/0/2331216514555489. [PMID: 25330772 PMCID: PMC4227667 DOI: 10.1177/2331216514555489] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Ishaq R, García Zapirain B. Optimal subband Kalman filter for normal and oesophageal speech enhancement. Biomed Mater Eng 2014;24:3569-78. [PMID: 25227070 DOI: 10.3233/bme-141183] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Chen F, Loizou PC. Impact of SNR and Gain-Function Over- and Under-estimation on Speech Intelligibility. Speech Commun 2012;54:272-281. [PMID: 22125352 PMCID: PMC3224092 DOI: 10.1016/j.specom.2011.09.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]

Hu Y, Loizou PC. Subjective comparison and evaluation of speech enhancement algorithms. Speech Commun 2007;49:588-601. [PMID: 18046463 PMCID: PMC2098693 DOI: 10.1016/j.specom.2006.12.006] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Chen B, Loizou PC. A Laplacian-based MMSE estimator for speech enhancement. Speech Commun 2007;49:134-143. [PMID: 18037977 PMCID: PMC2084213 DOI: 10.1016/j.specom.2006.12.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]