1
|
Zhang Y, Pu T, Zhou C, Cai H. An Improved Glottal Flow Model Based on Seq2Seq LSTM for Simulation of Vocal Fold Vibration. J Voice 2024; 38:983-992. [PMID: 35534328 DOI: 10.1016/j.jvoice.2022.03.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 03/29/2022] [Accepted: 03/30/2022] [Indexed: 10/18/2022]
Abstract
OBJECTIVES An improved data-driven glottal flow model for fluid-structure interaction (FSI) simulation of the vocal fold vibration is proposed in this paper. This model aims to improve the prediction performance of the previously developed deep neural network (DNN) based empirical flow model (EFM)1 on accuracy and efficiency. METHODS A Seq2Seq long short-term memory (LSTM) network is employed in the present model to infer the flow rate and pressure distribution from the subglottal pressure and cross-section area distribution of the glottis. The training data is collected from the generalized glottal shape library generated in Zhang et al.1 RESULTS AND CONCLUSIONS: Compared to the EFM, the present model not only discards the time-consuming optimization process, but also drastically reduces the errors, therefore the prediction performance can be greatly improved. The present model is evaluated by coupling with a solid dynamics solver for FSI simulation, and the results demonstrate a great improvement on accuracy and efficiency.
Collapse
Affiliation(s)
- Yang Zhang
- College of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China.
| | - Tianmei Pu
- College of General Aviation and Flight, Nanjing University of Aeronautics and Astronautics, Nanjing 213300, China
| | - Chunhua Zhou
- Department of Aerodynamics, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| | - Hongming Cai
- College of Astronautics, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
| |
Collapse
|
2
|
Palaparthi A, Alluri RK, Titze IR. Deep Learning for Neuromuscular Control of Vocal Source for Voice Production. APPLIED SCIENCES (BASEL, SWITZERLAND) 2024; 14:769. [PMID: 39071945 PMCID: PMC11281313 DOI: 10.3390/app14020769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
A computational neuromuscular control system that generates lung pressure and three intrinsic laryngeal muscle activations (cricothyroid, thyroarytenoid, and lateral cricoarytenoid) to control the vocal source was developed. In the current study, LeTalker, a biophysical computational model of the vocal system was used as the physical plant. In the LeTalker, a three-mass vocal fold model was used to simulate self-sustained vocal fold oscillation. A constant/ǝ/vowel was used for the vocal tract shape. The trachea was modeled after MRI measurements. The neuromuscular control system generates control parameters to achieve four acoustic targets (fundamental frequency, sound pressure level, normalized spectral centroid, and signal-to-noise ratio) and four somatosensory targets (vocal fold length, and longitudinal fiber stress in the three vocal fold layers). The deep-learning-based control system comprises one acoustic feedforward controller and two feedback (acoustic and somatosensory) controllers. Fifty thousand steady speech signals were generated using the LeTalker for training the control system. The results demonstrated that the control system was able to generate the lung pressure and the three muscle activations such that the four acoustic and four somatosensory targets were reached with high accuracy. After training, the motor command corrections from the feedback controllers were minimal compared to the feedforward controller except for thyroarytenoid muscle activation.
Collapse
Affiliation(s)
- Anil Palaparthi
- Utah Center for Vocology, University of Utah, Salt Lake City, UT 84112, USA
| | - Rishi K. Alluri
- School of Biological Sciences, University of Utah, Salt Lake City, UT 84112, USA
| | - Ingo R. Titze
- Utah Center for Vocology, University of Utah, Salt Lake City, UT 84112, USA
| |
Collapse
|
3
|
Zhang Y, Jiang W, Sun L, Wang J, Zheng X, Xue Q. A Deep Learning-Based Generalized Empirical Flow Model of Glottal Flow During Normal Phonation. J Biomech Eng 2022; 144:091001. [PMID: 35171218 PMCID: PMC8990722 DOI: 10.1115/1.4053862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 02/10/2022] [Indexed: 11/08/2022]
Abstract
This paper proposes a deep learning-based generalized empirical flow model (EFM) that can provide a fast and accurate prediction of the glottal flow during normal phonation. The approach is based on the assumption that the vibration of the vocal folds can be represented by a universal kinematics equation (UKE), which is used to generate a glottal shape library. For each shape in the library, the ground truth values of the flow rate and pressure distribution are obtained from the high-fidelity Navier-Stokes (N-S) solution. A fully connected deep neural network (DNN) is then trained to build the empirical mapping between the shapes and the flow rate and pressure distributions. The obtained DNN-based EFM is coupled with a finite element method (FEM)-based solid dynamics solver for fluid-structure-interaction (FSI) simulation of phonation. The EFM is evaluated by comparing the N-S solutions in both static glottal shapes and FSI simulations. The results demonstrate a good prediction performance in accuracy and efficiency.
Collapse
Affiliation(s)
- Yang Zhang
- Department of Mechanical Engineering, University of Maine, Orono, ME 04469
| | - Weili Jiang
- Department of Mechanical Engineering, University of Maine, 204 Crosby Hall, Orono, ME 04473
| | - Luning Sun
- Department of Aerospace and Mechanical Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Jianxun Wang
- Department of Aerospace and Mechanical Engineering, University of Notre Dame, Notre Dame, IN 46556
| | - Xudong Zheng
- Department of Mechanical Engineering, University of Maine, Room 213 A, Boardman Hall, Orono, ME 04473
| | - Qian Xue
- Department of Mechanical Engineering, University of Maine, Room 213, Boardman Hall, Orono, ME 04473
| |
Collapse
|
4
|
Calvache C, Solaque L, Velasco A, Peñuela L. Biomechanical Models to Represent Vocal Physiology: A Systematic Review. J Voice 2021; 37:465.e1-465.e18. [PMID: 33678534 DOI: 10.1016/j.jvoice.2021.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 01/26/2021] [Accepted: 02/02/2021] [Indexed: 11/24/2022]
Abstract
Biomechanical modeling allows obtaining information on physical phenomena that cannot be directly observed. This study aims to review models that represent voice production. A systematic review of the literature was conducted using PubMed/Medline, SCOPUS, and IEEE Xplore databases. To select the papers, we used the protocol PRISMA Statement. A total of 53 publications were included in this review. This article considers a taxonomic classification of models found in the literature. We propose four categories in the taxonomy: (1) Models representing the Source (Vocal folds); (2) Models representing the Filter (Vocal Tract); (3) Models representing the Source - Filter Interaction; and (4) Models representing the Airflow - Source Interaction. We include a bibliographic analysis with the evolution of the publications per category. We provide an analysis of the number as well of publications in journals per year. Moreover, we present an analysis of the term occurrence and its frequency of usage, as found in the literature. In each category, different types of vocal production models are mentioned and analyzed. The models account for the analysis of evidence about aerodynamic, biomechanical, and acoustic phenomena and their correlation with the physiological processes involved in the production of the human voice. This review gives an insight into the state of the art related to the mathematical modeling of voice production, analyzed from the viewpoint of vocal physiology.
Collapse
Affiliation(s)
- Carlos Calvache
- Vocology Center, Bogotá, Colombia; Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia.
| | - Leonardo Solaque
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Alexandra Velasco
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Lina Peñuela
- Department of Mechatronics Engineering, Universidad Militar Nueva Granada, Bogotá, Colombia
| |
Collapse
|
5
|
Li Z, Chen Y, Chang S, Rousseau B, Luo H. A one-dimensional flow model enhanced by machine learning for simulation of vocal fold vibration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1712. [PMID: 33765799 PMCID: PMC7954577 DOI: 10.1121/10.0003561] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 01/25/2021] [Accepted: 02/01/2021] [Indexed: 06/02/2023]
Abstract
A one-dimensional (1D) unsteady and viscous flow model that is derived from the momentum and mass conservation equations is described, and to enhance this physics-based model, a machine learning approach is used to determine the unknown modeling parameters. Specifically, an idealized larynx model is constructed and ten cases of three-dimensional (3D) fluid-structure interaction (FSI) simulations are performed. The flow data are then extracted to train the 1D flow model using a sparse identification approach for nonlinear dynamical systems. As a result of training, we obtain the analytical expressions for the entrance effect and pressure loss in the glottis, which are then incorporated in the flow model to conveniently handle different glottal shapes due to vocal fold vibration. We apply the enhanced 1D flow model in the FSI simulation of both idealized vocal fold geometries and subject-specific anatomical geometries reconstructed from the magnetic resonance imaging images of rabbits' larynges. The 1D flow model is evaluated in both of these setups and shown to have robust performance. Therefore, it provides a fast simulation tool that is superior to the previous 1D models.
Collapse
Affiliation(s)
- Zheng Li
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Ye Chen
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Siyuan Chang
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| | - Bernard Rousseau
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, Pennsylvania 15213, USA
| | - Haoxiang Luo
- Department of Mechanical Engineering, Vanderbilt University, 2301 Vanderbilt Place, Nashville, Tennessee 37235-1592, USA
| |
Collapse
|
6
|
Two-Phase Flow Simulations Using 1D Centerline-Based C- and U-Shaped Pipe Meshes. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11052020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This study aims to investigate the pressure changes, bubble dynamics, and flow physics inside the U- and C-shaped pipes with four different gravitational directions. The simulation is performed using a 1D centerline-based mesh generation technique along with a two-fluid model in the open-source software, OpenFOAM v.6. The continuity and momentum equations of the two-fluid model are discretized using the pressure-implicit method for the pressure-linked equation algorithm. The static and hydrostatic pressures in the two-phase flow were consistent with those of single-phase flow. The dynamic pressure in the two-phase flow was strongly influenced by the effect of the buoyancy force. In particular, if the direction of buoyancy force is the same as the flow direction, the dynamic pressure of the air phase increases, and that of the water phase decreases to satisfy the law of conservation of mass. Dean flows are observed on the transverse plane of the curve regions in both C-shaped and U-shaped pipes. The turbulent kinetic energy is stronger in a two-phase flow than in a single-phase flow. Using the 1D centerline-based mesh generation technique, we demonstrate the changes in pressure and the turbulent kinetic energy of the single- and two-phase flows, which could be observed in curve pipes.
Collapse
|
7
|
Gulbulak U, Gecgel O, Ertas A. A deep learning application to approximate the geometric orifice and coaptation areas of the polymeric heart valves under time - varying transvalvular pressure. J Mech Behav Biomed Mater 2021; 117:104371. [PMID: 33610020 DOI: 10.1016/j.jmbbm.2021.104371] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 12/29/2020] [Accepted: 01/26/2021] [Indexed: 11/20/2022]
Abstract
Machine learning and deep learning frameworks have been presented as a substitute for lengthy computational analysis, such as finite element analysis, computational fluid dynamics, and fluid-structure interaction. In this study, our objective was to apply a deep learning framework to predict the geometric orifice (GOA) and the coaptation areas (CA) of the polymeric heart valves under the time-varying transvalvular pressure. 377 different valve geometries were generated by changing the control coordinates of the attachment and the belly curve. The GOA and the CA values were obtained at the maximum and the minimum transvalvular pressure, respectively. The results showed that the applied framework can accurately predict the GOA and the CA despite being trained with a relatively smaller data set. The presented framework can reduce the required time of the lengthy FE frameworks.
Collapse
Affiliation(s)
- Utku Gulbulak
- Department of Mechanical Engineering, Texas Tech University, Lubbock, TX, 79409, USA.
| | - Ozhan Gecgel
- Department of Mechanical Engineering, Texas Tech University, Lubbock, TX, 79409, USA
| | - Atila Ertas
- Department of Mechanical Engineering, Texas Tech University, Lubbock, TX, 79409, USA
| |
Collapse
|