1
|
Rusho RZ, Ahmed AH, Kruger S, Alam W, Meyer D, Howard D, Story B, Jacob M, Lingala SG. Prospectively accelerated dynamic speech magnetic resonance imaging at 3 T using a self-navigated spiral-based manifold regularized scheme. NMR IN BIOMEDICINE 2024; 37:e5135. [PMID: 38440911 DOI: 10.1002/nbm.5135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 02/05/2024] [Accepted: 02/06/2024] [Indexed: 03/06/2024]
Abstract
This work develops and evaluates a self-navigated variable density spiral (VDS)-based manifold regularization scheme to prospectively improve dynamic speech magnetic resonance imaging (MRI) at 3 T. Short readout duration spirals (1.3-ms long) were used to minimize sensitivity to off-resonance. A custom 16-channel speech coil was used for improved parallel imaging of vocal tract structures. The manifold model leveraged similarities between frames sharing similar vocal tract postures without explicit motion binning. The self-navigating capability of VDS was leveraged to learn the Laplacian structure of the manifold. Reconstruction was posed as a sensitivity-encoding-based nonlocal soft-weighted temporal regularization scheme. Our approach was compared with view-sharing, low-rank, temporal finite difference, extra dimension-based sparsity reconstruction constraints. Undersampling experiments were conducted on five volunteers performing repetitive and arbitrary speaking tasks at different speaking rates. Quantitative evaluation in terms of mean square error over moving edges was performed in a retrospective undersampling experiment on one volunteer. For prospective undersampling, blinded image quality evaluation in the categories of alias artifacts, spatial blurring, and temporal blurring was performed by three experts in voice research. Region of interest analysis at articulator boundaries was performed in both experiments to assess articulatory motion. Improved performance with manifold reconstruction constraints was observed over existing constraints. With prospective undersampling, a spatial resolution of 2.4 × 2.4 mm2/pixel and a temporal resolution of 17.4 ms/frame for single-slice imaging, and 52.2 ms/frame for concurrent three-slice imaging, were achieved. We demonstrated implicit motion binning by analyzing the mechanics of the Laplacian matrix. Manifold regularization demonstrated superior image quality scores in reducing spatial and temporal blurring compared with all other reconstruction constraints. While it exhibited faint (nonsignificant) alias artifacts that were similar to temporal finite difference, it provided statistically significant improvements compared with the other constraints. In conclusion, the self-navigated manifold regularized scheme enabled robust high spatiotemporal resolution dynamic speech MRI at 3 T.
Collapse
Affiliation(s)
- Rushdi Zahid Rusho
- Roy J. Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa, USA
| | - Abdul Haseeb Ahmed
- Department of Electrical and Computer Engineering, University of Iowa, Iowa City, Iowa, USA
| | - Stanley Kruger
- Department of Radiology, University of Iowa, Iowa City, Iowa, USA
| | - Wahidul Alam
- Roy J. Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa, USA
| | - David Meyer
- Janette Ogg Voice Research Center, Shenandoah University, Winchester, Virginia, USA
| | - David Howard
- Department of Electronic Engineering, Royal Holloway, University of London, London, UK
| | - Brad Story
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona, USA
| | - Mathews Jacob
- Department of Electrical and Computer Engineering, University of Iowa, Iowa City, Iowa, USA
| | - Sajan Goud Lingala
- Roy J. Carver Department of Biomedical Engineering, University of Iowa, Iowa City, Iowa, USA
- Department of Radiology, University of Iowa, Iowa City, Iowa, USA
| |
Collapse
|
2
|
Kuwabara MS, Sitzman TJ, Szymanski KA, Perry JL, Miller JH, Cornejo P. The Pediatric Neuroradiologist's Practical Guide to Capture and Evaluate Pre- and Postoperative Velopharyngeal Insufficiency. AJNR Am J Neuroradiol 2023; 45:9-15. [PMID: 38164545 PMCID: PMC10756579 DOI: 10.3174/ajnr.a8055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 10/03/2023] [Indexed: 01/03/2024]
Abstract
Up to 30% of children with cleft palate will develop a severe speech disorder known as velopharyngeal insufficiency. Management of velopharyngeal insufficiency typically involves structural and functional assessment of the velum and pharynx by endoscopy and/or videofluoroscopy. These methods cannot provide direct evaluation of underlying velopharyngeal musculature. MR imaging offers an ideal imaging method, providing noninvasive, high-contrast, high-resolution imaging of soft-tissue anatomy. Furthermore, focused-speech MR imaging techniques can evaluate the function of the velum and pharynx during sustained speech production, providing critical physiologic information that supplements anatomic findings. The use of MR imaging for velopharyngeal evaluation is relatively novel, with limited literature describing its use in clinical radiology. Here we provide a practical approach to perform and interpret velopharyngeal MR imaging examinations. This article discusses the velopharyngeal MR imaging protocol, methods for interpreting velopharyngeal anatomy, and examples illustrating its clinical applications. This knowledge will provide radiologists with a new, noninvasive tool to offer to referring specialists.
Collapse
Affiliation(s)
- Michael S Kuwabara
- From the Radiology Department (M.S.K., J.H.M., P.C.), Phoenix Children's Hospital, Phoenix, Arizona
| | - Thomas J Sitzman
- Plastic Surgery Division (T.J.S.), Phoenix Children's Hospital, Phoenix, Arizona
| | - Kathryn A Szymanski
- Creighton University School of Medicine (K.A.S.), Phoenix Regional Campus, Phoenix, Arizona
| | - Jamie L Perry
- Department of Communication Sciences and Disorders (J.L.P.), East Carolina University, Greenville, North Carolina
| | - Jeffrey H Miller
- From the Radiology Department (M.S.K., J.H.M., P.C.), Phoenix Children's Hospital, Phoenix, Arizona
| | - Patricia Cornejo
- From the Radiology Department (M.S.K., J.H.M., P.C.), Phoenix Children's Hospital, Phoenix, Arizona
| |
Collapse
|
3
|
Zhao Z, Lim Y, Byrd D, Narayanan S, Nayak KS. Improved 3D real-time MRI of speech production. Magn Reson Med 2021; 85:3182-3195. [PMID: 33452722 DOI: 10.1002/mrm.28651] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 10/29/2020] [Accepted: 11/26/2020] [Indexed: 01/21/2023]
Abstract
PURPOSE To provide 3D real-time MRI of speech production with improved spatio-temporal sharpness using randomized, variable-density, stack-of-spiral sampling combined with a 3D spatio-temporally constrained reconstruction. METHODS We evaluated five candidate (k, t) sampling strategies using a previously proposed gradient-echo stack-of-spiral sequence and a 3D constrained reconstruction with spatial and temporal penalties. Regularization parameters were chosen by expert readers based on qualitative assessment. We experimentally determined the effect of spiral angle increment and kz temporal order. The strategy yielding highest image quality was chosen as the proposed method. We evaluated the proposed and original 3D real-time MRI methods in 2 healthy subjects performing speech production tasks that invoke rapid movements of articulators seen in multiple planes, using interleaved 2D real-time MRI as the reference. We quantitatively evaluated tongue boundary sharpness in three locations at two speech rates. RESULTS The proposed data-sampling scheme uses a golden-angle spiral increment in the kx -ky plane and variable-density, randomized encoding along kz . It provided a statistically significant improvement in tongue boundary sharpness score (P < .001) in the blade, body, and root of the tongue during normal and 1.5-times speeded speech. Qualitative improvements were substantial during natural speech tasks of alternating high, low tongue postures during vowels. The proposed method was also able to capture complex tongue shapes during fast alveolar consonant segments. Furthermore, the proposed scheme allows flexible retrospective selection of temporal resolution. CONCLUSION We have demonstrated improved 3D real-time MRI of speech production using randomized, variable-density, stack-of-spiral sampling with a 3D spatio-temporally constrained reconstruction.
Collapse
Affiliation(s)
- Ziwei Zhao
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Yongwan Lim
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Dani Byrd
- Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA.,Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
| | - Krishna S Nayak
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
4
|
Lim Y, Zhu Y, Lingala SG, Byrd D, Narayanan S, Nayak KS. 3D dynamic MRI of the vocal tract during natural speech. Magn Reson Med 2018; 81:1511-1520. [PMID: 30390319 DOI: 10.1002/mrm.27570] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 09/25/2018] [Accepted: 09/26/2018] [Indexed: 12/19/2022]
Abstract
PURPOSE To develop and evaluate a technique for 3D dynamic MRI of the full vocal tract at high temporal resolution during natural speech. METHODS We demonstrate 2.4 × 2.4 × 5.8 mm3 spatial resolution, 61-ms temporal resolution, and a 200 × 200 × 70 mm3 FOV. The proposed method uses 3D gradient-echo imaging with a custom upper-airway coil, a minimum-phase slab excitation, stack-of-spirals readout, pseudo golden-angle view order in kx -ky , linear Cartesian order along kz , and spatiotemporal finite difference constrained reconstruction, with 13-fold acceleration. This technique is evaluated using in vivo vocal tract airway data from 2 healthy subjects acquired at 1.5T scanner, 1 with synchronized audio, with 2 tasks during production of natural speech, and via comparison with interleaved multislice 2D dynamic MRI. RESULTS This technique captured known dynamics of vocal tract articulators during natural speech tasks including tongue gestures during the production of consonants "s" and "l" and of consonant-vowel syllables, and was additionally consistent with 2D dynamic MRI. Coordination of lingual (tongue) movements for consonants is demonstrated via volume-of-interest analysis. Vocal tract area function dynamics revealed critical lingual constriction events along the length of the vocal tract for consonants and vowels. CONCLUSION We demonstrate feasibility of 3D dynamic MRI of the full vocal tract, with spatiotemporal resolution adequate to visualize lingual movements for consonants and vocal tact shaping during natural productions of consonant-vowel syllables, without requiring multiple repetitions.
Collapse
Affiliation(s)
- Yongwan Lim
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Yinghua Zhu
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Sajan Goud Lingala
- Department of Biomedical Engineering, College of Engineering, University of Iowa, Iowa City, Iowa
| | - Dani Byrd
- Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, California
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Krishna Shrinivas Nayak
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| |
Collapse
|
5
|
Giammarinaro B, Coulouvrat F, Pinton G. Numerical Simulation of Focused Shock Shear Waves in Soft Solids and a Two-Dimensional Nonlinear Homogeneous Model of the Brain. J Biomech Eng 2016; 138:041003. [PMID: 26833489 DOI: 10.1115/1.4032643] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Indexed: 12/22/2022]
Abstract
Shear waves that propagate in soft solids, such as the brain, are strongly nonlinear and can develop into shock waves in less than one wavelength. We hypothesize that these shear shock waves could be responsible for certain types of traumatic brain injuries (TBI) and that the spherical geometry of the skull bone could focus shear waves deep in the brain, generating diffuse axonal injuries. Theoretical models and numerical methods that describe nonlinear polarized shear waves in soft solids such as the brain are presented. They include the cubic nonlinearities that are characteristic of soft solids and the specific types of nonclassical attenuation and dispersion observed in soft tissues and the brain. The numerical methods are validated with analytical solutions, where possible, and with self-similar scaling laws where no known solutions exist. Initial conditions based on a human head X-ray microtomography (CT) were used to simulate focused shear shock waves in the brain. Three regimes are investigated with shock wave formation distances of 2.54 m, 0.018 m, and 0.0064 m. We demonstrate that under realistic loading scenarios, with nonlinear properties consistent with measurements in the brain, and when the shock wave propagation distance and focal distance coincide, nonlinear propagation can easily overcome attenuation to generate shear shocks deep inside the brain. Due to these effects, the accelerations in the focal are larger by a factor of 15 compared to acceleration at the skull surface. These results suggest that shock wave focusing could be responsible for diffuse axonal injuries.
Collapse
|
6
|
Fu M, Barlaz MS, Holtrop JL, Perry JL, Kuehn DP, Shosted RK, Liang ZP, Sutton BP. High-frame-rate full-vocal-tract 3D dynamic speech imaging. Magn Reson Med 2016; 77:1619-1629. [PMID: 27099178 DOI: 10.1002/mrm.26248] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 03/25/2016] [Accepted: 03/28/2016] [Indexed: 11/08/2022]
Abstract
PURPOSE To achieve high temporal frame rate, high spatial resolution and full-vocal-tract coverage for three-dimensional dynamic speech MRI by using low-rank modeling and sparse sampling. METHODS Three-dimensional dynamic speech MRI is enabled by integrating a novel data acquisition strategy and an image reconstruction method with the partial separability model: (a) a self-navigated sparse sampling strategy that accelerates data acquisition by collecting high-nominal-frame-rate cone navigator sand imaging data within a single repetition time, and (b) are construction method that recovers high-quality speech dynamics from sparse (k,t)-space data by enforcing joint low-rank and spatiotemporal total variation constraints. RESULTS The proposed method has been evaluated through in vivo experiments. A nominal temporal frame rate of 166 frames per second (defined based on a repetition time of 5.99 ms) was achieved for an imaging volume covering the entire vocal tract with a spatial resolution of 2.2 × 2.2 × 5.0 mm3 . Practical utility of the proposed method was demonstrated via both validation experiments and a phonetics investigation. CONCLUSION Three-dimensional dynamic speech imaging is possible with full-vocal-tract coverage, high spatial resolution and high nominal frame rate to provide dynamic speech data useful for phonetic studies. Magn Reson Med 77:1619-1629, 2017. © 2016 International Society for Magnetic Resonance in Medicine.
Collapse
Affiliation(s)
- Maojing Fu
- Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Marissa S Barlaz
- Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Joseph L Holtrop
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jamie L Perry
- Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina, USA
| | - David P Kuehn
- Speech and Hearing Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Ryan K Shosted
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Zhi-Pei Liang
- Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Bradley P Sutton
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
7
|
Freitas AC, Wylezinska M, Birch MJ, Petersen SE, Miquel ME. Comparison of Cartesian and Non-Cartesian Real-Time MRI Sequences at 1.5T to Assess Velar Motion and Velopharyngeal Closure during Speech. PLoS One 2016; 11:e0153322. [PMID: 27073905 PMCID: PMC4830548 DOI: 10.1371/journal.pone.0153322] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2015] [Accepted: 03/28/2016] [Indexed: 11/19/2022] Open
Abstract
Dynamic imaging of the vocal tract using real-time MRI has been an active and growing area of research, having demonstrated great potential to become routinely performed in the clinical evaluation of speech and swallowing disorders. Although many technical advances have been made in regards to acquisition and reconstruction methodologies, there is still no consensus in best practice protocols. This study aims to compare Cartesian and non-Cartesian real-time MRI sequences, regarding image quality and temporal resolution trade-off, for dynamic speech imaging. Five subjects were imaged at 1.5T, while performing normal phonation, in order to assess velar motion and velopharyngeal closure. Data was acquired using both Cartesian and non-Cartesian (spiral and radial) real-time sequences at five different spatial-temporal resolution sets, between 10 fps (1.7×1.7×10 mm3) and 25 fps (1.5×1.5×10 mm3). Only standard scanning resources provided by the MRI scanner manufacturer were used to ensure easy applicability to clinical evaluation and reproducibility. Data sets were evaluated by comparing measurements of the velar structure, dynamic contrast-to-noise ratio and image quality visual scoring. Results showed that for all proposed sequences, FLASH spiral acquisitions provided higher contrast-to-noise ratio, up to a 170.34% increase at 20 fps, than equivalent bSSFP Cartesian acquisitions for the same spatial-temporal resolution. At higher frame rates (22 and 25 fps), spiral protocols were optimal and provided higher CNR and visual scoring than equivalent radial protocols. Comparison of dynamic imaging at 10 and 22 fps for radial and spiral acquisitions revealed no significant difference in CNR performance, thus indicating that temporal resolution can be doubled without compromising spatial resolution (1.9×1.9 mm2) or CNR. In summary, this study suggests that the use of FLASH spiral protocols should be preferred over bSSFP Cartesian for the dynamic imaging of velopharyngeal closure, as it allows for an improvement in CNR and overall image quality without compromising spatial-temporal resolution.
Collapse
Affiliation(s)
- Andreia C. Freitas
- NIHR Cardiovascular Biomedical Research Unit at Barts, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
- Clinical Physics, Barts Health NHS Trust, London, United Kingdom
| | - Marzena Wylezinska
- NIHR Cardiovascular Biomedical Research Unit at Barts, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Malcolm J. Birch
- Clinical Physics, Barts Health NHS Trust, London, United Kingdom
| | - Steffen E. Petersen
- NIHR Cardiovascular Biomedical Research Unit at Barts, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Marc E. Miquel
- NIHR Cardiovascular Biomedical Research Unit at Barts, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
- Clinical Physics, Barts Health NHS Trust, London, United Kingdom
| |
Collapse
|