1
|
Yagiz E, Garg P, Cen SY, Nayak KS, Tian Y. Simultaneous multi-slice cardiac real-time MRI at 0.55T. Magn Reson Med 2024. [PMID: 39506513 DOI: 10.1002/mrm.30364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 09/26/2024] [Accepted: 10/16/2024] [Indexed: 11/08/2024]
Abstract
PURPOSE To determine the feasibility of simultaneous multi-slice (SMS) real-time MRI (RT-MRI) at 0.55T for the evaluation of cardiac function. METHODS Cardiac CINE MRI is routinely used to evaluate left-ventricular (LV) function. The standard is sequential multi-slice balanced SSFP (bSSFP) over a stack of short-axis slices using electrocardiogram (ECG) gating and breath-holds. SMS has been used in CINE imaging to reduce the number of breath-holds by a factor of 2-4 at 1.5T, 3T, and recently at 0.55T. This work aims to determine if SMS is similarly effective in the RT-MRI evaluation of cardiac function. We used an SMS bSSFP pulse sequence with golden-angle spirals at 0.55T with an SMS factor of three. We cover the LV with three acquisitions for SMS, and nine for single-band (SB). Imaging was performed on 9 healthy volunteers and 1 patient with myocardial fibrosis and sternal wires. A spatio-temporal constrained reconstruction is used, with regularization parameters selected by a board-certified cardiologist. Images were quantitatively analyzed with a normalized contrast and an Edge Sharpness (ES) score. RESULTS There was a statistically significant 2-fold difference in contrast between SMS and SB and no significant difference in ES score. The contrast for SMS and SB were 13.38/29.05 at mid-diastole and 10.79/22.26 at end-systole; the ES scores for SMS and SB were 1.77/1.83 at mid-diastole and 1.50/1.72 at end-systole. CONCLUSIONS SMS cardiac RT-MRI at 0.55T is feasible and provides sufficient blood-myocardium contrast to evaluate LV function in three slices simultaneously without any gating or periodic motion assumptions.
Collapse
Affiliation(s)
- Ecrin Yagiz
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Parveen Garg
- Division of Cardiology, Department of Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Steven Y Cen
- Department of Neurology, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
- Department of Radiology, Keck School of Medicine, University of Southern California, Los Angeles, California, USA
| | - Krishna S Nayak
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Ye Tian
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
2
|
Jin R, Li Y, Shosted RK, Xing F, Gilbert I, Perry JL, Woo J, Liang ZP, Sutton BP. Optimization of 3D dynamic speech MRI: Poisson-disc undersampling and locally higher-rank reconstruction through partial separability model with regional optimized temporal basis. Magn Reson Med 2024; 91:61-74. [PMID: 37677043 PMCID: PMC10847962 DOI: 10.1002/mrm.29812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/09/2023] [Accepted: 07/12/2023] [Indexed: 09/09/2023]
Abstract
PURPOSE To improve the spatiotemporal qualities of images and dynamics of speech MRI through an improved data sampling and image reconstruction approach. METHODS For data acquisition, we used a Poisson-disc random under sampling scheme that reduced the undersampling coherence. For image reconstruction, we proposed a novel locally higher-rank partial separability model. This reconstruction model represented the oral and static regions using separate low-rank subspaces, therefore, preserving their distinct temporal signal characteristics. Regional optimized temporal basis was determined from the regional-optimized virtual coil approach. Overall, we achieved a better spatiotemporal image reconstruction quality with the potential of reducing total acquisition time by 50%. RESULTS The proposed method was demonstrated through several 2-mm isotropic, 64 mm total thickness, dynamic acquisitions with 40 frames per second and compared to the previous approach using a global subspace model along with other k-space sampling patterns. Individual timeframe images and temporal profiles of speech samples were shown to illustrate the ability of the Poisson-disc under sampling pattern in reducing total acquisition time. Temporal information of sagittal and coronal directions was also shown to illustrate the effectiveness of the locally higher-rank operator and regional optimized temporal basis. To compare the reconstruction qualities of different regions, voxel-wise temporal SNR analysis were performed. CONCLUSION Poisson-disc sampling combined with a locally higher-rank model and a regional-optimized temporal basis can drastically improve the spatiotemporal image quality and provide a 50% reduction in overall acquisition time.
Collapse
Affiliation(s)
- Riwei Jin
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Yudu Li
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- National Center for Supercomputing Applications, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Ryan K Shosted
- Department of Linguistics, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Fangxu Xing
- Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts, USA
| | - Imani Gilbert
- Department of Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina, USA
| | - Jamie L Perry
- Department of Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina, USA
| | - Jonghye Woo
- Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts, USA
| | - Zhi-Pei Liang
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- National Center for Supercomputing Applications, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Bradley P Sutton
- Department of Bioengineering, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- National Center for Supercomputing Applications, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Carle Illinois College of Medicine, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
3
|
Jin R, Shosted RK, Xing F, Gilbert IR, Perry JL, Woo J, Liang ZP, Sutton BP. Enhancing linguistic research through 2-mm isotropic 3D dynamic speech MRI optimized by sparse temporal sampling and low-rank reconstruction. Magn Reson Med 2023; 89:652-664. [PMID: 36289572 PMCID: PMC9712260 DOI: 10.1002/mrm.29486] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 08/17/2022] [Accepted: 09/16/2022] [Indexed: 12/13/2022]
Abstract
PURPOSE To enable a more comprehensive view of articulations during speech through near-isotropic 3D dynamic MRI with high spatiotemporal resolution and large vocal-tract coverage. METHODS Using partial separability model-based low-rank reconstruction coupled with a sparse acquisition of both spatial and temporal models, we are able to achieve near-isotropic resolution 3D imaging with a high frame rate. The total acquisition time of the speech acquisition is shortened by introducing a sparse temporal sampling that interleaves one temporal navigator with four randomized phase and slice-encoded imaging samples. Memory and computation time are improved through compressing coils based on the region of interest for low-rank constrained reconstruction with an edge-preserving spatial penalty. RESULTS The proposed method has been evaluated through experiments on several speech samples, including a standard reading passage. A near-isotropic 1.875 × 1.875 × 2 mm3 spatial resolution, 64-mm through-plane coverage, and a 35.6-fps temporal resolution are achieved. Investigations and analysis on specific speech samples support novel insights into nonsymmetric tongue movement, velum raising, and coarticulation events with adequate visualization of rapid articulatory movements. CONCLUSION Three-dimensional dynamic images of the vocal tract structures during speech with high spatiotemporal resolution and axial coverage is capable of enhancing linguistic research, enabling visualization of soft tissue motions that are not possible with other modalities.
Collapse
Affiliation(s)
- Riwei Jin
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA,Beckman Institute for Advanced Science and Technology, University of Illinois Urbana Champaign, Urbana, IL
| | - Ryan K. Shosted
- Department of Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Fangxu Xing
- Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Imani R. Gilbert
- Department of Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina 27858, USA
| | - Jamie L. Perry
- Department of Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina 27858, USA
| | - Jonghye Woo
- Gordon Center for Medical Imaging, Department of Radiology, Massachusetts General Hospital/Harvard Medical School, Boston, Massachusetts 02114, USA
| | - Zhi-Pei Liang
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana Champaign, Urbana, IL,Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Bradley P. Sutton
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA,Beckman Institute for Advanced Science and Technology, University of Illinois Urbana Champaign, Urbana, IL
| |
Collapse
|
4
|
Lu Y, Wiltshire CEE, Watkins KE, Chiew M, Goldstein L. Characteristics of articulatory gestures in stuttered speech: A case study using real-time magnetic resonance imaging. JOURNAL OF COMMUNICATION DISORDERS 2022; 97:106213. [PMID: 35397388 DOI: 10.1016/j.jcomdis.2022.106213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 02/09/2022] [Accepted: 03/14/2022] [Indexed: 06/14/2023]
Abstract
INTRODUCTION Most of the previous articulatory studies of stuttering have focussed on the fluent speech of people who stutter. However, to better understand what causes the actual moments of stuttering, it is necessary to probe articulatory behaviors during stuttered speech. We examined the supralaryngeal articulatory characteristics of stuttered speech using real-time structural magnetic resonance imaging (RT-MRI). We investigated how articulatory gestures differ across stuttered and fluent speech of the same speaker. METHODS Vocal tract movements of an adult man who stutters during a pseudoword reading task were recorded using RT-MRI. Four regions of interest (ROIs) were defined on RT-MRI image sequences around the lips, tongue tip, tongue body, and velum. The variation of pixel intensity in each ROI over time provided an estimate of the movement of these four articulators. RESULTS All disfluencies occurred on syllable-initial consonants. Three articulatory patterns were identified. Pattern 1 showed smooth gestural formation and release like fluent speech. Patterns 2 and 3 showed delayed release of gestures due to articulator fixation or oscillation respectively. Block and prolongation corresponded to either pattern 1 or 2. Repetition corresponded to pattern 3 or a mix of patterns. Gestures for disfluent consonants typically exhibited a greater constriction than fluent gestures, which was rarely corrected during disfluencies. Gestures for the upcoming vowel were initiated and executed during these consonant disfluencies, achieving a tongue body position similar to the fluent counterpart. CONCLUSION Different perceptual types of disfluencies did not necessarily result from distinct articulatory patterns, highlighting the importance of collecting articulatory data of stuttering. Disfluencies on syllable-initial consonants were related to the delayed release and the overshoot of consonant gestures, rather than the delayed initiation of vowel gestures. This suggests that stuttering does not arise from problems with planning the vowel gestures, but rather with releasing the overly constricted consonant gestures.
Collapse
Affiliation(s)
- Yijing Lu
- Department of Linguistics, University of Southern California, United States.
| | - Charlotte E E Wiltshire
- Wellcome Centre for Integrative Neuroimaging, Department of Experimental Psychology, University of Oxford, United Kingdom.
| | - Kate E Watkins
- Wellcome Centre for Integrative Neuroimaging, Department of Experimental Psychology, University of Oxford, United Kingdom.
| | - Mark Chiew
- Wellcome Centre for Integrative Neuroimaging, Nuffield Department of Clinical Neurosciences, University of Oxford, United Kingdom.
| | - Louis Goldstein
- Department of Linguistics, University of Southern California, United States.
| |
Collapse
|
5
|
Ismail TF, Strugnell W, Coletti C, Božić-Iven M, Weingärtner S, Hammernik K, Correia T, Küstner T. Cardiac MR: From Theory to Practice. Front Cardiovasc Med 2022; 9:826283. [PMID: 35310962 PMCID: PMC8927633 DOI: 10.3389/fcvm.2022.826283] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 01/17/2022] [Indexed: 01/10/2023] Open
Abstract
Cardiovascular disease (CVD) is the leading single cause of morbidity and mortality, causing over 17. 9 million deaths worldwide per year with associated costs of over $800 billion. Improving prevention, diagnosis, and treatment of CVD is therefore a global priority. Cardiovascular magnetic resonance (CMR) has emerged as a clinically important technique for the assessment of cardiovascular anatomy, function, perfusion, and viability. However, diversity and complexity of imaging, reconstruction and analysis methods pose some limitations to the widespread use of CMR. Especially in view of recent developments in the field of machine learning that provide novel solutions to address existing problems, it is necessary to bridge the gap between the clinical and scientific communities. This review covers five essential aspects of CMR to provide a comprehensive overview ranging from CVDs to CMR pulse sequence design, acquisition protocols, motion handling, image reconstruction and quantitative analysis of the obtained data. (1) The basic MR physics of CMR is introduced. Basic pulse sequence building blocks that are commonly used in CMR imaging are presented. Sequences containing these building blocks are formed for parametric mapping and functional imaging techniques. Commonly perceived artifacts and potential countermeasures are discussed for these methods. (2) CMR methods for identifying CVDs are illustrated. Basic anatomy and functional processes are described to understand the cardiac pathologies and how they can be captured by CMR imaging. (3) The planning and conduct of a complete CMR exam which is targeted for the respective pathology is shown. Building blocks are illustrated to create an efficient and patient-centered workflow. Further strategies to cope with challenging patients are discussed. (4) Imaging acceleration and reconstruction techniques are presented that enable acquisition of spatial, temporal, and parametric dynamics of the cardiac cycle. The handling of respiratory and cardiac motion strategies as well as their integration into the reconstruction processes is showcased. (5) Recent advances on deep learning-based reconstructions for this purpose are summarized. Furthermore, an overview of novel deep learning image segmentation and analysis methods is provided with a focus on automatic, fast and reliable extraction of biomarkers and parameters of clinical relevance.
Collapse
Affiliation(s)
- Tevfik F. Ismail
- School of Biomedical Engineering and Imaging Sciences, King's College London, London, United Kingdom
- Cardiology Department, Guy's and St Thomas' Hospital, London, United Kingdom
| | - Wendy Strugnell
- Queensland X-Ray, Mater Hospital Brisbane, Brisbane, QLD, Australia
| | - Chiara Coletti
- Magnetic Resonance Systems Lab, Delft University of Technology, Delft, Netherlands
| | - Maša Božić-Iven
- Magnetic Resonance Systems Lab, Delft University of Technology, Delft, Netherlands
- Computer Assisted Clinical Medicine, Heidelberg University, Mannheim, Germany
| | | | - Kerstin Hammernik
- Lab for AI in Medicine, Technical University of Munich, Munich, Germany
- Department of Computing, Imperial College London, London, United Kingdom
| | - Teresa Correia
- School of Biomedical Engineering and Imaging Sciences, King's College London, London, United Kingdom
- Centre of Marine Sciences, Faro, Portugal
| | - Thomas Küstner
- Medical Image and Data Analysis (MIDAS.lab), Department of Diagnostic and Interventional Radiology, University Hospital of Tübingen, Tübingen, Germany
| |
Collapse
|
6
|
Nayak KS, Lim Y, Campbell-Washburn AE, Steeden J. Real-Time Magnetic Resonance Imaging. J Magn Reson Imaging 2022; 55:81-99. [PMID: 33295674 PMCID: PMC8435094 DOI: 10.1002/jmri.27411] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 10/06/2020] [Accepted: 10/09/2020] [Indexed: 01/03/2023] Open
Abstract
Real-time magnetic resonance imaging (RT-MRI) allows for imaging dynamic processes as they occur, without relying on any repetition or synchronization. This is made possible by modern MRI technology such as fast-switching gradients and parallel imaging. It is compatible with many (but not all) MRI sequences, including spoiled gradient echo, balanced steady-state free precession, and single-shot rapid acquisition with relaxation enhancement. RT-MRI has earned an important role in both diagnostic imaging and image guidance of invasive procedures. Its unique diagnostic value is prominent in areas of the body that undergo substantial and often irregular motion, such as the heart, gastrointestinal system, upper airway vocal tract, and joints. Its value in interventional procedure guidance is prominent for procedures that require multiple forms of soft-tissue contrast, as well as flow information. In this review, we discuss the history of RT-MRI, fundamental tradeoffs, enabling technology, established applications, and current trends. LEVEL OF EVIDENCE: 5 TECHNICAL EFFICACY STAGE: 1.
Collapse
Affiliation(s)
- Krishna S. Nayak
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California, USA,Address reprint requests to: K.S.N., 3740 McClintock Ave, EEB 400C, Los Angeles, CA 90089-2564, USA.
| | - Yongwan Lim
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, California, USA
| | - Adrienne E. Campbell-Washburn
- Cardiovascular Branch, Division of Intramural Research, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Jennifer Steeden
- Institute of Cardiovascular Science, Centre for Cardiovascular Imaging, University College London, London, UK
| |
Collapse
|
7
|
Contrast-Enhanced T1-Weighted Head and Neck MRI: Prospective Intraindividual Image Quality Comparison of Spiral GRE, Cartesian GRE, and Cartesian TSE Sequences. AJR Am J Roentgenol 2021; 218:132-139. [PMID: 34406050 DOI: 10.2214/ajr.21.26413] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
BACKGROUND. Sequences with noncartesian k-space sampling may improve image quality of head and neck MRI. OBJECTIVE. The purpose of this study was to compare intraindividually the image quality of a spiral gradient-recalled echo (GRE) sequence and conventional cartesian GRE and cartesian turbo spin-echo (TSE) sequences for contrast-enhanced T1-weighted head and neck MRI. METHODS. This prospective study included patients referred for contrast-enhanced head and neck MRI from August 2020 to May 2021. Patients underwent 1.5-T MRI including contrast-enhanced spiral GRE (2 minutes 28 seconds), cartesian GRE (4 minutes 27 seconds), and cartesian TSE (3 minutes 41 seconds) sequences, acquired in rotating order across patients. Three radiologists independently assessed image quality measures, including conspicuity of prespecified lesions, using 5-point Likert scales. One reader measured maximal extent of dental material artifact and contrast-to-noise ratio (CNR). RESULTS. Thirty-one patients (13 men, 18 women; mean age, 63.8 years) were enrolled. Nineteen patients had a focal lesion; 22 had dental material. Interreader agreement for image quality measures was substantial to excellent (Krippendorff alpha, 0.681-1.000). Scores for overall image quality (whole head and neck, neck only, and head only), pulsation artifact, muscular contour delineation, vessel contour delineation, motion artifact, and differentiation between mucosa and pharyngeal muscles were significantly better for spiral GRE than for cartesian GRE and cartesian TSE for all readers (p < .05). Scores for lesion conspicuity (whole head and neck, neck only, and head only), quality of fat suppression, flow artifact, and foldover artifact were not significantly different between spiral GRE and the cartesian sequences for any reader (p > .05). Dental material artifact scores were significantly worse for spiral GRE than the other sequences for all readers (p < .05). The mean maximum extent of dental material artifact was 39.6 ± 25.5 (SD) mm for spiral GRE, 35.6 ± 24.3 mm for cartesian GRE, and 29.6 ± 21.4 mm for cartesian TSE; the mean CNR was 221.1 ± 94.5 for spiral GRE, 151.8 ± 85.7 for cartesian GRE, and 153.0 ± 63.2 for cartesian TSE (p < .001 between spiral GRE and other sequences for both measures). CONCLUSION. Three-dimensional spiral GRE improves subjective image quality and CNR of head and neck MRI with shorter scan time versus cartesian sequences, though it exhibits larger dental material artifact. CLINICAL IMPACT. A spiral sequence may help overcome certain challenges of conventional cartesian sequences for head and neck MRI.
Collapse
|
8
|
Tian Y, Lim Y, Zhao Z, Byrd D, Narayanan S, Nayak KS. Aliasing artifact reduction in spiral real-time MRI. Magn Reson Med 2021; 86:916-925. [PMID: 33728700 DOI: 10.1002/mrm.28746] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 01/09/2021] [Accepted: 02/02/2021] [Indexed: 12/17/2022]
Abstract
PURPOSE To mitigate a common artifact in spiral real-time MRI, caused by aliasing of signal outside the desired FOV. This artifact frequently occurs in midsagittal speech real-time MRI. METHODS Simulations were performed to determine the likely origin of the artifact. Two methods to mitigate the artifact are proposed. The first approach, denoted as "large FOV" (LF), keeps an FOV that is large enough to include the artifact signal source during reconstruction. The second approach, denoted as "estimation-subtraction" (ES), estimates the artifact signal source before subtracting a synthetic signal representing that source in multicoil k-space raw data. Twenty-five midsagittal speech-production real-time MRI data sets were used to evaluate both of the proposed methods. Reconstructions without and with corrections were evaluated by two expert readers using a 5-level Likert scale assessing artifact severity. Reconstruction time was also compared. RESULTS The origin of the artifact was found to be a combination of gradient nonlinearity and imperfect anti-aliasing in spiral sampling. The LF and ES methods were both able to substantially reduce the artifact, with an averaged qualitative score improvement of 1.25 and 1.35 Likert levels for LF correction and ES correction, respectively. Average reconstruction time without correction, with LF correction, and with ES correction were 160.69 ± 1.56, 526.43 ± 5.17, and 171.47 ± 1.71 ms/frame. CONCLUSION Both proposed methods were able to reduce the spiral aliasing artifacts, with the ES-reduction method being more effective and more time efficient.
Collapse
Affiliation(s)
- Ye Tian
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Yongwan Lim
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Ziwei Zhao
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| | - Dani Byrd
- Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, California, USA
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA.,Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, California, USA
| | - Krishna S Nayak
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, USA
| |
Collapse
|
9
|
Zhao Z, Lim Y, Byrd D, Narayanan S, Nayak KS. Improved 3D real-time MRI of speech production. Magn Reson Med 2021; 85:3182-3195. [PMID: 33452722 DOI: 10.1002/mrm.28651] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Revised: 10/29/2020] [Accepted: 11/26/2020] [Indexed: 01/21/2023]
Abstract
PURPOSE To provide 3D real-time MRI of speech production with improved spatio-temporal sharpness using randomized, variable-density, stack-of-spiral sampling combined with a 3D spatio-temporally constrained reconstruction. METHODS We evaluated five candidate (k, t) sampling strategies using a previously proposed gradient-echo stack-of-spiral sequence and a 3D constrained reconstruction with spatial and temporal penalties. Regularization parameters were chosen by expert readers based on qualitative assessment. We experimentally determined the effect of spiral angle increment and kz temporal order. The strategy yielding highest image quality was chosen as the proposed method. We evaluated the proposed and original 3D real-time MRI methods in 2 healthy subjects performing speech production tasks that invoke rapid movements of articulators seen in multiple planes, using interleaved 2D real-time MRI as the reference. We quantitatively evaluated tongue boundary sharpness in three locations at two speech rates. RESULTS The proposed data-sampling scheme uses a golden-angle spiral increment in the kx -ky plane and variable-density, randomized encoding along kz . It provided a statistically significant improvement in tongue boundary sharpness score (P < .001) in the blade, body, and root of the tongue during normal and 1.5-times speeded speech. Qualitative improvements were substantial during natural speech tasks of alternating high, low tongue postures during vowels. The proposed method was also able to capture complex tongue shapes during fast alveolar consonant segments. Furthermore, the proposed scheme allows flexible retrospective selection of temporal resolution. CONCLUSION We have demonstrated improved 3D real-time MRI of speech production using randomized, variable-density, stack-of-spiral sampling with a 3D spatio-temporally constrained reconstruction.
Collapse
Affiliation(s)
- Ziwei Zhao
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Yongwan Lim
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Dani Byrd
- Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA.,Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
| | - Krishna S Nayak
- Ming Hsieh Department of Electrical and Computer Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
10
|
Ruthven M, Freitas AC, Boubertakh R, Miquel ME. Application of radial GRAPPA techniques to single- and multislice dynamic speech MRI using a 16-channel neurovascular coil. Magn Reson Med 2019; 82:948-958. [PMID: 31016802 DOI: 10.1002/mrm.27779] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2019] [Revised: 03/07/2019] [Accepted: 03/29/2019] [Indexed: 12/11/2022]
Abstract
PURPOSE To investigate: (1) the feasibility of using through-time radial GeneRalized Autocalibrating Partially Parallel Acquisitions (rGRAPPA) and hybrid radial GRAPPA (h-rGRAPPA) in single- and multislice dynamic speech MRI; (2) whether single-slice dynamic speech MRI at a rate of 15 frames per second (fps) or higher and with adequate image quality can be achieved using these radial GRAPPA techniques. METHODS Seven healthy adult volunteers were imaged at 3T using a 16-channel neurovascular coil and 2 spoiled gradient echo sequences (radial trajectory, field of view = 192 × 192 mm2 , acquired pixel size = 2.4 × 2.4 mm2 ). One sequence imaged a single slice at 16.8 fps, the other imaged 2 interleaved slices at 7.8 fps per slice. Image sets were reconstructed using rGRAPPA and h-rGRAPPA, and their image quality was compared using the root mean square error, structural similarity index, and visual assessments. RESULTS Image quality deteriorated when fewer than 170 calibration frames were used in the rGRAPPA reconstruction. rGRAPPA image sets demonstrated: (1) in 97% of cases, a similar image quality to h-rGRAPPA image sets reconstructed using a k-space segment size of 4, (2) in 98% of cases, a better image quality than h-rGRAPPA image sets reconstructed using a k-space segment size of 32. CONCLUSION This study confirmed: (1) the feasibility of using rGRAPPA and h-rGRAPPA in single- and multislice dynamic speech MRI, (2) that single-slice speech imaging at a frame rate higher than 15 fps and with adequate image quality can be achieved using these radial GRAPPA techniques.
Collapse
Affiliation(s)
- Matthieu Ruthven
- Clinical Physics, Barts Health NHS Trust, St Bartholomew's Hospital, London, United Kingdom
| | - Andreia C Freitas
- William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, United Kingdom.,ISR-Lisboa/LARSyS and Department of Bioengineering, Instituto Superior Técnico - Universidade de Lisboa, Lisbon, Portugal
| | - Redha Boubertakh
- Clinical Physics, Barts Health NHS Trust, St Bartholomew's Hospital, London, United Kingdom.,William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, United Kingdom
| | - Marc E Miquel
- Clinical Physics, Barts Health NHS Trust, St Bartholomew's Hospital, London, United Kingdom.,William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, United Kingdom
| |
Collapse
|
11
|
Krohn S, Joseph AA, Voit D, Michaelis T, Merboldt KD, Buergers R, Frahm J. Multi-slice real-time MRI of temporomandibular joint dynamics. Dentomaxillofac Radiol 2019; 48:20180162. [PMID: 30028188 PMCID: PMC6398907 DOI: 10.1259/dmfr.20180162] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 05/29/2018] [Accepted: 06/28/2018] [Indexed: 12/12/2022] Open
Abstract
OBJECTIVES The purpose of this work was to improve the clinical versatility of high-speed real-time MRI studies of temporomandibular joint (TMJ) dynamics by simultaneous recordings of multiple MRI movies in different sections. METHODS Real-time MRI at 3 T was realized using highly undersampled radial FLASH acquisitions and image reconstruction by regularized nonlinear inversion (NLINV). Multi-slice real-time MRI of two, three or four slices at 0.75 mm resolution and 6 to 8 mm thickness was accomplished at 50.0 ms, 33.3 ms or 25.5 ms temporal resolution, respectively, yielding simultaneous movies at 2 × 10, 3 × 10 or 4 × 10 frames per second in a frame-interleaved acquisition mode. Real-time MRI movies were evaluated by three blinded raters for visibility of the anterior and posterior border of disc, shape of the disk body and condyle head as well as movement of the disc and condyle (1 = excellent, 5 = no visibility). RESULTS Effective delineation of the disk atop the mandibular condyle was achieved by T1-weighted images with opposed-phase water-fat contrast. Compared to 8 mm sections, multi-slice recordings with 6 mm thickness provided sharper delineation of relevant structures as confirmed by inter-rater evaluation. Respective dual-slice and triple-slice recordings of a single TMJ as well as dual-slice recordings of both joints (one slice per TMJ) received the highest visibility ratings of ≤ 2 corresponding to high confidence in diagnostic content. CONCLUSIONS The improved access to TMJ dynamics by multi-slice real-time MRI will contribute to more effective treatment of temporomandibular disorders.
Collapse
Affiliation(s)
- Sebastian Krohn
- Department of Prosthodontics, University Medical Center, Göttingen, Germany
| | - Arun A Joseph
- Biomedizinische NMR, MPI für biophysikalische Chemie, Göttingen, Germany
- DZHK, German Center for Cardiovascular Research, Göttingen, Germany
| | - Dirk Voit
- Biomedizinische NMR, MPI für biophysikalische Chemie, Göttingen, Germany
| | - Thomas Michaelis
- Biomedizinische NMR, MPI für biophysikalische Chemie, Göttingen, Germany
| | | | - Ralf Buergers
- Department of Prosthodontics, University Medical Center, Göttingen, Germany
| | - Jens Frahm
- Biomedizinische NMR, MPI für biophysikalische Chemie, Göttingen, Germany
- DZHK, German Center for Cardiovascular Research, Göttingen, Germany
| |
Collapse
|
12
|
Kim YC. Fast upper airway magnetic resonance imaging for assessment of speech production and sleep apnea. PRECISION AND FUTURE MEDICINE 2018. [DOI: 10.23838/pfm.2018.00100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
|
13
|
Lim Y, Zhu Y, Lingala SG, Byrd D, Narayanan S, Nayak KS. 3D dynamic MRI of the vocal tract during natural speech. Magn Reson Med 2018; 81:1511-1520. [PMID: 30390319 DOI: 10.1002/mrm.27570] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2018] [Revised: 09/25/2018] [Accepted: 09/26/2018] [Indexed: 12/19/2022]
Abstract
PURPOSE To develop and evaluate a technique for 3D dynamic MRI of the full vocal tract at high temporal resolution during natural speech. METHODS We demonstrate 2.4 × 2.4 × 5.8 mm3 spatial resolution, 61-ms temporal resolution, and a 200 × 200 × 70 mm3 FOV. The proposed method uses 3D gradient-echo imaging with a custom upper-airway coil, a minimum-phase slab excitation, stack-of-spirals readout, pseudo golden-angle view order in kx -ky , linear Cartesian order along kz , and spatiotemporal finite difference constrained reconstruction, with 13-fold acceleration. This technique is evaluated using in vivo vocal tract airway data from 2 healthy subjects acquired at 1.5T scanner, 1 with synchronized audio, with 2 tasks during production of natural speech, and via comparison with interleaved multislice 2D dynamic MRI. RESULTS This technique captured known dynamics of vocal tract articulators during natural speech tasks including tongue gestures during the production of consonants "s" and "l" and of consonant-vowel syllables, and was additionally consistent with 2D dynamic MRI. Coordination of lingual (tongue) movements for consonants is demonstrated via volume-of-interest analysis. Vocal tract area function dynamics revealed critical lingual constriction events along the length of the vocal tract for consonants and vowels. CONCLUSION We demonstrate feasibility of 3D dynamic MRI of the full vocal tract, with spatiotemporal resolution adequate to visualize lingual movements for consonants and vocal tact shaping during natural productions of consonant-vowel syllables, without requiring multiple repetitions.
Collapse
Affiliation(s)
- Yongwan Lim
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Yinghua Zhu
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Sajan Goud Lingala
- Department of Biomedical Engineering, College of Engineering, University of Iowa, Iowa City, Iowa
| | - Dani Byrd
- Department of Linguistics, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, California
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| | - Krishna Shrinivas Nayak
- Ming Hsieh Department of Electrical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California
| |
Collapse
|
14
|
Ramanarayanan V, Tilsen S, Proctor M, Töger J, Goldstein L, Nayak KS, Narayanan S. Analysis of speech production real-time MRI. COMPUT SPEECH LANG 2018. [DOI: 10.1016/j.csl.2018.04.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
15
|
Dietz B, Fallone BG, Wachowicz K. Nomenclature for real‐time magnetic resonance imaging. Magn Reson Med 2018; 81:1483-1484. [DOI: 10.1002/mrm.27487] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Revised: 07/11/2018] [Accepted: 07/18/2018] [Indexed: 12/15/2022]
Affiliation(s)
- Bryson Dietz
- Division of Medical Physics, Department of Oncology University of Alberta, Cross Cancer Institute Edmonton Canada
| | - B. Gino Fallone
- Department of Medical Physics Cross Cancer Institute Edmonton Canada
- Departments of Oncology and Physics University of Alberta Edmonton Canada
| | - Keith Wachowicz
- Department of Medical Physics Cross Cancer Institute Edmonton Canada
- Departments of Oncology and Physics University of Alberta Edmonton Canada
| |
Collapse
|
16
|
Feng X, Blemker SS, Inouye J, Pelland CM, Zhao L, Meyer CH. Assessment of velopharyngeal function with dual-planar high-resolution real-time spiral dynamic MRI. Magn Reson Med 2018; 80:1467-1474. [PMID: 29508458 DOI: 10.1002/mrm.27139] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2017] [Revised: 01/25/2018] [Accepted: 01/25/2018] [Indexed: 02/05/2023]
Abstract
PURPOSE To develop a real-time dynamic MRI method for comprehensive evaluation of velum movement during speech. METHODS Dynamic MRI has been used to study velopharyngeal insufficiency (VPI) by imaging the movement of the velum during speech, because it can provide good anatomic details with no exposed radiation. To be able to comprehensively evaluate dynamic velum movement, a real-time spiral non-balanced SSFP sequence was developed with simultaneous dual-planar coverage and improved spatial and temporal resolution using a combination of parallel imaging and spatial and temporal compressed sensing to achieve 6 × acceleration. New off-resonance correction and post-processing methods were also developed to reduce blurring and slice crosstalk. RESULTS The method demonstrated good image quality for visualizing dynamic velum movement with reduced blurring and improved image homogeneity. Spatial resolution of 1.2*1.2 mm2 with 150 mm FOV and temporal resolution of 20 frames-per-second with simultaneous dual-planar coverage was achieved. CONCLUSIONS This work describes a new technique for studying speech disorders using dual-planar accelerated spiral dynamic MRI.
Collapse
Affiliation(s)
- Xue Feng
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, USA
| | - Silvia S Blemker
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, USA.,Department of Mechanical and Aerospace Engineering, University of Virginia, Charlottesville, Virginia, USA
| | - Josh Inouye
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, USA
| | - Catherine M Pelland
- Department of Mechanical and Aerospace Engineering, University of Virginia, Charlottesville, Virginia, USA
| | - Li Zhao
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, USA
| | - Craig H Meyer
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, USA.,Department of Radiology, University of Virginia, Charlottesville, Virginia, USA
| |
Collapse
|
17
|
Töger J, Sorensen T, Somandepalli K, Toutios A, Lingala SG, Narayanan S, Nayak K. Test-retest repeatability of human speech biomarkers from static and real-time dynamic magnetic resonance imaging. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:3323. [PMID: 28599561 PMCID: PMC5436977 DOI: 10.1121/1.4983081] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Static anatomical and real-time dynamic magnetic resonance imaging (RT-MRI) of the upper airway is a valuable method for studying speech production in research and clinical settings. The test-retest repeatability of quantitative imaging biomarkers is an important parameter, since it limits the effect sizes and intragroup differences that can be studied. Therefore, this study aims to present a framework for determining the test-retest repeatability of quantitative speech biomarkers from static MRI and RT-MRI, and apply the framework to healthy volunteers. Subjects (n = 8, 4 females, 4 males) are imaged in two scans on the same day, including static images and dynamic RT-MRI of speech tasks. The inter-study agreement is quantified using intraclass correlation coefficient (ICC) and mean within-subject standard deviation (σe). Inter-study agreement is strong to very strong for static measures (ICC: min/median/max 0.71/0.89/0.98, σe: 0.90/2.20/6.72 mm), poor to strong for dynamic RT-MRI measures of articulator motion range (ICC: 0.26/0.75/0.90, σe: 1.6/2.5/3.6 mm), and poor to very strong for velocities (ICC: 0.21/0.56/0.93, σe: 2.2/4.4/16.7 cm/s). In conclusion, this study characterizes repeatability of static and dynamic MRI-derived speech biomarkers using state-of-the-art imaging. The introduced framework can be used to guide future development of speech biomarkers. Test-retest MRI data are provided free for research use.
Collapse
Affiliation(s)
- Johannes Töger
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Tanner Sorensen
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Krishna Somandepalli
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Asterios Toutios
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Sajan Goud Lingala
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Shrikanth Narayanan
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| | - Krishna Nayak
- Ming Hsieh Department of Electrical Engineering, University of Southern California, 3740 McClintock Avenue, EEB 400, Los Angeles, California 90089-2560, USA
| |
Collapse
|
18
|
Fu M, Barlaz MS, Holtrop JL, Perry JL, Kuehn DP, Shosted RK, Liang ZP, Sutton BP. High-frame-rate full-vocal-tract 3D dynamic speech imaging. Magn Reson Med 2016; 77:1619-1629. [PMID: 27099178 DOI: 10.1002/mrm.26248] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 03/25/2016] [Accepted: 03/28/2016] [Indexed: 11/08/2022]
Abstract
PURPOSE To achieve high temporal frame rate, high spatial resolution and full-vocal-tract coverage for three-dimensional dynamic speech MRI by using low-rank modeling and sparse sampling. METHODS Three-dimensional dynamic speech MRI is enabled by integrating a novel data acquisition strategy and an image reconstruction method with the partial separability model: (a) a self-navigated sparse sampling strategy that accelerates data acquisition by collecting high-nominal-frame-rate cone navigator sand imaging data within a single repetition time, and (b) are construction method that recovers high-quality speech dynamics from sparse (k,t)-space data by enforcing joint low-rank and spatiotemporal total variation constraints. RESULTS The proposed method has been evaluated through in vivo experiments. A nominal temporal frame rate of 166 frames per second (defined based on a repetition time of 5.99 ms) was achieved for an imaging volume covering the entire vocal tract with a spatial resolution of 2.2 × 2.2 × 5.0 mm3 . Practical utility of the proposed method was demonstrated via both validation experiments and a phonetics investigation. CONCLUSION Three-dimensional dynamic speech imaging is possible with full-vocal-tract coverage, high spatial resolution and high nominal frame rate to provide dynamic speech data useful for phonetic studies. Magn Reson Med 77:1619-1629, 2017. © 2016 International Society for Magnetic Resonance in Medicine.
Collapse
Affiliation(s)
- Maojing Fu
- Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Marissa S Barlaz
- Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Joseph L Holtrop
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Jamie L Perry
- Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina, USA
| | - David P Kuehn
- Speech and Hearing Science, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Ryan K Shosted
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Zhi-Pei Liang
- Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Bradley P Sutton
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA.,Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
19
|
Toutios A, Narayanan SS. Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING 2016; 5:e6. [PMID: 27833745 PMCID: PMC5100697 DOI: 10.1017/atsip.2016.5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Real-time magnetic resonance imaging (rtMRI) of the moving vocal tract during running speech production is an important emerging tool for speech production research providing dynamic information of a speaker's upper airway from the entire mid-sagittal plane or any other scan plane of interest. There have been several advances in the development of speech rtMRI and corresponding analysis tools, and their application to domains such as phonetics and phonological theory, articulatory modeling, and speaker characterization. An important recent development has been the open release of a database that includes speech rtMRI data from five male and five female speakers of American English each producing 460 phonetically balanced sentences. The purpose of the present paper is to give an overview and outlook of the advances in rtMRI as a tool for speech research and technology development.
Collapse
Affiliation(s)
- Asterios Toutios
- Signal Analysis and Interpretation Laboratory (SAIL), University of Southern California (USC), 3740 McClintock Avenue, Los Angeles, CA 90089, USA
| | - Shrikanth S Narayanan
- Signal Analysis and Interpretation Laboratory (SAIL), University of Southern California (USC), 3740 McClintock Avenue, Los Angeles, CA 90089, USA
| |
Collapse
|
20
|
Lingala SG, Zhu Y, Kim YC, Toutios A, Narayanan S, Nayak KS. A fast and flexible MRI system for the study of dynamic vocal tract shaping. Magn Reson Med 2016; 77:112-125. [PMID: 26778178 DOI: 10.1002/mrm.26090] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2015] [Revised: 11/06/2015] [Accepted: 11/24/2015] [Indexed: 11/07/2022]
Abstract
PURPOSE The aim of this work was to develop and evaluate an MRI-based system for study of dynamic vocal tract shaping during speech production, which provides high spatial and temporal resolution. METHODS The proposed system utilizes (a) custom eight-channel upper airway coils that have high sensitivity to upper airway regions of interest, (b) two-dimensional golden angle spiral gradient echo acquisition, (c) on-the-fly view-sharing reconstruction, and (d) off-line temporal finite difference constrained reconstruction. The system also provides simultaneous noise-cancelled and temporally aligned audio. The system is evaluated in 3 healthy volunteers, and 1 tongue cancer patient, with a broad range of speech tasks. RESULTS We report spatiotemporal resolutions of 2.4 × 2.4 mm2 every 12 ms for single-slice imaging, and 2.4 × 2.4 mm2 every 36 ms for three-slice imaging, which reflects roughly 7-fold acceleration over Nyquist sampling. This system demonstrates improved temporal fidelity in capturing rapid vocal tract shaping for tasks, such as producing consonant clusters in speech, and beat-boxing sounds. Novel acoustic-articulatory analysis was also demonstrated. CONCLUSION A synergistic combination of custom coils, spiral acquisitions, and constrained reconstruction enables visualization of rapid speech with high spatiotemporal resolution in multiple planes. Magn Reson Med 77:112-125, 2017. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Sajan Goud Lingala
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | - Yinghua Zhu
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | | | - Asterios Toutios
- Electrical Engineering, University of Southern California, Los Angeles, CA
| | | | - Krishna S Nayak
- Electrical Engineering, University of Southern California, Los Angeles, CA
| |
Collapse
|
21
|
Lingala SG, Sutton BP, Miquel ME, Nayak KS. Recommendations for real-time speech MRI. J Magn Reson Imaging 2016; 43:28-44. [PMID: 26174802 PMCID: PMC5079859 DOI: 10.1002/jmri.24997] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Accepted: 06/23/2015] [Indexed: 11/11/2022] Open
Abstract
Real-time magnetic resonance imaging (RT-MRI) is being increasingly used for speech and vocal production research studies. Several imaging protocols have emerged based on advances in RT-MRI acquisition, reconstruction, and audio-processing methods. This review summarizes the state-of-the-art, discusses technical considerations, and provides specific guidance for new groups entering this field. We provide recommendations for performing RT-MRI of the upper airway. This is a consensus statement stemming from the ISMRM-endorsed Speech MRI summit held in Los Angeles, February 2014. A major unmet need identified at the summit was the need for consensus on protocols that can be easily adapted by researchers equipped with conventional MRI systems. To this end, we provide a discussion of tradeoffs in RT-MRI in terms of acquisition requirements, a priori assumptions, artifacts, computational load, and performance for different speech tasks. We provide four recommended protocols and identify appropriate acquisition and reconstruction tools. We list pointers to open-source software that facilitate implementation. We conclude by discussing current open challenges in the methodological aspects of RT-MRI of speech.
Collapse
Affiliation(s)
| | - Brad P. Sutton
- University of Illinois at Urbana-Champaign, Urbana-Champaign, Illinois, USA
| | | | | |
Collapse
|
22
|
Nunthayanon K, Honda EI, Shimazaki K, Ohmori H, Inoue-Arai MS, Kurabayashi T, Ono T. Use of an advanced 3-T MRI movie to investigate articulation. Oral Surg Oral Med Oral Pathol Oral Radiol 2015; 119:684-94. [PMID: 25956219 DOI: 10.1016/j.oooo.2015.03.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2014] [Revised: 02/20/2015] [Accepted: 03/09/2015] [Indexed: 10/23/2022]
Abstract
OBJECTIVE To develop a magnetic resonance imaging (MRI) movie to reveal the dynamic movement of articulators and teeth. STUDY DESIGN Five healthy females with normal occlusion participated in this study. Various concentrations of MRI contrast media (ferric ammonium citrate [FAC]) were tested for visualization of teeth, according to facial markers and with the use of a gel. Custom-made circuitry was connected to synchronize pronunciation of fricative sounds (/asa/) with scans. Three gradient echo sequences (True fast imaging with steady state precession [true FISP], FISP, and fast low angle shot [FLASH]) with a segmented cine were tested with the use of repetition times (TRs) of 9 ms and 31.5 ms. The MRI movie images were superimposed over the boundaries of teeth. The images produced during pronunciation, using the two different TRs (9 ms and 31 ms), were compared to assess the position of the lips and the tongue. RESULTS Images obtained using the FLASH sequence, with a TR of 9 ms or 31.5 ms, can be used for diagnostic purposes. A TR of 9 ms, with 161 continuous images acquired, produced the highest-quality images of teeth, with few artifacts present. Pronunciation of the consonant "s" was clearly discernable. CONCLUSIONS Our 3-T MRI movie system, with a temporal resolution less than 9 ms, can provide detailed information pertaining to variations in speech or oropharyngeal function.
Collapse
Affiliation(s)
- Kulthida Nunthayanon
- Graduate student, Graduate School, Orthodontic Science, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan; Lecturer, Faculty of Dentistry, Naresuan University, Phitsanulok, 65000, Thailand.
| | - Ei-ichi Honda
- Professor, Graduate School, Oral and Maxillofacial Radiology, University of Tokushima, 3-18-15, Kuramoto-cho, Tokushima, 770-8504, Japan; Lecturer, Graduate School, Oral and Maxillofacial Radiology, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan
| | - Kazuo Shimazaki
- Assistant professor, Graduate School, Orthodontic Science, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan
| | - Hiroko Ohmori
- Staff, Graduate School, Orthodontic Science, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan
| | - Maristela Sayuri Inoue-Arai
- Lecturer, Maxillofacial Orthognathics, Graduate School, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan
| | - Tohru Kurabayashi
- Professor, Graduate School, Oral and Maxillofacial Radiology, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, Tokyo, 113-8549, Japan
| | - Takashi Ono
- Professor, Graduate School, Orthodontic Science, Tokyo Medical and Dental University, 1-5-45, Yushima, Bunkyo-ku, 113-8549, Tokyo, Japan
| |
Collapse
|
23
|
Fu M, Zhao B, Carignan C, Shosted RK, Perry JL, Kuehn DP, Liang ZP, Sutton BP. High-resolution dynamic speech imaging with joint low-rank and sparsity constraints. Magn Reson Med 2015; 73:1820-32. [PMID: 24912452 PMCID: PMC4261062 DOI: 10.1002/mrm.25302] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Revised: 04/11/2014] [Accepted: 05/05/2014] [Indexed: 11/11/2022]
Abstract
PURPOSE To enable dynamic speech imaging with high spatiotemporal resolution and full-vocal-tract spatial coverage, leveraging recent advances in sparse sampling. METHODS An imaging method is developed to enable high-speed dynamic speech imaging exploiting low-rank and sparsity of the dynamic images of articulatory motion during speech. The proposed method includes: (a) a novel data acquisition strategy that collects spiral navigators with high temporal frame rate and (b) an image reconstruction method that derives temporal subspaces from navigators and reconstructs high-resolution images from sparsely sampled data with joint low-rank and sparsity constraints. RESULTS The proposed method has been systematically evaluated and validated through several dynamic speech experiments. A nominal imaging speed of 102 frames per second (fps) was achieved for a single-slice imaging protocol with a spatial resolution of 2.2 × 2.2 × 6.5 mm(3) . An eight-slice imaging protocol covering the entire vocal tract achieved a nominal imaging speed of 12.8 fps with the identical spatial resolution. The effectiveness of the proposed method and its practical utility was also demonstrated in a phonetic investigation. CONCLUSION High spatiotemporal resolution with full-vocal-tract spatial coverage can be achieved for dynamic speech imaging experiments with low-rank and sparsity constraints.
Collapse
Affiliation(s)
- Maojing Fu
- Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Bo Zhao
- Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | | | - Ryan K. Shosted
- Department of Linguistics, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Jamie L. Perry
- Department of Communication Sciences and Disorders, East Carolina University, Greenville, North Carolina
| | - David P. Kuehn
- Department of Speech and Hearing Science, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Zhi-Pei Liang
- Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois
| | - Bradley P. Sutton
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois
| |
Collapse
|
24
|
Whole heart coronary imaging with flexible acquisition window and trigger delay. PLoS One 2015; 10:e0112020. [PMID: 25719750 PMCID: PMC4342264 DOI: 10.1371/journal.pone.0112020] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2014] [Accepted: 08/27/2014] [Indexed: 11/18/2022] Open
Abstract
Coronary magnetic resonance imaging (MRI) requires a correctly timed trigger delay derived from a scout cine scan to synchronize k-space acquisition with the quiescent period of the cardiac cycle. However, heart rate changes between breath-held cine and free-breathing coronary imaging may result in inaccurate timing errors. Additionally, the determined trigger delay may not reflect the period of minimal motion for both left and right coronary arteries or different segments. In this work, we present a whole-heart coronary imaging approach that allows flexible selection of the trigger delay timings by performing k-space sampling over an enlarged acquisition window. Our approach addresses coronary motion in an interactive manner by allowing the operator to determine the temporal window with minimal cardiac motion for each artery region. An electrocardiogram-gated, k-space segmented 3D radial stack-of-stars sequence that employs a custom rotation angle is developed. An interactive reconstruction and visualization platform is then employed to determine the subset of the enlarged acquisition window for minimal coronary motion. Coronary MRI was acquired on eight healthy subjects (5 male, mean age = 37 ± 18 years), where an enlarged acquisition window of 166–220 ms was set 50 ms prior to the scout-derived trigger delay. Coronary visualization and sharpness scores were compared between the standard 120 ms window set at the trigger delay, and those reconstructed using a manually adjusted window. The proposed method using manual adjustment was able to recover delineation of five mid and distal right coronary artery regions that were otherwise not visible from the standard window, and the sharpness scores improved in all coronary regions using the proposed method. This paper demonstrates the feasibility of a whole-heart coronary imaging approach that allows interactive selection of any subset of the enlarged acquisition window for a tailored reconstruction for each branch region.
Collapse
|
25
|
Scott AD, Wylezinska M, Birch MJ, Miquel ME. Speech MRI: morphology and function. Phys Med 2014; 30:604-18. [PMID: 24880679 DOI: 10.1016/j.ejmp.2014.05.001] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Revised: 04/24/2014] [Accepted: 05/01/2014] [Indexed: 11/27/2022] Open
Abstract
Magnetic Resonance Imaging (MRI) plays an increasing role in the study of speech. This article reviews the MRI literature of anatomical imaging, imaging for acoustic modelling and dynamic imaging. It describes existing imaging techniques attempting to meet the challenges of imaging the upper airway during speech and examines the remaining hurdles and future research directions.
Collapse
Affiliation(s)
- Andrew D Scott
- Clinical Physics, Barts Health NHS Trust, London EC1A 7BE, United Kingdom; NIHR Cardiovascular Biomedical Research Unit, The Royal Brompton Hospital, Sydney Street, London SW3 6NP, United Kingdom
| | - Marzena Wylezinska
- Clinical Physics, Barts Health NHS Trust, London EC1A 7BE, United Kingdom; Barts and The London NIHR CVBRU, London Chest Hospital, London E2 9JX, United Kingdom
| | - Malcolm J Birch
- Clinical Physics, Barts Health NHS Trust, London EC1A 7BE, United Kingdom
| | - Marc E Miquel
- Clinical Physics, Barts Health NHS Trust, London EC1A 7BE, United Kingdom; Barts and The London NIHR CVBRU, London Chest Hospital, London E2 9JX, United Kingdom.
| |
Collapse
|
26
|
Proctor M, Bresch E, Byrd D, Nayak K, Narayanan S. Paralinguistic mechanisms of production in human "beatboxing": a real-time magnetic resonance imaging study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:1043-54. [PMID: 23363120 PMCID: PMC3574116 DOI: 10.1121/1.4773865] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Real-time magnetic resonance imaging (rtMRI) was used to examine mechanisms of sound production by an American male beatbox artist. rtMRI was found to be a useful modality with which to study this form of sound production, providing a global dynamic view of the midsagittal vocal tract at frame rates sufficient to observe the movement and coordination of critical articulators. The subject's repertoire included percussion elements generated using a wide range of articulatory and airstream mechanisms. Many of the same mechanisms observed in human speech production were exploited for musical effect, including patterns of articulation that do not occur in the phonologies of the artist's native languages: ejectives and clicks. The data offer insights into the paralinguistic use of phonetic primitives and the ways in which they are coordinated in this style of musical performance. A unified formalism for describing both musical and phonetic dimensions of human vocal percussion performance is proposed. Audio and video data illustrating production and orchestration of beatboxing sound effects are provided in a companion annotated corpus.
Collapse
Affiliation(s)
- Michael Proctor
- Viterbi School of Engineering, University of Southern California, 3740 McClintock Avenue, Los Angeles, California 90089-2564, USA.
| | | | | | | | | |
Collapse
|