1
|
Hong Y, Ryun S, Chung CK. Evoking artificial speech perception through invasive brain stimulation for brain-computer interfaces: current challenges and future perspectives. Front Neurosci 2024; 18:1428256. [PMID: 38988764 PMCID: PMC11234843 DOI: 10.3389/fnins.2024.1428256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 06/10/2024] [Indexed: 07/12/2024] Open
Abstract
Encoding artificial perceptions through brain stimulation, especially that of higher cognitive functions such as speech perception, is one of the most formidable challenges in brain-computer interfaces (BCI). Brain stimulation has been used for functional mapping in clinical practices for the last 70 years to treat various disorders affecting the nervous system, including epilepsy, Parkinson's disease, essential tremors, and dystonia. Recently, direct electrical stimulation has been used to evoke various forms of perception in humans, ranging from sensorimotor, auditory, and visual to speech cognition. Successfully evoking and fine-tuning artificial perceptions could revolutionize communication for individuals with speech disorders and significantly enhance the capabilities of brain-computer interface technologies. However, despite the extensive literature on encoding various perceptions and the rising popularity of speech BCIs, inducing artificial speech perception is still largely unexplored, and its potential has yet to be determined. In this paper, we examine the various stimulation techniques used to evoke complex percepts and the target brain areas for the input of speech-like information. Finally, we discuss strategies to address the challenges of speech encoding and discuss the prospects of these approaches.
Collapse
Affiliation(s)
- Yirye Hong
- Department of Brain and Cognitive Sciences, College of Natural Sciences, Seoul National University, Seoul, Republic of Korea
| | - Seokyun Ryun
- Neuroscience Research Institute, Seoul National University Medical Research Center, Seoul, Republic of Korea
| | - Chun Kee Chung
- Neuroscience Research Institute, Seoul National University Medical Research Center, Seoul, Republic of Korea
| |
Collapse
|
2
|
Wu J, Nie S, Li C, Wang X, Peng Y, Shang J, Diao L, Ding H, Si Q, Wang S, Tong R, Li Y, Sun L, Zhang J. Sound-localization-related activation and functional connectivity of dorsal auditory pathway in relation to demographic, cognitive, and behavioral characteristics in age-related hearing loss. Front Neurosci 2024; 18:1353413. [PMID: 38562303 PMCID: PMC10982313 DOI: 10.3389/fnins.2024.1353413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 02/26/2024] [Indexed: 04/04/2024] Open
Abstract
Background Patients with age-related hearing loss (ARHL) often struggle with tracking and locating sound sources, but the neural signature associated with these impairments remains unclear. Materials and methods Using a passive listening task with stimuli from five different horizontal directions in functional magnetic resonance imaging, we defined functional regions of interest (ROIs) of the auditory "where" pathway based on the data of previous literatures and young normal hearing listeners (n = 20). Then, we investigated associations of the demographic, cognitive, and behavioral features of sound localization with task-based activation and connectivity of the ROIs in ARHL patients (n = 22). Results We found that the increased high-level region activation, such as the premotor cortex and inferior parietal lobule, was associated with increased localization accuracy and cognitive function. Moreover, increased connectivity between the left planum temporale and left superior frontal gyrus was associated with increased localization accuracy in ARHL. Increased connectivity between right primary auditory cortex and right middle temporal gyrus, right premotor cortex and left anterior cingulate cortex, and right planum temporale and left lingual gyrus in ARHL was associated with decreased localization accuracy. Among the ARHL patients, the task-dependent brain activation and connectivity of certain ROIs were associated with education, hearing loss duration, and cognitive function. Conclusion Consistent with the sensory deprivation hypothesis, in ARHL, sound source identification, which requires advanced processing in the high-level cortex, is impaired, whereas the right-left discrimination, which relies on the primary sensory cortex, is compensated with a tendency to recruit more resources concerning cognition and attention to the auditory sensory cortex. Overall, this study expanded our understanding of the neural mechanisms contributing to sound localization deficits associated with ARHL and may serve as a potential imaging biomarker for investigating and predicting anomalous sound localization.
Collapse
Affiliation(s)
- Junzhi Wu
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Shuai Nie
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Chunlin Li
- School of Biomedical Engineering, Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Xing Wang
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Ye Peng
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Jiaqi Shang
- Center of Clinical Hearing, Shandong Second Provincial General Hospital, Jinan, Shandong, China
| | - Linan Diao
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Hongping Ding
- College of Special Education, Binzhou Medical University, Yantai, Shandong, China
| | - Qian Si
- School of Cyber Science and Technology, Beihang University, Beijing, China
| | - Songjian Wang
- Key Laboratory of Otolaryngology, Head and Neck Surgery, Ministry of Education, Beijing Institute of Otolaryngology, Beijing, China
- Department of Otolaryngology, Head and Neck Surgery, Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Renjie Tong
- School of Biomedical Engineering, Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Yutang Li
- School of Biomedical Engineering, Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Liwei Sun
- School of Biomedical Engineering, Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Juan Zhang
- Department of Otorhinolaryngology Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
3
|
Nourski KV, Steinschneider M, Rhone AE, Berger JI, Dappen ER, Kawasaki H, Howard III MA. Intracranial electrophysiology of spectrally degraded speech in the human cortex. Front Hum Neurosci 2024; 17:1334742. [PMID: 38318272 PMCID: PMC10839784 DOI: 10.3389/fnhum.2023.1334742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 12/28/2023] [Indexed: 02/07/2024] Open
Abstract
Introduction Cochlear implants (CIs) are the treatment of choice for severe to profound hearing loss. Variability in CI outcomes remains despite advances in technology and is attributed in part to differences in cortical processing. Studying these differences in CI users is technically challenging. Spectrally degraded stimuli presented to normal-hearing individuals approximate input to the central auditory system in CI users. This study used intracranial electroencephalography (iEEG) to investigate cortical processing of spectrally degraded speech. Methods Participants were adult neurosurgical epilepsy patients. Stimuli were utterances /aba/ and /ada/, spectrally degraded using a noise vocoder (1-4 bands) or presented without vocoding. The stimuli were presented in a two-alternative forced choice task. Cortical activity was recorded using depth and subdural iEEG electrodes. Electrode coverage included auditory core in posteromedial Heschl's gyrus (HGPM), superior temporal gyrus (STG), ventral and dorsal auditory-related areas, and prefrontal and sensorimotor cortex. Analysis focused on high gamma (70-150 Hz) power augmentation and alpha (8-14 Hz) suppression. Results Chance task performance occurred with 1-2 spectral bands and was near-ceiling for clear stimuli. Performance was variable with 3-4 bands, permitting identification of good and poor performers. There was no relationship between task performance and participants demographic, audiometric, neuropsychological, or clinical profiles. Several response patterns were identified based on magnitude and differences between stimulus conditions. HGPM responded strongly to all stimuli. A preference for clear speech emerged within non-core auditory cortex. Good performers typically had strong responses to all stimuli along the dorsal stream, including posterior STG, supramarginal, and precentral gyrus; a minority of sites in STG and supramarginal gyrus had a preference for vocoded stimuli. In poor performers, responses were typically restricted to clear speech. Alpha suppression was more pronounced in good performers. In contrast, poor performers exhibited a greater involvement of posterior middle temporal gyrus when listening to clear speech. Discussion Responses to noise-vocoded speech provide insights into potential factors underlying CI outcome variability. The results emphasize differences in the balance of neural processing along the dorsal and ventral stream between good and poor performers, identify specific cortical regions that may have diagnostic and prognostic utility, and suggest potential targets for neuromodulation-based CI rehabilitation strategies.
Collapse
Affiliation(s)
- Kirill V. Nourski
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
- Iowa Neuroscience Institute, The University of Iowa, Iowa City, IA, United States
| | - Mitchell Steinschneider
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
- Departments of Neurology and Neuroscience, Albert Einstein College of Medicine, Bronx, NY, United States
| | - Ariane E. Rhone
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
| | - Joel I. Berger
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
| | - Emily R. Dappen
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
- Iowa Neuroscience Institute, The University of Iowa, Iowa City, IA, United States
| | - Hiroto Kawasaki
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
| | - Matthew A. Howard III
- Department of Neurosurgery, The University of Iowa, Iowa City, IA, United States
- Iowa Neuroscience Institute, The University of Iowa, Iowa City, IA, United States
- Pappajohn Biomedical Institute, The University of Iowa, Iowa City, IA, United States
| |
Collapse
|
4
|
Han JH, Kim JH, Park GK, Lee HJ. Preserved Gray Matter Volume in the Left Superior Temporal Gyrus Underpins Speech-in-Noise Processing in Middle-Aged Adults. J Int Adv Otol 2024; 20:62-68. [PMID: 38454291 PMCID: PMC10895841 DOI: 10.5152/iao.2024.231241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 09/05/2023] [Indexed: 03/09/2024] Open
Abstract
BACKGROUND Neuroanatomical evidence suggests that behavioral speech-in-noise (SiN) perception and the underlying cortical structural network are altered by aging, and these aging-induced changes could be initiated during middle age. However, the mechanism behind the relationship between auditory performance and neural substrates of speech perception in middle-aged individuals remains unclear. In this study, we measured the structural volumes of selected neuroanatomical regions involved in speech and hearing processing to establish their association with speech perception ability in middle-aged adults. METHODS Sentence perception in quiet and noisy conditions was behaviorally measured in 2 different age groups: young (20-39 years old) and middle-aged (40-59-year-old) adults. Anatomical magnetic resonance images were taken to assess the gray matter volume of specific parcellated brain areas associated with speech perception. The relationships between these and behavioral auditory performance with age were determined. RESULTS The middle-aged adults showed poorer speech perception in both quiet and noisy conditions than the young adults. Neuroanatomical data revealed that the normalized gray matter volume in the left superior temporal gyrus, which is closely related to acoustic and phonological processing, is associated with behavioral SiN perception in the middle-aged group. In addition, the normalized gray matter volumes in multiple cortical areas seem to decrease with age. CONCLUSION The results indicate that SiN perception in middle-aged adults is closely related to the brain region responsible for lower-level speech processing, which involves the detection and phonemic representation of speech. Nonetheless, the higher-order cortex may also contribute to age-induced changes in auditory performance.
Collapse
Affiliation(s)
- Ji-Hye Han
- Laboratory of Brain and Cognitive Sciences for Convergence Medicine, Hallym University, College of Medicine, Anyang, Republic of Korea
- Ear and Interaction Center, Doheun Institute for Digital Innovation in Medicine (D.I.D.I.M.), Hallym University Medical Center, Anyang, South Korea
| | - Ja-Hee Kim
- Department of Otorhinolaryngology-Head and Neck Surgery, Hallym University, College of Medicine, Chuncheon, Republic of Korea
| | - Gin-Kyeong Park
- Laboratory of Brain and Cognitive Sciences for Convergence Medicine, Hallym University, College of Medicine, Anyang, Republic of Korea
| | - Hyo-Jeong Lee
- Laboratory of Brain and Cognitive Sciences for Convergence Medicine, Hallym University, College of Medicine, Anyang, Republic of Korea
- Ear and Interaction Center, Doheun Institute for Digital Innovation in Medicine (D.I.D.I.M.), Hallym University Medical Center, Anyang, South Korea
- Department of Otorhinolaryngology-Head and Neck Surgery, Hallym University, College of Medicine, Chuncheon, Republic of Korea
| |
Collapse
|
5
|
Busby N, Hillis AE, Bunker L, Rorden C, Newman-Norlund R, Bonilha L, Meier E, Goldberg E, Hickok G, Yourganov G, Fridriksson J. Comparing the brain-behaviour relationship in acute and chronic stroke aphasia. Brain Commun 2023; 5:fcad014. [PMID: 37056476 PMCID: PMC10088484 DOI: 10.1093/braincomms/fcad014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 07/23/2022] [Accepted: 03/27/2023] [Indexed: 03/30/2023] Open
Abstract
In stroke aphasia, lesion volume is typically associated with aphasia severity. Although this relationship is likely present throughout recovery, different factors may affect lesion volume and behaviour early into recovery (acute) and in the later stages of recovery (chronic). Therefore, studies typically separate patients into two groups (acute/chronic), and this is often accompanied with arguments for and against using data from acute stroke patients over chronic. However, no comprehensive studies have provided strong evidence of whether the lesion-behaviour relationship early in recovery is comparable to later in the recovery trajectory. To that end, we investigated two aims: (i) whether lesion data from acute and chronic patients yield similar results in region-based lesion-symptom mapping analyses and (ii) if models based on one timepoint accurately predict the other. Lesions and aphasia severity scores from acute (N = 63) and chronic (N = 109) stroke survivors with aphasia were entered into separate univariate region-based lesion-symptom mapping analyses. A support vector regression model was trained on lesion data from either the acute or chronic data set to give an estimate of aphasia severity. Four model-based analyses were conducted: trained on acute/chronic using leave-one-out, tested on left-out behaviour or trained on acute/chronic to predict the other timepoint. Region-based lesion-symptom mapping analyses identified similar but not identical regions in both timepoints. All four models revealed positive correlations between actual and predicted Western Aphasia Battery-Revised aphasia-quotient scores. Lesion-to-behaviour predictions were almost equivalent when comparing within versus across stroke stage, despite differing lesion size/locations and distributions of aphasia severity between stroke timepoints. This suggests that research investigating the brain-behaviour relationship including subsets of patients from only one timepoint may also be applicable at other timepoints, although it is important to note that these comparable findings may only be seen using broad measures such as aphasia severity, rather than those aimed at identifying more specific deficits. Subtle differences found between timepoints may also be useful in understanding the nature of lesion volume and aphasia severity over time. Stronger correlations found when predicting acute behaviour (e.g. predicting acute: r = 0.6888, P < 0.001, predicting chronic r = 0.5014, P < 0.001) suggest that the acute lesion/perfusion patterns more accurately capture the critical changes in underlying vascular territories. Differences in critical brain regions between timepoints may shed light on recovery patterns. Future studies could focus on a longitudinal design to compare acute and chronic patients in a more controlled manner.
Collapse
Affiliation(s)
- Natalie Busby
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29209, USA
| | - Argye E Hillis
- Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MA 21287, USA
- Department of Cognitive Science, Johns Hopkins University, Baltimore, MA 21218, USA
- Department of Physical Medicine and Rehabilitation, Johns Hopkins University, Baltimore, MA 21287, USA
| | - Lisa Bunker
- Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MA 21287, USA
| | - Chis Rorden
- Department of Psychology, University of South Carolina, Columbia, SC 29208, USA
| | - Roger Newman-Norlund
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29209, USA
| | - Leo Bonilha
- Department of Neurology, Emory University, Atlanta, GA 30322, USA
| | - Erin Meier
- Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MA 21287, USA
- Department of Communication Sciences and Disorders, Northeastern University, Boston, MA 02115, USA
| | - Emily Goldberg
- Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MA 21287, USA
- Department of Communication Disorders, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Gregory Hickok
- Department of Cognitive Sciences, University of California, Irvine, CA 92697, USA
- Department of Language Science, University of California, Irvine, CA 92697, USA
| | - Grigori Yourganov
- Advanced Computing and Data Science, Cyberinfrastructure and Technology Integration, Clemson University, Clemson, SC 29634, USA
| | - Julius Fridriksson
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29209, USA
| |
Collapse
|
6
|
Rimmele JM, Sun Y, Michalareas G, Ghitza O, Poeppel D. Dynamics of Functional Networks for Syllable and Word-Level Processing. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2023; 4:120-144. [PMID: 37229144 PMCID: PMC10205074 DOI: 10.1162/nol_a_00089] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2021] [Accepted: 11/07/2022] [Indexed: 05/27/2023]
Abstract
Speech comprehension requires the ability to temporally segment the acoustic input for higher-level linguistic analysis. Oscillation-based approaches suggest that low-frequency auditory cortex oscillations track syllable-sized acoustic information and therefore emphasize the relevance of syllabic-level acoustic processing for speech segmentation. How syllabic processing interacts with higher levels of speech processing, beyond segmentation, including the anatomical and neurophysiological characteristics of the networks involved, is debated. In two MEG experiments, we investigate lexical and sublexical word-level processing and the interactions with (acoustic) syllable processing using a frequency-tagging paradigm. Participants listened to disyllabic words presented at a rate of 4 syllables/s. Lexical content (native language), sublexical syllable-to-syllable transitions (foreign language), or mere syllabic information (pseudo-words) were presented. Two conjectures were evaluated: (i) syllable-to-syllable transitions contribute to word-level processing; and (ii) processing of words activates brain areas that interact with acoustic syllable processing. We show that syllable-to-syllable transition information compared to mere syllable information, activated a bilateral superior, middle temporal and inferior frontal network. Lexical content resulted, additionally, in increased neural activity. Evidence for an interaction of word- and acoustic syllable-level processing was inconclusive. Decreases in syllable tracking (cerebroacoustic coherence) in auditory cortex and increases in cross-frequency coupling between right superior and middle temporal and frontal areas were found when lexical content was present compared to all other conditions; however, not when conditions were compared separately. The data provide experimental insight into how subtle and sensitive syllable-to-syllable transition information for word-level processing is.
Collapse
Affiliation(s)
- Johanna M. Rimmele
- Departments of Neuroscience and Cognitive Neuropsychology, Max-Planck-Institute for Empirical Aesthetics, Frankfurt am Main, Germany
- Max Planck NYU Center for Language, Music and Emotion, Frankfurt am Main, Germany; New York, NY, USA
| | - Yue Sun
- Departments of Neuroscience and Cognitive Neuropsychology, Max-Planck-Institute for Empirical Aesthetics, Frankfurt am Main, Germany
| | - Georgios Michalareas
- Departments of Neuroscience and Cognitive Neuropsychology, Max-Planck-Institute for Empirical Aesthetics, Frankfurt am Main, Germany
| | - Oded Ghitza
- Departments of Neuroscience and Cognitive Neuropsychology, Max-Planck-Institute for Empirical Aesthetics, Frankfurt am Main, Germany
- College of Biomedical Engineering & Hearing Research Center, Boston University, Boston, MA, USA
| | - David Poeppel
- Departments of Neuroscience and Cognitive Neuropsychology, Max-Planck-Institute for Empirical Aesthetics, Frankfurt am Main, Germany
- Department of Psychology and Center for Neural Science, New York University, New York, NY, USA
- Max Planck NYU Center for Language, Music and Emotion, Frankfurt am Main, Germany; New York, NY, USA
- Ernst Strüngmann Institute for Neuroscience, Frankfurt am Main, Germany
| |
Collapse
|
7
|
Farahbod H, Rogalsky C, Keator LM, Cai J, Pillay SB, Turner K, LaCroix A, Fridriksson J, Binder JR, Middlebrooks JC, Hickok G, Saberi K. Informational Masking in Aging and Brain-lesioned Individuals. J Assoc Res Otolaryngol 2023; 24:67-79. [PMID: 36471207 PMCID: PMC9971540 DOI: 10.1007/s10162-022-00877-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Accepted: 11/01/2022] [Indexed: 12/12/2022] Open
Abstract
Auditory stream segregation and informational masking were investigated in brain-lesioned individuals, age-matched controls with no neurological disease, and young college-age students. A psychophysical paradigm known as rhythmic masking release (RMR) was used to examine the ability of participants to identify a change in the rhythmic sequence of 20-ms Gaussian noise bursts presented through headphones and filtered through generalized head-related transfer functions to produce the percept of an externalized auditory image (i.e., a 3D virtual reality sound). The target rhythm was temporally interleaved with a masker sequence comprising similar noise bursts in a manner that resulted in a uniform sequence with no information remaining about the target rhythm when the target and masker were presented from the same location (an impossible task). Spatially separating the target and masker sequences allowed participants to determine if there was a change in the target rhythm midway during its presentation. RMR thresholds were defined as the minimum spatial separation between target and masker sequences that resulted in 70.7% correct-performance level in a single-interval 2-alternative forced-choice adaptive tracking procedure. The main findings were (1) significantly higher RMR thresholds for individuals with brain lesions (especially those with damage to parietal areas) and (2) a left-right spatial asymmetry in performance for lesion (but not control) participants. These findings contribute to a better understanding of spatiotemporal relations in informational masking and the neural bases of auditory scene analysis.
Collapse
Affiliation(s)
- Haleh Farahbod
- grid.266093.80000 0001 0668 7243Department of Cognitive Sciences, University of California, Irvine, USA
| | - Corianne Rogalsky
- grid.215654.10000 0001 2151 2636College of Health Solutions, Arizona State University, Tempe, USA
| | - Lynsey M. Keator
- grid.254567.70000 0000 9075 106XDepartment of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Julia Cai
- grid.215654.10000 0001 2151 2636College of Health Solutions, Arizona State University, Tempe, USA
| | - Sara B. Pillay
- grid.30760.320000 0001 2111 8460Department of Neurology, Medical College of Wisconsin, Milwaukee, USA
| | - Katie Turner
- grid.266093.80000 0001 0668 7243Department of Cognitive Sciences, University of California, Irvine, USA
| | - Arianna LaCroix
- grid.260024.20000 0004 0627 4571College of Health Sciences, Midwestern University, Glendale, USA
| | - Julius Fridriksson
- grid.254567.70000 0000 9075 106XDepartment of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Jeffrey R. Binder
- grid.30760.320000 0001 2111 8460Department of Neurology, Medical College of Wisconsin, Milwaukee, USA
| | - John C. Middlebrooks
- grid.266093.80000 0001 0668 7243Department of Cognitive Sciences, University of California, Irvine, USA ,grid.266093.80000 0001 0668 7243Department of Otolaryngology, University of California, Irvine, USA ,grid.266093.80000 0001 0668 7243Department of Language Science, University of California, Irvine, USA
| | - Gregory Hickok
- grid.266093.80000 0001 0668 7243Department of Cognitive Sciences, University of California, Irvine, USA ,grid.266093.80000 0001 0668 7243Department of Language Science, University of California, Irvine, USA
| | - Kourosh Saberi
- Department of Cognitive Sciences, University of California, Irvine, USA.
| |
Collapse
|
8
|
Franken MK, Liu BC, Ostry DJ. Towards a somatosensory theory of speech perception. J Neurophysiol 2022; 128:1683-1695. [PMID: 36416451 PMCID: PMC9762980 DOI: 10.1152/jn.00381.2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 11/19/2022] [Accepted: 11/19/2022] [Indexed: 11/24/2022] Open
Abstract
Speech perception is known to be a multimodal process, relying not only on auditory input but also on the visual system and possibly on the motor system as well. To date there has been little work on the potential involvement of the somatosensory system in speech perception. In the present review, we identify the somatosensory system as another contributor to speech perception. First, we argue that evidence in favor of a motor contribution to speech perception can just as easily be interpreted as showing somatosensory involvement. Second, physiological and neuroanatomical evidence for auditory-somatosensory interactions across the auditory hierarchy indicates the availability of a neural infrastructure that supports somatosensory involvement in auditory processing in general. Third, there is accumulating evidence for somatosensory involvement in the context of speech specifically. In particular, tactile stimulation modifies speech perception, and speech auditory input elicits activity in somatosensory cortical areas. Moreover, speech sounds can be decoded from activity in somatosensory cortex; lesions to this region affect perception, and vowels can be identified based on somatic input alone. We suggest that the somatosensory involvement in speech perception derives from the somatosensory-auditory pairing that occurs during speech production and learning. By bringing together findings from a set of studies that have not been previously linked, the present article identifies the somatosensory system as a presently unrecognized contributor to speech perception.
Collapse
Affiliation(s)
| | | | - David J Ostry
- McGill University, Montreal, Quebec, Canada
- Haskins Laboratories, New Haven, Connecticut
| |
Collapse
|
9
|
Matchin W, den Ouden DB, Hickok G, Hillis AE, Bonilha L, Fridriksson J. The Wernicke conundrum revisited: evidence from connectome-based lesion-symptom mapping. Brain 2022; 145:3916-3930. [PMID: 35727949 DOI: 10.1093/brain/awac219] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 05/25/2022] [Accepted: 06/06/2022] [Indexed: 11/13/2022] Open
Abstract
Wernicke's area has been assumed since the 1800s to be the primary region supporting word and sentence comprehension. However, in 2015 and 2019, Mesulam and colleagues raised what they termed the 'Wernicke conundrum', noting widespread variability in the anatomical definition of this area and presenting data from primary progressive aphasia that challenged this classical assumption. To resolve the conundrum, they posited a 'double disconnection' hypothesis: that word and sentence comprehension deficits in stroke-based aphasia result from disconnection of anterior temporal and inferior frontal regions from other parts of the brain due to white matter damage, rather than dysfunction of Wernicke's area itself. To test this hypothesis, we performed lesion-deficit correlations, including connectome-based lesion-symptom mapping, in four large, partially overlapping groups of English-speaking chronic left hemisphere stroke survivors. After removing variance due to object recognition and associative semantic processing, the same middle and posterior temporal lobe regions were implicated in both word comprehension deficits and complex noncanonical sentence comprehension deficits. Connectome lesion-symptom mapping revealed similar temporal-occipital white matter disconnections for impaired word and noncanonical sentence comprehension, including the temporal pole. We found an additional significant temporal-parietal disconnection for noncanonical sentence comprehension deficits, which may indicate a role for phonological working memory in processing complex syntax, but no significant frontal disconnections. Moreover, damage to these middle-posterior temporal lobe regions was associated with both word and noncanonical sentence comprehension deficits even when accounting for variance due to the strongest anterior temporal and inferior frontal white matter disconnections, respectively. Our results largely agree with the classical notion that Wernicke's area, defined here as middle superior temporal gyrus and middle-posterior superior temporal sulcus, supports both word and sentence comprehension, suggest a supporting role for temporal pole in both word and sentence comprehension, and speak against the hypothesis that comprehension deficits in Wernicke's aphasia result from double disconnection.
Collapse
Affiliation(s)
- William Matchin
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29208, USA
| | - Dirk Bart den Ouden
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29208, USA
| | - Gregory Hickok
- Department of Cognitive Sciences, University of California, Irvine, Irvine, CA 92697, USA.,Department of Language Science, University of California, Irvine, Irvine, CA 92697, USA
| | - Argye E Hillis
- Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD 21218, USA.,Department of Physical Medicine and Rehabilitation, Johns Hopkins University School of Medicine, Baltimore, MD 21218, USA.,Department of Cognitive Science, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Leonardo Bonilha
- Department of Neurology, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Julius Fridriksson
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC 29208, USA
| |
Collapse
|