Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ramsey NF, Salari E, Aarnoutse EJ, Vansteensel MJ, Bleichner MG, Freudenburg ZV. Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids. Neuroimage 2018;180:301-11. [PMID: 28993231 DOI: 10.1016/j.neuroimage.2017.10.011] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2017] [Revised: 10/04/2017] [Accepted: 10/06/2017] [Indexed: 12/19/2022] Open

For:	Ramsey NF, Salari E, Aarnoutse EJ, Vansteensel MJ, Bleichner MG, Freudenburg ZV. Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids. Neuroimage 2018;180:301-11. [PMID: 28993231 DOI: 10.1016/j.neuroimage.2017.10.011] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2017] [Revised: 10/04/2017] [Accepted: 10/06/2017] [Indexed: 12/19/2022] Open

Number

Cited by Other Article(s)

Wyse-Sookoo K, Luo S, Candrea D, Schippers A, Tippett DC, Wester B, Fifer M, Vansteensel MJ, Ramsey NF, Crone NE. Stability of ECoG high gamma signals during speech and implications for a speech BCI system in an individual with ALS: a year-long longitudinal study. J Neural Eng 2024;21:10.1088/1741-2552/ad5c02. [PMID: 38925110 PMCID: PMC11245360 DOI: 10.1088/1741-2552/ad5c02] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 06/26/2024] [Indexed: 06/28/2024]

Silva AB, Littlejohn KT, Liu JR, Moses DA, Chang EF. The speech neuroprosthesis. Nat Rev Neurosci 2024;25:473-492. [PMID: 38745103 DOI: 10.1038/s41583-024-00819-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/12/2024] [Indexed: 05/16/2024]

Wu X, Wellington S, Fu Z, Zhang D. Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods. J Neural Eng 2024;21:036055. [PMID: 38885688 DOI: 10.1088/1741-2552/ad593a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 06/17/2024] [Indexed: 06/20/2024]

van der Heijden K, Patel P, Bickel S, Herrero JL, Mehta AD, Mesgarani N. Joint population coding and temporal coherence link an attended talker's voice and location features in naturalistic multi-talker scenes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.13.593814. [PMID: 38798551 PMCID: PMC11118436 DOI: 10.1101/2024.05.13.593814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Abstract

Listeners readily extract multi-dimensional auditory objects such as a 'localized talker' from complex acoustic scenes with multiple talkers. Yet, the neural mechanisms underlying simultaneous encoding and linking of different sound features - for example, a talker's voice and location - are poorly understood. We analyzed invasive intracranial recordings in neurosurgical patients attending to a localized talker in real-life cocktail party scenarios. We found that sensitivity to an individual talker's voice and location features was distributed throughout auditory cortex and that neural sites exhibited a gradient from sensitivity to a single feature to joint sensitivity to both features. On a population level, cortical response patterns of both dual-feature sensitive sites but also single-feature sensitive sites revealed simultaneous encoding of an attended talker's voice and location features. However, for single-feature sensitive sites, the representation of the primary feature was more precise. Further, sites which selective tracked an attended speech stream concurrently encoded an attended talker's voice and location features, indicating that such sites combine selective tracking of an attended auditory object with encoding of the object's features. Finally, we found that attending a localized talker selectively enhanced temporal coherence between single-feature voice sensitive sites and single-feature location sensitive sites, providing an additional mechanism for linking voice and location in multi-talker scenes. These results demonstrate that a talker's voice and location features are linked during multi-dimensional object formation in naturalistic multi-talker scenes by joint population coding as well as by temporal coherence between neural sites.

SIGNIFICANCE STATEMENT

Listeners effortlessly extract auditory objects from complex acoustic scenes consisting of multiple sound sources in naturalistic, spatial sound scenes. Yet, how the brain links different sound features to form a multi-dimensional auditory object is poorly understood. We investigated how neural responses encode and integrate an attended talker's voice and location features in spatial multi-talker sound scenes to elucidate which neural mechanisms underlie simultaneous encoding and linking of different auditory features. Our results show that joint population coding as well as temporal coherence mechanisms contribute to distributed multi-dimensional auditory object encoding. These findings shed new light on cortical functional specialization and multidimensional auditory object formation in complex, naturalistic listening scenes.

HIGHLIGHTS

Cortical responses to an single talker exhibit a distributed gradient, ranging from sites that are sensitive to both a talker's voice and location (dual-feature sensitive sites) to sites that are sensitive to either voice or location (single-feature sensitive sites).Population response patterns of dual-feature sensitive sites encode voice and location features of the attended talker in multi-talker scenes jointly and with equal precision.Despite their sensitivity to a single feature at the level of individual cortical sites, population response patterns of single-feature sensitive sites also encode location and voice features of a talker jointly, but with higher precision for the feature they are primarily sensitive to.Neural sites which selectively track an attended speech stream concurrently encode the attended talker's voice and location features.Attention selectively enhances temporal coherence between voice and location selective sites over time.Joint population coding as well as temporal coherence mechanisms underlie distributed multi-dimensional auditory object encoding in auditory cortex.

Collapse

Guerreiro Fernandes F, Raemaekers M, Freudenburg Z, Ramsey N. Considerations for implanting speech brain computer interfaces based on functional magnetic resonance imaging. J Neural Eng 2024;21:036005. [PMID: 38648782 DOI: 10.1088/1741-2552/ad4178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 04/22/2024] [Indexed: 04/25/2024]

Abstract

Objective.Brain-computer interfaces (BCIs) have the potential to reinstate lost communication faculties. Results from speech decoding studies indicate that a usable speech BCI based on activity in the sensorimotor cortex (SMC) can be achieved using subdurally implanted electrodes. However, the optimal characteristics for a successful speech implant are largely unknown. We address this topic in a high field blood oxygenation level dependent functional magnetic resonance imaging (fMRI) study, by assessing the decodability of spoken words as a function of hemisphere, gyrus, sulcal depth, and position along the ventral/dorsal-axis.Approach.Twelve subjects conducted a 7T fMRI experiment in which they pronounced 6 different pseudo-words over 6 runs. We divided the SMC by hemisphere, gyrus, sulcal depth, and position along the ventral/dorsal axis. Classification was performed on in these SMC areas using multiclass support vector machine (SVM).Main results.Significant classification was possible from the SMC, but no preference for the left or right hemisphere, nor for the precentral or postcentral gyrus for optimal word classification was detected. Classification while using information from the cortical surface was slightly better than when using information from deep in the central sulcus and was highest within the ventral 50% of SMC. Confusion matrices where highly similar across the entire SMC. An SVM-searchlight analysis revealed significant classification in the superior temporal gyrus and left planum temporale in addition to the SMC.Significance.The current results support a unilateral implant using surface electrodes, covering the ventral 50% of the SMC. The added value of depth electrodes is unclear. We did not observe evidence for variations in the qualitative nature of information across SMC. The current results need to be confirmed in paralyzed patients performing attempted speech.

Collapse

Angrick M, Luo S, Rabbani Q, Candrea DN, Shah S, Milsap GW, Anderson WS, Gordon CR, Rosenblatt KR, Clawson L, Tippett DC, Maragakis N, Tenore FV, Fifer MS, Hermansky H, Ramsey NF, Crone NE. Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS. Sci Rep 2024;14:9617. [PMID: 38671062 PMCID: PMC11053081 DOI: 10.1038/s41598-024-60277-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 04/21/2024] [Indexed: 04/28/2024] Open

Affiliation(s)

Miguel Angrick Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA.
Shiyu Luo Department of Biomedical Engineering, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Qinwan Rabbani Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
Daniel N Candrea Department of Biomedical Engineering, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Samyak Shah Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Griffin W Milsap Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
William S Anderson Department of Neurosurgery, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Chad R Gordon Department of Neurosurgery, The Johns Hopkins University School of Medicine, Baltimore, MD, USA Section of Neuroplastic and Reconstructive Surgery, Department of Plastic Surgery, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Kathryn R Rosenblatt Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Anesthesiology & Critical Care Medicine, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Lora Clawson Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Donna C Tippett Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Otolaryngology-Head and Neck Surgery, The Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Physical Medicine and Rehabilitation, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Nicholas Maragakis Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Francesco V Tenore Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
Matthew S Fifer Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
Hynek Hermansky Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA Human Language Technology Center of Excellence, The Johns Hopkins University, Baltimore, MD, USA
Nick F Ramsey UMC Utrecht Brain Center, Department of Neurology and Neurosurgery, University Medical Center Utrecht, Utrecht, The Netherlands
Nathan E Crone Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA.

Collapse

Anastasopoulou I, Cheyne DO, van Lieshout P, Johnson BW. Decoding kinematic information from beta-band motor rhythms of speech motor cortex: a methodological/analytic approach using concurrent speech movement tracking and magnetoencephalography. Front Hum Neurosci 2024;18:1305058. [PMID: 38646159 PMCID: PMC11027130 DOI: 10.3389/fnhum.2024.1305058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Accepted: 02/26/2024] [Indexed: 04/23/2024] Open

Abstract

Introduction

Articulography and functional neuroimaging are two major tools for studying the neurobiology of speech production. Until now, however, it has generally not been feasible to use both in the same experimental setup because of technical incompatibilities between the two methodologies.

Methods

Here we describe results from a novel articulography system dubbed Magneto-articulography for the Assessment of Speech Kinematics (MASK), which is technically compatible with magnetoencephalography (MEG) brain scanning systems. In the present paper we describe our methodological and analytic approach for extracting brain motor activities related to key kinematic and coordination event parameters derived from time-registered MASK tracking measurements. Data were collected from 10 healthy adults with tracking coils on the tongue, lips, and jaw. Analyses targeted the gestural landmarks of reiterated utterances/ipa/ and /api/, produced at normal and faster rates.

Results

The results show that (1) Speech sensorimotor cortex can be reliably located in peri-rolandic regions of the left hemisphere; (2) mu (8-12 Hz) and beta band (13-30 Hz) neuromotor oscillations are present in the speech signals and contain information structures that are independent of those present in higher-frequency bands; and (3) hypotheses concerning the information content of speech motor rhythms can be systematically evaluated with multivariate pattern analytic techniques.

Discussion

These results show that MASK provides the capability, for deriving subject-specific articulatory parameters, based on well-established and robust motor control parameters, in the same experimental setup as the MEG brain recordings and in temporal and spatial co-register with the brain data. The analytic approach described here provides new capabilities for testing hypotheses concerning the types of kinematic information that are encoded and processed within specific components of the speech neuromotor system.

Collapse

Chen J, Chen X, Wang R, Le C, Khalilian-Gourtani A, Jensen E, Dugan P, Doyle W, Devinsky O, Friedman D, Flinker A, Wang Y. Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.11.584533. [PMID: 38559163 PMCID: PMC10980022 DOI: 10.1101/2024.03.11.584533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Abstract

Objective

This study investigates speech decoding from neural signals captured by intracranial electrodes. Most prior works can only work with electrodes on a 2D grid (i.e., Electrocorticographic or ECoG array) and data from a single patient. We aim to design a deep-learning model architecture that can accommodate both surface (ECoG) and depth (stereotactic EEG or sEEG) electrodes. The architecture should allow training on data from multiple participants with large variability in electrode placements and the trained model should perform well on participants unseen during training.

Approach

We propose a novel transformer-based model architecture named SwinTW that can work with arbitrarily positioned electrodes, by leveraging their 3D locations on the cortex rather than their positions on a 2D grid. We train both subject-specific models using data from a single participant as well as multi-patient models exploiting data from multiple participants.

Main Results

The subject-specific models using only low-density 8x8 ECoG data achieved high decoding Pearson Correlation Coefficient with ground truth spectrogram (PCC=0.817), over N=43 participants, outperforming our prior convolutional ResNet model and the 3D Swin transformer model. Incorporating additional strip, depth, and grid electrodes available in each participant (N=39) led to further improvement (PCC=0.838). For participants with only sEEG electrodes (N=9), subject-specific models still enjoy comparable performance with an average PCC=0.798. The multi-subject models achieved high performance on unseen participants, with an average PCC=0.765 in leave-one-out cross-validation.

Significance

The proposed SwinTW decoder enables future speech neuroprostheses to utilize any electrode placement that is clinically optimal or feasible for a particular participant, including using only depth electrodes, which are more routinely implanted in chronic neurosurgical procedures. Importantly, the generalizability of the multi-patient models suggests the exciting possibility of developing speech neuroprostheses for people with speech disability without relying on their own neural data for training, which is not always feasible.

Collapse

Vitória MA, Fernandes FG, van den Boom M, Ramsey N, Raemaekers M. Decoding Single and Paired Phonemes Using 7T Functional MRI. Brain Topogr 2024:10.1007/s10548-024-01034-6. [PMID: 38261272 DOI: 10.1007/s10548-024-01034-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 01/12/2024] [Indexed: 01/24/2024]

Canny E, Vansteensel MJ, van der Salm SMA, Müller-Putz GR, Berezutskaya J. Boosting brain-computer interfaces with functional electrical stimulation: potential applications in people with locked-in syndrome. J Neuroeng Rehabil 2023;20:157. [PMID: 37980536 PMCID: PMC10656959 DOI: 10.1186/s12984-023-01272-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 10/23/2023] [Indexed: 11/20/2023] Open

Duraivel S, Rahimpour S, Chiang CH, Trumpis M, Wang C, Barth K, Harward SC, Lad SP, Friedman AH, Southwell DG, Sinha SR, Viventi J, Cogan GB. High-resolution neural recordings improve the accuracy of speech decoding. Nat Commun 2023;14:6938. [PMID: 37932250 PMCID: PMC10628285 DOI: 10.1038/s41467-023-42555-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 10/13/2023] [Indexed: 11/08/2023] Open

Affiliation(s)

Suseendrakumar Duraivel Department of Biomedical Engineering, Duke University, Durham, NC, USA
Shervin Rahimpour Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA Department of Neurosurgery, Clinical Neuroscience Center, University of Utah, Salt Lake City, UT, USA
Chia-Han Chiang Department of Biomedical Engineering, Duke University, Durham, NC, USA
Michael Trumpis Department of Biomedical Engineering, Duke University, Durham, NC, USA
Charles Wang Department of Biomedical Engineering, Duke University, Durham, NC, USA
Katrina Barth Department of Biomedical Engineering, Duke University, Durham, NC, USA
Stephen C Harward Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC, USA
Shivanand P Lad Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA
Allan H Friedman Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA
Derek G Southwell Department of Biomedical Engineering, Duke University, Durham, NC, USA Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC, USA Department of Neurobiology, Duke School of Medicine, Durham, NC, USA
Saurabh R Sinha Penn Epilepsy Center, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Jonathan Viventi Department of Biomedical Engineering, Duke University, Durham, NC, USA. Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA. Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC, USA. Department of Neurobiology, Duke School of Medicine, Durham, NC, USA.
Gregory B Cogan Department of Biomedical Engineering, Duke University, Durham, NC, USA. Department of Neurosurgery, Duke School of Medicine, Durham, NC, USA. Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC, USA. Department of Neurology, Duke School of Medicine, Durham, NC, USA. Department of Psychology and Neuroscience, Duke University, Durham, NC, USA. Center for Cognitive Neuroscience, Duke University, Durham, NC, USA.

Collapse

Wilmskoetter J, Roth R, McDowell K, Munsell B, Fontenot S, Andrews K, Chang A, Johnson LP, Sangtian S, Behroozmand R, van Mierlo P, Fridriksson J, Bonilha L. Semantic Categorization of Naming Responses Based on Prearticulatory Electrical Brain Activity. J Clin Neurophysiol 2023;40:608-615. [PMID: 37931162 PMCID: PMC10628367 DOI: 10.1097/wnp.0000000000000933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open

Sankaran N, Moses D, Chiong W, Chang EF. Recommendations for promoting user agency in the design of speech neuroprostheses. Front Hum Neurosci 2023;17:1298129. [PMID: 37920562 PMCID: PMC10619159 DOI: 10.3389/fnhum.2023.1298129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 10/04/2023] [Indexed: 11/04/2023] Open

Berezutskaya J, Freudenburg ZV, Vansteensel MJ, Aarnoutse EJ, Ramsey NF, van Gerven MAJ. Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models. J Neural Eng 2023;20:056010. [PMID: 37467739 PMCID: PMC10510111 DOI: 10.1088/1741-2552/ace8be] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 07/12/2023] [Accepted: 07/19/2023] [Indexed: 07/21/2023]

Zhao Y, Chen Y, Cheng K, Huang W. Artificial intelligence based multimodal language decoding from brain activity: A review. Brain Res Bull 2023;201:110713. [PMID: 37487829 DOI: 10.1016/j.brainresbull.2023.110713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 06/26/2023] [Accepted: 07/20/2023] [Indexed: 07/26/2023]

Easthope E, Shamei A, Liu Y, Gick B, Fels S. Cortical control of posture in fine motor skills: evidence from inter-utterance rest position. Front Hum Neurosci 2023;17:1139569. [PMID: 37662639 PMCID: PMC10469778 DOI: 10.3389/fnhum.2023.1139569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Accepted: 06/12/2023] [Indexed: 09/05/2023] Open

Angrick M, Luo S, Rabbani Q, Candrea DN, Shah S, Milsap GW, Anderson WS, Gordon CR, Rosenblatt KR, Clawson L, Maragakis N, Tenore FV, Fifer MS, Hermansky H, Ramsey NF, Crone NE. Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.06.30.23291352. [PMID: 37425721 PMCID: PMC10327279 DOI: 10.1101/2023.06.30.23291352] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]

Affiliation(s)

Miguel Angrick Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Shiyu Luo Department of Biomedical Engineering, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Qinwan Rabbani Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, MD, USA
Daniel N Candrea Department of Biomedical Engineering, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Samyak Shah Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Griffin W Milsap Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
William S Anderson Department of Neurosurgery, The Johns Hopkins University School of Medicine, Baltimore, MD
Chad R Gordon Department of Neurosurgery, The Johns Hopkins University School of Medicine, Baltimore, MD Section of Neuroplastic and Reconstructive Surgery, Department of Plastic Surgery, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Kathryn R Rosenblatt Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Anesthesiology & Critical Care Medicine, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Lora Clawson Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Nicholas Maragakis Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA
Francesco V Tenore Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
Matthew S Fifer Research and Exploratory Development Department, Johns Hopkins Applied Physics Laboratory, Laurel, MD, USA
Hynek Hermansky Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD, USA Human Language Technology Center of Excellence, The Johns Hopkins University, Baltimore, MD, USA
Nick F Ramsey UMC Utrecht Brain Center, Department of Neurology and Neurosurgery, University Medical Center Utrecht, Utrecht, The Netherlands
Nathan E Crone Department of Neurology, The Johns Hopkins University School of Medicine, Baltimore, MD, USA

Collapse

Soroush PZ, Herff C, Ries SK, Shih JJ, Schultz T, Krusienski DJ. The nested hierarchy of overt, mouthed, and imagined speech activity evident in intracranial recordings. Neuroimage 2023;269:119913. [PMID: 36731812 DOI: 10.1016/j.neuroimage.2023.119913] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 01/05/2023] [Accepted: 01/29/2023] [Indexed: 02/01/2023] Open

Branco MP, Geukes SH, Aarnoutse EJ, Ramsey NF, Vansteensel MJ. Nine decades of electrocorticography: A comparison between epidural and subdural recordings. Eur J Neurosci 2023;57:1260-1288. [PMID: 36843389 DOI: 10.1111/ejn.15941] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 02/10/2023] [Accepted: 02/18/2023] [Indexed: 02/28/2023]

Towards clinical application of implantable brain-computer interfaces for people with late-stage ALS: medical and ethical considerations. J Neurol 2023;270:1323-1336. [PMID: 36450968 PMCID: PMC9971103 DOI: 10.1007/s00415-022-11464-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 10/26/2022] [Accepted: 10/27/2022] [Indexed: 12/05/2022]

Verwoert M, Ottenhoff MC, Goulis S, Colon AJ, Wagner L, Tousseyn S, van Dijk JP, Kubben PL, Herff C. Dataset of Speech Production in intracranial.Electroencephalography. Sci Data 2022;9:434. [PMID: 35869138 PMCID: PMC9307753 DOI: 10.1038/s41597-022-01542-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 07/08/2022] [Indexed: 11/28/2022] Open

Petrosyan A, Voskoboinikov A, Sukhinin D, Makarova A, Skalnaya A, Arkhipova N, Sinkin M, Ossadtchi A. Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network. J Neural Eng 2022;19. [PMID: 36356309 DOI: 10.1088/1741-2552/aca1e1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 11/10/2022] [Indexed: 11/12/2022]

Abstract

Objective. Speech decoding, one of the most intriguing brain-computer interface applications, opens up plentiful opportunities from rehabilitation of patients to direct and seamless communication between human species. Typical solutions rely on invasive recordings with a large number of distributed electrodes implanted through craniotomy. Here we explored the possibility of creating speech prosthesis in a minimally invasive setting with a small number of spatially segregated intracranial electrodes.Approach. We collected one hour of data (from two sessions) in two patients implanted with invasive electrodes. We then used only the contacts that pertained to a single stereotactic electroencephalographic (sEEG) shaft or an electrocorticographic (ECoG) stripe to decode neural activity into 26 words and one silence class. We employed a compact convolutional network-based architecture whose spatial and temporal filter weights allow for a physiologically plausible interpretation.Mainresults. We achieved on average 55% accuracy using only six channels of data recorded with a single minimally invasive sEEG electrode in the first patient and 70% accuracy using only eight channels of data recorded for a single ECoG strip in the second patient in classifying 26+1 overtly pronounced words. Our compact architecture did not require the use of pre-engineered features, learned fast and resulted in a stable, interpretable and physiologically meaningful decision rule successfully operating over a contiguous dataset collected during a different time interval than that used for training. Spatial characteristics of the pivotal neuronal populations corroborate with active and passive speech mapping results and exhibit the inverse space-frequency relationship characteristic of neural activity. Compared to other architectures our compact solution performed on par or better than those recently featured in neural speech decoding literature.Significance. We showcase the possibility of building a speech prosthesis with a small number of electrodes and based on a compact feature engineering free decoder derived from a small amount of training data.

Collapse

Rainey S. Speaker Responsibility for Synthetic Speech Derived from Neural Activity. THE JOURNAL OF MEDICINE AND PHILOSOPHY 2022;47:503-515. [PMID: 36333930 DOI: 10.1093/jmp/jhac011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Vansteensel MJ, Branco MP, Leinders S, Freudenburg ZF, Schippers A, Geukes SH, Gaytant MA, Gosselaar PH, Aarnoutse EJ, Ramsey NF. Methodological Recommendations for Studies on the Daily Life Implementation of Implantable Communication-Brain-Computer Interfaces for Individuals With Locked-in Syndrome. Neurorehabil Neural Repair 2022;36:666-677. [PMID: 36124975 DOI: 10.1177/15459683221125788] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Kennedy P, Cervantes AJ. Recruitment and Differential Firing Patterns of Single Units During Conditioning to a Tone in a Mute Locked-In Human. Front Hum Neurosci 2022;16:864983. [PMID: 36211127 PMCID: PMC9532552 DOI: 10.3389/fnhum.2022.864983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022] Open

Mercier MR, Dubarry AS, Tadel F, Avanzini P, Axmacher N, Cellier D, Vecchio MD, Hamilton LS, Hermes D, Kahana MJ, Knight RT, Llorens A, Megevand P, Melloni L, Miller KJ, Piai V, Puce A, Ramsey NF, Schwiedrzik CM, Smith SE, Stolk A, Swann NC, Vansteensel MJ, Voytek B, Wang L, Lachaux JP, Oostenveld R. Advances in human intracranial electroencephalography research, guidelines and good practices. Neuroimage 2022;260:119438. [PMID: 35792291 DOI: 10.1016/j.neuroimage.2022.119438] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 05/23/2022] [Accepted: 06/30/2022] [Indexed: 12/11/2022] Open

Favero P, Berezutskaya J, Ramsey NF, Nazarov A, Freudenburg ZV. Mapping Acoustics to Articulatory Gestures in Dutch: Relating Speech Gestures, Acoustics and Neural Data. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022;2022:802-806. [PMID: 36085697 DOI: 10.1109/embc48229.2022.9871909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Berezutskaya J, Ambrogioni L, Ramsey NF, van Gerven MAJ. Towards Naturalistic Speech Decoding from Intracranial Brain Data. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022;2022:3100-3104. [PMID: 36085779 DOI: 10.1109/embc48229.2022.9871301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Mirchi N, Warsi NM, Zhang F, Wong SM, Suresh H, Mithani K, Erdman L, Ibrahim GM. Decoding Intracranial EEG With Machine Learning: A Systematic Review. Front Hum Neurosci 2022;16:913777. [PMID: 35832872 PMCID: PMC9271576 DOI: 10.3389/fnhum.2022.913777] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 05/31/2022] [Indexed: 11/13/2022] Open

Lin Y, Hsieh PJ. Neural decoding of speech with semantic-based classification. Cortex 2022;154:231-240. [DOI: 10.1016/j.cortex.2022.05.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 04/17/2022] [Accepted: 05/09/2022] [Indexed: 11/16/2022]

Michail G, Senkowski D, Holtkamp M, Wächter B, Keil J. Early beta oscillations in multisensory association areas underlie crossmodal performance enhancement. Neuroimage 2022;257:119307. [PMID: 35577024 DOI: 10.1016/j.neuroimage.2022.119307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 04/29/2022] [Accepted: 05/10/2022] [Indexed: 11/28/2022] Open

Abstract

The combination of signals from different sensory modalities can enhance perception and facilitate behavioral responses. While previous research described crossmodal influences in a wide range of tasks, it remains unclear how such influences drive performance enhancements. In particular, the neural mechanisms underlying performance-relevant crossmodal influences, as well as the latency and spatial profile of such influences are not well understood. Here, we examined data from high-density electroencephalography (N = 30) recordings to characterize the oscillatory signatures of crossmodal facilitation of response speed, as manifested in the speeding of visual responses by concurrent task-irrelevant auditory information. Using a data-driven analysis approach, we found that individual gains in response speed correlated with larger beta power difference (13-25 Hz) between the audiovisual and the visual condition, starting within 80 ms after stimulus onset in the secondary visual cortex and in multisensory association areas in the parietal cortex. In addition, we examined data from electrocorticography (ECoG) recordings in four epileptic patients in a comparable paradigm. These ECoG data revealed reduced beta power in audiovisual compared with visual trials in the superior temporal gyrus (STG). Collectively, our data suggest that the crossmodal facilitation of response speed is associated with reduced early beta power in multisensory association and secondary visual areas. The reduced early beta power may reflect an auditory-driven feedback signal to improve visual processing through attentional gating. These findings improve our understanding of the neural mechanisms underlying crossmodal response speed facilitation and highlight the critical role of beta oscillations in mediating behaviorally relevant multisensory processing.

Collapse

Merk T, Peterson V, Köhler R, Haufe S, Richardson RM, Neumann WJ. Machine learning based brain signal decoding for intelligent adaptive deep brain stimulation. Exp Neurol 2022;351:113993. [PMID: 35104499 PMCID: PMC10521329 DOI: 10.1016/j.expneurol.2022.113993] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 11/18/2021] [Accepted: 01/22/2022] [Indexed: 12/30/2022]

Glanz O, Hader M, Schulze-Bonhage A, Auer P, Ball T. A Study of Word Complexity Under Conditions of Non-experimental, Natural Overt Speech Production Using ECoG. Front Hum Neurosci 2022;15:711886. [PMID: 35185491 PMCID: PMC8854223 DOI: 10.3389/fnhum.2021.711886] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 12/15/2021] [Indexed: 11/25/2022] Open

Luo S, Rabbani Q, Crone NE. Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication. Neurotherapeutics 2022;19:263-273. [PMID: 35099768 PMCID: PMC9130409 DOI: 10.1007/s13311-022-01190-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/16/2022] [Indexed: 01/03/2023] Open

Abstract

Damage or degeneration of motor pathways necessary for speech and other movements, as in brainstem strokes or amyotrophic lateral sclerosis (ALS), can interfere with efficient communication without affecting brain structures responsible for language or cognition. In the worst-case scenario, this can result in the locked in syndrome (LIS), a condition in which individuals cannot initiate communication and can only express themselves by answering yes/no questions with eye blinks or other rudimentary movements. Existing augmentative and alternative communication (AAC) devices that rely on eye tracking can improve the quality of life for people with this condition, but brain-computer interfaces (BCIs) are also increasingly being investigated as AAC devices, particularly when eye tracking is too slow or unreliable. Moreover, with recent and ongoing advances in machine learning and neural recording technologies, BCIs may offer the only means to go beyond cursor control and text generation on a computer, to allow real-time synthesis of speech, which would arguably offer the most efficient and expressive channel for communication. The potential for BCI speech synthesis has only recently been realized because of seminal studies of the neuroanatomical and neurophysiological underpinnings of speech production using intracranial electrocorticographic (ECoG) recordings in patients undergoing epilepsy surgery. These studies have shown that cortical areas responsible for vocalization and articulation are distributed over a large area of ventral sensorimotor cortex, and that it is possible to decode speech and reconstruct its acoustics from ECoG if these areas are recorded with sufficiently dense and comprehensive electrode arrays. In this article, we review these advances, including the latest neural decoding strategies that range from deep learning models to the direct concatenation of speech units. We also discuss state-of-the-art vocoders that are integral in constructing natural-sounding audio waveforms for speech BCIs. Finally, this review outlines some of the challenges ahead in directly synthesizing speech for patients with LIS.

Collapse

Dash D, Ferrari P, Babajani-Feremi A, Borna A, Schwindt PDD, Wang J. Magnetometers vs Gradiometers for Neural Speech Decoding. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021;2021:6543-6546. [PMID: 34892608 DOI: 10.1109/embc46164.2021.9630489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Verwoert M, Vansteensel MJ, Freudenburg ZV, Aarnoutse EJ, Leijten FS, Ramsey NF, Branco MP. Decoding four hand gestures with a single bipolar pair of electrocorticography electrodes. J Neural Eng 2021;18:10.1088/1741-2552/ac2c9f. [PMID: 34607318 PMCID: PMC8744490 DOI: 10.1088/1741-2552/ac2c9f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 10/04/2021] [Indexed: 11/12/2022]

Chaudhary U, Chander BS, Ohry A, Jaramillo-Gonzalez A, Lulé D, Birbaumer N. Brain Computer Interfaces for Assisted Communication in Paralysis and Quality of Life. Int J Neural Syst 2021;31:2130003. [PMID: 34587854 DOI: 10.1142/s0129065721300035] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Wittevrongel B, Holmes N, Boto E, Hill R, Rea M, Libert A, Khachatryan E, Van Hulle MM, Bowtell R, Brookes MJ. Practical real-time MEG-based neural interfacing with optically pumped magnetometers. BMC Biol 2021;19:158. [PMID: 34376215 PMCID: PMC8356471 DOI: 10.1186/s12915-021-01073-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 04/25/2021] [Indexed: 01/23/2023] Open

Trumpis M, Chiang CH, Orsborn AL, Bent B, Li J, Rogers JA, Pesaran B, Cogan G, Viventi J. Sufficient sampling for kriging prediction of cortical potential in rat, monkey, and human µECoG. J Neural Eng 2021;18. [PMID: 33326943 DOI: 10.1088/1741-2552/abd460] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 12/16/2020] [Indexed: 12/22/2022]

Affiliation(s)

Michael Trumpis Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States of America
Chia-Han Chiang Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States of America
Amy L Orsborn Center for Neural Science, New York University, New York, NY 10003, United States of America.,Department of Electrical & Computer Engineering, University of Washington, Seattle, WA 98195, United States of America.,Department of Bioengineering, University of Washington, Seattle, Washington 98105, United States of America.,Washington National Primate Research Center, Seattle, Washington 98195, United States of America
Brinnae Bent Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States of America
Jinghua Li Department of Materials Science and Engineering, Northwestern University, Evanston, IL 60208, United States of America.,Department of Materials Science and Engineering, The Ohio State University, Columbus, OH 43210, United States of America.,Chronic Brain Injury Program, The Ohio State University, Columbus, OH 43210, United States of America
John A Rogers Department of Materials Science and Engineering, Northwestern University, Evanston, IL 60208, United States of America.,Simpson Querrey Institute, Northwestern University, Chicago, IL 60611, United States of America.,Department of Biomedical Engineering, Northwestern University, Evanston, IL 60208, United States of America.,Department of Neurological Surgery, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, United States of America
Bijan Pesaran Center for Neural Science, New York University, New York, NY 10003, United States of America
Gregory Cogan Department of Neurosurgery, Duke School of Medicine, Durham, NC 27710, United States of America.,Department of Psychology and Neuroscience, Duke University, Durham, NC 27708, United States of America.,Center for Cognitive Neuroscience, Duke University, Durham, NC 27708, United States of America.,Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC 27710, United States of America
Jonathan Viventi Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States of America.,Department of Neurosurgery, Duke School of Medicine, Durham, NC 27710, United States of America.,Duke Comprehensive Epilepsy Center, Duke School of Medicine, Durham, NC 27710, United States of America.,Department of Neurobiology, Duke School of Medicine, Durham, NC 27710, United States of America

Collapse

Wilson GH, Stavisky SD, Willett FR, Avansino DT, Kelemen JN, Hochberg LR, Henderson JM, Druckmann S, Shenoy KV. Decoding spoken English from intracortical electrode arrays in dorsal precentral gyrus. J Neural Eng 2020;17:066007. [PMID: 33236720 PMCID: PMC8293867 DOI: 10.1088/1741-2552/abbfef] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]

Abstract

OBJECTIVE

To evaluate the potential of intracortical electrode array signals for brain-computer interfaces (BCIs) to restore lost speech, we measured the performance of decoders trained to discriminate a comprehensive basis set of 39 English phonemes and to synthesize speech sounds via a neural pattern matching method. We decoded neural correlates of spoken-out-loud words in the 'hand knob' area of precentral gyrus, a step toward the eventual goal of decoding attempted speech from ventral speech areas in patients who are unable to speak.

APPROACH

Neural and audio data were recorded while two BrainGate2 pilot clinical trial participants, each with two chronically-implanted 96-electrode arrays, spoke 420 different words that broadly sampled English phonemes. Phoneme onsets were identified from audio recordings, and their identities were then classified from neural features consisting of each electrode's binned action potential counts or high-frequency local field potential power. Speech synthesis was performed using the 'Brain-to-Speech' pattern matching method. We also examined two potential confounds specific to decoding overt speech: acoustic contamination of neural signals and systematic differences in labeling different phonemes' onset times.

MAIN RESULTS

A linear decoder achieved up to 29.3% classification accuracy (chance = 6%) across 39 phonemes, while an RNN classifier achieved 33.9% accuracy. Parameter sweeps indicated that performance did not saturate when adding more electrodes or more training data, and that accuracy improved when utilizing time-varying structure in the data. Microphonic contamination and phoneme onset differences modestly increased decoding accuracy, but could be mitigated by acoustic artifact subtraction and using a neural speech onset marker, respectively. Speech synthesis achieved r = 0.523 correlation between true and reconstructed audio.

SIGNIFICANCE

The ability to decode speech using intracortical electrode array signals from a nontraditional speech area suggests that placing electrode arrays in ventral speech areas is a promising direction for speech BCIs.

Collapse

Affiliation(s)

Guy H Wilson Neurosciences Graduate Program, Stanford University, Stanford, CA, United States of America
Sergey D Stavisky Department of Neurosurgery, Stanford University, Stanford, CA, United States of America Wu Tsai Neurosciences Institute and Bio-X Institute, Stanford University, Stanford, CA, United States of America Department of Electrical Engineering, Stanford University, Stanford, CA, United States of America
Francis R Willett Department of Neurosurgery, Stanford University, Stanford, CA, United States of America Department of Electrical Engineering, Stanford University, Stanford, CA, United States of America Howard Hughes Medical Institute at Stanford University, Stanford, CA, United States of America
Donald T Avansino Department of Neurosurgery, Stanford University, Stanford, CA, United States of America
Jessica N Kelemen Department of Neurology, Harvard Medical School, Boston, MA, United States of America
Leigh R Hochberg Department of Neurology, Harvard Medical School, Boston, MA, United States of America Center for Neurotechnology and Neurorecovery, Dept. of Neurology, Massachusetts General Hospital, Boston, MA, United States of America VA RR&D Center for Neurorestoration and Neurotechnology, Rehabilitation R&D Service, Providence VA Medical Center, Providence, RI, United States of America Carney Institute for Brain Science and School of Engineering, Brown University, Providence, RI, United States of America
Jaimie M Henderson Department of Neurosurgery, Stanford University, Stanford, CA, United States of America Wu Tsai Neurosciences Institute and Bio-X Institute, Stanford University, Stanford, CA, United States of America
Shaul Druckmann Wu Tsai Neurosciences Institute and Bio-X Institute, Stanford University, Stanford, CA, United States of America Department of Neurobiology, Stanford University, Stanford, CA, United States of America
Krishna V Shenoy Wu Tsai Neurosciences Institute and Bio-X Institute, Stanford University, Stanford, CA, United States of America Department of Electrical Engineering, Stanford University, Stanford, CA, United States of America Howard Hughes Medical Institute at Stanford University, Stanford, CA, United States of America Department of Neurobiology, Stanford University, Stanford, CA, United States of America Department of Bioengineering, Stanford University, Stanford, CA, United States of America

Collapse

Dash D, Wisler A, Ferrari P, Davenport EM, Maldjian J, Wang J. MEG Sensor Selection for Neural Speech Decoding. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020;8:182320-182337. [PMID: 33204579 PMCID: PMC7668411 DOI: 10.1109/access.2020.3028831] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Abstract

Direct decoding of speech from the brain is a faster alternative to current electroencephalography (EEG) speller-based brain-computer interfaces (BCI) in providing communication assistance to locked-in patients. Magnetoencephalography (MEG) has recently shown great potential as a non-invasive neuroimaging modality for neural speech decoding, owing in part to its spatial selectivity over other high-temporal resolution devices. Standard MEG systems have a large number of cryogenically cooled channels/sensors (200 - 300) encapsulated within a fixed liquid helium dewar, precluding their use as wearable BCI devices. Fortunately, recently developed optically pumped magnetometers (OPM) do not require cryogens, and have the potential to be wearable and movable making them more suitable for BCI applications. This design is also modular allowing for customized montages to include only the sensors necessary for a particular task. As the number of sensors bears a heavy influence on the cost, size, and weight of MEG systems, minimizing the number of sensors is critical for designing practical MEG-based BCIs in the future. In this study, we sought to identify an optimal set of MEG channels to decode imagined and spoken phrases from the MEG signals. Using a forward selection algorithm with a support vector machine classifier we found that nine optimally located MEG gradiometers provided higher decoding accuracy compared to using all channels. Additionally, the forward selection algorithm achieved similar performance to dimensionality reduction using a stacked-sparse-autoencoder. Analysis of spatial dynamics of speech decoding suggested that both left and right hemisphere sensors contribute to speech decoding. Sensors approximately located near Broca's area were found to be commonly contributing among the higher-ranked sensors across all subjects.

Collapse

Gearing M, Kennedy P. Histological Confirmation of Myelinated Neural Filaments Within the Tip of the Neurotrophic Electrode After a Decade of Neural Recordings. Front Hum Neurosci 2020;14:111. [PMID: 32372930 PMCID: PMC7187752 DOI: 10.3389/fnhum.2020.00111] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Accepted: 03/11/2020] [Indexed: 11/13/2022] Open

Dash D, Ferrari P, Wang J. Decoding Imagined and Spoken Phrases From Non-invasive Neural (MEG) Signals. Front Neurosci 2020;14:290. [PMID: 32317917 PMCID: PMC7154084 DOI: 10.3389/fnins.2020.00290] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Accepted: 03/13/2020] [Indexed: 11/16/2022] Open

Annen J, Laureys S, Gosseries O. Brain-computer interfaces for consciousness assessment and communication in severely brain-injured patients. BRAIN-COMPUTER INTERFACES 2020;168:137-152. [DOI: 10.1016/b978-0-444-63934-9.00011-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Ramsey NF. Human brain function and brain-computer interfaces. HANDBOOK OF CLINICAL NEUROLOGY 2020;168:1-13. [PMID: 32164845 DOI: 10.1016/b978-0-444-63934-9.00001-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Brain-computer interfaces for communication. HANDBOOK OF CLINICAL NEUROLOGY 2020;168:67-85. [PMID: 32164869 DOI: 10.1016/b978-0-444-63934-9.00007-x] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Stavisky SD, Willett FR, Wilson GH, Murphy BA, Rezaii P, Avansino DT, Memberg WD, Miller JP, Kirsch RF, Hochberg LR, Ajiboye AB, Druckmann S, Shenoy KV, Henderson JM. Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis. eLife 2019;8:e46015. [PMID: 31820736 PMCID: PMC6954053 DOI: 10.7554/elife.46015] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 11/14/2019] [Indexed: 01/20/2023] Open

Affiliation(s)

Sergey D Stavisky Department of NeurosurgeryStanford UniversityStanfordUnited States Department of Electrical EngineeringStanford UniversityStanfordUnited States
Francis R Willett Department of NeurosurgeryStanford UniversityStanfordUnited States Department of Electrical EngineeringStanford UniversityStanfordUnited States
Guy H Wilson Neurosciences ProgramStanford UniversityStanfordUnited States
Brian A Murphy Department of Biomedical EngineeringCase Western Reserve UniversityClevelandUnited States FES Center, Rehab R&D ServiceLouis Stokes Cleveland Department of Veterans Affairs Medical CenterClevelandUnited States
Paymon Rezaii Department of NeurosurgeryStanford UniversityStanfordUnited States
Donald T Avansino Department of NeurosurgeryStanford UniversityStanfordUnited States
William D Memberg Department of Biomedical EngineeringCase Western Reserve UniversityClevelandUnited States FES Center, Rehab R&D ServiceLouis Stokes Cleveland Department of Veterans Affairs Medical CenterClevelandUnited States
Jonathan P Miller FES Center, Rehab R&D ServiceLouis Stokes Cleveland Department of Veterans Affairs Medical CenterClevelandUnited States Department of NeurosurgeryUniversity Hospitals Cleveland Medical CenterClevelandUnited States
Robert F Kirsch Department of Biomedical EngineeringCase Western Reserve UniversityClevelandUnited States FES Center, Rehab R&D ServiceLouis Stokes Cleveland Department of Veterans Affairs Medical CenterClevelandUnited States
Leigh R Hochberg VA RR&D Center for Neurorestoration and Neurotechnology, Rehabilitation R&D ServiceProvidence VA Medical CenterProvidenceUnited States Center for Neurotechnology and Neurorecovery, Department of NeurologyMassachusetts General Hospital, Harvard Medical SchoolBostonUnited States School of Engineering and Robert J. & Nandy D. Carney Institute for Brain ScienceBrown UniversityProvidenceUnited States
A Bolu Ajiboye Department of Biomedical EngineeringCase Western Reserve UniversityClevelandUnited States FES Center, Rehab R&D ServiceLouis Stokes Cleveland Department of Veterans Affairs Medical CenterClevelandUnited States
Shaul Druckmann Department of NeurobiologyStanford UniversityStanfordUnited States
Krishna V Shenoy Department of Electrical EngineeringStanford UniversityStanfordUnited States Department of NeurobiologyStanford UniversityStanfordUnited States Department of BioengineeringStanford UniversityStanfordUnited States Howard Hughes Medical Institute, Stanford UniversityStanfordUnited States Wu Tsai Neurosciences InstituteStanford UniversityStanfordUnited States Bio-X ProgramStanford UniversityStanfordUnited States
Jaimie M Henderson Department of NeurosurgeryStanford UniversityStanfordUnited States Wu Tsai Neurosciences InstituteStanford UniversityStanfordUnited States Bio-X ProgramStanford UniversityStanfordUnited States

Collapse

Loza CA, Reddy CG, Akella S, Príncipe JC. Discrimination of Movement-Related Cortical Potentials Exploiting Unsupervised Learned Representations From ECoGs. Front Neurosci 2019;13:1248. [PMID: 31824249 PMCID: PMC6882771 DOI: 10.3389/fnins.2019.01248] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Accepted: 11/05/2019] [Indexed: 11/13/2022] Open

Abstract

Brain–Computer Interfaces (BCI) aim to bypass the peripheral nervous system to link the brain to external devices via successful modeling of decoding mechanisms. BCI based on electrocorticogram or ECoG represent a viable compromise between clinical practicality, spatial resolution, and signal quality when it comes to extracellular electrical potentials from local neuronal assemblies. Classic analysis of ECoG traces usually falls under the umbrella of Time-Frequency decompositions with adaptations from Fourier analysis and wavelets as its most prominent variants. However, analyzing such high-dimensional, multivariate time series demands for specialized signal processing and neurophysiological principles. We propose a generative model for single-channel ECoGs that is able to fully characterize reoccurring rhythm–specific neuromodulations as weighted activations of prototypical templates over time. The set of timings, weights and indexes comprise a temporal marked point process (TMPP) that accesses a set of bases from vector spaces of different dimensions—a dictionary. The shallow nature of the model admits the equivalence between latent variables and representations. In this way, learning the model parameters is a case of unsupervised representation learning. We exploit principles of Minimum Description Length (MDL) encoding to effectively yield a data-driven framework where prototypical neuromodulations (not restricted to a particular duration) can be estimated alongside the timings and features of the TMPP. We validate the proposed methodology on discrimination of movement-related tasks utilizing 32-electrode grids implanted in the frontal cortex of six epileptic subjects. We show that the learned representations from the high-gamma band (85–145 Hz) are not only interpretable, but also discriminant in a lower dimensional space. The results also underscore the practicality of our algorithm, i.e., 2 main hyperparameters that can be readily set via neurophysiology, and emphasize the need of principled and interpretable representation learning in order to model encoding mechanisms in the brain.

Collapse

Herff C, Diener L, Angrick M, Mugler E, Tate MC, Goldrick MA, Krusienski DJ, Slutzky MW, Schultz T. Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices. Front Neurosci 2019;13:1267. [PMID: 31824257 PMCID: PMC6882773 DOI: 10.3389/fnins.2019.01267] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Accepted: 11/07/2019] [Indexed: 12/17/2022] Open

Salari E, Freudenburg ZV, Branco MP, Aarnoutse EJ, Vansteensel MJ, Ramsey NF. Classification of Articulator Movements and Movement Direction from Sensorimotor Cortex Activity. Sci Rep 2019;9:14165. [PMID: 31578420 PMCID: PMC6775133 DOI: 10.1038/s41598-019-50834-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Accepted: 09/11/2019] [Indexed: 12/21/2022] Open