Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wang R, Chen X, Khalilian-Gourtani A, Yu L, Dugan P, Friedman D, Doyle W, Devinsky O, Wang Y, Flinker A. Distributed feedforward and feedback cortical processing supports human speech production. Proc Natl Acad Sci U S A 2023;120:e2300255120. [PMID: 37819985 PMCID: PMC10589651 DOI: 10.1073/pnas.2300255120] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 07/22/2023] [Indexed: 10/13/2023] Open

For:	Wang R, Chen X, Khalilian-Gourtani A, Yu L, Dugan P, Friedman D, Doyle W, Devinsky O, Wang Y, Flinker A. Distributed feedforward and feedback cortical processing supports human speech production. Proc Natl Acad Sci U S A 2023;120:e2300255120. [PMID: 37819985 PMCID: PMC10589651 DOI: 10.1073/pnas.2300255120] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 07/22/2023] [Indexed: 10/13/2023] Open

Number

Cited by Other Article(s)

Chen J, Chen X, Wang R, Le C, Khalilian-Gourtani A, Jensen E, Dugan P, Doyle W, Devinsky O, Friedman D, Flinker A, Wang Y. Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.11.584533. [PMID: 38559163 PMCID: PMC10980022 DOI: 10.1101/2024.03.11.584533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]

Abstract

Objective

This study investigates speech decoding from neural signals captured by intracranial electrodes. Most prior works can only work with electrodes on a 2D grid (i.e., Electrocorticographic or ECoG array) and data from a single patient. We aim to design a deep-learning model architecture that can accommodate both surface (ECoG) and depth (stereotactic EEG or sEEG) electrodes. The architecture should allow training on data from multiple participants with large variability in electrode placements and the trained model should perform well on participants unseen during training.

Approach

We propose a novel transformer-based model architecture named SwinTW that can work with arbitrarily positioned electrodes by leveraging their 3D locations on the cortex rather than their positions on a 2D grid. We train subject-specific models using data from a single participant and multi-patient models exploiting data from multiple participants.

Main Results

The subject-specific models using only low-density 8×8 ECoG data achieved high decoding Pearson Correlation Coefficient with ground truth spectrogram (PCC=0.817), over N=43 participants, outperforming our prior convolutional ResNet model and the 3D Swin transformer model. Incorporating additional strip, depth, and grid electrodes available in each participant (N=39) led to further improvement (PCC=0.838). For participants with only sEEG electrodes (N=9), subject-specific models still enjoy comparable performance with an average PCC=0.798. The multi-subject models achieved high performance on unseen participants, with an average PCC=0.765 in leave-one-out cross-validation.

Significance

The proposed SwinTW decoder enables future speech neuropros-theses to utilize any electrode placement that is clinically optimal or feasible for a particular participant, including using only depth electrodes, which are more routinely implanted in chronic neurosurgical procedures. Importantly, the generalizability of the multi-patient models suggests that such a model can be applied to new patients that do not have paired acoustic and neural data, providing an advance in neuroprostheses for people with speech disability, where acoustic-neural training data is not feasible.

Collapse

Maskeliūnas R, Damaševičius R, Kulikajevas A, Pribuišis K, Uloza V. Alaryngeal Speech Enhancement for Noisy Environments Using a Pareto Denoising Gated LSTM. J Voice 2024:S0892-1997(24)00228-5. [PMID: 39107213 DOI: 10.1016/j.jvoice.2024.07.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Revised: 07/13/2024] [Accepted: 07/15/2024] [Indexed: 08/09/2024]

Wang R, Chen ZS. Large-scale foundation models and generative AI for BigData neuroscience. Neurosci Res 2024:S0168-0102(24)00075-0. [PMID: 38897235 DOI: 10.1016/j.neures.2024.06.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 04/15/2024] [Accepted: 05/15/2024] [Indexed: 06/21/2024]

Wu H, Cai C, Ming W, Chen W, Zhu Z, Feng C, Jiang H, Zheng Z, Sawan M, Wang T, Zhu J. Speech decoding using cortical and subcortical electrophysiological signals. Front Neurosci 2024;18:1345308. [PMID: 38486966 PMCID: PMC10937352 DOI: 10.3389/fnins.2024.1345308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 02/12/2024] [Indexed: 03/17/2024] Open

He Q, Yang Y, Ge P, Li S, Chai X, Luo Z, Zhao J. The brain nebula: minimally invasive brain-computer interface by endovascular neural recording and stimulation. J Neurointerv Surg 2024:jnis-2023-021296. [PMID: 38388478 DOI: 10.1136/jnis-2023-021296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 01/19/2024] [Indexed: 02/24/2024]

Khanna AR, Muñoz W, Kim YJ, Kfir Y, Paulk AC, Jamali M, Cai J, Mustroph ML, Caprara I, Hardstone R, Mejdell M, Meszéna D, Zuckerman A, Schweitzer J, Cash S, Williams ZM. Single-neuronal elements of speech production in humans. Nature 2024;626:603-610. [PMID: 38297120 PMCID: PMC10866697 DOI: 10.1038/s41586-023-06982-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 12/14/2023] [Indexed: 02/02/2024]

Affiliation(s)

Arjun R Khanna Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
William Muñoz Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Young Joon Kim Harvard Medical School, Boston, MA, USA
Yoav Kfir Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Angelique C Paulk Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Mohsen Jamali Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Jing Cai Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Martina L Mustroph Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Irene Caprara Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Richard Hardstone Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Mackenna Mejdell Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Domokos Meszéna Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Abigail Zuckerman Harvard Medical School, Boston, MA, USA
Jeffrey Schweitzer Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Sydney Cash Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Ziv M Williams Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA. Harvard-MIT Division of Health Sciences and Technology, Boston, MA, USA. Harvard Medical School, Program in Neuroscience, Boston, MA, USA.

Collapse

Tsunada J, Eliades SJ. Frontal-Auditory Cortical Interactions and Sensory Prediction During Vocal Production in Marmoset Monkeys. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.28.577656. [PMID: 38352422 PMCID: PMC10862695 DOI: 10.1101/2024.01.28.577656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]