1
|
Roark CL, Paulon G, Rebaudo G, McHaney JR, Sarkar A, Chandrasekaran B. Individual differences in working memory impact the trajectory of non-native speech category learning. PLoS One 2024; 19:e0297917. [PMID: 38857268 PMCID: PMC11164376 DOI: 10.1371/journal.pone.0297917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 01/15/2024] [Indexed: 06/12/2024] Open
Abstract
What is the role of working memory over the course of non-native speech category learning? Prior work has predominantly focused on how working memory might influence learning assessed at a single timepoint. Here, we substantially extend this prior work by examining the role of working memory on speech learning performance over time (i.e., over several months) and leverage a multifaceted approach that provides key insights into how working memory influences learning accuracy, maintenance of knowledge over time, generalization ability, and decision processes. We found that the role of working memory in non-native speech learning depends on the timepoint of learning and whether individuals learned the categories at all. Among learners, across all stages of learning, working memory was associated with higher accuracy as well as faster and slightly more cautious decision making. Further, while learners and non-learners did not have substantially different working memory performance, learners had faster evidence accumulation and more cautious decision thresholds throughout all sessions. Working memory may enhance learning by facilitating rapid category acquisition in initial stages and enabling faster and slightly more careful decision-making strategies that may reduce the overall effort needed to learn. Our results have important implications for developing interventions to improve learning in naturalistic language contexts.
Collapse
Affiliation(s)
- Casey L. Roark
- Communication Science & Disorders, University of Pittsburgh, Pittsburgh, PA, United States of America
- Center for the Neural Basis of Cognition, Pittsburgh, PA, United States of America
| | - Giorgio Paulon
- Statistics and Data Sciences, University of Texas at Austin, Austin, TX, United States of America
| | - Giovanni Rebaudo
- Statistics and Data Sciences, University of Texas at Austin, Austin, TX, United States of America
| | - Jacie R. McHaney
- Communication Science & Disorders, University of Pittsburgh, Pittsburgh, PA, United States of America
| | - Abhra Sarkar
- Statistics and Data Sciences, University of Texas at Austin, Austin, TX, United States of America
| | - Bharath Chandrasekaran
- Communication Science & Disorders, University of Pittsburgh, Pittsburgh, PA, United States of America
- Center for the Neural Basis of Cognition, Pittsburgh, PA, United States of America
| |
Collapse
|
2
|
Gan Z, Zheng L, Wang S, Feng G. Distribution-dependent representations in auditory category learning and generalization. Front Psychol 2023; 14:1132570. [PMID: 37829077 PMCID: PMC10566369 DOI: 10.3389/fpsyg.2023.1132570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 08/31/2023] [Indexed: 10/14/2023] Open
Abstract
A fundamental objective in Auditory Sciences is to understand how people learn to generalize auditory category knowledge in new situations. How we generalize to novel scenarios speaks to the nature of acquired category representations and generalization mechanisms in handling perceptual variabilities and novelty. The dual learning system (DLS) framework proposes that auditory category learning involves an explicit, hypothesis-testing learning system, which is optimal for learning rule-based (RB) categories, and an implicit, procedural-based learning system, which is optimal for learning categories requiring pre-decisional information integration (II) across acoustic dimensions. Although DLS describes distinct mechanisms of two types of category learning, it is yet clear the nature of acquired representations and how we transfer them to new contexts. Here, we conducted three experiments to examine differences between II and RB category representations by examining what acoustic and perceptual novelties and variabilities affect learners' generalization success. Learners can successfully categorize different sets of untrained sounds after only eight blocks of training for both II and RB categories. The category structures and novel contexts differentially modulated the generalization success. The II learners significantly decreased generalization performances when categorizing new items derived from an untrained perceptual area and in a context with more distributed samples. In contrast, RB learners' generalizations are resistant to changes in perceptual regions but are sensitive to changes in sound dispersity. Representational similarity modeling revealed that the generalization in the more dispersed sampling context was accomplished differently by II and RB learners. II learners increased representations of perceptual similarity and decision distance to compensate for the decreased transfer of category representations, whereas the RB learners used a more computational cost strategy by default, computing the decision-bound distance to guide generalization decisions. These results suggest that distinct representations emerged after learning the two types of category structures and using different computations and flexible mechanisms in resolving generalization challenges when facing novel perceptual variability in new contexts. These findings provide new evidence for dissociated representations of auditory categories and reveal novel generalization mechanisms in resolving variabilities to maintain perceptual constancy.
Collapse
Affiliation(s)
- Zhenzhong Gan
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, Guangzhou, Guangdong, China
- Guangdong Provincial Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, Guangdong, China
- School of Psychology, South China Normal University, Guangzhou, Guangdong, China
| | - Lurong Zheng
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, Guangzhou, Guangdong, China
- Guangdong Provincial Key Laboratory of Mental Health and Cognitive Science, South China Normal University, Guangzhou, Guangdong, China
- School of Psychology, South China Normal University, Guangzhou, Guangdong, China
| | - Suiping Wang
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, Guangzhou, Guangdong, China
| | - Gangyi Feng
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
| |
Collapse
|
3
|
Gabay Y, Roark CL, Holt LL. Impaired and Spared Auditory Category Learning in Developmental Dyslexia. Psychol Sci 2023; 34:468-480. [PMID: 36791783 DOI: 10.1177/09567976231151581] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023] Open
Abstract
Categorization has a deep impact on behavior, but whether category learning is served by a single system or multiple systems remains debated. Here, we designed two well-equated nonspeech auditory category learning challenges to draw on putative procedural (information-integration) versus declarative (rule-based) learning systems among adult Hebrew-speaking control participants and individuals with dyslexia, a language disorder that has been linked to a selective disruption in the procedural memory system and in which phonological deficits are ubiquitous. We observed impaired information-integration category learning and spared rule-based category learning in the dyslexia group compared with the neurotypical group. Quantitative model-based analyses revealed reduced use of, and slower shifting to, optimal procedural-based strategies in dyslexia with hypothesis-testing strategy use on par with control participants. The dissociation is consistent with multiple category learning systems and points to the possibility that procedural learning inefficiencies across categories defined by complex, multidimensional exemplars may result in difficulty in phonetic category acquisition in dyslexia.
Collapse
Affiliation(s)
- Yafit Gabay
- Department of Special Education and the Edmond J. Safra Brain Research Center for the Study of Learning Disabilities, University of Haifa
| | - Casey L Roark
- Department of Communication Science and Disorders, Center for the Neural Basis of Cognition, University of Pittsburgh
| | - Lori L Holt
- Department of Psychology, Neuroscience Institute, Center for the Neural Basis of Cognition, Carnegie Mellon University
| |
Collapse
|
4
|
Abstract
The human brain exhibits the remarkable ability to categorize speech sounds into distinct, meaningful percepts, even in challenging tasks like learning non-native speech categories in adulthood and hearing speech in noisy listening conditions. In these scenarios, there is substantial variability in perception and behavior, both across individual listeners and individual trials. While there has been extensive work characterizing stimulus-related and contextual factors that contribute to variability, recent advances in neuroscience are beginning to shed light on another potential source of variability that has not been explored in speech processing. Specifically, there are task-independent, moment-to-moment variations in neural activity in broadly-distributed cortical and subcortical networks that affect how a stimulus is perceived on a trial-by-trial basis. In this review, we discuss factors that affect speech sound learning and moment-to-moment variability in perception, particularly arousal states—neurotransmitter-dependent modulations of cortical activity. We propose that a more complete model of speech perception and learning should incorporate subcortically-mediated arousal states that alter behavior in ways that are distinct from, yet complementary to, top-down cognitive modulations. Finally, we discuss a novel neuromodulation technique, transcutaneous auricular vagus nerve stimulation (taVNS), which is particularly well-suited to investigating causal relationships between arousal mechanisms and performance in a variety of perceptual tasks. Together, these approaches provide novel testable hypotheses for explaining variability in classically challenging tasks, including non-native speech sound learning.
Collapse
|
5
|
McHaney JR, Tessmer R, Roark CL, Chandrasekaran B. Working memory relates to individual differences in speech category learning: Insights from computational modeling and pupillometry. BRAIN AND LANGUAGE 2021; 222:105010. [PMID: 34454285 DOI: 10.1016/j.bandl.2021.105010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 07/26/2021] [Accepted: 08/10/2021] [Indexed: 05/27/2023]
Abstract
Across two experiments, we examine the relationship between individual differences in working memory (WM) and the acquisition of non-native speech categories in adulthood. While WM is associated with individual differences in a variety of learning tasks, successful acquisition of speech categories is argued to be contingent on WM-independent procedural-learning mechanisms. Thus, the role of WM in speech category learning is unclear. In Experiment 1, we show that individuals with higher WM acquire non-native speech categories faster and to a greater extent than those with lower WM. In Experiment 2, we replicate these results and show that individuals with higher WM use more optimal, procedural-based learning strategies and demonstrate more distinct speech-evoked pupillary responses for correct relative to incorrect trials. We propose that higher WM may allow for greater stimulus-related attention, resulting in more robust representations and optimal learning strategies. We discuss implications for neurobiological models of speech category learning.
Collapse
Affiliation(s)
- Jacie R McHaney
- Department of Communication Science and Disorders, University of Pittsburgh, United States
| | - Rachel Tessmer
- Department of Speech, Language, and Hearing Sciences, University of Texas at Austin, United States
| | - Casey L Roark
- Department of Communication Science and Disorders, University of Pittsburgh, United States; Center for the Neural Basis of Cognition, Pittsburgh, PA, United States
| | - Bharath Chandrasekaran
- Department of Communication Science and Disorders, University of Pittsburgh, United States.
| |
Collapse
|
6
|
Feng G, Li Y, Hsu SM, Wong PC, Chou TL, Chandrasekaran B. Emerging native-similar neural representations underlie non-native speech category learning success. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2021; 2:280-307. [PMID: 34368775 PMCID: PMC8345815 DOI: 10.1162/nol_a_00035] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
Learning non-native phonetic categories in adulthood is an exceptionally challenging task, characterized by large inter-individual differences in learning speed and outcomes. The neurobiological mechanisms underlying the inter-individual differences in the learning efficacy are not fully understood. Here we examined the extent to which training-induced neural representations of non-native Mandarin tone categories in English listeners (n = 53) are increasingly similar to those of the native listeners (n = 33) who acquired these categories early in infancy. We particularly assessed whether the neural similarities in representational structure between non-native learners and native listeners are robust neuromarkers of inter-individual differences in learning success. Using inter-subject neural representational similarity (IS-NRS) analysis and predictive modeling on two functional magnetic resonance imaging (fMRI) datasets, we examined the neural representational mechanisms underlying speech category learning success. Learners' neural representations that were significantly similar to the native listeners emerged in brain regions mediating speech perception following training; the extent of the emerging neural similarities with native listeners significantly predicted the learning speed and outcome in learners. The predictive power of IS-NRS outperformed models with other neural representational measures. Furthermore, neural representations underlying successful learning are multidimensional but cost-efficient in nature. The degree of the emergent native-similar neural representations was closely related to the robust neural sensitivity to feedback in the frontostriatal network. These findings provide important insights on experience-dependent representational neuroplasticity underlying successful speech learning in adulthood and could be leveraged in designing individualized feedback-based training paradigms that maximize learning efficiency.
Collapse
Affiliation(s)
- Gangyi Feng
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
- Corresponding authors: Gangyi Feng, Ph.D., Brain and Mind Institute, Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China, +852-3943 3190, , Bharath Chandrasekaran, Ph.D., Department of Communication Science and Disorders, University of Pittsburgh 6074 Forbes Tower, Pittsburgh, PA 15260, (412) 383-6565,
| | - Yu Li
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Shen-Mou Hsu
- Imaging Center for Integrated Body, Mind and Culture Research, National Taiwan University, Taipei 10617, Taiwan
| | - Patrick C.M. Wong
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
- Brain and Mind Institute, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Tai-Li Chou
- Imaging Center for Integrated Body, Mind and Culture Research, National Taiwan University, Taipei 10617, Taiwan
- Department of Psychology, National Taiwan University, Taipei 10617, Taiwan
| | - Bharath Chandrasekaran
- Department of Communication Sciences and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Corresponding authors: Gangyi Feng, Ph.D., Brain and Mind Institute, Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China, +852-3943 3190, , Bharath Chandrasekaran, Ph.D., Department of Communication Science and Disorders, University of Pittsburgh 6074 Forbes Tower, Pittsburgh, PA 15260, (412) 383-6565,
| |
Collapse
|
7
|
Explicit and implicit memory representations in cross-situational word learning. Cognition 2020; 205:104444. [PMID: 33075677 DOI: 10.1016/j.cognition.2020.104444] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Revised: 05/13/2020] [Accepted: 08/19/2020] [Indexed: 11/23/2022]
Abstract
What kind of memory representations do word learners use when they learn the meaning of words cross-situationally? This study leverages the measure of the relationship between confidence and performance to explore the nature of memory representations in word learning. In the recognition memory literature, studies have shown that explicit memory can be used when subjects can semantically encode the study material. However, when the study material is chosen to be unverbalizable, implicit memory is used but is presumed to be only detectable under certain experimental conditions. In the current paper, five cross-situational word learning experiments manipulated the type of word referents with varying experimental paradigms that were designed to probe different types of memory under an implicit learning paradigm. When word referents were line drawings of familiar concepts, memory in cross situational learning was explicit. Implicit memory was found where referents were objects that cannot be encoded semantically (e.g., unverbalizable images). These findings have implications for different theoretical perspectives on early word learning, which differ in the extent to which existing semantic category information, as opposed to perceptual information, contributes to the word meaning process.
Collapse
|
8
|
Feng G, Yi HG, Chandrasekaran B. The Role of the Human Auditory Corticostriatal Network in Speech Learning. Cereb Cortex 2020; 29:4077-4089. [PMID: 30535138 DOI: 10.1093/cercor/bhy289] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 08/30/2018] [Indexed: 01/26/2023] Open
Abstract
We establish a mechanistic account of how the mature human brain functionally reorganizes to acquire and represent new speech sounds. Native speakers of English learned to categorize Mandarin lexical tone categories produced by multiple talkers using trial-by-trial feedback. We hypothesized that the corticostriatal system is a key intermediary in mediating temporal lobe plasticity and the acquisition of new speech categories in adulthood. We conducted a functional magnetic resonance imaging experiment in which participants underwent a sound-to-category mapping task. Diffusion tensor imaging data were collected, and probabilistic fiber tracking analysis was employed to assay the auditory corticostriatal pathways. Multivariate pattern analysis showed that talker-invariant novel tone category representations emerged in the left superior temporal gyrus (LSTG) within a few hundred training trials. Univariate analysis showed that the putamen, a subregion of the striatum, was sensitive to positive feedback in correctly categorized trials. With learning, functional coupling between the putamen and LSTG increased during error processing. Furthermore, fiber tractography demonstrated robust structural connectivity between the feedback-sensitive striatal regions and the LSTG regions that represent the newly learned tone categories. Our convergent findings highlight a critical role for the auditory corticostriatal circuitry in mediating the acquisition of new speech categories.
Collapse
Affiliation(s)
- Gangyi Feng
- Department of Linguistics and Modern Languages, The Chinese University of Hong Kong, Hong Kong SAR, China.,Brain and Mind Institute, The Chinese University of Hong Kong, Hong Kong SAR, China
| | - Han Gyol Yi
- Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Bharath Chandrasekaran
- Department of Communication Science and Disorders, School of Health and Rehabilitation Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
9
|
Paulon G, Llanos F, Chandrasekaran B, Sarkar A. Bayesian Semiparametric Longitudinal Drift-Diffusion Mixed Models for Tone Learning in Adults. J Am Stat Assoc 2020; 116:1114-1127. [PMID: 34650315 PMCID: PMC8513775 DOI: 10.1080/01621459.2020.1801448] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 06/10/2020] [Accepted: 07/22/2020] [Indexed: 02/07/2023]
Abstract
Understanding how adult humans learn nonnative speech categories such as tone information has shed novel insights into the mechanisms underlying experience-dependent brain plasticity. Scientists have traditionally examined these questions using longitudinal learning experiments under a multi-category decision making paradigm. Drift-diffusion processes are popular in such contexts for their ability to mimic underlying neural mechanisms. Motivated by these problems, we develop a novel Bayesian semiparametric inverse Gaussian drift-diffusion mixed model for multi-alternative decision making in longitudinal settings. We design a Markov chain Monte Carlo algorithm for posterior computation. We evaluate the method's empirical performances through synthetic experiments. Applied to our motivating longitudinal tone learning study, the method provides novel insights into how the biologically interpretable model parameters evolve with learning, differ between input-response tone combinations, and differ between well and poorly performing adults. supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.
Collapse
Affiliation(s)
- Giorgio Paulon
- Department of Statistics and Data Sciences, University of Texas at Austin, Austin, TX
| | - Fernando Llanos
- Department of Linguistics, University of Texas at Austin, Austin, TX
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA
| | - Bharath Chandrasekaran
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA
| | - Abhra Sarkar
- Department of Statistics and Data Sciences, University of Texas at Austin, Austin, TX
| |
Collapse
|
10
|
Olasagasti I, Giraud AL. Integrating prediction errors at two time scales permits rapid recalibration of speech sound categories. eLife 2020; 9:44516. [PMID: 32223894 PMCID: PMC7217692 DOI: 10.7554/elife.44516] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Accepted: 03/17/2020] [Indexed: 01/01/2023] Open
Abstract
Speech perception presumably arises from internal models of how specific sensory features are associated with speech sounds. These features change constantly (e.g. different speakers, articulation modes etc.), and listeners need to recalibrate their internal models by appropriately weighing new versus old evidence. Models of speech recalibration classically ignore this volatility. The effect of volatility in tasks where sensory cues were associated with arbitrary experimenter-defined categories were well described by models that continuously adapt the learning rate while keeping a single representation of the category. Using neurocomputational modelling we show that recalibration of natural speech sound categories is better described by representing the latter at different time scales. We illustrate our proposal by modeling fast recalibration of speech sounds after experiencing the McGurk effect. We propose that working representations of speech categories are driven both by their current environment and their long-term memory representations. People can distinguish words or syllables even though they may sound different with every speaker. This striking ability reflects the fact that our brain is continually modifying the way we recognise and interpret the spoken word based on what we have heard before, by comparing past experience with the most recent one to update expectations. This phenomenon also occurs in the McGurk effect: an auditory illusion in which someone hears one syllable but sees a person saying another syllable and ends up perceiving a third distinct sound. Abstract models, which provide a functional rather than a mechanistic description of what the brain does, can test how humans use expectations and prior knowledge to interpret the information delivered by the senses at any given moment. Olasagasti and Giraud have now built an abstract model of how brains recalibrate perception of natural speech sounds. By fitting the model with existing experimental data using the McGurk effect, the results suggest that, rather than using a single sound representation that is adjusted with each sensory experience, the brain recalibrates sounds at two different timescales. Over and above slow “procedural” learning, the findings show that there is also rapid recalibration of how different sounds are interpreted. This working representation of speech enables adaptation to changing or noisy environments and illustrates that the process is far more dynamic and flexible than previously thought.
Collapse
Affiliation(s)
- Itsaso Olasagasti
- Department of Basic Neuroscience, University of Geneva, Geneva, Switzerland
| | - Anne-Lise Giraud
- Department of Basic Neuroscience, University of Geneva, Geneva, Switzerland
| |
Collapse
|
11
|
Abstract
Human category learning appears to be supported by dual learning systems. Previous research indicates the engagement of distinct neural systems in learning categories that require selective attention to dimensions versus those that require integration across dimensions. This evidence has largely come from studies of learning across perceptually separable visual dimensions, but recent research has applied dual system models to understanding auditory and speech categorization. Since differential engagement of the dual learning systems is closely related to selective attention to input dimensions, it may be important that acoustic dimensions are quite often perceptually integral and difficult to attend to selectively. We investigated this issue across artificial auditory categories defined by center frequency and modulation frequency acoustic dimensions. Learners demonstrated a bias to integrate across the dimensions, rather than to selectively attend, and the bias specifically reflected a positive correlation between the dimensions. Further, we found that the acoustic dimensions did not equivalently contribute to categorization decisions. These results demonstrate the need to reconsider the assumption that the orthogonal input dimensions used in designing an experiment are indeed orthogonal in perceptual space as there are important implications for category learning.
Collapse
|
12
|
Constraints on learning disjunctive, unidimensional auditory and phonetic categories. Atten Percept Psychophys 2019; 81:958-980. [PMID: 30761500 DOI: 10.3758/s13414-019-01683-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Phonetic categories must be learned, but the processes that allow that learning to unfold are still under debate. The current study investigates constraints on the structure of categories that can be learned and whether these constraints are speech-specific. Category structure constraints are a key difference between theories of category learning, which can roughly be divided into instance-based learning (i.e., exemplar only) and abstractionist learning (i.e., at least partly rule-based or prototype-based) theories. Abstractionist theories can relatively easily accommodate constraints on the structure of categories that can be learned, whereas instance-based theories cannot easily include such constraints. The current study included three groups to investigate these possible constraints as well as their speech specificity: English speakers learning German speech categories, German speakers learning German speech categories, and English speakers learning musical instrument categories, with each group including participants who learned different sets of categories. Both speech groups had greater difficulty learning disjunctive categories (ones that require an "or" statement) than nondisjunctive categories, which suggests that instance-based learning alone is insufficient to explain the learning of the participants learning phonetic categories. This fact was true for both novices (English speakers) and experts (German speakers), which implies that expertise with the materials used cannot explain the patterns observed. However, the same was not true for the musical instrument categories, suggesting a degree of domain-specificity in these constraints that cannot be explained through recourse to expertise alone.
Collapse
|
13
|
Roark CL, Holt LL. Auditory information-integration category learning in young children and adults. J Exp Child Psychol 2019; 188:104673. [PMID: 31430573 DOI: 10.1016/j.jecp.2019.104673] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 07/19/2019] [Accepted: 07/22/2019] [Indexed: 10/26/2022]
Abstract
Adults outperform children on category learning that requires selective attention to individual dimensions (rule-based categories) due to their more highly developed working memory abilities, but much less is known about developmental differences in learning categories that require integration across multiple dimensions (information-integration categories). The current study investigated auditory information-integration category learning in 5- to 7-year-old children (n = 34) and 18- to 25-year-old adults (n = 35). Adults generally outperformed children during learning. However, some children learned the categories well and used strategies similar to those of adults, as assessed through decision-bound computational models. The results demonstrate that information-integration learning ability continues to develop throughout at least middle childhood. These results have implications for the development of mechanisms that contribute to speech category learning.
Collapse
Affiliation(s)
- Casey L Roark
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Center for the Neural Basis of Cognition, University of Pittsburgh-Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | - Lori L Holt
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA 15213, USA; Center for the Neural Basis of Cognition, University of Pittsburgh-Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
14
|
Abstract
There is substantial evidence that two distinct learning systems are engaged in category learning. One is principally engaged when learning requires selective attention to a single dimension (rule-based), and the other is drawn online by categories requiring integration across two or more dimensions (information-integration). This distinction has largely been drawn from studies of visual categories learned via overt category decisions and explicit feedback. Recent research has extended this model to auditory categories, the nature of which introduces new questions for research. With the present experiment, we addressed the influences of incidental versus overt training and category distribution sampling on learning information-integration and rule-based auditory categories. The results demonstrate that the training task influences category learning, with overt feedback generally outperforming incidental feedback. Additionally, distribution sampling (probabilistic or deterministic) and category type (information-integration or rule-based) both affect how well participants are able to learn. Specifically, rule-based categories are learned equivalently, regardless of distribution sampling, whereas information-integration categories are learned better with deterministic than with probabilistic sampling. The interactions of distribution sampling, category type, and kind of feedback impacted category-learning performance, but these interactions have not yet been integrated into existing category-learning models. These results suggest new dimensions for understanding category learning, inspired by the real-world properties of auditory categories.
Collapse
|
15
|
Quam C, Wang A, Maddox WT, Golisch K, Lotto A. Procedural-Memory, Working-Memory, and Declarative-Memory Skills Are Each Associated With Dimensional Integration in Sound-Category Learning. Front Psychol 2018; 9:1828. [PMID: 30333772 PMCID: PMC6175975 DOI: 10.3389/fpsyg.2018.01828] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Accepted: 09/07/2018] [Indexed: 11/25/2022] Open
Abstract
This paper investigates relationships between procedural-memory, declarative-memory, and working-memory skills and adult native English speakers' novel sound-category learning. Participants completed a sound-categorization task that required integrating two dimensions: one native (vowel quality), one non-native (pitch). Similar information-integration category structures in the visual and auditory domains have been shown to be best learned implicitly (e.g., Maddox et al., 2006). Thus, we predicted that individuals with greater procedural-memory capacity would better learn sound categories, because procedural memory appears to support implicit learning of new information and integration of dimensions. Seventy undergraduates were tested across two experiments. Procedural memory was assessed using a linguistic adaptation of the serial-reaction-time task (Misyak et al., 2010a,b). Declarative memory was assessed using the logical-memory subtest of the Wechsler Memory Scale-4th edition (WMS-IV; Wechsler, 2009). Working memory was assessed using an auditory version of the reading-span task (Kane et al., 2004). Experiment 1 revealed contributions of only declarative memory to dimensional integration, which might indicate not enough time or motivation to shift over to a procedural/integrative strategy. Experiment 2 gave twice the speech-sound training, distributed over 2 days, and also attempted to train at the category boundary. As predicted, effects of declarative memory were removed and effects of procedural memory emerged, but, unexpectedly, new effects of working memory surfaced. The results may be compatible with a multiple-systems account in which declarative and working memory facilitate transfer of control to the procedural system.
Collapse
Affiliation(s)
- Carolyn Quam
- Department of Speech and Hearing Sciences, Portland State University, Portland, OR, United States
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, AZ, United States
- Department of Psychology, University of Arizona, Tucson, AZ, United States
| | - Alisa Wang
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, AZ, United States
| | - W. Todd Maddox
- Cognitive Design and Statistical Consulting, LLC., Austin, TX, United States
| | - Kimberly Golisch
- Department of Psychology, University of Arizona, Tucson, AZ, United States
- College of Medicine–Tucson, University of Arizona, Tucson, AZ, United States
| | - Andrew Lotto
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, AZ, United States
- Department of Speech, Language, and Hearing Sciences, University of Florida, Gainesville, FL, United States
| |
Collapse
|
16
|
Earle FS, Arthur DT. Native phonological processing abilities predict post-consolidation nonnative contrast learning in adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:EL525. [PMID: 29289078 PMCID: PMC5724740 DOI: 10.1121/1.5013141] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Revised: 10/06/2017] [Accepted: 11/08/2017] [Indexed: 06/07/2023]
Abstract
This study examined the relationship between native phonological processing ability and the learning outcome of a trained nonnative (Hindi /ɖ/ - / d̪/) contrast. Participants were perceptually trained and assessed in the evening, and reassessed early the next morning. Native phonological processing ability did not predict the learning of the nonnative contrasts on Day 1. However, after a period of post-training sleep, Blending ability predicted nonnative Discrimination performance, and Nonword Repetition predicted nonnative Identification. These findings may point to similarities between processes involved in maintaining native phonological representations and that in the retention of nonnative acoustic-phonetic features in adulthood.
Collapse
Affiliation(s)
- F Sayako Earle
- Communication Sciences and Disorders, University of Delaware, STAR Health Sciences Complex, 540 South College Avenue, Suite 210BB, Newark, Delaware 19716, USA
| | - Dana T Arthur
- Communication Sciences and Disorders, SUNY-New Paltz, 1 Hawk Drive, New Paltz, New York 12561, USA ,
| |
Collapse
|
17
|
Yang X, Jiang M, Zhao Y. Effects of Noise on English Listening Comprehension among Chinese College Students with Different Learning Styles. Front Psychol 2017; 8:1764. [PMID: 29085317 PMCID: PMC5650695 DOI: 10.3389/fpsyg.2017.01764] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 09/25/2017] [Indexed: 11/13/2022] Open
Abstract
This study was intended to determine whether the effects of noise on English listening comprehension would vary among Chinese college students with different learning styles. A total of 89 participants with different learning styles measured using Kolb’s (1985) Learning Style Inventory finished English listening comprehension tests in quiet and in white noise, Chinese two-talker babble, and English two-talker babble respectively. The results showed that the participants in general had significantly poorer performance in the two babble conditions than in quiet and white noise. However, the participants with assimilative and divergent learning styles performed relatively better in Chinese babble, and exhibited stable performance across the three noisy conditions, while the participants with convergent and accommodative learning styles had more impaired performance in both Chinese babble and English babble than in white noise. Moreover, of Kolb’s four learning modes, reflective observation had a facilitative effect on listening performance in Chinese babble and English babble. These findings suggest that differences in learning style might lead to differential performance in foreign language listening comprehension in noise.
Collapse
Affiliation(s)
- Xiaohu Yang
- Speech-Language-Hearing Center, School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| | - Meng Jiang
- Language & Brain Research Center, Sichuan International Studies University, Chongqing, China
| | - Yong Zhao
- Department of Translation and Interpreting, School of Foreign Languages, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
18
|
Xie Z, Reetzke R, Chandrasekaran B. Stability and plasticity in neural encoding of linguistically relevant pitch patterns. J Neurophysiol 2017; 117:1407-1422. [PMID: 28077662 DOI: 10.1152/jn.00445.2016] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Revised: 01/09/2017] [Accepted: 01/09/2017] [Indexed: 12/15/2022] Open
Abstract
While lifelong language experience modulates subcortical encoding of pitch patterns, there is emerging evidence that short-term training introduced in adulthood also shapes subcortical pitch encoding. Here we use a cross-language design to examine the stability of language experience-dependent subcortical plasticity over multiple days. We then examine the extent to which behavioral relevance induced by sound-to-category training leads to plastic changes in subcortical pitch encoding in adulthood relative to adolescence, a period of ongoing maturation of subcortical and cortical auditory processing. Frequency-following responses (FFRs), which reflect phase-locked activity from subcortical neural ensembles, were elicited while participants passively listened to pitch patterns reflective of Mandarin tones. In experiment 1, FFRs were recorded across three consecutive days from native Chinese-speaking (n = 10) and English-speaking (n = 10) adults. In experiment 2, FFRs were recorded from native English-speaking adolescents (n = 20) and adults (n = 15) before, during, and immediately after a session of sound-to-category training, as well as a day after training ceased. Experiment 1 demonstrated the stability of language experience-dependent subcortical plasticity in pitch encoding across multiple days of passive exposure to linguistic pitch patterns. In contrast, experiment 2 revealed an enhancement in subcortical pitch encoding that emerged a day after the sound-to-category training, with some developmental differences observed. Taken together, these findings suggest that behavioral relevance is a critical component for the observation of plasticity in the subcortical encoding of pitch.NEW & NOTEWORTHY We examine the timescale of experience-dependent auditory plasticity to linguistically relevant pitch patterns. We find extreme stability in lifelong experience-dependent plasticity. We further demonstrate that subcortical function in adolescents and adults is modulated by a single session of sound-to-category training. Our results suggest that behavioral relevance is a necessary ingredient for neural changes in pitch encoding to be observed throughout human development. These findings contribute to the neurophysiological understanding of long- and short-term experience-dependent modulation of pitch.
Collapse
Affiliation(s)
- Zilong Xie
- Department of Communication Sciences and Disorders, The University of Texas at Austin, Austin, Texas
| | - Rachel Reetzke
- Department of Communication Sciences and Disorders, The University of Texas at Austin, Austin, Texas
| | - Bharath Chandrasekaran
- Department of Communication Sciences and Disorders, The University of Texas at Austin, Austin, Texas; .,Department of Psychology, The University of Texas at Austin, Austin, Texas.,Department of Linguistics, The University of Texas at Austin, Austin, Texas.,Institute for Neuroscience, The University of Texas at Austin, Austin, Texas; and.,Institute for Mental Health Research, The University of Texas at Austin, Austin, Texas
| |
Collapse
|
19
|
Maddox WT, Koslov S, Yi HG, Chandrasekaran B. Performance Pressure Enhances Speech Learning. APPLIED PSYCHOLINGUISTICS 2016; 37:1369-1396. [PMID: 28077883 PMCID: PMC5222599 DOI: 10.1017/s0142716415000600] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Real-world speech learning often occurs in high pressure situations such as trying to communicate in a foreign country. However, the impact of pressure on speech learning success is largely unexplored. In this study, adult, native speakers of English learned non-native speech categories under pressure or no-pressure conditions. In the pressure conditions, participants were informed that they were paired with a (fictitious) partner, and that each had to independently exceed a performance criterion for both to receive a monetary bonus. They were then informed that their partner had exceeded the bonus and the fate of both bonuses depended upon the participant's performance. Our results demonstrate that pressure significantly enhanced speech learning success. In addition, neurobiologically-inspired computational modeling revealed that the performance advantage was due to faster and more frequent use of procedural learning strategies. These results integrate two well-studied research domains and suggest a facilitatory role of motivational factors in speech learning performance that may not be captured in traditional training paradigms.
Collapse
Affiliation(s)
- W Todd Maddox
- Department of Psychology, 1 University Station A8000, Austin, TX, USA, 78712
| | - Seth Koslov
- Department of Psychology, 1 University Station A8000, Austin, TX, USA, 78712
| | - Han-Gyol Yi
- Department of Communication Sciences and Disorders, 1 University Station A1100, Austin, TX, USA, 78712
| | - Bharath Chandrasekaran
- Department of Psychology, 1 University Station A8000, Austin, TX, USA, 78712; Department of Communication Sciences and Disorders, 1 University Station A1100, Austin, TX, USA, 78712
| |
Collapse
|
20
|
Yi HG, Chandrasekaran B. Auditory categories with separable decision boundaries are learned faster with full feedback than with minimal feedback. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:1332. [PMID: 27586759 PMCID: PMC5001972 DOI: 10.1121/1.4961163] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Revised: 08/01/2016] [Accepted: 08/02/2016] [Indexed: 05/27/2023]
Abstract
During visual category learning, full feedback (e.g., "Wrong, that was a category 4."), relative to minimal feedback (e.g., "Wrong."), enhances performance when the relevant dimensions are separable. This pattern is reversed with inseparable dimensions. Here, the interaction between trial-by-trial feedback and separability of dimensions in the auditory domain is examined. Participants were trained to categorize auditory stimuli along separable or inseparable dimensions. One group received full feedback, while the other group received minimal feedback. In the separable-dimensions condition, the full-feedback group achieved higher accuracy than did the minimal-feedback group. In the inseparable-dimensions condition, performance was equivalent across the feedback groups. These results altogether suggest that trial-by-trial feedback affects auditory category learning performance differentially for separable and inseparable categories.
Collapse
Affiliation(s)
- Han Gyol Yi
- Department of Communication Sciences and Disorders, 2504A Whitis Avenue (A1100), The University of Texas at Austin, Austin, Texas 78712, USA
| | - Bharath Chandrasekaran
- Department of Communication Sciences and Disorders, 2504A Whitis Avenue (A1100), The University of Texas at Austin, Austin, Texas 78712, USA
| |
Collapse
|
21
|
Chandrasekaran B, Yi HG, Smayda KE, Maddox WT. Effect of explicit dimensional instruction on speech category learning. Atten Percept Psychophys 2016; 78:566-82. [PMID: 26542400 PMCID: PMC4744489 DOI: 10.3758/s13414-015-0999-x] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Learning nonnative speech categories is often considered a challenging task in adulthood. This difficulty is driven by cross-language differences in weighting critical auditory dimensions that differentiate speech categories. For example, previous studies have shown that differentiating Mandarin tonal categories requires attending to dimensions related to pitch height and direction. Relative to native speakers of Mandarin, the pitch direction dimension is underweighted by native English speakers. In the current study, we examined the effect of explicit instructions (dimension instruction) on native English speakers' Mandarin tone category learning within the framework of a dual-learning systems (DLS) model. This model predicts that successful speech category learning is initially mediated by an explicit, reflective learning system that frequently utilizes unidimensional rules, with an eventual switch to a more implicit, reflexive learning system that utilizes multidimensional rules. Participants were explicitly instructed to focus and/or ignore the pitch height dimension, the pitch direction dimension, or were given no explicit prime. Our results show that instruction instructing participants to focus on pitch direction, and instruction diverting attention away from pitch height, resulted in enhanced tone categorization. Computational modeling of participant responses suggested that instruction related to pitch direction led to faster and more frequent use of multidimensional reflexive strategies and enhanced perceptual selectivity along the previously underweighted pitch direction dimension.
Collapse
Affiliation(s)
- Bharath Chandrasekaran
- Department of Communication Sciences and Disorders, The University of Texas at Austin, 2504A Whitis Ave., Austin, TX, 78712, USA.
- Department of Psychology, The University of Texas at Austin, 2504A Whitis Ave., Austin, TX, 78712, USA.
| | - Han-Gyol Yi
- Department of Communication Sciences and Disorders, The University of Texas at Austin, 2504A Whitis Ave., Austin, TX, 78712, USA
| | - Kirsten E Smayda
- Department of Psychology, The University of Texas at Austin, 2504A Whitis Ave., Austin, TX, 78712, USA
| | - W Todd Maddox
- Department of Psychology, The University of Texas at Austin, 2504A Whitis Ave., Austin, TX, 78712, USA
| |
Collapse
|
22
|
The role of age and executive function in auditory category learning. J Exp Child Psychol 2015; 142:48-65. [PMID: 26491987 DOI: 10.1016/j.jecp.2015.09.018] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Revised: 08/01/2015] [Accepted: 09/21/2015] [Indexed: 11/20/2022]
Abstract
Auditory categorization is a natural and adaptive process that allows for the organization of high-dimensional, continuous acoustic information into discrete representations. Studies in the visual domain have identified a rule-based learning system that learns and reasons via a hypothesis-testing process that requires working memory and executive attention. The rule-based learning system in vision shows a protracted development, reflecting the influence of maturing prefrontal function on visual categorization. The aim of the current study was twofold: (a) to examine the developmental trajectory of rule-based auditory category learning from childhood through adolescence and into early adulthood and (b) to examine the extent to which individual differences in rule-based category learning relate to individual differences in executive function. A sample of 60 participants with normal hearing-20 children (age range=7-12years), 21 adolescents (age range=13-19years), and 19 young adults (age range=20-23years)-learned to categorize novel dynamic "ripple" sounds using trial-by-trial feedback. The spectrotemporally modulated ripple sounds are considered the auditory equivalent of the well-studied "Gabor" patches in the visual domain. Results reveal that auditory categorization accuracy improved with age, with young adults outperforming children and adolescents. Computational modeling analyses indicated that the use of the task-optimal strategy (i.e., a conjunctive rule-based learning strategy) improved with age. Notably, individual differences in executive flexibility significantly predicted auditory category learning success. The current findings demonstrate a protracted development of rule-based auditory categorization. The results further suggest that executive flexibility coupled with perceptual processes play important roles in successful rule-based auditory category learning.
Collapse
|
23
|
Abstract
A mutation of the forkhead box protein P2 (FOXP2) gene is associated with severe deficits in human speech and language acquisition. In rodents, the humanized form of FOXP2 promotes faster switching from declarative to procedural learning strategies when the two learning systems compete. Here, we examined a polymorphism of FOXP2 (rs6980093) in humans (214 adults; 111 females) for associations with non-native speech category learning success. Neurocomputational modeling results showed that individuals with the GG genotype shifted faster to procedural learning strategies, which are optimal for the task. These findings support an adaptive role for the FOXP2 gene in modulating the function of neural learning systems that have a direct bearing on human speech category learning.
Collapse
|
24
|
Smayda KE, Chandrasekaran B, Maddox WT. Enhanced cognitive and perceptual processing: a computational basis for the musician advantage in speech learning. Front Psychol 2015; 6:682. [PMID: 26052304 PMCID: PMC4439769 DOI: 10.3389/fpsyg.2015.00682] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Accepted: 05/10/2015] [Indexed: 01/05/2023] Open
Abstract
Long-term music training can positively impact speech processing. A recent framework developed to explain such cross-domain plasticity posits that music training-related advantages in speech processing are due to shared cognitive and perceptual processes between music and speech. Although perceptual and cognitive processing advantages due to music training have been independently demonstrated, to date no study has examined perceptual and cognitive processing within the context of a single task. The present study examines the impact of long-term music training on speech learning from a rigorous, computational perspective derived from signal detection theory. Our computational models provide independent estimates of cognitive and perceptual processing in native English-speaking musicians (n = 15, mean age = 25 years) and non-musicians (n = 15, mean age = 23 years) learning to categorize non-native lexical pitch patterns (Mandarin tones). Musicians outperformed non-musicians in this task. Model-based analyses suggested that musicians shifted from simple unidimensional decision strategies to more optimal multidimensional (MD) decision strategies sooner than non-musicians. In addition, musicians used optimal decisional strategies more often than non-musicians. However, musicians and non-musicians who used MD strategies showed no difference in performance. We estimated parameters that quantify the magnitude of perceptual variability along two dimensions that are critical for tone categorization: pitch height and pitch direction. Both musicians and non-musicians showed a decrease in perceptual variability along the pitch height dimension, but only musicians showed a significant reduction in perceptual variability along the pitch direction dimension. Notably, these advantages persisted during a generalization phase, when no feedback was provided. These results provide an insight into the mechanisms underlying the musician advantage observed in non-native speech learning.
Collapse
Affiliation(s)
- Kirsten E Smayda
- Department of Psychology, The University of Texas at Austin Austin, TX USA
| | - Bharath Chandrasekaran
- Department of Psychology, The University of Texas at Austin Austin, TX USA ; Department of Communication Sciences and Disorders, The University of Texas at Austin Austin, TX USA
| | - W Todd Maddox
- Department of Psychology, The University of Texas at Austin Austin, TX USA
| |
Collapse
|
25
|
Xie Z, Maddox WT, McGeary JE, Chandrasekaran B. The C957T polymorphism in the dopamine receptor D₂ gene modulates domain-general category learning. J Neurophysiol 2015; 113:3281-90. [PMID: 25761959 DOI: 10.1152/jn.01005.2014] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 03/10/2015] [Indexed: 11/22/2022] Open
Abstract
Adaptive learning from reward and punishment is vital for human survival. Striatal and frontal dopaminergic activities are associated with adaptive learning. For example, the C957T single nucleotide polymorphism of the dopamine receptor D2 (DRD2) gene alters striatal D2 receptor availability and affects individuals' adaptive learning ability. Specifically, individuals with the T/T genotype, which is associated with higher striatal D2 availability, show enhanced learning from negative outcomes. Prior work examining DRD2 genetic variability has focused primarily on frontally mediated reflective learning that is under effortful, conscious control. However, less is known about a more automatic, striatally mediated reflexive learning. Here we examined the extent to which this polymorphism differentially influences reflective and reflexive learning across visual and auditory modalities. We employed rule-based (RB) and information-integration (II) category learning paradigms that target reflective and reflexive learning, respectively. Results revealed an advantage in II category learning but poorer RB category learning in T/T homozygotes. The pattern of results was consistent across sensory modalities. These findings suggest that this DRD2 polymorphism exerts opposite influences on domain-general frontally mediated reflective learning and striatally mediated reflexive learning.
Collapse
Affiliation(s)
- Zilong Xie
- Department of Communication Sciences & Disorders, The University of Texas at Austin, Austin, Texas
| | - W Todd Maddox
- Department of Psychology, The University of Texas at Austin, Austin, Texas
| | - John E McGeary
- Division of Behavioral Genetics, Rhode Island Hospital, Providence, Rhode Island; Brown University, Providence, Rhode Island; and Psychologist, Providence Veterans Affairs Medical Center, Providence, Rhode Island
| | - Bharath Chandrasekaran
- Department of Communication Sciences & Disorders, The University of Texas at Austin, Austin, Texas;
| |
Collapse
|
26
|
Levi SV. Individual differences in learning talker categories: the role of working memory. PHONETICA 2015; 71:201-26. [PMID: 25721393 PMCID: PMC4861173 DOI: 10.1159/000370160] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 11/25/2014] [Indexed: 06/04/2023]
Abstract
The current study explores the question of how an auditory category is learned by having school-age listeners learn to categorize speech not in terms of linguistic categories, but instead in terms of talker categories (i.e., who is talking). Findings from visual-category learning indicate that working memory skills affect learning, but the literature is equivocal: sometimes better working memory is advantageous, and sometimes not. The current study examined the role of different components of working memory to test which component skills benefit, and which hinder, learning talker categories. Results revealed that the short-term storage component positively predicted learning, but that the Central Executive and Episodic Buffer negatively predicted learning. As with visual categories, better working memory is not always an advantage.
Collapse
Affiliation(s)
- Susannah V Levi
- Department of Communicative Sciences and Disorders, New York University, New York, N.Y., USA
| |
Collapse
|
27
|
Yi HG, Maddox WT, Mumford JA, Chandrasekaran B. The Role of Corticostriatal Systems in Speech Category Learning. Cereb Cortex 2014; 26:1409-1420. [PMID: 25331600 DOI: 10.1093/cercor/bhu236] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
One of the most difficult category learning problems for humans is learning nonnative speech categories. While feedback-based category training can enhance speech learning, the mechanisms underlying these benefits are unclear. In this functional magnetic resonance imaging study, we investigated neural and computational mechanisms underlying feedback-dependent speech category learning in adults. Positive feedback activated a large corticostriatal network including the dorsolateral prefrontal cortex, inferior parietal lobule, middle temporal gyrus, caudate, putamen, and the ventral striatum. Successful learning was contingent upon the activity of domain-general category learning systems: the fast-learning reflective system, involving the dorsolateral prefrontal cortex that develops and tests explicit rules based on the feedback content, and the slow-learning reflexive system, involving the putamen in which the stimuli are implicitly associated with category responses based on the reward value in feedback. Computational modeling of response strategies revealed significant use of reflective strategies early in training and greater use of reflexive strategies later in training. Reflexive strategy use was associated with increased activation in the putamen. Our results demonstrate a critical role for the reflexive corticostriatal learning system as a function of response strategy and proficiency during speech category learning.
Collapse
Affiliation(s)
- Han-Gyol Yi
- Department of Communication Sciences & Disorders, Moody College of Communication, The University of Texas at Austin, Austin, TX, USA
| | - W Todd Maddox
- Department of Psychology, College of Liberal Arts, The University of Texas at Austin, Austin, TX, USA.,Institute for Mental Health Research, College of Liberal Arts, The University of Texas at Austin, Austin, TX, USA.,The Institute for Neuroscience, The University of Texas at Austin, Austin, TX, USA.,Center for Perceptual Systems, College of Liberal Arts, The University of Texas at Austin, Austin, TX, USA
| | - Jeanette A Mumford
- Department of Psychology, College of Liberal Arts, The University of Texas at Austin, Austin, TX, USA
| | - Bharath Chandrasekaran
- Department of Communication Sciences & Disorders, Moody College of Communication, The University of Texas at Austin, Austin, TX, USA.,Institute for Mental Health Research, College of Liberal Arts, The University of Texas at Austin, Austin, TX, USA.,The Institute for Neuroscience, The University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
28
|
Maddox WT, Chandrasekaran B, Smayda K, Yi HG, Koslov S, Beevers CG. Elevated depressive symptoms enhance reflexive but not reflective auditory category learning. Cortex 2014; 58:186-98. [PMID: 25041936 PMCID: PMC4130789 DOI: 10.1016/j.cortex.2014.06.013] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Revised: 05/09/2014] [Accepted: 06/12/2014] [Indexed: 11/22/2022]
Abstract
In vision an extensive literature supports the existence of competitive dual-processing systems of category learning that are grounded in neuroscience and are partially-dissociable. The reflective system is prefrontally-mediated and uses working memory and executive attention to develop and test rules for classifying in an explicit fashion. The reflexive system is striatally-mediated and operates by implicitly associating perception with actions that lead to reinforcement. Although categorization is fundamental to auditory processing, little is known about the learning systems that mediate auditory categorization and even less is known about the effects of individual difference in the relative efficiency of the two learning systems. Previous studies have shown that individuals with elevated depressive symptoms show deficits in reflective processing. We exploit this finding to test critical predictions of the dual-learning systems model in audition. Specifically, we examine the extent to which the two systems are dissociable and competitive. We predicted that elevated depressive symptoms would lead to reflective-optimal learning deficits but reflexive-optimal learning advantages. Because natural speech category learning is reflexive in nature, we made the prediction that elevated depressive symptoms would lead to superior speech learning. In support of our predictions, individuals with elevated depressive symptoms showed a deficit in reflective-optimal auditory category learning, but an advantage in reflexive-optimal auditory category learning. In addition, individuals with elevated depressive symptoms showed an advantage in learning a non-native speech category structure. Computational modeling suggested that the elevated depressive symptom advantage was due to faster, more accurate, and more frequent use of reflexive category learning strategies in individuals with elevated depressive symptoms. The implications of this work for dual-process approach to auditory learning and depression are discussed.
Collapse
Affiliation(s)
| | | | | | - Han-Gyol Yi
- Department of Communication Sciences and Disorders, Austin, TX, 78712, USA.
| | - Seth Koslov
- Department of Psychology, Austin, TX, 78712, USA.
| | | |
Collapse
|
29
|
Chandrasekaran B, Koslov SR, Maddox WT. Toward a dual-learning systems model of speech category learning. Front Psychol 2014; 5:825. [PMID: 25132827 PMCID: PMC4116788 DOI: 10.3389/fpsyg.2014.00825] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2014] [Accepted: 07/10/2014] [Indexed: 11/15/2022] Open
Abstract
More than two decades of work in vision posits the existence of dual-learning systems of category learning. The reflective system uses working memory to develop and test rules for classifying in an explicit fashion, while the reflexive system operates by implicitly associating perception with actions that lead to reinforcement. Dual-learning systems models hypothesize that in learning natural categories, learners initially use the reflective system and, with practice, transfer control to the reflexive system. The role of reflective and reflexive systems in auditory category learning and more specifically in speech category learning has not been systematically examined. In this article, we describe a neurobiologically constrained dual-learning systems theoretical framework that is currently being developed in speech category learning and review recent applications of this framework. Using behavioral and computational modeling approaches, we provide evidence that speech category learning is predominantly mediated by the reflexive learning system. In one application, we explore the effects of normal aging on non-speech and speech category learning. Prominently, we find a large age-related deficit in speech learning. The computational modeling suggests that older adults are less likely to transition from simple, reflective, unidimensional rules to more complex, reflexive, multi-dimensional rules. In a second application, we summarize a recent study examining auditory category learning in individuals with elevated depressive symptoms. We find a deficit in reflective-optimal and an enhancement in reflexive-optimal auditory category learning. Interestingly, individuals with elevated depressive symptoms also show an advantage in learning speech categories. We end with a brief summary and description of a number of future directions.
Collapse
Affiliation(s)
- Bharath Chandrasekaran
- SoundBrain Lab, Department of Communication Sciences and Disorders, The University of Texas at AustinAustin, TX, USA
- Institute for Mental Health Research, The University of Texas at AustinAustin, TX, USA
- Institute for Neuroscience, The University of Texas at AustinAustin, TX, USA
- Department of Psychology, The University of Texas at AustinAustin, TX, USA
| | - Seth R. Koslov
- Department of Psychology, The University of Texas at AustinAustin, TX, USA
| | - W. T. Maddox
- Institute for Mental Health Research, The University of Texas at AustinAustin, TX, USA
- Institute for Neuroscience, The University of Texas at AustinAustin, TX, USA
- Department of Psychology, The University of Texas at AustinAustin, TX, USA
| |
Collapse
|