1
|
Funk R, Weirich M, Simpson AP. The Effect of Fundamental Frequency on Gender Perception in Prepubertal Children: Insights from the LoKiS Database. J Voice 2024:S0892-1997(24)00129-2. [PMID: 38704276 DOI: 10.1016/j.jvoice.2024.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 04/05/2024] [Accepted: 04/05/2024] [Indexed: 05/06/2024]
Abstract
This study examines the impact of fundamental frequency on gender perception in prepubertal children in the LoKiS database - a longitudinal project collecting and analyzing recordings of approximately 60 German primary school children aged 6 to 10years. Spontaneous and content-controlled audio recordings were collected in two German primary schools. Three distinct listening experiments with over 100 listeners were conducted. In the first experiment, listeners judged the gender of the voices on a seven-point scale. The second experiment explored the relationships between perceptual attribute ratings and corresponding acoustic parameters associated with fundamental frequency. The third experiment utilized voice morphing techniques to investigate the influence of fundamental frequency on gender perception while controlling for other acoustic parameters. About one-third of the children receive unambiguous gender attributions. The perceived gender difference between children assigned female at birth (AFAB) and assigned male at birth (AMAB) increases from first to third grade. The feminine-sounding children were perceived as significantly higher-pitched and more melodious. A strong correlation between perceived pitch and measured fundamental frequency was found. While the acoustic analysis revealed only a few significant differences between AFAB and AMAB children in general, the feminine-sounding children exhibited markedly higher values than the masculine-sounding ones. Stronger differences of fundamental frequency and semitone range occur as AFAB and AMAB children get older. Linear mixed models confirm a significant influence of fundamental frequency and semitone range on gender perception. Other interacting factors include the speech material used, as well as the gender of the listener. The influence of fundamental frequency was even more pronounced when controlling for other acoustic parameters.
Collapse
|
2
|
Yücel Ekici N, Demet Akbaş E, Kadir Arslan A. Voice aspects in children with precocious puberty. Int J Pediatr Otorhinolaryngol 2024; 180:111962. [PMID: 38657429 DOI: 10.1016/j.ijporl.2024.111962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 04/18/2024] [Accepted: 04/22/2024] [Indexed: 04/26/2024]
Abstract
PURPOSE In this prospective study, we aimed to investigate the difference in voice acoustic parameters between girls with idiopathic central precocious puberty (ICPP) and those who developed normally during prepuberty. MATERIALS AND METHODS Our study recruited 54 girls diagnosed with ICPP and randomly sampled 51 healthy prepubertal girls as the control. Tanner stages, circulating hormone levels and bone ages of the girls with ICPP and the age and body mass index (BMI) of all participants were recorded. Acoustic analyses were performed using PRAAT computer-based voice analysis software and the mean pitch (F0), jitter, shimmer, noise-to harmonic-ratio (NHR) and harmonic-to-noise ratio (HNR) values were compared in the patient and control groups. RESULTS The two groups did not significantly differ in age or BMI. In the evaluation of the F0 and jitter values, we were found to be lower in the control group than in the patient group. However, we did not find a statistical significance. The mean shimmer values of the patient group were significantly higher than those of the control group. In addition, a statistically significant difference was noted for the mean HNR and NHR values (P < 0.001). A moderate negative correlation was found between shimmer and hormone levels in the patient group. CONCLUSIONS Voice acoustic parameters one of the defining features of girls with ICPP. Voice changes in acoustic parameters could reflect hormonal changes during puberty. Clinicians should suspect ICPP when there is a change in the voice.
Collapse
Affiliation(s)
- Nur Yücel Ekici
- Department of Otorhinolaryngology, University of Health Sciences Adana City Training and Research Hospital, Adana, Turkey.
| | - Emine Demet Akbaş
- Department of Pediatric Endocrinology, University of Health Sciences Adana City Training and Research Hospital, Adana, Turkey
| | - Ahmet Kadir Arslan
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Inönü University, Malatya, Turkey
| |
Collapse
|
3
|
Fan C, Xue J, Tao J, Yi J, Wang C, Zheng C, Lv Z. Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection. Neural Netw 2024; 175:106320. [PMID: 38640696 DOI: 10.1016/j.neunet.2024.106320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/05/2024] [Accepted: 04/13/2024] [Indexed: 04/21/2024]
Abstract
The rhythm of bonafide speech is often difficult to replicate, which causes that the fundamental frequency (F0) of synthetic speech is significantly different from that of real speech. It is expected that the F0 feature contains the discriminative information for the fake speech detection (FSD) task. In this paper, we propose a novel F0 subband for FSD. In addition, to effectively model the F0 subband so as to improve the performance of FSD, the spatial reconstructed local attention Res2Net (SR-LA Res2Net) is proposed. Specifically, Res2Net is used as a backbone network to obtain multiscale information, and enhanced with a spatial reconstruction mechanism to avoid losing important information when the channel group is constantly superimposed. In addition, local attention is designed to make the model focus on the local information of the F0 subband. Experimental results on the ASVspoof 2019 LA dataset show that our proposed method obtains an equal error rate (EER) of 0.47% and a minimum tandem detection cost function (min t-DCF) of 0.0159, achieving the state-of-the-art performance among all of the single systems.
Collapse
Affiliation(s)
- Cunhang Fan
- Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, Hefei, 230601, China.
| | - Jun Xue
- Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, Hefei, 230601, China
| | - Jianhua Tao
- Department of Automation, Tsinghua University, Beijing, 100190, China.
| | - Jiangyan Yi
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
| | - Chenglong Wang
- National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
| | - Chengshi Zheng
- Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Sciences, Beijing, 100190, China
| | - Zhao Lv
- Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, Hefei, 230601, China.
| |
Collapse
|
4
|
Radic R, Lukacova K, Baciak L, Hodova V, Kubikova L. The role of cerebellum in learned vocal communication in adult songbirds. Sci Rep 2024; 14:8168. [PMID: 38589482 PMCID: PMC11001874 DOI: 10.1038/s41598-024-58569-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 04/01/2024] [Indexed: 04/10/2024] Open
Abstract
Injury, tumors, ischemia, and lesions in the cerebellum show the involvement of this region in human speech. The association of the cerebellum with learned birdsong has only been identified recently. Cerebellar dysfunction in young songbirds causes learning disabilities, but its role in adult songbirds has not been established. The aim of this study was to investigate the role of the deep cerebellar nuclei (DCN) in adult birdsong. We created bilateral excitotoxic lesions in the DCN of adult male zebra finches (Taeniopygia guttata) and recorded their songs for up to 4 months. Using magnetic resonance imaging (MRI) and immunohistochemistry, we validated the lesion efficacy. We found that the song duration significantly increased from 14 weeks post-op; the increase in duration was caused by a greater number of introductory notes as well as a greater number of syllables sung after the introductory notes. On the other hand, the motif duration decreased from 8 weeks after DCN lesions were induced, which was due to faster singing of syllables, not changes in inter-syllable interval length. DCN lesions also caused a decrease in the fundamental frequency of syllables. In summary, we showed that DCN lesions influence the temporal and acoustic features of birdsong. These results suggest that the cerebellum influences singing in adult songbirds.
Collapse
Affiliation(s)
- Rebecca Radic
- Institute of Animal Biochemistry and Genetics, Centre of Biosciences, Slovak Academy of Sciences, 840 05, Bratislava, Slovakia
| | - Kristina Lukacova
- Institute of Animal Biochemistry and Genetics, Centre of Biosciences, Slovak Academy of Sciences, 840 05, Bratislava, Slovakia
| | - Ladislav Baciak
- Central Laboratories, Faculty of Chemical and Food Technology, Slovak University of Technology, 812 37, Bratislava, Slovakia
| | - Vladimira Hodova
- Institute of Animal Biochemistry and Genetics, Centre of Biosciences, Slovak Academy of Sciences, 840 05, Bratislava, Slovakia
| | - Lubica Kubikova
- Institute of Animal Biochemistry and Genetics, Centre of Biosciences, Slovak Academy of Sciences, 840 05, Bratislava, Slovakia.
| |
Collapse
|
5
|
Jacxsens L, Biot L, Escera C, Gilles A, Cardon E, Van Rompaey V, De Hertogh W, Lammers MJW. Frequency-Following Responses in Sensorineural Hearing Loss: A Systematic Review. J Assoc Res Otolaryngol 2024; 25:131-147. [PMID: 38334887 PMCID: PMC11018579 DOI: 10.1007/s10162-024-00932-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 01/18/2024] [Indexed: 02/10/2024] Open
Abstract
PURPOSE This systematic review aims to assess the impact of sensorineural hearing loss (SNHL) on various frequency-following response (FFR) parameters. METHODS Following PRISMA guidelines, a systematic review was conducted using PubMed, Web of Science, and Scopus databases up to January 2023. Studies evaluating FFRs in patients with SNHL and normal hearing controls were included. RESULTS Sixteen case-control studies were included, revealing variability in acquisition parameters. In the time domain, patients with SNHL exhibited prolonged latencies. The specific waves that were prolonged differed across studies. There was no consensus regarding wave amplitude in the time domain. In the frequency domain, focusing on studies that elicited FFRs with stimuli of 170 ms or longer, participants with SNHL displayed a significantly smaller fundamental frequency (F0). Results regarding changes in the temporal fine structure (TFS) were inconsistent. CONCLUSION Patients with SNHL may require more time for processing (speech) stimuli, reflected in prolonged latencies. However, the exact timing of this delay remains unclear. Additionally, when presenting longer stimuli (≥ 170 ms), patients with SNHL show difficulties tracking the F0 of (speech) stimuli. No definite conclusions could be drawn on changes in wave amplitude in the time domain and the TFS in the frequency domain. Patient characteristics, acquisition parameters, and FFR outcome parameters differed greatly across studies. Future studies should be performed in larger and carefully matched subject groups, using longer stimuli presented at the same intensity in dB HL for both groups, or at a carefully determined maximum comfortable loudness level.
Collapse
Affiliation(s)
- Laura Jacxsens
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium.
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium.
- Department of Rehabilitation Sciences and Physiotherapy, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium.
| | - Lana Biot
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Carles Escera
- Neuroscience Research Group, Department of Clinical Psychology and Psychobiology, Brainlab - Cognitive, University of Barcelona, Catalonia, Spain
- Institute of Neurosciences, University of Barcelona, Catalonia, Spain
- Institut de Recerca Sant Joan de Déu, Santa Rosa 39-57, 08950, Esplugues de Llobregat, Catalonia, Spain
| | - Annick Gilles
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
- Department of Education, Health and Social Work, University College Ghent, Ghent, Belgium
| | - Emilie Cardon
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Vincent Van Rompaey
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Willem De Hertogh
- Department of Rehabilitation Sciences and Physiotherapy, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| | - Marc J W Lammers
- Department of Otorhinolaryngology, Head and Neck Surgery, Antwerp University Hospital (UZA), Drie Eikenstraat 655, 2650, Edegem, Belgium
- Resonant Labs Antwerp, Department of Translational Neurosciences, Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
6
|
Södersten M, Oates J, Sand A, Granqvist S, Quinn S, Dacakis G, Nygren U. Gender-Affirming Voice Training for Trans Women: Acoustic Outcomes and Their Associations With Listener Perceptions Related to Gender. J Voice 2024:S0892-1997(24)00023-7. [PMID: 38503674 DOI: 10.1016/j.jvoice.2024.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 02/01/2024] [Accepted: 02/02/2024] [Indexed: 03/21/2024]
Abstract
OBJECTIVES To investigate acoustic outcomes of gender-affirming voice training for trans women wanting to develop a female sounding voice and to describe what happens acoustically when male sounding voices become more female sounding. STUDY DESIGN Prospective treatment study with repeated measures. METHODS N = 74 trans women completed a voice training program of 8-12 sessions and had their voices audio recorded twice before and twice after training. Reference data were obtained from N = 40 cisgender speakers. Fundamental frequency (fo), formant frequencies (F1-F4), sound pressure level (Leq), and level difference between first and second harmonic (L1-L2) were extracted from a reading passage and spontaneous speech. N = 79 naive listeners provided gender-related ratings of participants' audio recordings. A linear mixed-effects model was used to estimate average training effects. Individual level analyses determined how changes in acoustic data were related to listeners' ratings. RESULTS Group data showed substantial training effects on fo (average, minimum, and maximum) and formant frequencies. Individual data demonstrated that many participants also increased Leq and some increased L1-L2. Measures that most strongly predicted listener ratings of a female sounding voice were: fo, average formant frequency, and Leq. CONCLUSIONS This is the largest prospective study reporting on acoustic outcomes of gender-affirming voice training for trans women. We confirm findings from previous smaller scale studies by demonstrating that listener perceptions of male and female sounding voices are related to acoustic voice features, and that voice training for trans women wanting to sound female is associated with desirable acoustic changes, indicating training effectiveness. Although acoustic measures can be a valuable indicator of training effectiveness, particularly from the perspective of clinicians and researchers, we contend that a combination of outcome measures, including client perspectives, are needed to provide comprehensive evaluation of gender-affirming voice training that is relevant for all stakeholders.
Collapse
Affiliation(s)
- Maria Södersten
- Division of Speech and Language Pathology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden; Speech and Language Pathology, Medical Unit, Karolinska University Hospital, Stockholm, Sweden.
| | - Jennifer Oates
- Discipline of Speech Pathology, School of Allied Health, Human Services and Sport, La Trobe University, Melbourne, Australia
| | - Anders Sand
- Division of Speech and Language Pathology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Svante Granqvist
- Division of Speech and Language Pathology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Sterling Quinn
- Discipline of Speech Pathology, School of Allied Health, Human Services and Sport, La Trobe University, Melbourne, Australia
| | - Georgia Dacakis
- Discipline of Speech Pathology, School of Allied Health, Human Services and Sport, La Trobe University, Melbourne, Australia
| | - Ulrika Nygren
- Division of Speech and Language Pathology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden; Speech and Language Pathology, Medical Unit, Karolinska University Hospital, Stockholm, Sweden
| |
Collapse
|
7
|
Busquet F, Efthymiou F, Hildebrand C. Voice analytics in the wild: Validity and predictive accuracy of common audio-recording devices. Behav Res Methods 2024; 56:2114-2134. [PMID: 37253958 PMCID: PMC10228884 DOI: 10.3758/s13428-023-02139-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2023] [Indexed: 06/01/2023]
Abstract
The use of voice recordings in both research and industry practice has increased dramatically in recent years-from diagnosing a COVID-19 infection based on patients' self-recorded voice samples to predicting customer emotions during a service center call. Crowdsourced audio data collection in participants' natural environment using their own recording device has opened up new avenues for researchers and practitioners to conduct research at scale across a broad range of disciplines. The current research examines whether fundamental properties of the human voice are reliably and validly captured through common consumer-grade audio-recording devices in current medical, behavioral science, business, and computer science research. Specifically, this work provides evidence from a tightly controlled laboratory experiment analyzing 1800 voice samples and subsequent simulations that recording devices with high proximity to a speaker (such as a headset or a lavalier microphone) lead to inflated measures of amplitude compared to a benchmark studio-quality microphone while recording devices with lower proximity to a speaker (such as a laptop or a smartphone in front of the speaker) systematically reduce measures of amplitude and can lead to biased measures of the speaker's true fundamental frequency. We further demonstrate through simulation studies that these differences can lead to biased and ultimately invalid conclusions in, for example, an emotion detection task. Finally, we outline a set of recording guidelines to ensure reliable and valid voice recordings and offer initial evidence for a machine-learning approach to bias correction in the case of distorted speech signals.
Collapse
Affiliation(s)
- Francesc Busquet
- Institute of Behavioral Science and Technology, University of St. Gallen, Torstrasse 25, St. Gallen, 9000, Switzerland.
| | - Fotis Efthymiou
- Institute of Behavioral Science and Technology, University of St. Gallen, Torstrasse 25, St. Gallen, 9000, Switzerland
| | - Christian Hildebrand
- Institute of Behavioral Science and Technology, University of St. Gallen, Torstrasse 25, St. Gallen, 9000, Switzerland.
| |
Collapse
|
8
|
Rogalska M, Zielinski M, Antkowiak L, Kasperczuk A, Misiolek M. Impact of Wendler glottoplasty on acoustic measures and quality of voice in transgender women: a systematic review and meta-analysis. Eur Arch Otorhinolaryngol 2024:10.1007/s00405-024-08486-3. [PMID: 38315177 DOI: 10.1007/s00405-024-08486-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 01/15/2024] [Indexed: 02/07/2024]
Abstract
PURPOSE Since new evidence regarding the impact of Wendler glottoplasty (WG) on the voice in transgender women became available in the literature in recent years, we aimed to perform an updated systematic review and meta-analysis to determine the actual safety and efficacy of WG in the process of vocal feminization. METHODS PubMed, Embase, and Cochrane were searched for English-language articles published until July 4, 2023. Studies were found eligible if they evaluated the impact of WG on the acoustic-aerodynamic measures and quality of voice in transgender women. RESULTS Twenty-three studies were identified. After exclusion of three studies due to incomplete data, 20 studies including 656 patients were included in the meta-analysis. After WG, there was a significant increase of fundamental frequency, speaking fundamental frequency, and lower limit of the frequency range (p < 0.001). Concurrently, a significant reduction of frequency range and maximum phonation time was observed (p < 0.001). No significant differences were found between the pre- and postoperative values regarding the Grade, Roughness, Breathiness, Asthenia, and Strain scale score (p = 0.339). The overall score in the Trans Woman Voice Questionnaire (TWVQ) significantly improved after WG (p < 0.001). CONCLUSIONS WG is an effective voice feminization method in transgender women, associated with a high procedural success and low risk of postoperative complications. Significantly improved TWVQ score after surgery suggests its positive impact on the voice-related quality of life. Postoperative decrease of maximum phonation time and frequency range does not seem to significantly impact the effectiveness of voice production.
Collapse
Affiliation(s)
- Marta Rogalska
- Department of Otorhinolaryngology and Oncological Laryngology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia in Katowice, Zabrze, Poland.
| | - Maciej Zielinski
- Department of Otorhinolaryngology and Oncological Laryngology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia in Katowice, Zabrze, Poland
| | - Lukasz Antkowiak
- Department of Pediatric Neurosurgery, Medical University of Silesia in Katowice, Katowice, Poland
| | - Anna Kasperczuk
- Faculty of Mechanical Engineering, Institute of Biomedical Engineering, Bialystok University of Technology, Bialystok, Poland
| | - Maciej Misiolek
- Department of Otorhinolaryngology and Oncological Laryngology, Faculty of Medical Sciences in Zabrze, Medical University of Silesia in Katowice, Zabrze, Poland
| |
Collapse
|
9
|
Howard DM. Influence of vowels on note pitching in a cappella vocal quartet performance. J Voice 2023:S0892-1997(23)00369-7. [PMID: 38036379 DOI: 10.1016/j.jvoice.2023.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 11/07/2023] [Accepted: 11/07/2023] [Indexed: 12/02/2023]
Abstract
This paper notes that accurate tuning in a cappella (unaccompanied) choral music is a two-step process requiring (1) making pitch shifts on individual notes to sing intervals in just intonation thereby providing beat-free tuning for the consonant musical intervals that underpin the underlying harmony and (2) tuning different vowels to account for any additional pitch changes that are made in the context of the intrinsic pitch relating to the specific vowel being produced. This paper explores the second of these in the context of a part-by-part vowel variation during a sustained chord sung by the other three parts by a quartet of professional singers. OBJECTIVE This study aimed to investigate the relationship between the fundamental frequencies employed when singing different vowels on the same pitch for each member in turn of a professional soprano, alto, tenor, bass (SATB) a cappella vocal quartet in terms of the measured fundamental frequency during the production of a set of individual vowels within a four-part carrier phrase. STUDY DESIGN This is an experimental singing production study on the tuning of vowels in the context of a cappella quartet singing. To facilitate this, one singer at a time in a professional a cappella vocal quartet changed the vowel they are singing on a single note concurrent with a consonant a cappella triad being sustained by the other three singers singing a fixed vowel. METHODS/DESIGN The first few bars of the a cappella anthem "If ye love me" by Thomas Tallis provided a carrier phrase that ended with a held chord on the chord on the word "me" by three singers against which the remaining singer produced a set of different vowels on their note of the chord. This is carried out for each of the four singers. The hypothesis being investigated is to remain accurately in-tune; fundamental frequency should be varied depending on the vowel being sung. Fundamental frequencies were measured using electrolaryngographs to ensure that there was no acoustic interference that could affect the accuracy of the fundamental frequency measurements if they were obtained from the audio output. RESULTS The results provide clear evidence of changes being made to the fundamental frequencies of different vowels in an a cappella quartet context, and these changes confirm variations found elsewhere relating to the perceived pitches of different vowels. CONCLUSIONS Measurable and consistent fundamental frequency variations occur when one part tunes different vowels against a reference chord sung by the other three singers in a choral quartet context on a fixed vowel. This has a direct consequence for tuning in a cappella choral music. The importance of carefully tuning individual notes for different vowels in a cappella choral singing requires focused listening to the pitches of the sounds being produced by the other singers in the choir. Usually, all parts are singing common vowels where it is important that the vowels are matched by being carefully blended together and tuning is aimed at being beat-free in just intonation. When compositions require parts to sing different vowels, both intrinsic (production related) and auditory (hearing related) pitch variations become relevant and challenge beat-free tuning.
Collapse
Affiliation(s)
- David M Howard
- Biosignals and Intelligent Systems Research Group, Department of Electronic Engineering, Royal Holloway, University of London, Egham, UK.
| |
Collapse
|
10
|
Toppo R, Sinha S. The Acoustics of Gender in Indian English: Toward Forensic Profiling in a Multilingual Context. J Voice 2023:S0892-1997(23)00239-4. [PMID: 37748969 DOI: 10.1016/j.jvoice.2023.07.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 07/31/2023] [Accepted: 07/31/2023] [Indexed: 09/27/2023]
Abstract
The present study is an acoustic analysis of Indian English, specifically examining the speech patterns and characteristics of three different groups with different native languages. This study investigates fundamental frequency (fo), fo range, fo variance, formant frequencies, and vowel space size in 42 native male and female speakers of Odia, Bangla, and Hindi. Furthermore, it investigated the potential correlation between fundamental frequency and vowel space, examining whether variations in vowel space size could be influenced by gender-specific perceptual factors. The paper emphasizes that in a multilingual context, gender identification can be efficiently correlated with both fo and formant frequencies. To measure a range of acoustic characteristics, speech samples were collected from the recording task. Analysis was done on PRAAT. The study revealed significant differences between genders for the examined acoustic characteristics. Results indicate differences in the size of gender-specific variations among the language groups, with females exhibiting more significant differences in fo, formant frequencies, and vowel space than males. The findings show no significant correlation between fo and vowel space area, indicating that other features are responsible for large vowel space for females. These findings display significant potential toward creating a robust empirical framework for gender profiling that can be utilized in a wide range of forensic linguistics investigations.
Collapse
Affiliation(s)
- Ravina Toppo
- Department of Humanities and Social Sciences, Indian Institute of Technology Patna, Bihar, India.
| | - Sweta Sinha
- Department of Humanities and Social Sciences, Indian Institute of Technology Patna, Bihar, India
| |
Collapse
|
11
|
Dong L. Loudness and Pitch of Emotional Stage Speech in Kunqu Opera. J Voice 2023; 37:806.e1-806.e7. [PMID: 34090741 DOI: 10.1016/j.jvoice.2021.04.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 11/26/2022]
Abstract
Equivalent sound level (Leq), sound pressure level (SPL) and fundamental frequency (F0) were analyzed in the stage speech of six Kunqu Opera roles, Young woman, Old woman, Young man, Old man, Colorful face and Clown. The roles differ in gender, age, personality and phonation types. Differences among emotions (neutral, sad, angry and happy), singers and roles were examined. For most roles, more similarities were observed between neutral and sad stage speech and between angry and happy stage speeches. In most cases, the latter group showed higher Leq, mean SPL and Mean F0 and larger standard deviation (SD) of SPL difference than the former. Some parameters, such as SD of SPL, the mean of SPL difference and the difference between Leq and mean SPL, also showed the intra-group differences.Young woman role, Young man role and Old woman role were similar in some parameters. Colorful face role and Old man role showed a lot of similarities. Clown role showed the least similarities with the other roles. With regard to gender and age, young roles showed smaller SPL difference, larger correlation coefficient between F0 and SPL and larger mean SPL differences between emotions than old roles; female roles had greater parameter consistency and larger correlation coefficient between F0 and SPL than male roles. The personality and phonation types also effected the characteristics of loudness and pitch. This study showed the importance of speakers' characteristics in emotional speech analysis.
Collapse
Affiliation(s)
- Li Dong
- School of Humanities, Shenzhen University, Shenzhen, China.
| |
Collapse
|
12
|
Haddou N, Idrissi N, Ben Jebara S. Analysis of Voice Quality After Thyroid Surgery. J Voice 2023:S0892-1997(23)00208-4. [PMID: 37612171 DOI: 10.1016/j.jvoice.2023.06.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 06/29/2023] [Accepted: 06/29/2023] [Indexed: 08/25/2023]
Abstract
OBJECTIVES Voice changes are a common complication after a thyroidectomy, which is a surgical procedure involving partial or total removal of the thyroid gland. The main objective of this work is to examine the possible voice disorders after thyroid surgery. More precisely, it is an investigation of partial and total thyroidectomy, as well as the effects that cancerous and noncancerous thyroid glands can have regarding postsurgical vocal and their association with age and gender. METHODS Patients were evaluated using acoustic voice parameters, including harmonics-to-noise ratio (HNR), fundamental frequency (F0), jitter, speaker phonation frequency (SPF) range, cepstral peak prominence (CPP), maximum phonational frequency range (MPFR), and shimmer at the preoperative stage and postoperatively at the 1 day, and first-month stages. RESULTS Results demonstrated a significant change in F0 parameters, SPF range, and CPP feature 1 month after surgery, depending on the type of thyroidectomy and thyroid pathology. No significant changes were observed in the HNR, shimmer, and jitter features. Age was associated with the CPP parameter in the entire sample. In contrast, the MPFR parameter was also related to the type of thyroidectomy in the entire sample. However, maximum F0 was significantly associated with the type of thyroidectomy, specifically in the female sample. CONCLUSIONS Results indicated that a thyroidectomy can have a negative impact on voice quality. The age and type of thyroidectomy performed are not responsible for this change. Potentially this change can be due to factors such as nerve damage or the subjects' experience, such as job, anxiety, and their physical condition, as well as treatments they may have undergone before thyroidectomy. Further efforts are needed to fully understand the background of voice changes after thyroidectomy.
Collapse
Affiliation(s)
- Noura Haddou
- Data4Earth Laboratory, Faculty of Science and Technology, Sultan Moulay Slimane University, Beni-Mellal, Morocco.
| | - Najlae Idrissi
- Data4Earth Laboratory, Faculty of Science and Technology, Sultan Moulay Slimane University, Beni-Mellal, Morocco
| | - Sofia Ben Jebara
- Higher School of Communication of Tunis, Research Lab COSIM, Carthage University, Tunis, Tunisia
| |
Collapse
|
13
|
Serry MA, Alzamendi GA, Zañartu M, Peterson SD. Modeling the influence of the extrinsic musculature on phonation. Biomech Model Mechanobiol 2023; 22:1365-1378. [PMID: 37169957 PMCID: PMC10529543 DOI: 10.1007/s10237-023-01724-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 04/11/2023] [Indexed: 05/13/2023]
Abstract
Neck muscles play important roles in various physiological tasks, including swallowing, head stabilization, and phonation. The mechanisms by which neck muscles influence phonation are not well understood, with conflicting reports on the change in fundamental frequency for ostensibly the same neck muscle activation scenarios. In this work, we introduce a reduced-order muscle-controlled vocal fold model, comprising both intrinsic muscle control and extrinsic muscle effects. The model predicts that when the neck muscles pull the thyroid cartilage in the superior-anterior direction (with a sufficiently large anterior component), inferior direction, or inferior-anterior direction, tension in the vocal folds increases, leading to fundamental frequency rise during sustained phonation. On the other hand, pulling in the superior direction, superior-posterior direction, or inferior-posterior direction (with a sufficiently large posterior component) tends to decrease vocal fold tension and phonation fundamental frequency. Varying the pulling force location alters the posture and phonation biomechanics, depending on the force direction. These findings suggest potential roles of particular neck muscles in modulating phonation fundamental frequency, with implications for vocal hyperfunction.
Collapse
Affiliation(s)
- Mohamed A Serry
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada
| | - Gabriel A Alzamendi
- Institute for Research and Development on Bioengineering and Bioinformatics (IBB), CONICET-UNER, Oro Verde, Entre Ríos, 3100, Argentina
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Valparaíso, Chile
| | - Sean D Peterson
- Mechanical and Mechatronics Engineering, University of Waterloo, Waterloo, Ontario, N2L 3G1, Canada.
| |
Collapse
|
14
|
Friedman L, Lauber M, Behroozmand R, Fogerty D, Kunecki D, Berry-Kravis E, Klusek J. Atypical vocal quality in women with the FMR1 premutation: an indicator of impaired sensorimotor control. Exp Brain Res 2023; 241:1975-1987. [PMID: 37347418 PMCID: PMC10863608 DOI: 10.1007/s00221-023-06653-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 06/13/2023] [Indexed: 06/23/2023]
Abstract
Women with the FMR1 premutation are susceptible to motor involvement related to atypical cerebellar function, including risk for developing fragile X tremor ataxia syndrome. Vocal quality analyses are sensitive to subtle differences in motor skills but have not yet been applied to the FMR1 premutation. This study examined whether women with the FMR1 premutation demonstrate differences in vocal quality, and whether such differences relate to FMR1 genetic, executive, motor, or health features of the FMR1 premutation. Participants included 35 women with the FMR1 premutation and 45 age-matched women without the FMR1 premutation who served as a comparison group. Three sustained /a/ vowels were analyzed for pitch (mean F0), variability of pitch (standard deviation of F0), and overall vocal quality (jitter, shimmer, and harmonics-to-noise ratio). Executive, motor, and health indices were obtained from direct and self-report measures and genetic samples were analyzed for FMR1 CGG repeat length and activation ratio. Women with the FMR1 premutation had a lower pitch, larger pitch variability, and poorer vocal quality than the comparison group. Working memory was related to harmonics-to-noise ratio and shimmer in women with the FMR1 premutation. Vocal quality abnormalities differentiated women with the FMR1 premutation from the comparison group and were evident even in the absence of other clinically evident motor deficits. This study supports vocal quality analyses as a tool that may prove useful in the detection of early signs of motor involvement in this population.
Collapse
Affiliation(s)
- Laura Friedman
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Meagan Lauber
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Roozbeh Behroozmand
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA
| | - Daniel Fogerty
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, USA
| | - Dariusz Kunecki
- Department of Pediatrics, Rush University Medical Center, Chicago, USA
| | | | - Jessica Klusek
- Department of Communication Sciences and Disorders, University of South Carolina, Columbia, USA.
| |
Collapse
|
15
|
Bulling LJ, Hilpert P, Bertschi IC, Ivic A, Bodenmann G. Associations Between Vocal Arousal and Dyadic Coping During Couple Interactions After a Stress Induction. Int J Appl Posit Psychol 2023; 8:187-204. [PMID: 38162834 PMCID: PMC10754724 DOI: 10.1007/s41042-023-00087-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 01/14/2023] [Indexed: 01/03/2024]
Abstract
It is well known that although relationship external stressors can harm couples, dyadic coping behavior can buffer the negative effects of stress. Thus far, however, less is known about how vocally encoded stress (i.e., f0) might affect the stress-coping process in couples during an interaction. Therefore, the goal of the current study was to compare two different stress hypotheses (i.e., paraverbal communication stress hypothesis and emotional resonance hypothesis). We observed 187 mixed-gender couples (N = 374 participants) interacting naturally after an experimental stress induction (Trier Social Stress Test), for which couples were randomly allocated into three groups (women stressed, men stressed, and both stressed). Results of a multi-group actor-partner interdependence mediation model (APIMeM) show that either the paraverbal communication stress hypothesis or the emotional resonance hypothesis could be confirmed, depending on whether the man, the woman, or both partners were stressed.
Collapse
Affiliation(s)
- Lisanne J. Bulling
- Department of Psychology, Clinical Psychology for Children/Adolescents & Couples/Families, University of Zurich, Binzmühlestrasse 14/23, Zürich, CH – 8050 Switzerland
| | - Peter Hilpert
- Department of Psychology, Université de Lausanne, Lausanne, Switzerland
| | - Isabella C. Bertschi
- Department of Psychology, Clinical Psychology for Children/Adolescents & Couples/Families, University of Zurich, Binzmühlestrasse 14/23, Zürich, CH – 8050 Switzerland
| | - Ana Ivic
- Family Translational Research Group, New York University, New York, United States
| | - Guy Bodenmann
- Department of Psychology, Clinical Psychology for Children/Adolescents & Couples/Families, University of Zurich, Binzmühlestrasse 14/23, Zürich, CH – 8050 Switzerland
| |
Collapse
|
16
|
Tseng WH, Chang CC, Chiu HL, Hsiao TY, Yang TL. Effects of surgery on the relationship between subglottic pressure and fundamental frequency in vocal fold dynamics in patients with benign laryngeal diseases. Eur Arch Otorhinolaryngol 2023; 280:1283-1290. [PMID: 36136150 DOI: 10.1007/s00405-022-07662-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 09/14/2022] [Indexed: 02/07/2023]
Abstract
PURPOSE Subglottic pressure (Ps) and fundamental frequency (F0) play important roles in governing vocal fold (VF) dynamics. Theoretical description, model simulation, excised larynx and animal models have been used in previous studies, yet clinically applicable measurements are still lacking. This study aimed to evaluate the effects of surgery for benign laryngeal lesions by investigating the relationship between F0 and Ps. METHODS Patients with benign laryngeal lesions who underwent phonosurgery were prospectively recruited. Participants were instructed to sustain voicing the vowel /o/ at three incremental frequencies four semitones apart in the modal register (F01, F02, F03). F0 was estimated by VF vibration on the accelerometer. Ps change was achieved and measured using the airflow interruption method. RESULTS Thirteen patients with a mean age (SD) of 43.5 (12.4) years were included. The change in F0 per unit change of Ps, which is the slope (Hz/kPa) of the regression line of the frequency-pressure data pairs, decreased as the tension of the VF increased. The slopes significantly increased after the operation for F01 and F02 (36.43 ± 14.68 preoperatively, 53.91 ± 30.71 postoperatively, p = 0.011 and 26.02 ± 10.71; 34.85 ± 17.92, p = 0.046, respectively). In addition, there was a significant decrease in phonation threshold pressure and improvements in the grade, roughness, breathiness, asthenia, strain scale, and the voice handicap inventory-10. CONCLUSIONS The relationship between F0 and Ps may serve as an objective assessment of the outcomes in the treatment of benign laryngeal diseases with clinical relevance.
Collapse
Affiliation(s)
- Wen-Hsuan Tseng
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, #1, Sec. 1, Jen-Ai Road, Taipei, 100, Taiwan.,Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Chi-Chin Chang
- Department of Speech Language Pathology and Audiology, National Taipei University of Nursing and Health Sciences, Taipei, Taiwan
| | - Hsiang-Ling Chiu
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, #1, Sec. 1, Jen-Ai Road, Taipei, 100, Taiwan
| | - Tzu-Yu Hsiao
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, #1, Sec. 1, Jen-Ai Road, Taipei, 100, Taiwan
| | - Tsung-Lin Yang
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, #1, Sec. 1, Jen-Ai Road, Taipei, 100, Taiwan. .,Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan. .,Research Center for Developmental Biology and Regenerative Medicine, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
17
|
Li Q, Millard K, Tetnowski J, Narayana S, Cannito M. Acoustic Analysis of Intonation in Persons With Parkinson's Disease Receiving Transcranial Magnetic Stimulation and Intensive Voice Treatment. J Voice 2023; 37:203-214. [PMID: 33461880 DOI: 10.1016/j.jvoice.2020.12.019] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Revised: 12/14/2020] [Accepted: 12/15/2020] [Indexed: 11/16/2022]
Abstract
Intonation is one of the prosodic features manifested acoustically in the fundamental frequency (f0). Intonation abnormality is common and prominent in the speech of persons with Parkinson's disease (PD). The current research investigated acoustically five intonational features including f0 declination, f0 resetting, sentence stress, terminal fall, and syntactic prejunctural fall in 20 PD participants, receiving Lee Silverman Voice Treatment (LSVT)-LOUD alone, or combined with transcranial magnetic stimulation delivered to the left or right primary laryngeal motor cortex. The results revealed that f0 declination, sentence stress, and terminal fall changed significantly from pre- to post-treatment, and the changes of declination and terminal fall were maintained at the follow-up evaluations. The observed changes in intonation were attributed to LSVT alone, which caused large changes of f0 magnitude. f0 resetting and syntactic prejunctural fall did not change significantly following treatment, probably because these intonational features need very precise fine motor control of the intrinsic laryngeal muscles to make small-range, rapid f0 adjustments, which were not improved by LSVT in the present PD participants. Difficulties with syntactic processing previously reported in PD may have also contributed to the lack of improvement in resetting and prejunctural fall, since these f0 features are used to mark syntactic boundaries within utterances.
Collapse
Affiliation(s)
- Qiang Li
- Department of Communication Sciences and Disorders, Fort Hays State University, Hays, Kansas.
| | - Kelly Millard
- Department of Communicative Disorders, University of Louisiana at Lafayette, Lafayette, Louisiana
| | - John Tetnowski
- Department of Communicative Disorders, University of Louisiana at Lafayette, Lafayette, Louisiana
| | - Shalini Narayana
- Department of Pediatrics, Department of Anatomy and Neurology, University of Tennessee Health Science Center, Memphis, Tennesse
| | - Michael Cannito
- Department of Communicative Disorders, University of Louisiana at Lafayette, Lafayette, Louisiana
| |
Collapse
|
18
|
Frosolini A, Fantin F, Tundo I, Pessot N, Badin G, Bartolotta P, Vedovelli L, Marioni G, de Filippis C. Voice Parameters in Children With Cochlear Implants: A Systematic Review and Meta-Analysis. J Voice 2023:S0892-1997(23)00021-8. [PMID: 36868956 DOI: 10.1016/j.jvoice.2023.01.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/15/2023] [Accepted: 01/17/2023] [Indexed: 03/05/2023]
Abstract
INTRODUCTION An intact auditory system is essential for the development and maintenance of voice quality and speech prosody. On the contrary hearing loss affects the adjustments and appropriate use of organs involved in speech and voice production. Spectro-acoustic voice parameters have been evaluated in Cochlear Implant (CI) users, and the authors of previous systematic reviews on the topic concluded that fundamental frequency (F0) seemed preliminarily the most reliable parameter to evaluate voice alterations in adult CI users. The main aim of this systematic review and meta-analysis was to clarify the vocal parameters and prosodic alterations of speech in pediatric CI users. MATERIALS AND METHODS The protocol of the systematic review was registered on the PROSPERO database, International prospective register of systematic reviews. We conducted a search of the English literature published in the period between January 1, 2005 and April 1, 2022 on the Pubmed and Scopus databases. A meta-analysis was conducted to compare the values of voice acoustic parameters in CI users and non-hearing-impaired controls. The analysis was conducted using the standardized mean difference as the outcome measure. A random-effects model was fitted to the data. RESULTS A total of 1334 articles were initially evaluated using title and abstract screening. After applying inclusion/exclusion criteria, 20 articles were considered suitable for this review. The age of the cases ranged between 25 and 132 months at examination. The most studied parameters were F0, Jitter, Shimmer and Harmonics-to-Noise Ratio (HNR); other parameters were seldom reported. A total of 11 studies were included in the meta-analysis of F0, with the majority of estimates being positive (75%); the estimated average standardized mean difference based on the random-effects model was 0.3033 (95% CI: 0.0605 to 0.5462; P = 0.0144). For Jitter (0.2229; 95% CI: -0.1862 to 0.7986; P = 0.2229) and shimmer (0.2540; 95% CI: -0.1404 to 0.6485; P = 0.2068) there was a trend toward positive values without reaching statistical significance. DISCUSSION AND CONCLUSIONS This meta-analysis confirmed that higher F0 values have been observed in the pediatric population of CI users compared to age-matched normal hearing volunteers, whereas the parameters of voice noise were not significantly different between cases and controls. Prosodic aspects of language need further investigations. In longitudinal contexts, prolonged auditory experience with CI has brought voice parameters closer to the norm. Following the available evidence, we stress the utility of inclusion of vocal acoustic analysis in the clinical evaluation and follow-up of CI patients to optimize the rehabilitation process of pediatric patients with hearing loss.
Collapse
Affiliation(s)
- Andrea Frosolini
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy; Department of Maxillo-Facial Surgery, Policlinico Le Scotte, University of Siena, Siena, Italy.
| | - Francesco Fantin
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy
| | - Isabella Tundo
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy; Department of Medical and Surgical Sciences and Advanced Technologies "G.F. Ingrassia", ENT Section, A.O.U. Policlinico "G.Rodolico-San Marco", University of Catania, Catania, Italy
| | - Nicholas Pessot
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy
| | - Giulio Badin
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy; Department of Neuroscience DNS, Otolaryngology Section, University of Padova, Padova, Italy
| | - Patrizia Bartolotta
- Epidemiology, and Public Health, Department of Cardiac, Thoracic, Vascular Sciences, and Public Health, Unit of Biostatistics, University of Padova, Padova, Italy
| | - Luca Vedovelli
- Epidemiology, and Public Health, Department of Cardiac, Thoracic, Vascular Sciences, and Public Health, Unit of Biostatistics, University of Padova, Padova, Italy
| | - Gino Marioni
- Department of Neuroscience DNS, Otolaryngology Section, University of Padova, Padova, Italy
| | - Cosimo de Filippis
- Department of Neuroscience DNS, Audiology Unit at Treviso Hospital, University of Padova, Treviso, Italy
| |
Collapse
|
19
|
Lester-Smith RA, Kim JH, Hilger A, Chan CL, Larson CR. Auditory-Motor Control of Fundamental Frequency in Vocal Vibrato. J Voice 2023; 37:296.e9-296.e19. [PMID: 33461882 PMCID: PMC8282806 DOI: 10.1016/j.jvoice.2020.12.049] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 12/16/2020] [Accepted: 12/21/2020] [Indexed: 11/21/2022]
Abstract
PURPOSE The purpose of this study was to investigate how classically trained singers use their auditory feedback to control fundamental frequency (fo) during production of vocal vibrato. Two main questions were addressed: (1) Do singers produce reflexive foresponses to sudden perturbation of the fo of their auditory feedback during production of vibrato indicative of feedback control? (2) Do singers produce adaptive foresponses to repeated perturbation of the fo of their auditory feedback during production of vibrato indicative of feedback and feedforward control? In addition, one methodological question was addressed to determine if adaptive fo responses were more precisely assessed with or without an auditory cue for fo during the repeated fo perturbation paradigm. METHOD Ten classically trained singers produced sustained vowels with vibrato while the fo and harmonics of their auditory feedback were suddenly perturbed by 100 cents to assess reflexive control or repeatedly perturbed by 100 cents to assess adaptive control. Half of the participants completed the repeated perturbation experiment with an auditory cue for fo, and the other half completed the experiment without an auditory cue for fo. Acoustical analyses measured changes in mean fo in response to the auditory feedback perturbations. RESULTS On average, participants produced compensatory responses to both sudden and repeated perturbation of the fo of their auditory feedback. The magnitude of the responses to repeated perturbations was larger than the responses to sudden perturbations. Responses were also larger in the cued, repeated fo perturbation experiment than in the uncued, repeated fo perturbation experiment. CONCLUSIONS These findings indicate that classically-trained singers use both feedforward and feedback mechanisms to control their average fo during production of vibrato. When compared to prior studies of singers producing a steady voice, the reflexive fo responses were larger in the current study, which may indicate that the feedback control system is engaged more during production of vibrato.
Collapse
Affiliation(s)
- Rosemary A Lester-Smith
- Department of Physical Medicine & Rehabilitation, Feinberg School of Medicine, Northwestern University, Chicago, Illinois.
| | - Jason H Kim
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Allison Hilger
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Chun-Liang Chan
- Department of Linguistics, Northwestern University, Evanston, Illinois
| | - Charles R Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| |
Collapse
|
20
|
Santausa FM, Nusdwinuringtyas N, Ulianna Tambunan TF, Friska D. The correlation between six-minute walking distance and maximum phonation time in healthy adults. Turk J Phys Med Rehabil 2023; 69:40-45. [PMID: 37201010 PMCID: PMC10186023 DOI: 10.5606/tftrd.2023.10944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 08/08/2022] [Indexed: 05/20/2023] Open
Abstract
Objectives This study aimed to determine the association between six-minute walking distance (6MWD) and maximum phonation time (MPT) in healthy adults. Patients and methods The cross-sectional study was conducted with 50 sedentary nonsingers (32 females, 18 males; mean age: 33.5±8.3 years; range, 18 to 50 years) between February 2021 and April 2021. Subjects with a history of smoking, respiratory symptoms in the last two weeks, and heart, lung, musculoskeletal, and balance problems were excluded. The measurements of MPT and 6MWD were carried out by two different assessors blinded to each other. Results The mean MPT was higher in male subjects (27.4±7.4 sec vs. 20.6±5.1 sec, p<0.001). In the bivariate analysis, there was a significant correlation between MPT and 6MWD (r=0.621, p<0.001), as well as body height (r=0.421, p=0.002) and the mean fundamental frequency (r=-0.429, p=0.002); however, no association was found with age, body weight, and the mean sound pressure level. After multiple regression, 6MWD was the only factor associated with MPT (p=0.002). Conclusion There is a significant association between 6MWD and MPT in healthy adults, and the results suggest that aerobic capacity might have a role in improving the ability to sustain phonation.
Collapse
Affiliation(s)
- Febrian Mulya Santausa
- Department of Medical Rehabilitation, Faculty of Medicine, Universitas Indonesia, Central Jakarta, Indonesia
| | - Nury Nusdwinuringtyas
- Department of Medical Rehabilitation, Faculty of Medicine, Universitas Indonesia, Central Jakarta, Indonesia
| | | | - Dewi Friska
- Department of Community Medicine, Faculty of Medicine, Universitas Indonesia, Central Jakarta, Indonesia
| |
Collapse
|
21
|
Edmonds TJ, Howard DM. An Investigation in the Measurable Differences between Pitch Perception in the Voice and Pitch Perception of External Sound Sources. J Voice 2023:S0892-1997(22)00378-2. [PMID: 36710198 DOI: 10.1016/j.jvoice.2022.11.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/17/2022] [Accepted: 11/18/2022] [Indexed: 01/31/2023]
Abstract
OBJECTIVES Pitch perception is an important part of accurate singing. Therefore, accurate singing requires the ability to accurately assess the pitch in one's own voice.There are two objectives of this study the first was to investigate whether there is a measurable difference in perceived pitch in one's voice to the pitch one perceives from an external sound source. The second, to measure the effects of occlusion on pitch accuracy over a melodic phrase. STUDY DESIGN We recruited 16 participants for this study. The study that was designed to investigate the perceptual difference was split into two parts. The first is a one-to-one pitch matching test where they would recreate a pitch with singing and matching external pitches. The second was singing the familiar song 'Happy Birthday' which was used to measure pitch accuracy over a melodic phrase and to measure the effects of occlusion on pitch accuracy while singing. METHODS The one-to-one study involved singing back a series of 5 notes to a set vowel which were the same 5 notes used when matching them to a series of possible pitches on the button test. The melodic test was to sing 'Happy Birthday' 3 times, the first normally, the second wearing headphones to occlude the ear to reduce air conductive hearing and the third time with white noise to mask all hearing. RESULTS The results showed a higher accuracy of pitch matching with external sounds over using their voice, and some form of occlusion (wearing headphones or headphones with white noise) showed the version with higher pitch accuracy. CONCLUSIONS The results of this study showed that there was improved pitch accuracy when comparing two external sounds in pitch and when singing occlusion of some form improved pitch accuracy. This could suggest a difference when recreating pitch between the voice and matching external sound sources. Furthermore, with the improvements shown from occluding the ears, it could further suggest a difference in pitch perception abilities between the voice and external sound sources. This could have implications of improving pitch accuracy in a studio environment.
Collapse
|
22
|
Ou W, Su J, Yan P, Zhang S, Kang J, Ge P. Change of Phonation in Canines Without Vocal Fold Paralysis After Vocal Processes Resection. J Voice 2023:S0892-1997(22)00416-7. [PMID: 36599714 DOI: 10.1016/j.jvoice.2022.12.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 12/16/2022] [Accepted: 12/16/2022] [Indexed: 01/04/2023]
Abstract
The effects of the vocal processes resection on phonation in an animal without vocal fold paralysis have not been clarified. The present study used an in vivo animal model with vocal processes resection and excised larynges phonation model to investigate the effects of the vocal processes resection on phonation. Six months after resection of bilateral vocal fold processes, glottal airflow, subglottal air pressure, acoustic signals, and ultra-high-speed video images were recorded in the excised larynges phonation model of canine. Glottal aerodynamic parameters were estimated by calculation of subglottal pressure and glottal flow. Histological analyses of the scarred were assessed for wound healing completion. In the vocal processes resection group, fundamental frequency(F0) and vocal intensity decreased, and the Jitter and Shimmer increased significantly. The phonation threshold power(PTW) of the vocal processes resection was significantly higher than controls. The vibratory amplitude of the vocal fold posterior and visual vocal fold vibration length increased. Expression of collagen I-III in scarred tissue samples in vocal process resection was similar to controlling soft tissue specimens around vocal process cartilage, and collagen fiber formed matured thick bundles. The results suggest that the F0, voice quality, and vocal intensity significantly decreased after complete wound healing of vocal processes resection in canines without vocal folds paralysis. The higher PTW and posterior scarred vocal vibration may be the dynamic reasons.
Collapse
Affiliation(s)
- Weijian Ou
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Otolaryngology, The First People's Hospital of Zhaoqing, 526020, Zhaoqing city, China
| | - Jinglin Su
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Peikai Yan
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China
| | - Siyi Zhang
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; The First School of Clinical Medicine, South China University of Technology, Guangzhou, 510080, China
| | - Jing Kang
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; Department of Otolaryngology, The First People's Hospital of Zhaoqing, 526020, Zhaoqing city, China.
| | - Pingjiang Ge
- Department of Otolaryngology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, China; The First School of Clinical Medicine, South China University of Technology, Guangzhou, 510080, China.
| |
Collapse
|
23
|
Kilshaw RE, Kerig PK, Baucom BRW. Vocally-Encoded Emotional Arousal as a Marker of Callous-Unemotional Traits in a Sample of Justice-Involved Adolescents. Child Psychiatry Hum Dev 2022:10.1007/s10578-022-01486-6. [PMID: 36572843 DOI: 10.1007/s10578-022-01486-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 10/26/2022] [Accepted: 12/16/2022] [Indexed: 12/28/2022]
Abstract
Callous-unemotional (CU) traits are associated with severe and persistent juvenile offending. CU traits are also associated with dampened emotional arousal, which suggests that fundamental frequency (f0), a measure of vocally-encoded emotional arousal, may serve as an accessible psychophysiological marker of CU traits in youth. This study investigated the associations between f0 range measured during an emotionally evocative task, CU traits, and emotion dysregulation in a mixed-gender sample of 168 justice-involved youth. For boys, after controlling for covariates, wider f0 range-indicating greater emotional arousal-was negatively associated with CU traits and positively associated with emotion dysregulation. For girls, no significant associations with f0 range emerged; however, CU traits were positively associated with emotion dysregulation. Findings suggest that f0 range may serve as a valid indicator of CU traits in JJ-involved boys, and that detained boys and girls with high CU traits are characterized by different profiles of emotion dysregulation.
Collapse
Affiliation(s)
- Robyn E Kilshaw
- Department of Psychology, University of Utah, 380 S 1530 E BEH S 502, UT, 84112, Salt Lake City, USA.
| | - Patricia K Kerig
- Department of Psychology, University of Utah, Salt Lake City, USA
| | - Brian R W Baucom
- Department of Psychology, University of Utah, Salt Lake City, USA
| |
Collapse
|
24
|
Gera S, Murthy Pusuluri SN, Tripuraneni SC. Office Based Voice Evaluation in Assessing Outcome of Treatment in Laryngeal Disorders. Indian J Otolaryngol Head Neck Surg 2022; 74:5111-5116. [PMID: 36742848 PMCID: PMC9895327 DOI: 10.1007/s12070-021-02887-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 09/22/2021] [Indexed: 02/07/2023] Open
Abstract
Rapidly evolving global economic status has changed professional behavior. Of late, there is a drastic increase in professional voice users and thereby increase in the patients with voice disorders. Clinically, the changes in the voice can only be perceived but cannot be quantified and documented. It is therefore necessary to have a gadget/tool which is simple and with which we can document and quantify the voice change objectively. This is a prospective case control study with 50 normal control subjects (25 male and 25 female subjects) who had no voice complaints and 50 patients(cases) who presented with complaints of change in voice due to various voice pathologies, the voices were recorded from the control group and also from the cases group both pre and posttreatment and were analyzed. The disease specific frequency in various pathologies of voice pre and post treatment were documented. The degree of change in voice frequency pre and post treatment were compared with each other and with the average frequency obtained from the voices in the control group. Office based acoustic analysis can definitely be used as a simple tool for documenting voice changes on outpatient basis and it provides a dependable documented evidence with which we can even compare the results following treatment.
Collapse
Affiliation(s)
- Sameera Gera
- Department of ENT, Dr.Pinnamanenini Siddhartha Institute of Medical Sciences & Research Foundation, Chinnaoutpalli, Andhra Pradesh India
| | - S. N. Murthy Pusuluri
- Department of ENT, Dr.Pinnamanenini Siddhartha Institute of Medical Sciences & Research Foundation, Chinnaoutpalli, Andhra Pradesh India
| | - Satish Chandra Tripuraneni
- Department of ENT, Dr.Pinnamanenini Siddhartha Institute of Medical Sciences & Research Foundation, Chinnaoutpalli, Andhra Pradesh India
| |
Collapse
|
25
|
Duymaz YK, Surmeli M, Erzincan E, Deveci I, Çanakçı H, Sahın Yılmaz AA, Oysu C. Effect of Educatıon and Socıal Anxıety on Acoustıc, Perceptual and Aerodynamıc Parameters of Adult Females. Indian J Otolaryngol Head Neck Surg 2022; 74:5731-5738. [PMID: 36742596 PMCID: PMC9895204 DOI: 10.1007/s12070-022-03105-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 11/28/2021] [Indexed: 02/07/2023] Open
Abstract
In this study, we hypothesized that social anxiety disorder would be more common in women with lower education levels and that this could have a negative effect on acoustic parameters. A total of eighty-eight (88) healthy female volunteers were enrolled into the study. These volunteers were divided into two groups, which were categorized as those with graduation from elementary school or below (Group A, n = 42) and those with graduation from high school or above (Group B, n = 46). Personal anxiety and avoidance for all participants were evaluated using The Liebowitz Social Anxiety Scale. All subjects also underwent acoustic and aerodynamic voice analysis for evaluation of their objective voice quality and function. Additionally, the GRBAS scale was used for perceptual analysis. Social anxiety scores were higher in group A than group B (p < 0.05). In aerodynamic sound analysis, maximal phonation time was lower in group A than group B (p < 0.05). In perceptual sound analysis, the mean values of the GRBAS parameters for group B were lower than group A (p < 0.05). Mean F0 of Group-B was higher than Group-A in acoustic analysis (p < 0.05). However, there was no statistical difference in jitter, shimmer, NHR and HNR between the groups (p > 0.05). In this study, social anxiety and education has been shown to have an effect on voice in women. As the level of education increases, social anxiety decreases, and both maximum phonation time and fundamental frequency increase.
Collapse
Affiliation(s)
- Yasar Kemal Duymaz
- Department of Otolaryngology, Umraniye Training and Research Hospital, University of Health Sciences, Istanbul, Turkey
- Department of Otorhinolaryngology, Head and Neck Surgery Department, Umraniye Training and Research Hospital, University of Health Sciences, Istanbul, Turkey
| | - Mehmet Surmeli
- Department of Otolaryngology, Umraniye Training and Research Hospital, University of Health Sciences, Istanbul, Turkey
| | - Erkal Erzincan
- Department of Psycology, Istanbul Gelisim University, Istanbul, Turkey
| | - Ildem Deveci
- Department of Otolaryngology, Umraniye Training and Research Hospital, University of Health Sciences, Istanbul, Turkey
| | - Hasan Çanakçı
- Department of Otorhinolaryngology, Balıkesir University School of Medicine, Balıkesir, Turkey
| | - Ayse Asli Sahın Yılmaz
- Department of Otolaryngology, Umraniye Training and Research Hospital, University of Health Sciences, Istanbul, Turkey
| | - Cagatay Oysu
- Department of Otolaryngology, School of Medicine, Marmara University, Istanbul, Turkey
| |
Collapse
|
26
|
LeAnn B, Claire PL. Bright Voice Quality and Fundamental Frequency Variation in Non-binary Speakers. J Voice 2022:S0892-1997(22)00234-X. [PMID: 36210223 DOI: 10.1016/j.jvoice.2022.08.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 07/29/2022] [Accepted: 08/02/2022] [Indexed: 03/17/2023]
Abstract
OBJECTIVES 1) To investigate if vocal variation produced by assigned-female-at-birth (AFAB) non-binary people differed from vocal variation produced by cisgender (cis) participants. Cue values produced by non-binary participants were predicted to differ from those values produced by cisgender participants. 2) To determine if previous subjective assessments of bright voice quality in AFAB non-binary participants were quantifiable, and if so, if non-binary and cisgender participants differed in their voice quality production. STUDY DESIGN A quantitative comparative research design. METHODS Phonetic and statistical analyses of continuous speech samples produced by AFAB non-binary and cisgender participants. Vocal cues were mean fundamental frequency (F0) and bright voice quality, measured by cepstral peak prominence-smoothed and spectral slope, with speaker gender as the predictor. RESULTS At the group level, non-binary participants produced intermediate F0 values - significantly lower than the cis women's and significantly higher than the cis men's. Individually, the majority of non-binary participants produced mean F0 in this intermediate range. Non-binary participants produced significantly less negative spectral slope and higher cepstral peak prominence-smoothed, indicative of a brighter, more resonant voice quality. Individual-level results indicated that vocal training and vocal tract physiology did not fully account for the results found. CONCLUSION Participants' agency, particularly their motivation to alter vocal output to avoid being misgendered, has an effect on the AFAB non-binary participants' F0 production and potentially their voice quality. The majority of AFAB non-binary participants uniquely produced the cue combination of intermediate F0 and bright voice quality.
Collapse
Affiliation(s)
- Brown LeAnn
- Laboratoire Parole et Langage (LPL) UMR 7309/CNRS, Aix-Marseille Université / CLESTHIA EA 7345, Sorbonne-Nouvelle Université, Paris, France.
| | - Pillot-Loiseau Claire
- Sorbonne-Nouvelle Université and Laboratoire de Phonétique et Phonologie (LPP) UMR 7018/CNRS, Paris, France
| |
Collapse
|
27
|
Dubey P, Fernandes JB, Bhat M. Acoustic Analysis of Voice in Laryngopharyngeal Cancers Pre and Post Radiotherapy. Indian J Otolaryngol Head Neck Surg 2022; 74:1973-1978. [PMID: 36452640 PMCID: PMC9701928 DOI: 10.1007/s12070-020-01934-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Accepted: 06/23/2020] [Indexed: 12/01/2022] Open
Abstract
Laryngopharyngeal cancers are one of the most commonly diagnosed head and neck malignancies frequently presenting primarily with change in voice. Radiotherapy being the main modality of treatment for early cancers continues to affect voice. Hence, acoustic analysis of the voice offers quantifiable values of several parameters delineating the obvious effect of the therapy. A total number of 60 patients, diagnosed with laryngopharyngeal cancers undergoing radiotherapy underwent acoustic voice assessment using Dr. speech software pre-treatment and at 1 and 3 months post radiotherapy. Data analysis was done using Mann-Whitney test and Wilcoxon signed rank test and a significant p value was obtained. The results of the study showed fundamental frequency (F0) and noise to harmonic ratio (NHR) to be the most affected in comparison to Jitter and Shimmer. The F0 and NHR values across baseline evaluation, first month and third month follow up showed a steady deterioration which was significant. The deterioration noted from the first to third month was not statistically significant. Across genders both F0 and NHR deterioration is more in males than in females. Radiotherapy causes definitive alterations in some acoustic measures of voice, which make the voice disharmonic and hoarse with contribution of harshness and breathiness. The effect is more pronounced on vocal parameters that are structure and projection based as evidenced by deterioration in values noted in F0 and NHR. Persistent deteriorated acoustic parameters for a longer duration of time are more likely which emphasizes the need for early voice rehabilitation.
Collapse
Affiliation(s)
- Priya Dubey
- Department of ENT, Father Muller Medical College, Mangalore, India
- Bhopal, India
| | - Jacqueline B. Fernandes
- Department of Speech and Language Pathology, Father Muller Medical College, Mangalore, India
| | - Mahesh Bhat
- Department of ENT, Father Muller Medical College, Mangalore, India
| |
Collapse
|
28
|
Loutrari A, Georgiadou I. Adapted melodic intonation therapy can help raise trans women's singing and speaking fundamental frequencies. LOGOP PHONIATR VOCO 2022:1-9. [PMID: 36098962 DOI: 10.1080/14015439.2022.2121985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 07/05/2022] [Accepted: 09/01/2022] [Indexed: 10/14/2022]
Abstract
AIM Voice interventions for transgender individuals can address several speech and voice parameters - fundamental frequency, resonance, intonation, rhythm, and intensity. In this study, we focus on fundamental voice frequency and build on existing research to test one technique that has been shown in a preliminary study to effectively adjust fundamental voice frequency in line with clients' goals. METHOD More specifically, we employed an adaptation of melodic intonation therapy (MIT) to assess whether it can raise trans women's average fundamental frequency to a significant degree. Eleven trans women participated in two one-to-one therapy sessions, four weeks apart. RESULTS Results pointed to a statistically significant rise in both their singing and speaking fundamental frequencies following the adapted MIT therapy sessions. Participants were also successful in imitating upward fundamental frequency contours when singing and in producing them independently in the speech modality. FUTURE DIRECTIONS Longitudinal studies are warranted to determine whether the observed positive results translate into long-term benefits.
Collapse
Affiliation(s)
- Ariadne Loutrari
- School of Psychology & Clinical Language Sciences, University of Reading, Reading, UK
| | | |
Collapse
|
29
|
Ambreen S, Hamdani SZ, Anwar M. Speaking Fundamental Frequency of Native Urdu Speakers-A Pilot Study. J Voice 2022; 36:140.e23-8. [PMID: 32359806 DOI: 10.1016/j.jvoice.2020.03.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Revised: 03/16/2020] [Accepted: 03/22/2020] [Indexed: 11/23/2022]
Abstract
OBJECTIVES This study aimed to find the normative fundamental frequency values for adult native Urdu speakers from Pakistan. The age and gender differences were also examined. METHOD A total of 71 participants aged between 21 and 50 years old (Men = 34, Women = 37) were recruited from University of the Punjab, Lahore, Pakistan. The participants were further divided into subgroups based on age: Group 1 (21-30 years), Group 2 (31-40 years), Group 3 (41-50 years). At least three voice samples were recorded using Praat in a sound-proof lab for each participant in his/her habitual voice. An average speaking fundamental frequency value was computed for each participant. RESULTS Two-way ANOVA was employed to investigate age and gender differences in the SF0 values. The findings revealed significant gender differences with females having significantly higher SF0 values as compared to males. The findings were further compared with similar data from other populations with different ethnic backgrounds. CONCLUSION This study provides baseline fundamental frequency data for Native Urdu speakers. The clinical implications of the findings are also discussed.
Collapse
|
30
|
Leongómez JD, Sánchez OR, Vásquez-Amézquita M, Roberts SC. Contextualising courtship: Exploring male body odour effects on vocal modulation. Behav Processes 2021; 193:104531. [PMID: 34655713 DOI: 10.1016/j.beproc.2021.104531] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 06/26/2020] [Accepted: 10/07/2021] [Indexed: 12/21/2022]
Abstract
Voice characteristics are important to communicate socially relevant information. Recent research has shown that individuals alter their voices depending on the context of social interactions and perceived characteristics of the audience, and this affects how they are perceived. Numerous studies have also shown that the presence of bodily odours can elicit psychological changes in people. Here, we tested whether the presence of male axillary odour would influence vocal modulations in courtship contexts. We analysed differences in vocal parameters and attractiveness ratings across 950 recordings from 80 participants as they responded to opposite-sex target stimuli. Using these, we tested whether men's and women's vocal parameters and perceived stimuli attractiveness differed in the presence or absence of the odour. We expected women to speak with increased voice F0, and men to lower their pitch, when exposed to male body odour, especially if it were of high quality. However, neither the presence of male odour, its quality, nor the addition of androstadienone produced any consistent changes in vocal parameters. Nevertheless, rated stimulus attractiveness was predicted by F0 and especially F0 variability, suggesting that this is a key parameter in signalling attraction during human courtship, and supporting the idea that vocal modulations are context-sensitive.
Collapse
Affiliation(s)
- Juan David Leongómez
- Human Behaviour Lab, Faculty of Psychology, Universidad El Bosque, Bogota 110121, Colombia; Division of Psychology, University of Stirling, Stirling FK9 4LA, UK.
| | - Oscar R Sánchez
- Human Behaviour Lab, Faculty of Psychology, Universidad El Bosque, Bogota 110121, Colombia.
| | - Milena Vásquez-Amézquita
- Experimental Psychology Lab, Faculty of Psychology, Universidad El Bosque, Bogota 110121, Colombia.
| | - S Craig Roberts
- Division of Psychology, University of Stirling, Stirling FK9 4LA, UK.
| |
Collapse
|
31
|
Sondhi S, Salhan A, Santoso CA, Doucoure M, Dharmawan DM, Sureka A, Natasha BN, Danusaputro AD, Dowson NS, Yap MSL, Hadiwidjaja MA, Veeraraghavan SG, Hatta AZR, Lee C, Megantara RA, Wihardja AN, Sharma M, Lardizabal EL, Sondhi LJ, Raina R, Vashisth S, Hedwig R. Voice processing for COVID-19 scanning and prognostic indicator. Heliyon 2021; 7:e08134. [PMID: 34632133 DOI: 10.1016/j.heliyon.2021.e08134] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 07/25/2021] [Accepted: 10/01/2021] [Indexed: 12/24/2022] Open
Abstract
COVID-19 pandemic has posed serious risk of contagion to humans. There is a need to find reliable non-contact tests like vocal correlates of COVID-19 infection. Thirty-six Asian ethnic volunteers 16 (8M & 8F) infected subjects and 20 (10M &10F) non-infected controls participated in this study by vocalizing vowels /a/, /e/, /i/, /o/, /u/. Voice correlates of 16 COVID-19 positive patients were compared during infection and after recovery with 20 non-infected controls. Compared to non-infected controls, significantly higher values of energy intensity for /o/ (p = 0.048); formant F1 for /o/ (p = 0.014); and formant F3 for /u/ (p = 0.032) were observed in male patients, while higher values of Jitter (local, abs) for /o/ (p = 0.021) and Jitter (ppq5) for /a/ (p = 0.014) were observed in female patients. However, formant F2 for /u/ (p = 0.018), mean pitch F0 for /e/, /i/ and /o/ (p = 0.033; 0.036; 0.047) decreased for female patients under infection. Compared to recovered conditions, HNR for /e/ (p = 0.014) was higher in male patients under infection, while Jitter (rap) for /a/ (p = 0.041); Jitter (ppq5) for /a/ (p = 0.032); Shimmer (local, dB) for /i/ (p = 0.024); Shimmer (apq5) for /u/ (p = 0.019); and formant F4 for vowel /o/ (p = 0.022) were higher in female patients under infection. However, HNR for /e/ (p = 0.041); and formant F1 for /o/ (p = 0.002) were lower in female patients compared to their recovered conditions. Obtained results support the hypothesis since changes in voice parameters were observed in the infected patients which can be correlated to a combination of acoustic measures like fundamental frequency, formant characteristics, HNR, and voice perturbations like jitter and shimmer for different vowels. Thus, voice analysis can be used for scanning and prognosis of COVID-19 infection. Based on the findings of this study, a mobile application can be developed to analyze human voice in real-time to detect COVID-19 symptoms for remedial measures and necessary action.
Collapse
|
32
|
Ersoz Unlu C, Karacayli C, Ocal FCA. Acoustic and perceptual voice parameters in subclinical and overt primary hypothyroidism. Auris Nasus Larynx 2021:S0385-8146(21)00239-X. [PMID: 34615613 DOI: 10.1016/j.anl.2021.09.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 09/02/2021] [Accepted: 09/18/2021] [Indexed: 12/18/2022]
Abstract
OBJECTIVE Hypothyroidism is a common endocrine disorder affecting various systems of the body. Only a few studies have focused on the effect of primary hypothyroidism on voice with objective parameters, and none of them compared the effect of subclinical and overt hypothyroidism on voice. The aim of the present study is to evaluate and compare the acoustic and perceptual parameters of voice in patients newly diagnosed with subclinical and overt hypothyroidism. METHODS The study included 26 subclinical hypothyroidism, 26 overt hypothyroidism patients and 30 euthyroid control participants. Perceptual evaluation of voice with GRBAS (grade, roughness, breathiness, asthenia, strain) scale, voice handicap index (VHI)-10, and acoustic voice analysis by using Multi-Dimensional Voice Program were performed for all the participants. RESULTS The voice parameters which showed a statistically significant difference between the groups were frequency parameters (Fo; p= 0.003, Fhi; p=0.010, Flo; p= 0.002) and VHI-10 (p= 0.047). A statistically significant decrease in frequency parameters and a statistically significant increase in VHI-10 were found in overt hypothyroidism group comparing with control group (Fo; p= 0.002, Fhi; p=0.009, Flo; p= 0.001 and VHI-10; p= 0.046). Voice parameters in subclinical hypothyroidism group did not show a statistically significant difference comparing with both control and overt hypothyroidism groups. CONCLUSION In the present study, overt hypothyroidism is found to affect frequency parameters and patients' own subjective assessment of voice. Primary hypothyroidism does not seem to have significant effect on voice parameters until thyroxine levels are affected.
Collapse
|
33
|
Kappen M, Hoorelbeke K, Madhu N, Demuynck K, Vanderhasselt MA. Speech as an indicator for psychosocial stress: A network analytic approach. Behav Res Methods 2021. [PMID: 34357541 DOI: 10.3758/s13428-021-01670-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/09/2021] [Indexed: 11/30/2022]
Abstract
Recently, the possibilities of detecting psychosocial stress from speech have been discussed. Yet, there are mixed effects and a current lack of clarity in relations and directions for parameters derived from stressed speech. The aim of the current study is – in a controlled psychosocial stress induction experiment – to apply network modeling to (1) look into the unique associations between specific speech parameters, comparing speech networks containing fundamental frequency (F0), jitter, mean voiced segment length, and Harmonics-to-Noise Ratio (HNR) pre- and post-stress induction, and (2) examine how changes pre- versus post-stress induction (i.e., change network) in each of the parameters are related to changes in self-reported negative affect. Results show that the network of speech parameters is similar after versus before the stress induction, with a central role of HNR, which shows that the complex interplay and unique associations between each of the used speech parameters is not impacted by psychosocial stress (aim 1). Moreover, we found a change network (consisting of pre-post stress difference values) with changes in jitter being positively related to changes in self-reported negative affect (aim 2). These findings illustrate – for the first time in a well-controlled but ecologically valid setting – the complex relations between different speech parameters in the context of psychosocial stress. Longitudinal and experimental studies are required to further investigate these relationships and to test whether the identified paths in the networks are indicative of causal relationships.
Collapse
|
34
|
Choi W. Cantonese advantage on English stress perception: Constraints and neural underpinnings. Neuropsychologia 2021; 158:107888. [PMID: 33991562 DOI: 10.1016/j.neuropsychologia.2021.107888] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 04/25/2021] [Accepted: 05/10/2021] [Indexed: 11/22/2022]
Abstract
A prevailing conception of cross-linguistic transfer is that first language experience poses perceptual interference, or at best null effect, on second language speech perception. Surprisingly, a recent study found that Cantonese listeners outperformed English listeners on English stress perception. The present study further evaluated whether segmental variations would constrain the Cantonese advantage on English stress perception. Cantonese and English listeners were tested with both active and passive oddball paradigms in which ERP responses to English stress deviations were elicited. Behaviorally, the Cantonese listeners exhibited a perceptual advantage relative to the English listeners, but this advantage disappeared upon the introduction of segmental variations. Neurophysiologically, segmental variations diminished the P3b amplitudes of the Cantonese but not the English listeners. Collectively, results suggest that segmental variations constrain the Cantonese advantage on English stress perception.
Collapse
|
35
|
Wermke K, Cebulla M, Salinger V, Ross V, Wirbelauer J, Shehata-Dieler W. Cry features of healthy neonates who passed their newborn hearing screening vs. those who did not. Int J Pediatr Otorhinolaryngol 2021; 144:110689. [PMID: 33799102 DOI: 10.1016/j.ijporl.2021.110689] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 02/09/2021] [Accepted: 03/18/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVES Temporal and fundamental frequency (fo) variations in infant cries provide critical insights into the maturity of vocal control and hearing performances. Earlier research has examined the use of vocalisation properties (in addition to hearing tests) to identify infants at risk of hearing impairment. The aim of this study was to determine whether such an approach could be suitable for neonates. METHODS To investigate this, we recruited 74 healthy neonates within their first week of life as our participants, assigning them to either a group that passed the ABR-based NHS (PG, N = 36) or a group that did not, but were diagnosed as normally hearing in follow-up check at 3 months of life, a so-called false-positive group (NPG, N = 36). Spontaneously uttered cries (N = 2330) were recorded and analysed quantitatively. The duration, minimum, maximum and mean fo, as well as two variability measures (fo range, fo sigma), were calculated for each cry utterance, averaged for individual neonates, and compared between the groups. RESULTS A multiple analysis of variance (MANOVA) revealed no significant effects. This confirms that cry features reflecting vocal control do not differ between healthy neonates with normal hearing, irrespective of the outcome of their initial NHS. CONCLUSIONS Healthy neonates who do not pass the NHS but are normal hearing in the follow-up (false positive cases) have the same cry properties as those with normal hearing who do. This is an essential prerequisite to justify the research strategy of incorporating vocal analysis into NHS to complement ABR measures in identifying hearing-impaired newborns.
Collapse
Affiliation(s)
- Kathleen Wermke
- Center for Pre-Speech Development and Developmental Disorders, Department of Orthodontics, University of Würzburg, Germany.
| | - Mario Cebulla
- Comprehensive Hearing Center (CHC), Department of Otorhinolaryngology, Plastic, Aesthetic and Reconstructive Head and Neck Surgery, University Hospital, University of Würzburg, Germany
| | - Vivien Salinger
- Center for Pre-Speech Development and Developmental Disorders, Department of Orthodontics, University of Würzburg, Germany
| | - Veronique Ross
- Center for Pre-Speech Development and Developmental Disorders, Department of Orthodontics, University of Würzburg, Germany
| | - Johannes Wirbelauer
- University Children's Hospital, Department of Neonatology, University of Würzburg, Germany
| | - Wafaa Shehata-Dieler
- Comprehensive Hearing Center (CHC), Department of Otorhinolaryngology, Plastic, Aesthetic and Reconstructive Head and Neck Surgery, University Hospital, University of Würzburg, Germany
| |
Collapse
|
36
|
Brockmann-Bauser M, Van Stan JH, Carvalho Sampaio M, Bohlender JE, Hillman RE, Mehta DD. Effects of Vocal Intensity and Fundamental Frequency on Cepstral Peak Prominence in Patients with Voice Disorders and Vocally Healthy Controls. J Voice 2021; 35:411-417. [PMID: 31859213 PMCID: PMC7295673 DOI: 10.1016/j.jvoice.2019.11.015] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Revised: 11/21/2019] [Accepted: 11/21/2019] [Indexed: 11/22/2022]
Abstract
OBJECTIVE Cepstrum-based voice measures, such as smoothed cepstral peak prominence (CPPS), are influenced by voice sound pressure level (SPL) in vocally healthy adults. Since it is unclear if similar effects hold in voice disordered adults and how these interact with natural fundamental frequency (fo) changes, this study examines voice SPL and fo effects on CPPS in women with vocal hyperfunction and vocally healthy controls. STUDY DESIGN Retrospective matched case-control study. METHODS Fifty-eight women with vocal hyperfunction were individually matched with 58 vocally healthy women for occupation and approximate age. The patient group comprised women exhibiting phonotraumatic vocal hyperfunction associated with vocal fold nodules (n = 39) or polyps (n = 5), and nonphonotraumatic vocal hyperfunction associated with primary muscle tension dysphonia (n = 14). All participants sustained the vowel /a/ at soft, comfortable, and loud loudness conditions. Voice SPL, fo, and CPPS (dB) were computed from acoustic voice recordings using Praat. The effects of loudness condition, measured voice SPL, and fo on CPPS were assessed with linear mixed models. Pairwise correlations among voice SPL, fo, and CPPS were assessed using multiple regression analysis. RESULTS Increasing voice SPL correlated significantly (P < 0.001) with higher CPPS in both patient (r2 = 0.53) and normative groups (r2 = 0.45). fo had statistically significant effects on CPPS (P < 0.001), but with a weak relation for the patient (r2 = 0.02) and control groups (r2 = 0.05). CONCLUSIONS In women with and without voice disorder, CPPS is highly affected by the individual's voice SPL in vowel phonation. Future studies could investigate how these effects should be controlled for to improve the diagnostic value of acoustic-based cepstral measures.
Collapse
Affiliation(s)
- Meike Brockmann-Bauser
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Zurich, Switzerland; University of Zurich, Zurich, Switzerland.
| | - Jarrad H Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts; Department of Surgery, Harvard Medical School; MGH Institute of Health Professions, Boston, Massachusetts
| | - Marilia Carvalho Sampaio
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Zurich, Switzerland; University of Zurich, Zurich, Switzerland; Federal University of Bahia, Institute of Health Sciences, Department of Speech, Language and Hearing Sciences, Salvador, Brazil
| | - Joerg E Bohlender
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Zurich, Switzerland; University of Zurich, Zurich, Switzerland
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts
| |
Collapse
|
37
|
Mohammed AA, Nagy A. Fundamental Frequency and Jitter Percent in MDVP and PRAAT. J Voice 2021:S0892-1997(21)00107-7. [PMID: 33926765 DOI: 10.1016/j.jvoice.2021.03.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 02/12/2021] [Accepted: 03/01/2021] [Indexed: 11/26/2022]
Abstract
PURPOSE This study initially investigated the relationship between Fundamental Frequency and Jitter Percent across and within MDVP and PRAAT. Subsequently, it explored if the measured acoustic signal's Length or the analysis temporal segment selection impacts potential correlation across the tools' measures. METHODS We collected forty-two Maximum Phonation Time acoustic signals from 10 participants with Healthy Voices in a standardized setting. We excluded from enrollment any potential participants having a history of voice disorders or showing an abnormality in a pre-study assessment. RESULTS There is no correlation between Jitter percent's values and Fundamental Frequency within either Tool in our healthy voice samples. The Length of the acoustic signal and temporal analysis selection impact the correlation between Jitter Percent measurements across the two tools; The correlation between Fundamental Frequency measurements across the devices was not affected. Means of Fundamental Frequency did not differ across the two devices but show a persistent pattern of greater values in MDVP. Jitter Percent measurements were significantly higher in MDVP CONCLUSIONS: There is a potential for clinicians using PRAAT assessments in the clinic to make inferences from research using MDVP as an analysis tool. Further work is needed in patients with Voice disorders to explore that possibility.
Collapse
Affiliation(s)
- Ahmed A Mohammed
- Department of ENT, Ain Shams University; Assistant professor of Phoniatrics, Cairo, Egypt.
| | - Ahmed Nagy
- Communicative Disorders and Sciences Department, University at Buffalo, Buffalo, NY, USA.; Faculty of Medicine - Fayoum University, Fayoum, Egypt.
| |
Collapse
|
38
|
Sahin Onder S, Savran F, Karabulut B, Surmeli M, Cetemen A. Impact of Allergic Rhinitis on Voice in Children. ORL J Otorhinolaryngol Relat Spec 2021; 83:335-340. [PMID: 33652440 DOI: 10.1159/000514120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2020] [Accepted: 12/22/2020] [Indexed: 11/19/2022]
Abstract
INTRODUCTION The purpose of the present study was to determine the possible effect of allergic rhinitis (AR) on voice change in children with acoustic analysis and Turkish children's voice handicap index-10 (TR-CVHI-10). METHODS This is a case-control study. Forty-one children with AR, and a positive skin prick test, as well as 39 children of controls who had produced a negative skin prick test and lacked a history of allergic disease, were selected for the study. Each assessment included recordings for the purposes of acoustic voice analysis (fundamental frequency [f0], jitter %, shimmer %, and harmonics-to-noise ratio (HNR)), and aerodynamic analysis (maximum phonation time (MPT) and s/z ratio). All participants completed TR-CVHI-10. RESULTS The mean TR-CVHI-10 score of the AR group was significantly higher than the control group (p = 0.013). No difference was observed between the AR and control groups in terms of jitter, shimmer, HNR, and MPT values and s/z ratio (p > 0.05). Conversely, the f0 value was more pronounced in controls (270.9 ± 60.3 Hz) than in the AR group (237.7 ± 54.3 Hz) (p = 0.012). CONCLUSION The study's results revealed that AR can have an effect on fundamental frequency and voice quality in children. The diagnostic process should include AR as a potential cause of voice disorders in children.
Collapse
Affiliation(s)
- Serap Sahin Onder
- Department of Otolaryngology, University of Health Sciences Umraniye Research and Education Hospital, Istanbul, Turkey,
| | - Fatih Savran
- Department of Otolaryngology, University of Health Sciences Umraniye Research and Education Hospital, Istanbul, Turkey
| | - Burak Karabulut
- Department of Otolaryngology, University of Health Sciences, Kartal Research and Education Hospital, Istanbul, Turkey
| | - Mehmet Surmeli
- Department of Otolaryngology, University of Health Sciences Umraniye Research and Education Hospital, Istanbul, Turkey
| | - Aysen Cetemen
- Department of Pediatric Allergy, University of Health Sciences Umraniye Research and Education Hospital, Istanbul, Turkey
| |
Collapse
|
39
|
Anand S, Gutierrez D, Bottalico P. Acoustic-Perceptual Correlates of Voice Among Steam Train Engineers: Effects of Noise and Hearing Protection. J Voice 2021; 37:366-373. [PMID: 33637357 DOI: 10.1016/j.jvoice.2021.01.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Revised: 12/31/2020] [Accepted: 01/07/2021] [Indexed: 10/22/2022]
Abstract
Occupational voice users are at a higher risk for developing voice disorders due to their vocal demands, such as prolonged periods of work-related voice use and nonideal environmental factors, such as speaking above background noise. The current study focused on the effects of background noise and hearing protection on acoustic-perceptual correlates of voice among steam train engineers. Fourteen participants phonated vowel /a/, read a phrase, and described a map under different noise and hearing protection conditions. Relative sound pressure level, relative fundamental frequency, and perceived vocal effort and disturbance decreased in the presence of hearing protection for all noise conditions. In contrast, these acoustic measures increased in the absence of hearing protection supporting Lombard effect. Overall, results of the current study provide insight into possible risks to vocal health in workers exposed to high levels of background noise and use hearing protection.
Collapse
Affiliation(s)
- Supraja Anand
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida.
| | - Desi Gutierrez
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida
| | - Pasquale Bottalico
- Department of Speech and Hearing Science, University of Illinois Urbana-Champaign, Champaign, Illinois
| |
Collapse
|
40
|
Gorina-Careta N, Kurkela JLO, Hämäläinen J, Astikainen P, Escera C. Neural generators of the frequency-following response elicited to stimuli of low and high frequency: A magnetoencephalographic (MEG) study. Neuroimage 2021; 231:117866. [PMID: 33592244 DOI: 10.1016/j.neuroimage.2021.117866] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 01/03/2023] Open
Abstract
The frequency-following response (FFR) to periodic complex sounds has gained recent interest in auditory cognitive neuroscience as it captures with great fidelity the tracking accuracy of the periodic sound features in the ascending auditory system. Seminal studies suggested the FFR as a correlate of subcortical sound encoding, yet recent studies aiming to locate its sources challenged this assumption, demonstrating that FFR receives some contribution from the auditory cortex. Based on frequency-specific phase-locking capabilities along the auditory hierarchy, we hypothesized that FFRs to higher frequencies would receive less cortical contribution than those to lower frequencies, hence supporting a major subcortical involvement for these high frequency sounds. Here, we used a magnetoencephalographic (MEG) approach to trace the neural sources of the FFR elicited in healthy adults (N = 19) to low (89 Hz) and high (333 Hz) frequency sounds. FFRs elicited to the high and low frequency sounds were clearly observable on MEG and comparable to those obtained in simultaneous electroencephalographic recordings. Distributed source modeling analyses revealed midbrain, thalamic, and cortical contributions to FFR, arranged in frequency-specific configurations. Our results showed that the main contribution to the high-frequency sound FFR originated in the inferior colliculus and the medial geniculate body of the thalamus, with no significant cortical contribution. In contrast, the low-frequency sound FFR had a major contribution located in the auditory cortices, and also received contributions originating in the midbrain and thalamic structures. These findings support the multiple generator hypothesis of the FFR and are relevant for our understanding of the neural encoding of sounds along the auditory hierarchy, suggesting a hierarchical organization of periodicity encoding.
Collapse
|
41
|
Herbst CT, Nishimura T, Garcia M, Migimatsu K, Tokuda IT. Effect of Ventricular Folds on Vocalization Fundamental Frequency in Domestic Pigs (Sus scrofa domesticus). J Voice 2020; 35:805.e1-805.e15. [PMID: 33388229 DOI: 10.1016/j.jvoice.2020.01.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 01/10/2020] [Accepted: 01/16/2020] [Indexed: 10/22/2022]
Abstract
This study investigates the effect of the ventricular folds on fundamental frequency (fo) in the voice production of domestic pigs (Sus scrofa domesticus). The excised larynges of six subadult pigs were phonated in two preparation stages, with the ventricular folds present (PS1) and removed (PS2). Vocal fold resonances were tested with a laser vibrometer, and a four-mass computational model was created. Highly significant fo differences were found between PS1 and PS2 (means at 93.7 and 409.3 Hz, respectively). Two tissue resonances were found at 115 Hz and 250-290 Hz. The computational model had unique solutions for abducted and adducted ventricular folds at about 150 and 400 Hz, roughly matching the fo measured ex vivo for PS1 and PS2. The differing fo encountered across preparation stages PS1 and PS2 is explained by distinct activation of either a high or a low eigenfrequency mode, depending on the engagement of the ventricular folds. The inability of the investigated larynges to vibrate at frequencies below 250 Hz in PS2 suggests that in vivo low-frequency calls of domestic pigs (pre-eminently grunts) are likely produced with engaged ventricular folds. Allometric comparison suggests that the special, mechanically coupled "double oscillator" has evolved to prevent signaling disadvantages. Given these traits, the porcine larynx might - apart from special applications relating to the involvement of ventricular folds - not be an ideal candidate for emulating human voice production in excised larynx experimentation.
Collapse
Affiliation(s)
- Christian T Herbst
- Antonio Salieri Department of Vocal Studies and Vocal Research in Music Education, University of Music and Performing Arts Vienna, Vienna, Austria.
| | | | - Maxime Garcia
- ENES Lab, Université Lyon/Saint-Etienne, Neuro-PSI, CNRS UMR 9197, Saint-Etienne, France; Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Kishin Migimatsu
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| |
Collapse
|
42
|
Pawelec ŁP, Graja K, Lipowicz A. Vocal Indicators of Size, Shape and Body Composition in Polish Men. J Voice 2020; 36:878.e9-878.e22. [PMID: 33069508 DOI: 10.1016/j.jvoice.2020.09.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 09/13/2020] [Accepted: 09/14/2020] [Indexed: 11/25/2022]
Abstract
OBJECTIVES From a human evolution perspective, identifying a link between physique and vocal quality could demonstrate dual signaling in terms of the health and biological condition of an individual. In this regard, this study investigates the relationship between men's body size, shape, and composition, and their vocal characteristics. MATERIALS AND METHODS Eleven anthropometric measurements, using seven indices, were carried out with 80 adult Polish male participants, while the speech analysis adopted a voice recording procedure that involved phonetically recording vowels /ɑː/, /ɛː/, /iː/, /ɔː/, /uː/ to define the voice acoustic components used in Praat software. RESULTS The relationship between voice parameters and body size/shape/composition was found. The analysis indicated that the formants and their derivatives were useful parameters for prediction of height, weight, neck, shoulder, waist, and hip circumferences. Fundamental frequency (F0) was negatively correlated with neck circumference at Adam's apple level and body height. Moreover neck circumference and F0 association was observed for the first time in this paper. The association between waist circumference and formant component showed a net effect. In addition, the formant parameters showed significant correlations with body shape, indicating a lower vocal timbre in men with a larger relative waist circumference. DISCUSSION Men with lower vocal pitch had wider necks, probably a result of larynx size. Furthermore, a greater waist circumference, presumably resulting from abdominal fat distribution in men, correlated with a lower vocal timbre. While these results are inconclusive, they highlight new directions for further research.
Collapse
Affiliation(s)
- Łukasz Piotr Pawelec
- Department of Anthropology, Wroclaw University of Environmental and Life Sciences, Wroclaw, Poland.
| | - Katarzyna Graja
- Department of Anthropology, Wroclaw University of Environmental and Life Sciences, Wroclaw, Poland
| | - Anna Lipowicz
- Department of Anthropology, Wroclaw University of Environmental and Life Sciences, Wroclaw, Poland; Institute of Physiotherapy and Health Sciences, Academy of Physical Education, Katowice, Poland
| |
Collapse
|
43
|
Baur J, Krohmer K, Naumann E, Tuschen-Caffier B, Svaldi J. Vocal arousal: a physiological correlate of body distress in women with overweight and obesity. Eat Weight Disord 2020; 25:1161-1169. [PMID: 31338792 DOI: 10.1007/s40519-019-00744-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 07/01/2019] [Indexed: 11/26/2022] Open
Abstract
PURPOSE Numerous studies highlight the relevance of body image in the development and maintenance of overweight and obesity mostly using self-reported data. Given the importance of physiological assessment methods, the present study aimed at investigating vocally encoded emotional arousal as a correlate of body distress in women with overweight and obesity. METHODS Cognitions of women with overweight and obesity (OW; n = 22) as well as normal weight controls (NW; n = 22) were assessed by means of a thought-sampling procedure during a mirror exposure and a control condition. Fundamental frequency (f0) as a marker of vocally encoded emotional arousal as well as verbalized body-related cognitions were analyzed during this experimental task. RESULTS A stronger increase in f0 between the control and the mirror exposure condition was found in OW compared to NW. Furthermore, there were significant positive correlations between vocally encoded emotional arousal and various measures of body image. CONCLUSION The findings support the utility of vocally encoded emotional arousal as an objective physiological correlate of the evaluative dimension of body image in women with overweight and obesity. LEVEL OF EVIDENCE Level I, experimental study.
Collapse
Affiliation(s)
- Julia Baur
- Department of Clinical Psychology and Psychotherapy, University of Tuebingen, Schleichstraße 4, 72076, Tübingen, Germany
| | - Kerstin Krohmer
- Department of Clinical Psychology and Psychotherapy, University of Tuebingen, Schleichstraße 4, 72076, Tübingen, Germany
| | - Eva Naumann
- Department of Clinical Psychology and Psychotherapy, University of Tuebingen, Schleichstraße 4, 72076, Tübingen, Germany
| | - Brunna Tuschen-Caffier
- Department of Clinical Psychology and Psychotherapy, University of Freiburg, Engelbergerstraße 41, 79106, Freiburg, Germany
| | - Jennifer Svaldi
- Department of Clinical Psychology and Psychotherapy, University of Tuebingen, Schleichstraße 4, 72076, Tübingen, Germany.
| |
Collapse
|
44
|
Wieder G, Fischer MS, Einsle F, Baucom DH, Hahlweg K, Wittchen HU, Weusthoff S. Fundamental frequency during cognitive preparation and its impact on therapy outcome for panic disorder with Agoraphobia. Behav Res Ther 2020; 135:103728. [PMID: 32987282 DOI: 10.1016/j.brat.2020.103728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 08/04/2020] [Accepted: 09/14/2020] [Indexed: 01/31/2023]
Abstract
BACKGROUND Cognitive preparation plays a crucial role in CBT with exposure for panic disorder and agoraphobia. High emotional arousal while developing the exposure rationale might impair patients' cognitive capacities for processing information about treatment and impede therapeutic outcome. OBJECTIVE This study investigates whether patients' vocally encoded emotional arousal, assessed by fundamental frequency (f0), during rationale development is associated with premature treatment dropout, insight into the rationale, and symptom reduction. METHODS Patients' (N = 197, mean age 36.1 years, 79.2% female) f0 during rationale development was measured based on treatment videos from a randomized controlled trial of CBT for panic disorder and agoraphobia. Insight was rater assessed. Symptom severity was self- and rater assessed at the beginning and end of therapy. RESULTS Higher f0 mean during rationale development was associated with lower probability of insight and less reduction in avoidance behavior. f0 was not associated with dropout. Insight was associated with lower probability of dropout and partially mediated the association between f0 and avoidance reduction. DISCUSSION This study highlights the importance of emotional arousal during cognitive preparation for exposure. Therapists should ensure that patients are not too highly aroused while learning about the exposure rationale as an important step in treatment.
Collapse
|
45
|
Melton J, Bradford Z, Lee J. Acoustic Characteristics of Vocal Sounds Used by Professional Actors Performing Classical Material Without Microphones in Outdoor Theatre. J Voice 2020; 36:733.e23-733.e29. [PMID: 33008725 DOI: 10.1016/j.jvoice.2020.08.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 08/25/2020] [Accepted: 08/26/2020] [Indexed: 11/19/2022]
Abstract
OBJECTIVE Theatre actors use voice in virtually any physical position, moving or still, and perform in a wide range of venues. The present study investigated acoustic qualities required to perform classical material without electronic amplification in outdoor spaces. DESIGN Eight professional actors, four female, four male, from NY Classical Theatre performed one-minute monologues, first stationary, then moving, for audio recording in Central Park. Four subjects recorded two monologues each, from productions in which they played both male and female characters. Data were analyzed for fundamental frequency (F0), sound pressure level (SPL), and long-term average spectrum (LTAS). RESULTS Overall, F0 ranged between 75.38 and 530.33 Hz. Average F0 was 326 Hz stationary and 335.78 Hz moving for females, 248.54 Hz stationary, 252.82 Hz moving for males. SPL ranged from 28.54 to 110.51 dB for females, and 56.69 to 124.44 dB for males. Average SPL was 82 dB for females, 96.98 dB for males. On LTAS, females had a peak between 3 and 4 kHz ranging from 1.5 to 4.5 dB and another between 4 and 5 kHz ranging from 2 to 4.5 dB, while males had a peak between 3 and 4 kHz ranging from 1 to 8.5 dB. CONCLUSION Actors appear to use a similar F0 range across gender and performing conditions. Average F0 increased from stationary to moving. Males had greater SPL values than females, and the amplitude of peaks in the region of the Actor's Formant of LTAS curves was higher in male than female voices.
Collapse
Affiliation(s)
- Joan Melton
- California State University Fullerton, 253 W 72(nd) Street, # 902, New York, NY 10023.
| | - Zachary Bradford
- New York University, 7 Davenport Parade, North Lakes, QLD, 4509 Australia
| | - Jessica Lee
- New York University, 35 W 33(rd) Street, # 26C, New York, NY 10001
| |
Collapse
|
46
|
Vedurmudi AP, Young BA, van Hemmen JL. Active tympanic tuning facilitates sound localization in animals with internally coupled ears. Hear Res 2020; 387:107861. [PMID: 31911335 DOI: 10.1016/j.heares.2019.107861] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Revised: 11/12/2019] [Accepted: 12/02/2019] [Indexed: 10/25/2022]
Abstract
Earlier studies have reported that numerous vertebrate taxa have skeletal muscle(s) attaching directly, or indirectly, onto the tympanic membrane. The present study links these prior studies by quantitatively modeling the influence of skeletal muscle contraction on tympanic tension, tympanic dampening, and, ultimately, the fundamental frequency. In this way, the efficacy of these tympanic muscles to dynamically alter the sensory response of the vertebrate ear is quantified. Changing the tension modifies the eardrum's fundamental frequency, a key notion in understanding hearing through internally coupled ears (ICE) as used by the majority of terrestrial vertebrates. Tympanic tension can also be modulated by altering the pressure acting on the deep (medial) surface of the tympanum. Herein we use the monitor lizard Varanus as an example to demonstrate how active modulation of the pharyngeal volume permits tuning of an ICE auditory system. The present contribution offers a behaviorally and biologically realistic perspective on the ICE system, by demonstrating how an organism can dynamically alter its morphology to tune the auditory response. Through quantification of the relationships between tympanic surface tension, damping, membrane fundamental frequency, and auditory cavity volume, it can be shown that an ICE system affords a biologically relevant range of tuning.
Collapse
Affiliation(s)
- Anupam P Vedurmudi
- Forschungs-Neutronenquelle Heinz Maier-Leibnitz, Technische Universität München, 85748, Garching bei München, Germany.
| | - Bruce A Young
- Kirksville College of Osteopathic Medicine, A.T. Still University, Kirksville, MO, 63501, USA.
| | - J Leo van Hemmen
- Physik Department, Technische Universität München, 85747, Garching bei München, Germany.
| |
Collapse
|
47
|
Philip AS, Rajashekhar B, Shekaraiah S, N R. Analysis of Phonatory and Respiratory Functions in Physical Fitness Instructors in Comparison to Controls. J Voice 2020; 35:746-752. [PMID: 32057610 DOI: 10.1016/j.jvoice.2020.01.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 01/16/2020] [Accepted: 01/17/2020] [Indexed: 10/25/2022]
Abstract
OBJECTIVE Professional voice users form an increasing part of our population. Physical fitness instructors listed as professional voice users are at risk for voice problem as they perform vigorous exercises, instruct at the same time and compete vocally with loud background music. Hence, there is a dire need to study the voice problems in this population. The current study was aimed at investigating the phonatory and respiratory functions in physical fitness instructors in comparison to a control group. STUDY DESIGN Cross-sectional study design MATERIALS AND METHODS: Fifty physical fitness instructors (48 males and two females) and 50 controls (45 males and five females) with the age range of 18-40 years were initially enrolled in the study. The phonatory function measures (perceptual, physiological, and acoustic) and respiratory function measures (aerodynamics) were obtained. RESULTS Following the data analysis only for male participants, the majority of the perceptual parameters such as grade, roughness, breathiness, and strain showed significant increase in the clinical group compared to controls (P < 0.05). Similarly, few acoustic parameters such as fundamental frequency (F0) related parameters such as average F0, mean F0, highest fundamental frequency (Fhi) and lowest fundamental frequency (Flo) showed significant decrease in clinical group compared to controls (P < 0.05). CONCLUSIONS Few perceptual and acoustic parameters are sensitive in differentiating the voice problems in physical fitness instructors from the controls. The significant perceptual and acoustic parameters implied that they are at risk for developing voice problems. Hence, early identification and prevention of voice problems in this population is warranted.
Collapse
Affiliation(s)
- Anju Susan Philip
- Department of Speech and Hearing, Manipal College of Health Professions (MCHP), Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
| | - Bellur Rajashekhar
- Department of Speech and Hearing, Manipal College of Health Professions (MCHP), Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India
| | - Sheela Shekaraiah
- Department of Speech and Hearing, Manipal College of Health Professions (MCHP), Manipal Academy of Higher Education (MAHE), Manipal, Karnataka, India.
| | - Ravishankar N
- Department of Statistics, MAHE, Manipal, Karnataka, India
| |
Collapse
|
48
|
Cartei V, Banerjee R, Garnham A, Oakhill J, Roberts L, Anns S, Bond R, Reby D. Physiological and perceptual correlates of masculinity in children's voices. Horm Behav 2020; 117:104616. [PMID: 31644889 DOI: 10.1016/j.yhbeh.2019.104616] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Revised: 10/06/2019] [Accepted: 10/12/2019] [Indexed: 11/19/2022]
Abstract
Low frequency components (i.e. a low pitch (F0) and low formant spacing (ΔF)) signal high salivary testosterone and height in adult male voices and are associated with high masculinity attributions by unfamiliar listeners (in both men and women). However, the relation between the physiological, acoustic and perceptual dimensions of speakers' masculinity prior to puberty remains unknown. In this study, 110 pre-pubertal children (58 girls), aged 3 to 10, were recorded as they described a cartoon picture. 315 adults (182 women) rated children's perceived masculinity from the voice only after listening to the speakers' audio recordings. On the basis of their voices alone, boys who had higher salivary testosterone levels were rated as more masculine and the relation between testosterone and perceived masculinity was partially mediated by F0. The voices of taller boys were also rated as more masculine, but the relation between height and perceived masculinity was not mediated by the considered acoustic parameters, indicating that acoustic cues other than F0 and ΔF may signal stature. Both boys and girls who had lower F0, were also rated as more masculine, while ΔF did not affect ratings. These findings highlight the interdependence of physiological, acoustic and perceptual dimensions, and suggest that inter-individual variation in male voices, particularly F0, may advertise hormonal masculinity from a very early age.
Collapse
Affiliation(s)
| | - Robin Banerjee
- School of Psychology, University of Sussex, Brighton, UK
| | - Alan Garnham
- School of Psychology, University of Sussex, Brighton, UK
| | - Jane Oakhill
- Equipe Neuro-Ethologie Sensorielle, ENES/CRNL, CNRS UMR5292, INSERM UMR_S 1028, University of Lyon, Saint-Etienne, France
| | - Lucy Roberts
- School of Psychology, University of Sussex, Brighton, UK
| | - Sophie Anns
- School of Psychology, University of Sussex, Brighton, UK
| | - Rod Bond
- School of Psychology, University of Sussex, Brighton, UK
| | - David Reby
- School of Psychology, University of Sussex, Brighton, UK; Equipe Neuro-Ethologie Sensorielle, ENES/CRNL, CNRS UMR5292, INSERM UMR_S 1028, University of Lyon, Saint-Etienne, France
| |
Collapse
|
49
|
Rosenthal MA. A systematic review of the voice-tagging hypothesis of speech-in-noise perception. Neuropsychologia 2019; 136:107256. [PMID: 31715197 DOI: 10.1016/j.neuropsychologia.2019.107256] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 11/03/2019] [Accepted: 11/06/2019] [Indexed: 01/05/2023]
Abstract
The voice-tagging hypothesis claims that individuals who better represent pitch information in a speaker's voice, as measured with the frequency following response (FFR), will be better at speech-in-noise perception. The hypothesis has been provided to explain how music training might improve speech-in-noise perception. This paper reviews studies that are relevant to the voice-tagging hypothesis, including studies on musicians and nonmusicians. Most studies on musicians show greater f0 amplitude compared to controls. Most studies on nonmusicians do not show group differences in f0 amplitude. Across all studies reviewed, f0 amplitude does not consistently predict accuracy in speech-in-noise perception. The evidence suggests that music training does not improve speech-in-noise perception via enhanced subcortical representation of the f0.
Collapse
Affiliation(s)
- Matthew A Rosenthal
- University of Kansas, 1450 Jayhawk Blvd, Lawrence, KS, 66045, Department of Psychology, United States.
| |
Collapse
|
50
|
Ladich F, Maiditsch IP. Temperature affects sound production in fish with two sets of sonic organs: The Pictus cat. Comp Biochem Physiol A Mol Integr Physiol 2019; 240:110589. [PMID: 31648065 DOI: 10.1016/j.cbpa.2019.110589] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2019] [Revised: 10/09/2019] [Accepted: 10/10/2019] [Indexed: 11/18/2022]
Abstract
Sound communication is affected by ambient temperature in ectothermic animals including fishes. The present study examines the effects of temperature on acoustic signaling in a fish species possessing two different sound-generating mechanisms. The Amazonian Pictus catfish Pimelodus pictus produces low-frequency harmonic sounds (swimbladder drumming muscles) and high-frequency stridulation sounds (rubbing pectoral fin spines in the pectoral girdle). Sounds of 15 juveniles were recorded when hand-held after three weeks of acclimation at 30 °C, 22 °C and again 30 °C. The following sound characteristics were investigated: calling activity, sound duration, fundamental frequency of drumming sounds and dominant frequency of stridulation sounds. The number of both sound types produced within the first minute of experiments did not change with temperature. In contrast, sound duration was significantly shorter at 30 °C than at 22 °C (drumming: 78-560 ms; stridulation: 23-96 ms). The fundamental frequency of drumming sounds and thus the drumming muscle contraction rate varied from 127 Hz to 242 Hz and increased with temperature. The dominant frequency of broadband stridulation sounds ranged from 1.67 kHz to 3.39 kHz and was unaffected by temperature changes. Our data demonstrate that temperature affects acoustic signaling in P. pictus, although the changes differed between sound characteristics and sound type. The effects vary from no change in calling activity and dominant frequency, to an increase in fundamental frequency and shortened duration of both sound types. Together with the known effects of temperature on hearing in the Pictus cat, the present results indicate that global warming may affect acoustic communication in fishes.
Collapse
Affiliation(s)
- Friedrich Ladich
- Department of Behavioural Biology, University of Vienna, Althanstraße 14, 1090 Wien, Austria.
| | - Isabelle Pia Maiditsch
- Department of Behavioural Biology, University of Vienna, Althanstraße 14, 1090 Wien, Austria.
| |
Collapse
|