1
|
Bromham L, Yaxley KJ, Cardillo M. Islands are engines of language diversity. Nat Ecol Evol 2024:10.1038/s41559-024-02488-4. [PMID: 39237760 DOI: 10.1038/s41559-024-02488-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 07/01/2024] [Indexed: 09/07/2024]
Abstract
Islands have played a prominent role in evolutionary and ecological theory, centring the theoretical framework for understanding biodiversity in terms of isolation and area and providing 'laboratories' of evolutionary change and adaptive radiation. However, a similar role for islands in understanding global language diversity has not been established, even though one-sixth of the world's languages are spoken on islands which account for <1% of the inhabited land area. The striking diversity of island languages remains largely unexplained. We construct a global database which reveals that 10% of the world's languages are endemic to islands (landmasses <11,000 km2) and we test several key theories of language evolution and diversity. We show that language diversity on islands increases with area but does not show a steady decrease with isolation, nor are island languages at elevated risk of loss. However, number of endemic languages per island increases with both area and isolation. We demonstrate that islands shape language evolution, with fewer phonemes (distinct sounds) in island endemic languages with increasing isolation. Our results suggest that islands generate language diversity by accelerating both language change and diversification.
Collapse
Affiliation(s)
- Lindell Bromham
- Macroevolution and Macroecology, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia.
| | - Keaghan J Yaxley
- Macroevolution and Macroecology, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Marcel Cardillo
- Macroevolution and Macroecology, Research School of Biology, Australian National University, Canberra, Australian Capital Territory, Australia
| |
Collapse
|
2
|
Shcherbakova O, Blasi DE, Gast V, Skirgård H, Gray RD, Greenhill SJ. The evolutionary dynamics of how languages signal who does what to whom. Sci Rep 2024; 14:7259. [PMID: 38538665 PMCID: PMC10973346 DOI: 10.1038/s41598-024-51542-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 01/06/2024] [Indexed: 08/02/2024] Open
Abstract
Languages vary in how they signal "who does what to whom". Three main strategies to indicate the participant roles of "who" and "whom" are case, verbal indexing, and rigid word order. Languages that disambiguate these roles with case tend to have either verb-final or flexible word order. Most previous studies that found these patterns used limited language samples and overlooked the causal mechanisms that could jointly explain the association between all three features. Here we analyze grammatical data from a Grambank sample of 1705 languages with phylogenetic causal graph methods. Our results corroborate the claims that verb-final word order generally gives rise to case and, strikingly, establish that case tends to lead to the development of flexible word order. The combination of novel statistical methods and the Grambank database provides a model for the rigorous testing of causal claims about the factors that shape patterns of linguistic diversity.
Collapse
Affiliation(s)
- Olena Shcherbakova
- Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany.
| | - Damián E Blasi
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, 08010, Spain
- Center for Brain and Cognition, Pompeu Fabra University, Barcelona, 08018, Spain
| | - Volker Gast
- Department of English and American Studies, Friedrich-Schiller University of Jena, 07745, Jena, Germany
| | - Hedvig Skirgård
- Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - Russell D Gray
- Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
- School of Psychology, University of Auckland, 1010, Auckland, New Zealand
| | - Simon J Greenhill
- Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
- School of Biological Sciences, University of Auckland, 1010, Auckland, New Zealand
| |
Collapse
|
3
|
Aceves P, Evans JA. Human languages with greater information density have higher communication speed but lower conversation breadth. Nat Hum Behav 2024:10.1038/s41562-024-01815-w. [PMID: 38366103 DOI: 10.1038/s41562-024-01815-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Accepted: 01/03/2024] [Indexed: 02/18/2024]
Abstract
Human languages vary widely in how they encode information within circumscribed semantic domains (for example, time, space, colour, human body parts and activities), but little is known about the global structure of semantic information and nothing about its relation to human communication. We first show that across a sample of ~1,000 languages, there is broad variation in how densely languages encode information into words. Second, we show that this language information density is associated with a denser configuration of semantic information. Finally, we trace the relationship between language information density and patterns of communication, showing that informationally denser languages tend towards faster communication but conceptually narrower conversations or expositions within which topics are discussed at greater depth. These results highlight an important source of variation across the human communicative channel, revealing that the structure of language shapes the nature and texture of human engagement, with consequences for human behaviour across levels of society.
Collapse
Affiliation(s)
- Pedro Aceves
- Department of Management and Organization, Carey Business School, Johns Hopkins University, Baltimore, MD, USA.
| | - James A Evans
- Department of Sociology & Knowledge Lab, University of Chicago, Chicago, IL, USA
- Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
4
|
Blum F, Barrientos C, Ingunza A, Blasi DE, Zariquiey R. Grammars Across Time Analyzed (GATA): a dataset of 52 languages. Sci Data 2023; 10:835. [PMID: 38017079 PMCID: PMC10684564 DOI: 10.1038/s41597-023-02659-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 10/18/2023] [Indexed: 11/30/2023] Open
Abstract
Grammars Across Time Analyzed (GATA) is a resource capturing two snapshots of the grammatical structure of a diverse range of languages separated in time, aimed at furthering research on historical linguistics, language evolution, and cultural change. GATA comprises grammatical information on 52 diverse languages across all continents, featuring morphological, syntactic, and phonological information based on published grammars of the same language at two different time points. Here we introduce the coding scheme and design features of GATA, and we describe some salient patterns related to language change and the coverage of grammatical descriptions over time.
Collapse
Affiliation(s)
- Frederic Blum
- Department for Linguistic and Cultural Evolution, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Carlos Barrientos
- Department for Linguistic and Cultural Evolution, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Institut für Linguistik, Universität Leipzig, Leipzig, Germany
| | | | - Damián E Blasi
- Department for Linguistic and Cultural Evolution, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
- Department of Human Evolutionary Biology, Harvard University, Cambridge, USA.
- Harvard Data Science Initiative, Harvard University, Cambridge, USA.
- Center for Brain and Cognition, Pompeu Fabra University, Barcelona, Spain.
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain.
| | | |
Collapse
|
5
|
Thorp HH. Editorial expression of concern. SCIENCE ADVANCES 2023; 9:eadm8238. [PMID: 37939184 PMCID: PMC10846899 DOI: 10.1126/sciadv.adm8238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]
|
6
|
Koplenig A, Wolfer S. Languages with more speakers tend to be harder to (machine-)learn. Sci Rep 2023; 13:18521. [PMID: 37898699 PMCID: PMC10613286 DOI: 10.1038/s41598-023-45373-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 10/18/2023] [Indexed: 10/30/2023] Open
Abstract
Computational language models (LMs), most notably exemplified by the widespread success of OpenAI's ChatGPT chatbot, show impressive performance on a wide range of linguistic tasks, thus providing cognitive science and linguistics with a computational working model to empirically study different aspects of human language. Here, we use LMs to test the hypothesis that languages with more speakers tend to be easier to learn. In two experiments, we train several LMs-ranging from very simple n-gram models to state-of-the-art deep neural networks-on written cross-linguistic corpus data covering 1293 different languages and statistically estimate learning difficulty. Using a variety of quantitative methods and machine learning techniques to account for phylogenetic relatedness and geographical proximity of languages, we show that there is robust evidence for a relationship between learning difficulty and speaker population size. However, contrary to expectations derived from previous research, our results suggest that languages with more speakers tend to be harder to learn.
Collapse
Affiliation(s)
| | - Sascha Wolfer
- Leibniz Institute for the German Language (IDS), Mannheim, Germany
| |
Collapse
|