Periodic Reporting for period 1 - CAUSALITY (Modeling causes of language change and conservatism)
Période du rapport: 2022-09-01 au 2025-02-28
This investigation involves the retrieval of actual historical patterns of change from annotated databases, or treebanks, of Dutch, English, and Low German, as well as modeling, using Game Theory, the pragmatic reasoning the speakers engage in when evaluating their communicative choices. Modeling such reasoning is, by hypothesis, the key to uncovering causal relations between various grammatical changes. It is the communicative pressure that ensures that a decline of one feature (such as the informativeness of verbal endings) is "compensated" by an increase of an alternative feature (such as the frequency of pronominal subjects). By comparing the results of the game-theoretic simulations with the data retrieved from historical treebanks we can validate the communicative assumptions the models are based on. Finally, game-theoretic models will be incorporated into simulations involving multiple “generations” of artificial agents communicating with each other, to simulate a timeline and to pitch communicative efficiency against factors of group conservatism.
An important sub-project of CAUSALITY is creating a large treebank of historical Dutch. This will enable deeper insights into the evolution of Dutch and the way it gave rise to the modern dialectal landscape.
Language speakers make use of various grammatical signals to relate what they are saying to the background information. Since such signalling strategies can change with time, we did a quantitative comparison based on historical corpora of English, Middle Low and High German and Icelandic, focusing on the evolution of the "prefield", a preverbal position hypothesized to host elements relating the interpretation to what has just been said. Even though Icelandic, in line with the traditional view, comes out as the most conservative system, there too do we find statistically significant shifts across time in the same direction as the other languages in the study.
An important subgroup of elements relating the meaning to the background information are determiners. We used statistical methods to examine their use in historical treebanks of English, coming to a novel conclusion that while the morphophonological profile of the determiner system has significantly changed over time, the semantic profile remained largely intact. This finding reveals a greater than previously assumed semantic stability of determiner systems.
An often discussed but so far unproven cause of word order changes are changes in the systems of morphological marking of the syntactic roles of verbal arguments, that is, changes in case marking. We performed the first large-scale quantitative investigation of the decline of morphological case marking in the documented history of English and French, as well as changes in the direct object placement in the two languages. We showed that the two processes are strongly correlated in both cases. Moreover, the empirical frequencies correlate with the frequencies obtained from game-theoretic simulations which are based on the assumption that morphological case and argument placement are two strategies of disambiguating syntactic structure and that historical phonological changes make the case marking option an increasingly “costlier” choice for the speakers.
Project results will be appearing on this page: https://research.flw.ugent.be/en/projects/causality(s’ouvre dans une nouvelle fenêtre)
The statistical approach to the Old English article semantics led to a breakthrough in the understanding of the article system of historical English. For the first time there is strong quantitative evidence that Old English had a definite article whose quantitative signature and, by hypothesis, semantics, is identical to that of its Present-Day-English counterpart. These results point to a much greater than previously assumed stability of article systems across time.
The project is also groundbreaking in identifying strong correlations between changes in morphological case expression and word orders, finally lending statistical support to an age-old intuition.