Periodic Reporting for period 1 - BraveNewWord (The acquisition of new meanings through novel word learning)
Période du rapport: 2023-06-01 au 2025-11-30
BraveNewWord posits three main cognitive mechanisms of semantic enrichment via novel words. First, the novel meaning can be induced via the sentence context in which the new word appears. Indeed, most novel words are not found in isolation, and the surrounding linguistic environment is informative about their meaning: when we hear “The small, hairy wug was sleeping below a tree” we have an intuition that the unfamiliar “wug” might denote some kind of animal. Second, substantial information can be obtained on the basis of the word morphological structure; in fact, most of the novel words are composed by familiar sublexical units, i.e. morphemes: the meaning of “quickify” is immediately evident since we are familiar with both “quick” and “ify”. Third, natural languages present a certain degree of systematicity, i.e. nuanced association between form and meaning; such reliable statistical patterns can be exploited to have an intuition about a novel word, even if it does not include any particularly salient or familiar sublexical element (e.g. futmaw).
In BraveNewWord, the three described mechanisms are integrated in a unique computational framework to provide a new understanding of the relation between language and meaning, and the cognitive underpinnings on which such relation is built.
The models developed by BraveNewWord naturally produce quantitative, empirically testable predictions about behaviour and neural activity. The project is testing such predictions with methodologies ranging from response times to neuroimaging to electrophysiology.
Concerning the impact of minimal linguistic context, we have observed a modulation of the N400 for novel words in context, after as few as two occurrences. The N400 is an electrophysiological response, measured via EEG, that indexes how surprising a word is within a given sentence or, from a different perspective, how difficult it is to integrate the encountered element in the previous context. This evidence indicates that novel words are rapidly assigned a meaning, which is routinely integrated with the previously presented familiar information. Crucially, the BraveNewWord computational approach can predict the N400 magnitude, and hence how well the novel word integrates with the preceding context.
Concerning morphology-induced meanings, we relied on and extended a model, CAOSS, previously proposed for novel compound words (e.g. rivercat). We adapted this architecture to other types of morphologically complex elements, such as prefixed (e.g. respeak) and suffixed words (e.g. quickify), and showed that the model predictions align with human responses in behavioral tasks across different languages. Furthermore, in a neuroimaging study, we observed distinct neural signatures for novel meanings induced by morphologically complex words.
Concerning form-meaning mapping, across a number of studies we investigated human intuitions about the possible meaning of completely unfamiliar linguistic strings (e.g. futmaw). Participants were shown to be able to produce consistent responses across a range of different semantic dimensions, up to actual definitions. Such responses were significantly predicted by the BraveNewWord computational approach. Moreover, uniquely within the BraveNewWord endeavour that typically moves from language to semantics, we designed a system that moves in the opposite direction, producing a new word on the basis of a desired meaning.
Given its computationally-driven approach, BraveNewWord will further pave the ground for new applications and research venues. It will allow us to understand which novel elements are easier to integrate in memory, helping with data-driven definitions of education programs in second language acquisition and interventions in the rehabilitation field. Moreover, BraveNewWord can tell us which words can be more impactful in shaping ideas, and hence nudge behaviors. This has applications in marketing (e.g. brand generation), but also in promoting desirable societal change, such as in the domain of inclusive language