Final Report Summary - BOOTPHON (A computational approach to early language bootstrapping)
The ERC Bootphon has found new evidence that these two achievements are not independent but interrelated. Indeed, it has shown using machine learning techniques run on speech corpora, that consonants and vowels cannot be directly extracted from the speech signals. Instead, it is important to know at least some words to help distinguishing the important versus unimportant contrasts. Yet, the ERC team has also found that is very difficult to learn the words without knowing which are the consonants and vowels of the language. The way it proposes to solve this chicken-and-egg problem is through the join and gradual learning of approximations of consonants and vowels (proto-phonemes) and words (proto words). The algorithms proposed and analyzed by the team centers on the discovery of linguistic units from raw speech, which open up applications in the documentation of endangered languages and the construction of automatic speech recognition tools for languages without stable orthographies.
The ERC has also demonstrated that infant's input is much more noisy than thought previously. Indeed, it was thought that parents who adopt 'baby talk', which is a particular, exaggerated way of talking to infants, were simplifying the work for their infants. The ERC Bootphon, in collaboration with several teams who record infant directed inputs in their homes, showed that baby talk is actually more confusing to a naive learner, because it is much more variable than adult directed speech. The ERC Bootphon constructed a set of instruments enabling the quantitative measure of the language input available to infant learner, which is now deployed in a large international collaboration to study how language directed to infants affects language learning across cultures.