Skip to main content
European Commission logo print header

A computational approach to early language bootstrapping

Cel

During their first year of life, infants become attuned to the phonemes, words and phonological rules of their language, with little or no adult supervision. After 30 years of accumulated experimental results, we are still lacking an account for the puzzling fact that these 3 interdependent components of language are acquired not sequentially, but in parallel. Drawing tools from Machine Learning and Automatic Speech Recognition, we construct a model of this early process, test it on 2 large spontaneous speech databases (Japanese, French and Dutch) and test its predictions in infants using behavioral, EEGs and fNIRS techniques.
1. Coding. We study different ways of defining coding features for speech, from fine-grained to coarse grained, in view of the automatic discovery of a hierarchy of linguistic units. We compare this with a systematic study of the units of speech coding as they unfold in 6, 9 and 12 month old infants..
2. Lexicon. Infants recognize some words before they know the phonemes of their language; we modify existing word segmentation algorithms so they can work on raw speech. We test the unique prediction that infants start with a large lexicon that’s quite different from the adult one.
3. Rules. Phonemes are produced as overlapping, coarticulated gestures. To untangle these context effects, we use a predictive model of coarticulation in auditory space and invert it. We test when and how infants perform reverse coarticulation.
4. Integration. The above subprojects provide only an initial bootstrapping into approximate phonemes, words, and contextual rules. We show how to iteratively integrate these approximate representations to derive better ones. The outcome will be numerically assessed on an adult directed and infant directed speech database, and compared to those of to state-of-the-art supervized phoneme recognizers. The predictions will be tested in infants learning artificial languages and in a longitudinal study.

Zaproszenie do składania wniosków

ERC-2011-ADG_20110406
Zobacz inne projekty w ramach tego zaproszenia

System finansowania

ERC-AG - ERC Advanced Grant

Instytucja przyjmująca

ECOLE DES HAUTES ETUDES EN SCIENCES SOCIALES
Wkład UE
€ 2 194 557,00
Adres
54 BD RASPAIL
75270 Paris
Francja

Zobacz na mapie

Region
Ile-de-France Ile-de-France Paris
Rodzaj działalności
Higher or Secondary Education Establishments
Kierownik naukowy
Emmanuel Dupoux (Prof.)
Kontakt administracyjny
Alexandre Burger (Mr.)
Linki
Koszt całkowity
Brak danych

Beneficjenci (1)