Skip to main content
European Commission logo print header

A computational approach to early language bootstrapping

Ziel

During their first year of life, infants become attuned to the phonemes, words and phonological rules of their language, with little or no adult supervision. After 30 years of accumulated experimental results, we are still lacking an account for the puzzling fact that these 3 interdependent components of language are acquired not sequentially, but in parallel. Drawing tools from Machine Learning and Automatic Speech Recognition, we construct a model of this early process, test it on 2 large spontaneous speech databases (Japanese, French and Dutch) and test its predictions in infants using behavioral, EEGs and fNIRS techniques.
1. Coding. We study different ways of defining coding features for speech, from fine-grained to coarse grained, in view of the automatic discovery of a hierarchy of linguistic units. We compare this with a systematic study of the units of speech coding as they unfold in 6, 9 and 12 month old infants..
2. Lexicon. Infants recognize some words before they know the phonemes of their language; we modify existing word segmentation algorithms so they can work on raw speech. We test the unique prediction that infants start with a large lexicon that’s quite different from the adult one.
3. Rules. Phonemes are produced as overlapping, coarticulated gestures. To untangle these context effects, we use a predictive model of coarticulation in auditory space and invert it. We test when and how infants perform reverse coarticulation.
4. Integration. The above subprojects provide only an initial bootstrapping into approximate phonemes, words, and contextual rules. We show how to iteratively integrate these approximate representations to derive better ones. The outcome will be numerically assessed on an adult directed and infant directed speech database, and compared to those of to state-of-the-art supervized phoneme recognizers. The predictions will be tested in infants learning artificial languages and in a longitudinal study.

Aufforderung zur Vorschlagseinreichung

ERC-2011-ADG_20110406
Andere Projekte für diesen Aufruf anzeigen

Gastgebende Einrichtung

ECOLE DES HAUTES ETUDES EN SCIENCES SOCIALES
EU-Beitrag
€ 2 194 557,00
Adresse
54 BD RASPAIL
75270 Paris
Frankreich

Auf der Karte ansehen

Region
Ile-de-France Ile-de-France Paris
Aktivitätstyp
Higher or Secondary Education Establishments
Hauptforscher
Emmanuel Dupoux (Prof.)
Kontakt Verwaltung
Alexandre Burger (Mr.)
Links
Gesamtkosten
Keine Daten

Begünstigte (1)