Skip to main content
European Commission logo print header

A computational approach to early language bootstrapping

Objetivo

During their first year of life, infants become attuned to the phonemes, words and phonological rules of their language, with little or no adult supervision. After 30 years of accumulated experimental results, we are still lacking an account for the puzzling fact that these 3 interdependent components of language are acquired not sequentially, but in parallel. Drawing tools from Machine Learning and Automatic Speech Recognition, we construct a model of this early process, test it on 2 large spontaneous speech databases (Japanese, French and Dutch) and test its predictions in infants using behavioral, EEGs and fNIRS techniques.
1. Coding. We study different ways of defining coding features for speech, from fine-grained to coarse grained, in view of the automatic discovery of a hierarchy of linguistic units. We compare this with a systematic study of the units of speech coding as they unfold in 6, 9 and 12 month old infants..
2. Lexicon. Infants recognize some words before they know the phonemes of their language; we modify existing word segmentation algorithms so they can work on raw speech. We test the unique prediction that infants start with a large lexicon that’s quite different from the adult one.
3. Rules. Phonemes are produced as overlapping, coarticulated gestures. To untangle these context effects, we use a predictive model of coarticulation in auditory space and invert it. We test when and how infants perform reverse coarticulation.
4. Integration. The above subprojects provide only an initial bootstrapping into approximate phonemes, words, and contextual rules. We show how to iteratively integrate these approximate representations to derive better ones. The outcome will be numerically assessed on an adult directed and infant directed speech database, and compared to those of to state-of-the-art supervized phoneme recognizers. The predictions will be tested in infants learning artificial languages and in a longitudinal study.

Convocatoria de propuestas

ERC-2011-ADG_20110406
Consulte otros proyectos de esta convocatoria

Régimen de financiación

ERC-AG - ERC Advanced Grant

Institución de acogida

ECOLE DES HAUTES ETUDES EN SCIENCES SOCIALES
Aportación de la UE
€ 2 194 557,00
Dirección
54 BD RASPAIL
75270 Paris
Francia

Ver en el mapa

Región
Ile-de-France Ile-de-France Paris
Tipo de actividad
Higher or Secondary Education Establishments
Investigador principal
Emmanuel Dupoux (Prof.)
Contacto administrativo
Alexandre Burger (Mr.)
Enlaces
Coste total
Sin datos

Beneficiarios (1)