Skip to main content
Vai all'homepage della Commissione europea (si apre in una nuova finestra)
italiano it
CORDIS - Risultati della ricerca dell’UE
CORDIS
Contenuto archiviato il 2024-04-16

High-Resolution Speech Recognition : Auditory Connectionist Technologies for Speech

Obiettivo

The key research questions explored were:
-How can auditory processing performed in the cochlea be simulated?
-How can the transient information concerning the onsets and offsets of parts of speech be extracted from an early stage of the auditory processing chain?
-How can existing connectionist recognisers be developed to perform vowel recognition using the cochlea simulation as a front-end ?
-How can training techniques for connectionist recognisers be improved?
-How do humans perform on a phoneme recognition task in the absence of lexical and contextual constraints?
The computer modelling of auditory processing and connectionist systems for speech recognition was addressed. Based on research into peripheral auditory processing and cognitive research on speech perception and memory, the primary goal was to develop a recognition system that benefits from the processing embodied in current models of the human auditory system.

At the present time, developed software and published research on the project include:
an efficient computer simulation of cochlea processing, documentation of the software for users in speech recognition and hearing research and published papers describing experiments on octave perception and phoneme recognition;
efficient implementation of a connectionist recognizer producing performance comparable to the best fast Fourier transform (FET) filterbank and improved training and pattern recognition algorithms;
experimental research on the structure of human phonological memory.
APPROACH AND METHODS
The goal was to construct a recognition system involving a high-resolution front-end processor based on current models of the human auditory system, and a word-recognition system based on abstract phonological representations and a phonological buffer store.
The three parallel research streams aimed to:
-Develop a detailed but efficient functional model of human hearing in five stages: spectral analysis, neural transduction, phase allignment, pitch extraction and timbre stabilisation. Feature extraction mechanisms used in the model were to be developedin line with psychological research.
-Compare connectionist recogniser performance using auditory preprocessing with conventional spectrographic preprocessing.
-Develop new recogniser architectures and training algorithms in line with psycholinguistic research.
Work was also directed towards developing user documentation for the cochlea simulation software, establishing base-line performance for phoneme recognition using both traditional and auditory front-ends, feature extraction software, and an efficient parallel implementation of a connectionist recogniser. Phonological memory research, experimentation concerning human vowel recognition and human psycho-acoustical research are continuing parts of the Action.
PROGRESS AND RESULTS
At the present time, developed software and published research on the Action include:
-An efficient computer simulation of cochlea processing; documentation of the software for users in speech recognition and hearing research; published papers describing experiments on octave perception and phoneme recognition.
-Efficient implementation of a connectionist recogniser producing performance comparable to the best HMM recognisers; comparisons of the auditory model front-end with an FFT filterbank; improved training and pattern-recognition algorithms.
-Experimental research on the structure of human phonological memory.
POTENTIAL
This Action is laying the foundations for the construction of balanced and efficient auditory connectionist speech recognition systems that are founded on cognitive research in the areas of speech perception and memory.

Campo scientifico (EuroSciVoc)

CORDIS classifica i progetti con EuroSciVoc, una tassonomia multilingue dei campi scientifici, attraverso un processo semi-automatico basato su tecniche NLP. Cfr.: Il Vocabolario Scientifico Europeo.

È necessario effettuare l’accesso o registrarsi per utilizzare questa funzione

Programma(i)

Programmi di finanziamento pluriennali che definiscono le priorità dell’UE in materia di ricerca e innovazione.

Argomento(i)

Gli inviti a presentare proposte sono suddivisi per argomenti. Un argomento definisce un’area o un tema specifico per il quale i candidati possono presentare proposte. La descrizione di un argomento comprende il suo ambito specifico e l’impatto previsto del progetto finanziato.

Dati non disponibili

Invito a presentare proposte

Procedura per invitare i candidati a presentare proposte di progetti, con l’obiettivo di ricevere finanziamenti dall’UE.

Dati non disponibili

Meccanismo di finanziamento

Meccanismo di finanziamento (o «Tipo di azione») all’interno di un programma con caratteristiche comuni. Specifica: l’ambito di ciò che viene finanziato; il tasso di rimborso; i criteri di valutazione specifici per qualificarsi per il finanziamento; l’uso di forme semplificate di costi come gli importi forfettari.

Dati non disponibili

Coordinatore

MRC Applied Psychology Unit
Contributo UE
Nessun dato
Costo totale

I costi totali sostenuti dall’organizzazione per partecipare al progetto, compresi i costi diretti e indiretti. Questo importo è un sottoinsieme del bilancio complessivo del progetto.

Nessun dato

Partecipanti (4)

Il mio fascicolo 0 0