Robust End-To-End SPEAKER recognition based on deep learning and attention models

Project Information

ETE SPEAKER

Grant agreement ID: 843627

DOI

10.3030/843627

Project closed

EC signature date 12 April 2019

Start date 1 June 2019

End date 31 January 2021

Funded under

EXCELLENT SCIENCE - Marie Skłodowska-Curie Actions

Total cost

€ 120 817,20

EU contribution

€ 120 817,20

120 817,20

Coordinated by

VYSOKE UCENI TECHNICKE V BRNE
Czechia

Project description

An optimised automatic speaker recognition technology

Speech recognition is central to a broad spectrum of applications. The growing development of data exploitation and analysis techniques offers solutions for continuous improvement in the speech-processing industry. The EU-funded ETE SPEAKER project aims to develop an innovative tool based on automatic speaker recognition (SID) that isolates the necessary information to determine the identity of the speaker in a speech recording. ETE SPEAKER will focus on fully investigating and utilising the potential of deep neural networks to disentangle the speaker-specific information from the rest of nuisance variability. Its main aim is the introduction of an end-to-end SID conform to the latest Speaker Recognition Evaluation standards.

Objective

This project focuses on automatic speaker recognition (SID), the task of determining the identity of the speaker in a speech recording. Disentangling the speaker specific information from the rest of nuisance variability requires complex models. Deep neural networks (DNNs) have recently showed their potential for this, as the popular x-vector learnt by a DNN.
Here, we aim for end-to-end SID where the system is optimized as a whole for the target task. Despite several attempts in this line of research, many aspects still remain unexplored or not explored thoroughly.
We also propose to explore recurrent approaches, suitable for dealing with temporal signals, as well as different pooling methods to obtain a fixed-length representation from a variable length input sequence of speech features.
Next, we want to explore different flavors of attention mechanisms, which make the DNN to focus on relevant parts of the input, providing a way to quantify how much evidence has been collected about the speaker identity and the uncertainty of the obtained representation, which is a critical issue when making (Bayesian) decisions in SID.
Finally, some other approaches such as using the raw signal (instead of features) or other advances that might arise will be also explored for SID and related tasks.
To achieve our goals, we will start from theory, implement the proposed approaches and test on public SID benchmarks such as NIST SREs. The outcomes are intended to benefit both scientific community and speech processing industry.
The applicant Dr. Alicia Lozano-Diez is an excellent female researcher, who has done her Ph.D. at Audias (Universidad Autonoma de Madrid, Spain), a respected research lab. The host group Speech@FIT from Brno University of Technology (Czechia) has a top-class track on speech processing research. Thus, we expect the combination of both the researcher and the host to boost the researcher career and benefit the host group (and its industrial European partners).

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

H2020-EU.1.3. - EXCELLENT SCIENCE - Marie Skłodowska-Curie Actions MAIN PROGRAMME
See all projects funded under this programme
H2020-EU.1.3.2. - Nurturing excellence by means of cross-border and cross-sector mobility
See all projects funded under this programme

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

MSCA-IF-2018 - Individual Fellowships
See all projects funded under this topic

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MSCA-IF-EF-ST - Standard EF

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) H2020-MSCA-IF-2018

See all projects funded under this call

Coordinator

VYSOKE UCENI TECHNICKE V BRNE

Net EU contribution

€ 120 817,20

Address

ANTONINSKA 548/1
602 00 BRNO STRED
Czechia

Region

Česko Jihovýchod Jihomoravský kraj

Activity type

Higher or Secondary Education Establishments

Links

Contact the organisation

Website

Participation in EU R&I programmes

HORIZON collaboration network

Total cost

€ 120 817,20

Project description

An optimised automatic speaker recognition technology

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Coordinator

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.