BIg Speech data analytics for cONtact centres

Información del proyecto

BISON

Identificador del acuerdo de subvención: 645323

DOI

10.3030/645323

Proyecto cerrado

Fecha de la firma de la CE 12 Diciembre 2014

Fecha de inicio 1 Enero 2015

Fecha de finalización 31 Diciembre 2017

Financiado con arreglo a

INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT)

Coste total

€ 4 097 952,50

Aportación de la UE

€ 3 090 824,50

3 090 824,50

1 007 128,00

Coordinado por

PHONEXIA SRO
Czechia

CORDIS proporciona enlaces a los documentos públicos y las publicaciones de los proyectos de los programas marco HORIZONTE.

Los enlaces a los documentos y las publicaciones de los proyectos del Séptimo Programa Marco, así como los enlaces a algunos tipos de resultados específicos, como conjuntos de datos y «software», se obtienen dinámicamente de OpenAIRE .

Resultado final

bigBison

The final prototype - an open environment, allowing to be used in stand-alone configuration or in integration with third-party infrastructure (Result of all WP6 Tasks). The system will demonstrate full capabilities of the technology. All speech technologies will come in noise-robust and optimized versions, transcription and keyword spotting will be available all languages covering the needs of end-user in the consortium (see also Table 3), and methodologies for rapid and cost-effective development of new ones (leveraging on customer data) will be provided. The system will be fully integrated with one large CC hardware and software infrastructure and generation of real business outputs will be demonstrated on real data.

smallBison

Initial prototype demonstrating the BISON technologies. A software system contain a full range of speech mining technologies (speech recognition, speaker verification, language identification and voice activity detection) in 9 languages (see Table 3) and a simple presentation of results. Although working on off-line data set and with rudimentary UI, it will be deployed with the CCs in the project to gather initial user feedback. It is based on intermediate results of T6.2 through T6.6.

Legal, ethical and societal issues of BISON - The BISON ethical and societal code

Starting from the outcome of the previous deliverables (D8.[234]) and with the support of the feedback by project partners during the development and deployment of BISON, D8.5 will set the rules and procedures for BISON as concerns ethics. It will be addressed both to BISON partners and to CCs, while a dedicated schematic section will be addressed to the wider public for awareness building and information. The deliverable will also provide ethical approvals for the planned collection and analyses of personal data, if updates or new approvals as compared to the approvals submitted in M3 are needed.

Optimizing speech data mining for CC operation

A progress report on advancing speech data mining for the dynamic CC environment. Will include notes on scalability and real-time (T4.3), fast bootstrapping of recognizers for new languages (T4.4), and component evaluations (T4.6).

Indexing and database access to big speech data

software for fast database access to speech and mined data.

Initial speech mining technologies

A set of SW consolidating existing or slightly adapted speech data miners to provide fast start of the project. Mainly based on the results of T4.1 and T4.2, includes the results of component evaluation T4.6.

Final set of speech technologies adapted for Contact Centers

Software modules and associated report describing the final version of CC-adapted speech mining technologies, including innovation during BISON lifetime. Includes the results of T4.5 and all preceding Tasks.

Publicaciones

Three ways to adapt a CTS recognizer to unseen reverberated speech in BUT system for the ASpIRE challenge

Autores: KARAFIÁT Martin, GRÉZL František, BURGET Lukáš, SZŐKE Igor and ČERNOCKÝ Jan
Publicado en: Proceedings of Interspeech, 2015, Página(s) 2454-2458, ISSN 1990-9772
Editor: International Speech Communication Association

Voiceprint transformation for migration between automatic speaker identification systems .

Autores: GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, PEŠÁN Jan and PLCHOT Oldřich
Publicado en: A bstract book of the 7th European Academy of Forensic Science Conference, 2015, ISBN 978-80-260-8659-8
Editor: Criminal Police Department Prague

Effect of gender and call duration on customer satisfaction in call center big data

Autores: Llimona, Quim / Luque, Jordi / Anguera, Xavier / Hidalgo, Zoraida / Park, Souneil / Oliver, Nuria
Publicado en: Proc. INTERSPEECH 2015, 2015, Página(s) 1825-1829, ISSN 1990-9772
Editor: International Speech Communication Association

Using voice quality measurements with prosodic and spectral features for speaker diarization

Autores: Woubie, Abraham / Luque, Jordi / Hernando, Javier
Publicado en: Proc. Interspeech 2015, 2015, Página(s) 3100-3104, ISSN 1990-9772
Editor: International Speech Communication Association

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies

Autores: Murali Karthick Baskar, Martin Karafiat, Lukas Burget, Karel Vesely, Frantisek Grezl, Jan Cernocky
Publicado en: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, Página(s) 4810-4814, ISBN 978-1-5090-4117-6
Editor: IEEE
DOI: 10.1109/ICASSP.2017.7953070

Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks

Autores: Karel Beneš, Murali Karthick Baskar, Lukáš Burget
Publicado en: Interspeech 2017, 2017, Página(s) 284-288
Editor: ISCA
DOI: 10.21437/Interspeech.2017-1442

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation

Autores: Martin Karafiát, Murali Karthick Baskar, Pavel Matějka, Karel Veselý, František Grézl, Lukáš Burget, Jan Černocký
Publicado en: Interspeech 2017, 2017, Página(s) 719-723
Editor: ISCA
DOI: 10.21437/Interspeech.2017-1775

Analysis of Score Normalization in Multilingual Speaker Recognition

Autores: Pavel Matějka, Ondřej Novotný, Oldřich Plchot, Lukáš Burget, Mireia Diez Sánchez, Jan Černocký
Publicado en: Interspeech 2017, 2017, Página(s) 1567-1571
Editor: ISCA
DOI: 10.21437/Interspeech.2017-803

Bayesian phonotactic Language Model for Acoustic Unit Discovery

Autores: Lucas Ondel, Lukas Burget, Jan Cernocky, Santosh Kesiraju
Publicado en: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017, Página(s) 5750-5754, ISBN 978-1-5090-4117-6
Editor: IEEE
DOI: 10.1109/ICASSP.2017.7953258

Analysis and Description of ABC Submission to NIST SRE 2016

Autores: Oldřich Plchot, Pavel Matějka, Anna Silnova, Ondřej Novotný, Mireia Diez Sánchez, Johan Rohdin, Ondřej Glembek, Niko Brümmer, Albert Swart, Jesús Jorrín-Prieto, Paola García, Luis Buera, Patrick Kenny, Jahangir Alam, Gautam Bhattacharya
Publicado en: Interspeech 2017, 2017, Página(s) 1348-1352
Editor: ISCA
DOI: 10.21437/Interspeech.2017-1498

Alternative Approaches to Neural Network Based Speaker Verification

Autores: Anna Silnova, Lukáš Burget, Jan Černocký
Publicado en: Interspeech 2017, 2017, Página(s) 1572-1575
Editor: ISCA
DOI: 10.21437/Interspeech.2017-1062

MGB-3 but system: Low-resource ASR on Egyptian YouTube data

Autores: Karel Vesely, Baskar Karthick Murali, Mireia Diez, Karel Benes
Publicado en: 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 2017, Página(s) 368-373, ISBN 978-1-5090-4788-8
Editor: IEEE
DOI: 10.1109/ASRU.2017.8268959

Semi-Supervised DNN Training with Word Selection for ASR

Autores: Karel Veselý, Lukáš Burget, Jan Černocký
Publicado en: Interspeech 2017, 2017, Página(s) 3687-3691
Editor: ISCA
DOI: 10.21437/Interspeech.2017-1385

ABC NIST SRE 2016 SYSTEM DESCRIPTION

Autores: BRUMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, MATĚJKA Pavel, PLCHOT Oldřich, DIEZ Sánchez Mireia, SILNOVA Anna, JIANG Xiaowei, NOVOTNÝ Ondřej, ROHDIN Johan A., GLEMBEK Ondřej, GRÉZL František, BURGET Lukáš, ONDEL Lucas, PEŠÁN Jan, ČERNOCKÝ Jan, KENNY Patrick, ALAM Jahangir, BHATTACHARYA Gautam and ZEINALI Hossein et al.
Publicado en: Proceedings of the NIST SRE Workshop, 2016
Editor: National Institute of Standards and Technology

Sequence Summarizing Neural Networks for Spoken Language Recognition

Autores: Jan Pešán, Lukáš Burget, Jan Černocký
Publicado en: Interspeech 2016, 2016, Página(s) 3285-3288
Editor: ISCA
DOI: 10.21437/Interspeech.2016-764

Analysis of DNN approaches to speaker identification

Autores: Pavel Matejka, Ondrej Glembek, Ondrej Novotny, Oldrich Plchot, Frantisek Grezl, Lukas Burget, Jan Honza Cernocky
Publicado en: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, Página(s) 5100-5104, ISBN 978-1-4799-9988-0
Editor: IEEE Signal Processing Society
DOI: 10.1109/ICASSP.2016.7472649

Improving i-Vector and PLDA Based Speaker Clustering with Long-Term Features

Autores: Abraham Woubie, Jordi Luque, Javier Hernando
Publicado en: Interspeech 2016, 2016, Página(s) 372-376
Editor: ISCA
DOI: 10.21437/Interspeech.2016-339

Audio enhancing with DNN autoencoder for speaker recognition

Autores: Oldrich Plchot, Lukas Burget, Hagai Aronowitz, Pavel Matejka
Publicado en: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, Página(s) 5090-5094, ISBN 978-1-4799-9988-0
Editor: IEEE Signal Processing Society
DOI: 10.1109/ICASSP.2016.7472647

Analysis of the DNN-Based SRE Systems in Multi-language Conditions

Autores: NOVOTNÝ Ondřej, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan
Publicado en: Proceedings of the 2016 IEEE Workshop on Spoken Language Technology (SLT 2016), 2016, Página(s) 199-204, ISBN 978-1-5090-4903-5
Editor: IEEE Signal Processing Society

Short- and Long-Term Speech Features for Hybrid HMM-i-Vector based Speaker Diarization System

Autores: Abraham Woubie Zewoudie, Jordi Luque, Javier Hernando
Publicado en: Odyssey 2016, 2016, Página(s) 400-406
Editor: ISCA
DOI: 10.21437/Odyssey.2016-58

HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification

Autores: Hossein Zeinali, Hossein Sameti, Lukas Burget
Publicado en: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Edición 25/7, 2017, Página(s) 1421-1435, ISSN 2329-9290
Editor: IEEE Advancing Technology for Humanity
DOI: 10.1109/TASLP.2017.2694708

Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models

Autores: Hossein Zeinali, Hossein Sameti, Lukáš Burget, Jan “Honza” Černocký
Publicado en: Computer Speech & Language, Edición 46, 2017, Página(s) 53-71, ISSN 0885-2308
Editor: Academic Press
DOI: 10.1016/j.csl.2017.04.005

Variational Inference for Acoustic Unit Discovery

Autores: Lucas Ondel, Lukaš Burget, Jan Černocký
Publicado en: Procedia Computer Science, Edición 81, 2016, Página(s) 80-86, ISSN 1877-0509
Editor: Procedia Computer Science
DOI: 10.1016/j.procs.2016.04.033

Study of Large Data Resources for Multilingual Training and System Porting

Autores: František Grézl, Ekaterina Egorova, Martin Karafiát
Publicado en: Procedia Computer Science, Edición 81, 2016, Página(s) 15-22, ISSN 1877-0509
Editor: Procedia Computer Science, managed by Elsevier Science
DOI: 10.1016/j.procs.2016.04.024

Bottle-Neck Feature Extraction Structures for Multilingual Training and Porting

Autores: František Grézl, Martin Karafiát
Publicado en: Procedia Computer Science, Edición 81, 2016, Página(s) 144-151, ISSN 1877-0509
Editor: Procedia Computer Science, Volume 81, managed by Elsevier Science
DOI: 10.1016/j.procs.2016.04.042

Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data

Autores: Ekaterina Egorova, Jordi Luque Serrano
Publicado en: Procedia Computer Science, Edición 81, 2016, Página(s) 114-120, ISSN 1877-0509
Editor: Procedia Computer Science, Volume 81, managed by Elsevier Science
DOI: 10.1016/j.procs.2016.04.038

Automatic Speech Feature Learning for Continuous Prediction of Customer Satisfaction in Contact Center Phone Calls

Autores: Carlos Segura, Daniel Balcells, Martí Umbert, Javier Arias, Jordi Luque
Publicado en: Advances in Speech and Language Technologies for Iberian Languages, 2016, Página(s) 255-265, ISBN 978-3-319-49169-1
Editor: Springer International Publishing
DOI: 10.1007/978-3-319-49169-1_25

Privacy Through Anonymisation in Large-Scale Socio-Technical Systems: Multi-lingual Contact Centres Across the EU

Autores: Claudia Cevenini, Enrico Denti, Andrea Omicini, Italo Cerno
Publicado en: Internet Science, 2016, Página(s) 291-305, ISBN 978-3-319-45982-0
Editor: Springer International Publishing
DOI: 10.1007/978-3-319-45982-0_25

Buscando datos de OpenAIRE...

Resultado final

Publicaciones

Descargar Descargar el contenido de la página