Skip to main content

"Dereverberation and Reverberation of Audio, Music, and Speech"

Publicaciones

Reverberant speech recognition exploiting clarity index estimation

Autores: Patrick A. Naylor; Toon van Waterschoot; Pablo Peso Parada; Dushyant Sharma
Publicado en: Springer Nature 12 2015
Identificador permanente: Digital Object Identifier:10.1186/s13634-015-0237-7

Distributed Remote Vector Gaussian Source Coding for Wireless Acoustic Sensor Networks

Autores: Søren Holdt Jensen; Jan Ostergaard; Søren Bech; Adel Zahedi; Patrick A. Naylor
Publicado en: IEEE Press Zahedi , A , Østergaard , J , Jensen , S H , Naylor , P & Bech , S 2014 , Distributed Remote Vector Gaussian Source Coding for Wireless Acoustic Sensor Networks . in Data Compression Conference (DCC), 2014 . IEEE Press , Data Compression Conference. Proceedings , pp. 263-272 , 2014 Data Compression Conference , Snowbird, Utah , United States , 26/03/2014 . https://doi.org/10.1109/DCC.2014.27 2014
Identificador permanente: arXiv:1401.3945; Digital Object Identifier:10.1109/dcc.2014.27; Digital Object Identifier:10.48550/arxiv.1401.3945

A single-channel non-intrusive C50 estimator correlated with speech recognition performance

Autores: Dushyant Sharma; Daniel A. Barreda; Patrick A. Naylor; Jose Lainez; Toon van Waterschoot; Pablo Peso Parada
Publicado en: Institute of Electrical and Electronics Engineers (IEEE) 719 2016
Identificador permanente: Digital Object Identifier:10.1109/taslp.2016.2521486

Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

Autores: Benjamin Cauchi; Timo Gerkmann; Stefan Goetze; Simon Doclo; Stephan Gerlach; Ante Jukic; Robert Rehr; Ina Kodrasi
Identificador permanente: Digital Object Identifier:10.1186/s13634-015-0242-x

Single-Channel Online Enhancement of Speech Corrupted by Reverberation and Noise

Autores:
Identificador permanente: Digital Object Identifier:10.1109/taslp.2016.2641904

Distributed Remote Vector Gaussian Source Coding with Covariance Distortion Constraints

Autores: Søren Holdt Jensen; Patrick A. Naylor; Adel Zahedi; Søren Bech; Jan Ostergaard
Identificador permanente: arXiv:1401.6136; Digital Object Identifier:10.48550/arxiv.1401.6136; Digital Object Identifier:10.1109/isit.2014.6874900

Source Coding in Networks with Covariance Distortion Constraints

Autores: Søren Holdt Jensen; Adel Zahedi; Søren Bech; Jan Ostergaard; Patrick A. Naylor
Publicado en: arXiv Zahedi , A , Østergaard , J , Jensen , S H , Naylor , P & Bech , S 2016 , ' Source Coding in Networks with Covariance Distortion Constraints ' , I E E E Transactions on Signal Processing , vol. 64 , no. 22 , pp. 5943-5958 . https://doi.org/10.1109/TSP.2016.2603973 2015
Identificador permanente: Digital Object Identifier:10.48550/arxiv.1504.01090; Digital Object Identifier:10.1109/tsp.2016.2603973; arXiv:1504.01090

Front-end technologies for robust ASR in reverberant environments—spectral enhancement-based dereverberation and auditory modulation filterbank features

Autores: Simon Doclo; Simon Doclo; Feifei Xiong; Feifei Xiong; Robert Rehr; Jörn Anemüller; Timo Gerkmann; Stefan Goetze; Stefan Goetze; Bernd Meyer; Niko Moritz; Niko Moritz
Publicado en: Springer Science and Business Media LLC 2015
Identificador permanente: Digital Object Identifier:10.1186/s13634-015-0256-4

Maximum likelihood PSD estimation for speech enhancement in reverberation and noise

Autores: Søren Holdt Jensen; Jesper Jensen; Simon Doclo; Adam Kuklasinski
Identificador permanente: Digital Object Identifier:10.1109/taslp.2016.2573591

Restoration of images corrupted by mixed Gaussian-impulse noise by iterative soft-hard thresholding

Autores: Filipovic, Marko; Jukić, Ante
Publicado en: EURASIP Array 2014
Identificador permanente: Digital Object Identifier:10.5281/zenodo.43956

Proximal Gradient Algorithms: Applications in Signal Processing

Autores: Antonello, Niccol��; Stella, Lorenzo; Patrinos, Panagiotis; van Waterschoot, Toon
Identificador permanente: arXiv:1803.01621; Digital Object Identifier:10.48550/arxiv.1803.01621

Tap-length optimization of adaptive filters used in stereophonic acoustic echo cancellation

Autores: Asutosh Kar; Mallappa Kumara Swamy
Publicado en: Elsevier BV Kar , A & Swamy , M N S 2017 , ' Tap-length optimization of adaptive filters used in stereophonic acoustic echo cancellation ' , Signal Processing , vol. 131 , pp. 422-433 . https://doi.org/10.1016/j.sigpro.2016.09.003 2017
Identificador permanente: Digital Object Identifier:10.1016/j.sigpro.2016.09.003

Efficient multichannel acoustic echo cancellation using constrained tap selection schemes in the subband domain

Autores:
Identificador permanente: Digital Object Identifier:10.1186/s13634-017-0497-5

Audio coding in wireless acoustic sensor networks

Autores: Søren Bech; Søren Holdt Jensen; Patrick A. Naylor; Jan Ostergaard; Adel Zahedi
Publicado en: Elsevier BV 2015
Identificador permanente: Digital Object Identifier:10.1016/j.sigpro.2014.07.021

Low-Complexity Steered Response Power Mapping Based on Nyquist-Shannon Sampling

Autores: Dietzen, Thomas; De Sena, Enzo; van Waterschoot, Toon
Publicado en: IEEE Crossref 2020
Identificador permanente: Digital Object Identifier:10.1109/waspaa52581.2021.9632774; Digital Object Identifier:10.48550/arxiv.2012.09499; arXiv:2012.09499

Speech enhancement for robust automatic speech recognition: Evaluation using a baseline system and instrumental measures

Autores:
Identificador permanente: Digital Object Identifier:10.1016/j.csl.2016.11.003

Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

Autores: Marc Moonen; Thomas Dietzen; Toon van Waterschoot
Publicado en: IEEE EUSIPCO 2020
Identificador permanente: arXiv:2007.00542; Digital Object Identifier:10.48550/arxiv.2007.00542; Digital Object Identifier:10.23919/eusipco47968.2020.9287839

Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction

Autores: Simon Doclo; Marc Moonen; Toon van Waterschoot; Thomas Dietzen
Publicado en: Institute of Electrical and Electronics Engineers 2019
Identificador permanente: Digital Object Identifier:10.48550/arxiv.1906.07512; Digital Object Identifier:10.1109/taslp.2020.2966869; arXiv:1906.07512

Efficient synthesis of room acoustics via scattering delay networks

Autores: Julius O. Smith; Zoran Cvetkovic; Enzo De Sena; Hüseyin Hacιhabiboğlu
Publicado en: arXiv De Sena , E , Hacihabiboglu , H , Cvetkovic , Z & Smith , J 2015 , ' Efficient Synthesis of Room Acoustics via Scattering Delay Networks ' , Ieee Transactions On Audio Speech And Language Processing , vol. 23 , no. 9 , pp. 1478-1492 . https://doi.org/10.1109/TASLP.2015.2438547 2015
Identificador permanente: Digital Object Identifier:10.48550/arxiv.1502.05751; arXiv:1502.05751; Digital Object Identifier:10.1109/taslp.2015.2438547

Square Root-Based Multi-Source Early PSD Estimation and Recursive RETF Update in Reverberant Environments by Means of the Orthogonal Procrustes Problem

Autores: Toon van Waterschoot; Thomas Dietzen; Simon Doclo; Marc Moonen
Identificador permanente: arXiv:1906.07493; Digital Object Identifier:10.1109/taslp.2020.2966891; Digital Object Identifier:10.48550/arxiv.1906.07493

Joint Acoustic Localization and Dereverberation Through Plane Wave Decomposition and Sparse Regularization

Autores: Toon van Waterschoot; Niccolo Antonello; Marc Moonen; Patrick A. Naylor; Enzo De Sena
Publicado en: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC IEEE/ACM Transactions on Audio, Speech, and Language Processing 2019
Identificador permanente: Digital Object Identifier:10.1109/taslp.2019.2933047

Localization Uncertainty in Time-Amplitude Stereophonic Reproduction

Autores: De Sena, Enzo; Cvetkovic, Zoran; Hachabiboglu, Huseyin; Moonen, Marc; van Waterschoot, Toon
Publicado en: Institute of Electrical and Electronics Engineers (IEEE) De Sena , E , Cvetkovic , Z , Hacihabiboglu , H , Moonen , M & van Waterschoot , T 2020 , ' Localization Uncertainty in Time-Amplitude Stereophonic Reproduction ' , Ieee Transactions On Audio Speech And Language Processing , vol. 28 , 9004547 , pp. 1000-1015 . https://doi.org/10.1109/TASLP.2020.2975419 2020
Identificador permanente: Digital Object Identifier:10.1109/taslp.2020.2975419; arXiv:1907.11425; Digital Object Identifier:10.48550/arxiv.1907.11425