Skip to main content

Training Network on Automatic Processing of PAthological Speech

Deliverables

Training reports M24 (D4.3)

Training reports M24 (D4.3)

Training reports M12 (D4.3)

Training reports M12 (D4.3)

Network wide training and TAPAS workshop M21 (D4.1)

Network wide training and TAPAS workshop M21 (D4.1)

Network wide training and TAPAS workshop M27 (D4.1)

Network wide training and TAPAS workshop M27 (D4.1)

Dissemination and public engagement (D5.2)

Dissemination and public engagement (D5.2)

Network wide training and TAPAS workshop M15 (D4.1)

Network wide training and TAPAS workshop M15 (D4.1)

Network wide training and TAPAS workshop M9 (D4.1)

Network wide training and TAPAS workshop M9 (D4.1)

Publications

A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease

Author(s): Juan Camilo Vásquez Correa, Tomas Arias, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Interspeech 2018, 2018, Page(s) 456-460
Publisher: ISCA
DOI: 10.21437/interspeech.2018-1988

Multimodal I-vectors to Detect and Evaluate Parkinson's Disease

Author(s): Nicanor Garcia, Juan Camilo Vásquez Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Interspeech 2018, 2018, Page(s) 2349-2353
Publisher: ISCA
DOI: 10.21437/interspeech.2018-2295

Acoustic correlates of speech intelligibility: the usability of the eGeMAPS feature set for atypical speech

Author(s): Wei Xue, Catia Cucchiarini, Roeland van Hout, Helmer Strik
Published in: SLaTE 2019: 8th ISCA Workshop on Speech and Language Technology in Education, 2019, Page(s) 48-52
Publisher: ISCA
DOI: 10.21437/slate.2019-9

Speech differences between CI users with pre- and postlingual onset of deafness detected by speech processing methods on voiceless to voice transitions

Author(s): T Arias Vergara, S Gollwitzer, JR Orozco-Arroyave, JC Vasquez-Correa, E Nöth, C Högerle, M Schuster
Published in: Abstract- und Posterband – 90. Jahresversammlung der Deutschen Gesellschaft für HNO-Heilkunde, Kopf- und Hals-Chirurgie e.V., Bonn – Digitalisierung in der HNO-Heilkunde, 2019
Publisher: Georg Thieme Verlag KG
DOI: 10.1055/s-0039-1686328

Feature Space Visualization with Spatial Similarity Maps for Pathological Speech Data

Author(s): Philipp Klumpp, J.C. Vásquez-Correa, Tino Haderlein, Elmar Nöth
Published in: Interspeech 2019, 2019, Page(s) 3068-3072
Publisher: ISCA
DOI: 10.21437/interspeech.2019-2080

Phonet: A Tool Based on Gated Recurrent Neural Networks to Extract Phonological Posteriors from Speech

Author(s): J.C. Vásquez-Correa, Philipp Klumpp, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Interspeech 2019, 2019, Page(s) 549-553
Publisher: ISCA
DOI: 10.21437/interspeech.2019-1405

Feature Representation of Pathophysiology of Parkinsonian Dysarthria

Author(s): Alice Rueda, J.C. Vásquez-Correa, Cristian David Rios-Urrego, Juan Rafael Orozco-Arroyave, Sridhar Krishnan, Elmar Nöth
Published in: Interspeech 2019, 2019, Page(s) 3048-3052
Publisher: ISCA
DOI: 10.21437/interspeech.2019-2490

Apkinson — A Mobile Monitoring Solution for Parkinson’s Disease

Author(s): Philipp Klumpp, Thomas Janu, Tomás Arias-Vergara, J.C. Vásquez-Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Interspeech 2017, 2017, Page(s) 1839-1843
Publisher: ISCA
DOI: 10.21437/interspeech.2017-416

Automatic Diagnosis of Alzheimer’s Disease Using Neural Network Language Models

Author(s): Julian Fritsch, Sebastian Wankerl, Elmar Noth
Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, Page(s) 5841-5845, ISBN 978-1-4799-8131-1
Publisher: IEEE
DOI: 10.1109/icassp.2019.8682690

Automatic Hierarchical Attention Neural Network for Detecting AD

Author(s): Yilin Pan, Bahman Mirheidari, Markus Reuber, Annalena Venneri, Daniel Blackburn, Heidi Christensen
Published in: Interspeech 2019, 2019, Page(s) 4105-4109
Publisher: ISCA
DOI: 10.21437/interspeech.2019-1799

Phone-Attribute Posteriors to Evaluate the Speech of Cochlear Implant Users

Author(s): T. Arias-Vergara, Juan Rafael Orozco-Arroyave, Milos Cernak, S. Gollwitzer, M. Schuster, Elmar Nöth
Published in: Interspeech 2019, 2019, Page(s) 3108-3112
Publisher: ISCA
DOI: 10.21437/interspeech.2019-2144

Attention-based convolutional neural networks for acoustic scene classification

Author(s): Z. Ren , Q. Kong, K. Qian, M. D. Plumbley, and B. W. Schuller
Published in: Detection and Classification of Acoustic Scenes and Events 2018, 2018
Publisher: DCASE 2018

Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality

Author(s): Jing Han, Zixing Zhang, Zhao Ren, Bjorn Schuller
Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, Page(s) 5861-5865, ISBN 978-1-4799-8131-1
Publisher: IEEE
DOI: 10.1109/icassp.2019.8682773

Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach

Author(s): Kun Qian, Hiroyuki Kuromiya, Zhao Ren, Maximilian Schmitt, Zixing Zhang, Toru Nakamura, Kazuhiro Yoshiuchi, Björn W. Schuller, Yoshiharu Yamamoto
Published in: Proceedings of the Third International Symposium on Image Computing and Digital Medicine - ISICDM 2019, 2019, Page(s) 71-75, ISBN 9781-450372626
Publisher: ACM Press
DOI: 10.1145/3364836.3364851

Evaluation of the Pain Level from Speech: Introducing a Novel Pain Database and Benchmarks

Author(s): Z. Ren , N. Cummins, J. Han, S. Schnieder, J. Krajewski, and B. Schuller
Published in: Speech Communication; 13th ITG-Symposium, 2018, ISBN 978-3-8007-4767-2
Publisher: VDE

Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data

Author(s): Zhao Ren, Jing Han, Nicholas Cummins, Qiuqiang Kong, Mark D. Plumbley, Björn W. Schuller
Published in: Proceedings of the 9th International Conference on Digital Public Health - DPH2019, 2019, Page(s) 79-83, ISBN 9781-450372084
Publisher: ACM Press
DOI: 10.1145/3357729.3357743

Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes

Author(s): Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Bjorn W. Schuller
Published in: ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019, Page(s) 56-60, ISBN 978-1-4799-8131-1
Publisher: IEEE
DOI: 10.1109/icassp.2019.8683434

Deep Sensing of Breathing Signal During Conversational Speech

Author(s): Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik
Published in: Interspeech 2019, 2019, Page(s) 4110-4114
Publisher: ISCA
DOI: 10.21437/interspeech.2019-1796

Unobtrusive Monitoring of Speech Impairments of Parkinson'S Disease Patients Through Mobile Devices

Author(s): T. Arias-Vergara, J.C. Vasquez-Correa, J.R. Orozco-Arroyave, P. Klumpp, E. Noth
Published in: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, Page(s) 6004-6008, ISBN 978-1-5386-4658-8
Publisher: IEEE
DOI: 10.1109/icassp.2018.8462332

Dysarthric Speech Recognition with Lattice-Free MMI

Author(s): Enno Hermann, Mathew Magimai.-Doss
Published in: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, Page(s) 6109-6113, ISBN 978-1-5090-6631-5
Publisher: IEEE
DOI: 10.1109/icassp40776.2020.9053549

Analysis and evaluation of handwriting in patients with Parkinson’s disease using kinematic, geometrical, and non-linear features

Author(s): C.D. Rios-Urrego, J.C. Vásquez-Correa, J.F. Vargas-Bonilla, E. Nöth, F. Lopera, J.R. Orozco-Arroyave
Published in: Computer Methods and Programs in Biomedicine, 173, 2019, Page(s) 43-52, ISSN 0169-2607
Publisher: Elsevier BV
DOI: 10.1016/j.cmpb.2019.03.005

Speaker models for monitoring Parkinson’s disease progression considering different communication channels and acoustic conditions

Author(s): T. Arias-Vergara, J.C. Vásquez-Correa, J.R. Orozco-Arroyave, E. Nöth
Published in: Speech Communication, 101, 2018, Page(s) 11-25, ISSN 0167-6393
Publisher: Elsevier BV
DOI: 10.1016/j.specom.2018.05.007

Towards an automatic evaluation of the dysarthria level of patients with Parkinson's disease

Author(s): J.C. Vásquez-Correa, J.R. Orozco-Arroyave, T. Bocklet, E. Nöth
Published in: Journal of Communication Disorders, 76, 2018, Page(s) 21-36, ISSN 0021-9924
Publisher: Elsevier BV
DOI: 10.1016/j.jcomdis.2018.08.002

Effects of oral and oropharyngeal cancer on speech intelligibility using acoustic analysis: Systematic review

Author(s): Mathieu Balaguer, Timothy Pommée, Jérôme Farinas, Julien Pinquier, Virginie Woisard, Renée Speyer
Published in: Head & Neck, 42/1, 2019, Page(s) 111-130, ISSN 1043-3074
Publisher: John Wiley & Sons Inc.
DOI: 10.1002/hed.25949

Acoustic features to characterize sentence accent production in dysarthric speech

Author(s): Viviana Mendoza Ramos, Hector A. Kairuz Hernandez-Diaz, Maria E. Hernandez-Diaz Huici, Heidi Martens, Gwen Van Nuffelen, Marc De Bodt
Published in: Biomedical Signal Processing and Control, 57, 2020, Page(s) 101750, ISSN 1746-8094
Publisher: Elsevier BV
DOI: 10.1016/j.bspc.2019.101750

Multimodal Assessment of Parkinson's Disease: A Deep Learning Approach

Author(s): Juan Camilo Vasquez-Correa, Tomas Arias-Vergara, J. R. Orozco-Arroyave, Bjorn Eskofier, Jochen Klucken, Elmar Noth
Published in: IEEE Journal of Biomedical and Health Informatics, 23/4, 2019, Page(s) 1618-1630, ISSN 2168-2194
Publisher: Institute of Electrical and Electronics Engineers Inc.
DOI: 10.1109/JBHI.2018.2866873

EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings

Author(s): Jing Han, Zixing Zhang, Zhao Ren, Bjoern W. Schuller
Published in: IEEE Transactions on Affective Computing, 2019, Page(s) 1-1, ISSN 1949-3045
Publisher: Institute of Electrical and Electronics Engineers
DOI: 10.1109/TAFFC.2019.2928297

Phonological i-Vectors to Detect Parkinson’s Disease

Author(s): N. Garcia-Ospina, T. Arias-Vergara, J. C. Vásquez-Correa, J. R. Orozco-Arroyave, M. Cernak, E. Nöth
Published in: Text, Speech, and Dialogue - 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings, 11107, 2018, Page(s) 462-470, ISBN 978-3-030-00793-5
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-00794-2_50

Evaluation of the Pain Level from Speech: Introducing a Novel Pain Database and Benchmarks

Author(s): Zhao Ren, Nicholas Cummins, Jing Han, Sebastian Schnieder, Jarek Krajewski, Björn Schuller
Published in: ITG-Fb. 282: Speech Communication, Konferenz: Speech Communication - 13. ITG-Fachtagung Sprachkommunikation 10.10.2018 - 12.10.2018 in Oldenburg, Deutschland, 2018, ISBN 978-3-8007-4767-2
Publisher: VDE-Verlag

Natural Language Analysis to Detect Parkinson’s Disease

Author(s): P. A. Pérez-Toro, J. C. Vásquez-Correa, M. Strauss, J. R. Orozco-Arroyave, E. Nöth
Published in: Text, Speech, and Dialogue - 22nd International Conference, TSD 2019, Ljubljana, Slovenia, September 11–13, 2019, Proceedings, 11697, 2019, Page(s) 82-90, ISBN 978-3-030-27946-2
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-27947-9_7

Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson’s Disease from Speech in Three Different Languages

Author(s): Juan Camilo Vásquez-Correa, Tomas Arias-Vergara, Cristian D. Rios-Urrego, Maria Schuster, Jan Rusz, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - 24th Iberoamerican Congress, CIARP 2019, Havana, Cuba, October 28-31, 2019, Proceedings, 11896, 2019, Page(s) 697-706, ISBN 978-3-030-33903-6
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-33904-3_66

Articulation and Empirical Mode Decomposition Features in Diadochokinetic Exercises for the Speech Assessment of Parkinson’s Disease Patients

Author(s): Juan Camilo Vásquez-Correa, Cristian D. Rios-Urrego, Alice Rueda, Juan Rafael Orozco-Arroyave, Sri Krishnan, Elmar Nöth
Published in: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - 24th Iberoamerican Congress, CIARP 2019, Havana, Cuba, October 28-31, 2019, Proceedings, 11896, 2019, Page(s) 688-696, ISBN 978-3-030-33903-6
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-33904-3_65

Consonant-to-Vowel/Vowel-to-Consonant Transitions to Analyze the Speech of Cochlear Implant Users

Author(s): T. Arias-Vergara, J. R. Orozco-Arroyave, S. Gollwitzer, M. Schuster, E. Nöth
Published in: Text, Speech, and Dialogue - 22nd International Conference, TSD 2019, Ljubljana, Slovenia, September 11–13, 2019, Proceedings, 11697, 2019, Page(s) 299-306, ISBN 978-3-030-27946-2
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-27947-9_25

Multi-channel Convolutional Neural Networks for Automatic Detection of Speech Deficits in Cochlear Implant Users

Author(s): Tomas Arias-Vergara, Juan Camilo Vasquez-Correa, Sandra Gollwitzer, Juan Rafael Orozco-Arroyave, Maria Schuster, Elmar Nöth
Published in: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications - 24th Iberoamerican Congress, CIARP 2019, Havana, Cuba, October 28-31, 2019, Proceedings, 11896, 2019, Page(s) 679-687, ISBN 978-3-030-33903-6
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-33904-3_64

Phonological Posteriors and GRU Recurrent Units to Assess Speech Impairments of Patients with Parkinson’s Disease

Author(s): Juan Camilo Vásquez-Correa, Nicanor Garcia-Ospina, Juan Rafael Orozco-Arroyave, Milos Cernak, Elmar Nöth
Published in: Text, Speech, and Dialogue - 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings, 11107, 2018, Page(s) 453-461, ISBN 978-3-030-00793-5
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-00794-2_49

Automatic Intelligibility Assessment of Parkinson’s Disease with Diadochokinetic Exercises

Author(s): L. Felipe Parra-Gallego, Tomás Arias-Vergara, Juan Camilo Vásquez-Correa, Nicanor Garcia-Ospina, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Applied Computer Sciences in Engineering - 5th Workshop on Engineering Applications, WEA 2018, Medellín, Colombia, October 17-19, 2018, Proceedings, Part II, 916, 2018, Page(s) 223-230, ISBN 978-3-030-00352-4
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-00353-1_20

A Non-linear Dynamics Approach to Classify Gait Signals of Patients with Parkinson’s Disease

Author(s): Paula Andrea Pérez-Toro, Juan Camilo Vásquez-Correa, Tomas Arias-Vergara, Nicanor Garcia-Ospina, Juan Rafael Orozco-Arroyave, Elmar Nöth
Published in: Applied Computer Sciences in Engineering - 5th Workshop on Engineering Applications, WEA 2018, Medellín, Colombia, October 17-19, 2018, Proceedings, Part II, 916, 2018, Page(s) 268-278, ISBN 978-3-030-00352-4
Publisher: Springer International Publishing
DOI: 10.1007/978-3-030-00353-1_24

Datasets

Oral cancer speech corpus

Author(s): Halpern, Bence Mark; Son, Rob Van; Brekel, Michiel Van Den; Scharenborg, Odette
Published in: Zenodo

Oral cancer speech corpus for paper "Detecting and analysing spontaneous oral cancer speech in the wild"

Author(s): Bence Mark Halpern; Rob van Son; Michiel van den Brekel; Odette Scharenborg
Published in: Zenodo

Software

idiap/torgo_asr: Torgo ASR 1.0.0

Author(s): Enno Hermann; Magimai.-Doss, Mathew
DOI: 10.5281/zenodo.4073245; 10.5281/zenodo.4073246
Publisher: Zenodo

karkirowle/relative_phoneme_analysis: First release

Author(s): Bence Halpern
DOI: 10.5281/zenodo.4679550; 10.5281/zenodo.4679551
Publisher: Zenodo