Project description
Dynamic modelling of task-oriented dialogue systems
Spoken dialogue systems are seeing a surge in popularity and acceptance due to the rise of personal assistants. The dialogue systems currently deployed both in academia and in industry are typically built for a fixed scope of functions and do not allow easy expansion to new topics. However, as the world continues to change and knowledge expands, a truly intelligent system must be dynamic, being able to understand and reason about new topics seamlessly. The EU-funded DYMO project aims to tackle the most substantial obstacles on the way to intelligent conversational systems that can expand dynamically, spanning from the problem of operating with dynamic knowledge, dynamic policies, rich user models to sophisticated measures of quality.
Objective
With the prevalence of information technology in our daily lives, our ability to interact with machines in increasingly simplified and more human-like ways has become paramount. Information is becoming ever more abundant but our access to it is limited not least by technological restraints. Spoken dialogue systems address this issue by providing an intelligent speech interface that facilitates swift, human-like acquisition of information.
The advantages of speech interfaces are already evident from the rise of personal assistants such as Siri, Google Assistant, Cortana or Amazon Alexa. In these systems, however, the user is limited to a simple query, and the systems attempt to provide an answer within one or two turns of dialogue. To date, significant parts of these systems are rule-based and do not readily scale to changes in the domain of operation. Furthermore, rule-based systems can be brittle when speech recognition errors occur.
The vision of this project is to develop novel dialogue models that provide natural human-computer interaction beyond simple information-seeking dialogues and that continuously evolve as they are being used by exploiting both dialogue and non-dialogue data. Building such robust and intelligent spoken dialogue systems poses serious challenges in artificial intelligence and machine learning. The project will tackle four bottleneck areas that require fundamental research: automated knowledge acquisition, optimisation of complex behaviour, realistic user models and sentiment awareness. Taken together, the proposed solutions have the potential to transform the way we access information in areas as diverse as e-commerce, government, healthcare and education.
Fields of science
Keywords
Programme(s)
Topic(s)
Funding Scheme
ERC-STG - Starting GrantHost institution
40225 Dusseldorf
Germany