Next-Generation Natural Language Generation

Projektbeschreibung

Neurosymbolischer Ansatz für Systeme zur Sprachgenerierung

Mit KI-Programmierung können aus Daten Texte erstellt werden. Dieser Prozess heißt Generierung natürlicher Sprache (Natural Language Generation, NLG), bei der komplexe Daten in natürliche Sprache verwandelt wird. Diese soll klingen, als hätte ein Mensch sie geschrieben. Das EU-finanzierte Projekt NG-NLG wird neuronale Ansätze zur Sprachgenerierung, die derzeit nur auf experimenteller Ebene zum Einsatz kommen, weiter erkunden. Der Hintergrund ist, dass aktuelle neuronale Systeme zwar sehr natürliche Sprache generieren, ihr Verhalten jedoch weder transparent noch zuverlässig ist. Das Projekt wird innovative Ansätze erarbeiten, die neuronale Ansätze mit expliziten Repräsentationen symbolischer Semantik kombinieren. So können das Ergebnis und die expliziten logischen Inferenzen besser über die Daten gesteuert werden. Das Projekt wird die Ansätze an der Erzeugung von Text aus Daten, Zusammenfassungen und der Generierung von Dialogantworten testen.

Ziel

This project aims to overcome the major hurdles that prevent current state-of-the-art models for natural language generation (NLG) from real-world deployment.

While deep learning and neural networks brought considerable progress in many areas of natural language processing, neural approaches to NLG remain confined to experimental use and production NLG systems are handcrafted. The reason for this is that despite the very natural and fluent outputs of recent neural systems, neural NLG still has major drawbacks: (1) the behavior of the systems is not transparent and hard to control (the internal representation is implicit), which leads to incorrect or even harmful outputs, (2) the models require a lot of training data and processing power do not generalize well, and are mostly English-only. On the other hand, handcrafted models are safe, transparent and fast, but produce less fluent outputs and are expensive to adapt to new languages and domains (topics). As a result, usefulness of NLG models in general is limited. In addition, current methods for automatic evaluation of NLG outputs are unreliable, hampering system development.

The main aims of this project, directly addressing the above drawbacks, are:
1) Develop new approaches for NLG that combine neural approaches with explicit symbolic semantic representations, thus allowing greater control over the outputs and explicit logical inferences over the data.
2) Introduce approaches to model compression and adaptation to make models easily portable across domains and languages.
3) Develop reliable neural-symbolic approaches for evaluation of NLG systems.

We will test our approaches on multiple NLG applications—data-to-text generation (e.g. weather or sports reports), summarization, and dialogue response generation. For example, our approach will make it possible to deploy a new data reporting system for a given domain based on a few dozen example input-output pairs, compared to thousands needed by current methods.

Wissenschaftliches Gebiet

Schlüsselbegriffe

Finanzierungsplan

HORIZON-ERC - HORIZON ERC Grants

Gastgebende Einrichtung

UNIVERZITA KARLOVA

Netto-EU-Beitrag

€ 1 420 375,00

Adresse

OVOCNY TRH 560/5
116 36 Praha 1
Tschechien

Region

Česko Praha Hlavní město Praha

Aktivitätstyp

Higher or Secondary Education Establishments

Links

Die Organisation kontaktieren Website

Teilnahme an EU-FuI-Programmen

HORIZON-Kooperationsnetzwerk

Gesamtkosten

€ 1 420 375,00

Begünstigte (1)

UNIVERZITA KARLOVA

Tschechien

Netto-EU-Beitrag

€ 1 420 375,00

Projektbeschreibung

Neurosymbolischer Ansatz für Systeme zur Sprachgenerierung

Ziel

Wissenschaftliches Gebiet

Schlüsselbegriffe

Programm/Programme

Thema/Themen

Aufforderung zur Vorschlagseinreichung

Finanzierungsplan

Gastgebende Einrichtung

Begünstigte (1)

Diese Seite teilen

Herunterladen