Voice driven interaction in XR spaces

Projektbeschreibung

XR-Erlebnisse, die Bild und Ton kombinieren

Technologien der Extended Reality (XR) stehen kurz davor, den Bereich Mensch-Computer-Interaktion zu dominieren, indem sie die traditionellen Ansätze überholen. Zwei weitere Fachgebiete, die einen ähnlichen Aufschwung erleben, sind die Verarbeitung natürlicher Sprache und das maschinelle Sehen, vor allem aufgrund des Aufkommens datengesteuerter Methoden in den Bereichen maschinelles Lernen und künstliche Intelligenz (KI). VOXreality strebt danach, diese parallelen Bereiche zu verschmelzen und KI-Modelle zu entwerfen und zu entwickeln, die Sprache als zentrales Interaktionsmedium zusammen mit visuellem Verständnis integrieren. Der Schwerpunkt liegt auf der Erstellung von vortrainierten XR-Modellen, die das räumliche und semantische Wissen von XR- und Sprachverarbeitungssystemen miteinander verknüpfen. Daraus könnte eine neue Ära von Anwendungen entstehen, die auf dem ganzheitlichen Verständnis der Ziele der Nutzenden aufbauen, weg von Geräten und Controllern.

Ziel

VOXReality is an ambitious project whose goal will be to facilitate and exploit the convergence of two important technologies, natural language processing (NLP) and computer vision (CV). Both technologies are experiencing a huge performance increase due to the emergence of data-driven methods, specifically machine learning (ML) and artificial intelligence (AI). On the one hand, CV/ML are driving the extended reality (XR) revolution beyond what was possible up to now, and, on the other, speech-based interfaces and text-based content understanding are revolutionising human-machine and human-human interaction. VOXReality will employ an economical approach to combine these two. VOXReality will pursue the integration of language- and vision-based AI models with either unidirectional or bidirectional exchanges between the two modalities. Vision systems drive both AR and VR, while language understanding adds a natural way for humans to interact with the back-ends of XR systems or create multimodal XR experiences combining vision and sound. The results of the project will be twofold: 1) a set of pretrained next-generation XR models combining, in various levels and ways, language and vision AI and enabling richer, more natural immersive experiences that are expected to boost XR adoption, and 2) a set of applications using these models to demonstrate innovations in various sectors. The above technologies will be validated through three use cases: 1) Personal Assistants that are an emerging type of digital technology that seeks to support humans in their daily tasks, with their core functionalities related to human-to-machine interaction; 2) Virtual Conferences that are completely hosted and run online, typically using a virtual conferencing platform that sets up a shared virtual environment, allowing their attendees to view or participate from anywhere in the world; 3) Theaters where VOXReality will combine language translation, audiovisual user associations and AR VFX triggered by predetermined speech.

Wissenschaftliches Gebiet

Schlüsselbegriffe

Koordinator

MAGGIOLI SPA

Netto-EU-Beitrag

€ 1 483 750,00

Adresse

VIA DEL CARPINO 8
47822 Santarcangelo Di Romagna
Italien

Region

Nord-Est Emilia-Romagna Rimini

Aktivitätstyp

Private for-profit entities (excluding Higher or Secondary Education Establishments)

Links

Die Organisation kontaktieren

Teilnahme an EU-FuI-Programmen

HORIZON-Kooperationsnetzwerk

Gesamtkosten

€ 1 483 750,00

Beteiligte (9)

ETHNIKO KENTRO EREVNAS KAI TECHNOLOGIKIS ANAPTYXIS

Griechenland

Netto-EU-Beitrag

€ 451 875,00

UNIVERSITEIT MAASTRICHT

Niederlande

Netto-EU-Beitrag

€ 703 250,00

STICHTING NEDERLANDSE WETENSCHAPPELIJK ONDERZOEK INSTITUTEN

Niederlande

Netto-EU-Beitrag

€ 450 125,00

SYNELIXIS LYSEIS PLIROFORIKIS AUTOMATISMOU & TILEPIKOINONION ANONIMI ETAIRIA

Griechenland

Netto-EU-Beitrag

€ 561 187,50

STICHTING DUTCH VIRTUAL REALITY DAYS

Niederlande

Netto-EU-Beitrag

€ 252 125,00

ADAPT IT AE

Griechenland

Netto-EU-Beitrag

€ 151 125,00

F6S NETWORK IRELAND LIMITED

Irland

Netto-EU-Beitrag

€ 223 437,50

HOLO-INDUSTRIE 4.0 SOFTWARE GMBH

Deutschland

Netto-EU-Beitrag

€ 331 875,00

ELLINIKO FESTIVAL ANONYMOS ETAIREIA

Griechenland

Netto-EU-Beitrag

€ 178 125,00

Projektbeschreibung

XR-Erlebnisse, die Bild und Ton kombinieren

Ziel

Wissenschaftliches Gebiet

Schlüsselbegriffe

Programm/Programme

Thema/Themen

Aufforderung zur Vorschlagseinreichung

Finanzierungsplan

Koordinator

Beteiligte (9)

Diese Seite teilen

Herunterladen