Voice driven interaction in XR spaces

Descripción del proyecto

Experiencias de realidad extendida que combinan visión y sonido

Las tecnologías de realidad extendida (RX) están a punto de dominar la escena de la interacción entre personas y ordenadores al superar los métodos tradicionales. Otros dos campos que están experimentando un crecimiento similar son el procesamiento del lenguaje natural (PLN) y la visión artificial, sobre todo debido a la aparición de métodos basados en datos en los ámbitos del aprendizaje automático y la inteligencia artificial (IA). El equipo del proyecto VOXreality pretende fusionar estos campos paralelos para diseñar y desarrollar modelos de IA que integren el lenguaje como un medio de interacción central, junto con la comprensión visual. Su objetivo es crear modelos de RX entrenados previamente que combinen el conocimiento espacial y semántico de los sistemas de RX y PNL. Esto podría iniciar una nueva era de aplicaciones creadas en torno a la comprensión holística de los objetivos de los usuarios, lejos de los dispositivos y controladores.

Objetivo

VOXReality is an ambitious project whose goal will be to facilitate and exploit the convergence of two important technologies, natural language processing (NLP) and computer vision (CV). Both technologies are experiencing a huge performance increase due to the emergence of data-driven methods, specifically machine learning (ML) and artificial intelligence (AI). On the one hand, CV/ML are driving the extended reality (XR) revolution beyond what was possible up to now, and, on the other, speech-based interfaces and text-based content understanding are revolutionising human-machine and human-human interaction. VOXReality will employ an economical approach to combine these two. VOXReality will pursue the integration of language- and vision-based AI models with either unidirectional or bidirectional exchanges between the two modalities. Vision systems drive both AR and VR, while language understanding adds a natural way for humans to interact with the back-ends of XR systems or create multimodal XR experiences combining vision and sound. The results of the project will be twofold: 1) a set of pretrained next-generation XR models combining, in various levels and ways, language and vision AI and enabling richer, more natural immersive experiences that are expected to boost XR adoption, and 2) a set of applications using these models to demonstrate innovations in various sectors. The above technologies will be validated through three use cases: 1) Personal Assistants that are an emerging type of digital technology that seeks to support humans in their daily tasks, with their core functionalities related to human-to-machine interaction; 2) Virtual Conferences that are completely hosted and run online, typically using a virtual conferencing platform that sets up a shared virtual environment, allowing their attendees to view or participate from anywhere in the world; 3) Theaters where VOXReality will combine language translation, audiovisual user associations and AR VFX triggered by predetermined speech.

Ámbito científico

Palabras clave

Coordinador

MAGGIOLI SPA

Aportación neta de la UEn

€ 1 483 750,00

Dirección

VIA DEL CARPINO 8
47822 Santarcangelo Di Romagna
Italia

Región

Nord-Est Emilia-Romagna Rimini

Tipo de actividad

Private for-profit entities (excluding Higher or Secondary Education Establishments)

Enlaces

Contactar con la organización

Participación en los programas de I+D de la UE

Red de colaboración de HORIZON

Coste total

€ 1 483 750,00

Participantes (9)

ETHNIKO KENTRO EREVNAS KAI TECHNOLOGIKIS ANAPTYXIS

Grecia

Aportación neta de la UEn

€ 451 875,00

UNIVERSITEIT MAASTRICHT

Países Bajos

Aportación neta de la UEn

€ 703 250,00

STICHTING NEDERLANDSE WETENSCHAPPELIJK ONDERZOEK INSTITUTEN

Países Bajos

Aportación neta de la UEn

€ 450 125,00

SYNELIXIS LYSEIS PLIROFORIKIS AUTOMATISMOU & TILEPIKOINONION ANONIMI ETAIRIA

Grecia

Aportación neta de la UEn

€ 561 187,50

STICHTING DUTCH VIRTUAL REALITY DAYS

Países Bajos

Aportación neta de la UEn

€ 252 125,00

ADAPT IT AE

Grecia

Aportación neta de la UEn

€ 151 125,00

F6S NETWORK IRELAND LIMITED

Irlanda

Aportación neta de la UEn

€ 223 437,50

HOLO-INDUSTRIE 4.0 SOFTWARE GMBH

Alemania

Aportación neta de la UEn

€ 331 875,00

ELLINIKO FESTIVAL ANONYMOS ETAIREIA

Grecia

Aportación neta de la UEn

€ 178 125,00

Descripción del proyecto

Experiencias de realidad extendida que combinan visión y sonido

Objetivo

Ámbito científico

Palabras clave

Programa(s)

Tema(s)

Convocatoria de propuestas

Régimen de financiación

Coordinador

Participantes (9)

Compartir esta página

Descargar