Spatial 3D Semantic Understanding for Perception in the Wild

Description du projet

Révolutionner le domaine de la perception visuelle 3D

À l’ère du numérique, la compréhension de la sémantique spatiale 3D de notre monde constitue un défi majeur. Les environnements du monde réel, riches en complexité, exigent une compréhension dans leur véritable contexte tridimensionnel, même lorsqu’ils sont observés sur des images 2D. La réalisation d’un raisonnement sémantique 3D robuste à partir de données visuelles, telles que les observations RVB ou RVB-D, n’en est toutefois qu’à ses débuts, entravée par le nombre limité de données 3D réelles et par la nature complexe et hautement dimensionnelle du problème. Dans cette optique, le projet SpatialSem, financé par le CER, se propose d’exploiter la puissance de la perception 3D et jeter les bases d’avancées révolutionnaires dans les domaines de la perception machine, de la communication immersive, de la réalité mixte, ainsi que de la modélisation architecturale et industrielle. Plus précisément, le projet entend passer d’un raisonnement basé sur l’image à une représentation 3D cohérente dans l’espace.

Objectif

Understanding the 3D spatial semantics of the world around us is core to visual perception and digitization -- real-world environments are spatially three-dimensional, and must be understood in its 3D context, even from 2D image observations. This will lead to spatially-grounded reasoning and higher-level perception of the world around us. Such 3D perception will provide the foundation for transformative, next-generation technology across machine perception, immersive communications, mixed reality, architectural or industrial modeling, and more. This will enable a new paradigm in semantic understanding that derives primarily from a spatially-consistent, 3D representation rather than relying on image-based reasoning that captures only projections of the world. However, 3D semantic reasoning from visual data such as RGB or RGB-D observations remains in its infancy, due to challenges in learning from limited amounts of real-world 3D data, and moreover, the complex, high-dimensional nature of the problem. In this proposal, we will develop new algorithmic approaches to effectively learn robust visual 3D perception, with new learning paradigms for features, representations, and operators, to encompass 3D semantic understanding.

Régime de financement

HORIZON-ERC - HORIZON ERC Grants

Institution d’accueil

TECHNISCHE UNIVERSITAET MUENCHEN

Contribution nette de l'UE

€ 1 500 000,00

Adresse

Arcisstrasse 21
80333 Muenchen
Allemagne

Région

Bayern Oberbayern München, Kreisfreie Stadt

Type d’activité

Higher or Secondary Education Establishments

Liens

Contacter l’organisation Site web

Participation aux programmes de R&I de l'UE

Réseau de collaboration HORIZON

Coût total

€ 1 500 000,00

Bénéficiaires (1)

TECHNISCHE UNIVERSITAET MUENCHEN

Allemagne

Contribution nette de l'UE

€ 1 500 000,00

Description du projet

Révolutionner le domaine de la perception visuelle 3D

Objectif

Champ scientifique

Mots‑clés

Programme(s)

Thème(s)

Appel à propositions

Régime de financement

Institution d’accueil

Bénéficiaires (1)

Partager cette page

Télécharger