Skip to main content
European Commission logo
español español
CORDIS - Resultados de investigaciones de la UE
CORDIS
CORDIS Web 30th anniversary CORDIS Web 30th anniversary

Learning Pixel-Perfect 3D Vision and Generative Modeling

Periodic Reporting for period 2 - PIPE (Learning Pixel-Perfect 3D Vision and Generative Modeling)

Período documentado: 2022-03-01 hasta 2023-08-31

PIPE seeks to advance computers' capability to reason about the physical world by looking at nothing but pictures. Moreover, PIPE seeks to enable this reasoning to stand on solid, physically meaningful quantities. Concretely, PIPE studies both the problem of understanding what shapes, materials, lights, objects, etc., appear in pictures of real-world scenes, as well as the problem of generative modeling, i.e. learning "a recipe" for the world.

Solutions to these fundamental problems in computer vision, computer graphics, and machine learning are relevant for a large number of applications from computer graphics, augmented and virtual reality to robotics; clearly, truly understanding the messy complexity of the real world must be necessary, e.g. for reliable, safe autonomous machines to emerge.
PIPE seeks to advance the state of the art on three axes: (1) understanding the shapes in the world; (2) understanding the visual complexity of the surface materials in the world; and (3) learning generative computational recipes for things in the world. The most significant achievement since the beginning of the project is a result in the final direction that combines intuitive, controllable computational components with modern machine learning techniques. This enables photorealistic generative modeling in terms of physically meaningful concepts and quantities while requiring no human supervision.
PIPE is expected to provide breakthrough results in how well computer vision and generative algorithms can understand our three-dimensional world.