Visual Recognition

Informacje na temat projektu

VISREC

Identyfikator umowy o grant: 228180

Projekt został zamknięty

Data rozpoczęcia 1 Stycznia 2009

Data zakończenia 31 Grudnia 2014

Finansowanie w ramach

Specific programme: "Ideas" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)

Koszt całkowity

€ 1 872 056,00

Wkład UE

€ 1 872 056,00

1 872 056,00

Koordynowany przez

THE CHANCELLOR, MASTERS AND SCHOLARS OF THE UNIVERSITY OF OXFORD
United Kingdom

Final Report Summary - VISREC (Visual Recognition)

The goal of this project was to develop the fundamental knowledge to design a visual system that is able to learn, recognize and retrieve quickly and accurately thousands of visual categories, including objects, scenes, human actions and activities. A "visual google" for images and videos - able to search for the "nouns" (objects, scenes), "verbs" (actions/activities) and adjectives (materials, patterns) of visual content.

Progress has been made on a number of fronts including: (i) learning visual models on-the-fly to retrieve semantic entities in large scale image and video collections starting from a text query - this has enabled visual retrieval of people (from faces), object categories (such as vehicles, animals) and object instances (such as particular buildings, particular paintings); (ii) automatic identification of flower species and sculptures; (iii) methods and models for detecting and localizing object categories in images - in particular reducing the level of supervision that is required when training such models; and (iv) deep learning methods for recognizing object categories, text, and human actions and inter-actions (such as hand-shakes) in images and videos.

The outcomes of this research will impact any applications where visual recognition is useful, and will enable new applications entirely: effortlessly searching and annotating home image and video collections on their visual content; searching and annotating large commercial image and video archives (e.g. YouTube); extending the class of images that can be used to access the web (in the manner of Google Goggles) and hence identify their visual content.

Final Report Summary - VISREC (Visual Recognition)

Udostępnij tę stronę Udostępnij tę stronę w mediach społecznościowych

Pobierz Pobierz zawartość strony