Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Scalable Knowledge-Aware Image Caption Generation

Project description

Cutting-edge tool to sharpen up image captioning

Image captioning systems are limited because they depend heavily on visual content. This explains why generated captions are usually only descriptive and overlook key information required to understand the image. The EU-funded ROCAP project aims to introduce a captioning tool to benefit fields such as geography, radiology and art history, where captions must include information that cannot be obtained solely from images. ROCAP will study the feasibility of applying the captioning method developed in a previous project to captioning in medical imaging and art history. By engaging with experts in these domains, the project will specify a practical captioning system, apply it as an open-source tool and test it in real-life conditions.

Objective

Image captioning is the process of mapping a visual scene to a short textual description. Automating this process is vital for many computer applications, including information retrieval from visual data, computerized assistance to visually impaired people, and automatic tour guiding. State-of-the-art captioning systems are limited by their heavy reliance on visual contents. As a result, generated captions are often purely descriptive and miss important information that is needed in order to understand the image. This PoC project develops a captioning tool that will be useful for knowledge-intensive areas like Geography, Radiology or Art History, where captions need to include information that cannot be extracted from images alone. It builds on results of the ROCKY ERC AdG project, whose innovative captioning system integrates external knowledge into the captioning process. This allowed the ROCKY project to employ standard methods of image captioning, with a deep convolutional neural network (CNN) for image understanding and a Transformer network for language generation. Thanks to the external knowledge integration, the ROCKY captioning prototype gets substantially closer to human-generated captions than standard captioning systems that do not take external knowledge into account. This PoC project will use this result by implementing a knowledge-aware captioning system that is scalable for practical purposes. The project examines the feasibility of the ROCKY captioning method for Medical Imaging and Art History and implements it for one of these domain as a use case. The project will engage with experts in these domains, specify a practical captioning system, implement it as an open-source tool and test it in realistic situations. The anticipated value of this effort is in the development of a general method that would allow one open-source platform to be multi-purpose, thereby cost-effectively adjustable to needs of different domains.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

HORIZON-ERC-POC - HORIZON ERC Proof of Concept Grants

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) ERC-2022-POC2

See all projects funded under this call

Host institution

UNIVERSITEIT UTRECHT
Net EU contribution

Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.

€ 150 000,00
Address
HEIDELBERGLAAN 8
3584 CS Utrecht
Netherlands

See on map

Activity type
Higher or Secondary Education Establishments
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data

Beneficiaries (1)

My booklet 0 0