Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS

Semantics-Based Machine Translation

Project description

Machine translation based on semantics

Machine translation systems could be a cost-effective alternative to human translators in a variety of situations but there are serious flaws in existing systems. Some take semantics partially into account, others tend to focus on the fluency of the translated text rather that the adequacy of the read. The EU-funded SEBAMAT project aims to provide state of the art translation that considers word senses rather than words only. The project also strives to use role labelling for identifying the semantic roles of the words in a sentence.

Objective

"Most current machine translation systems are either rule-based or corpus-based. They typically take the semantics of a text only in so far into account as they are implicit in the underlying text corpora or dictionaries. This is also true for the recent neural machine translation systems, which - in comparison to standard phrase-based systems, tend to have the focus even more on fluency rather than adequacy. However, it has been pointed out that it is unlikely to be able to bring machine translation quality to the next level as long as the systems do not make better use of semantic knowledge. For example, according to Kevin Knight future machine translation systems should use information of the type ""who is doing what to whom and when"", i.e. involving the identification of the semantic roles of the items occurring in a sentence. To move forward in this direction, we propose to implement and evaluate three different approaches: The first approach is based on state of the art machine translation but considers word senses rather than words. That is, a word sense disambiguation system is used to identify the word senses in large parallel text corpora. Then, in analogy to standard word alignment, the word senses are aligned across languages, and the resulting multilingual sense dictionaries are used in conjunction with the word sense disambiguation systems for translating new texts. Our second approach uses role labeling for identifying the semantic roles of the words in a sentence. The roles are aligned across languages, and this information is then used to improve the translation process. The third approach is based on an algorithm which computes the semantic similarity between phrases. It considers the translation task as finding semantically similar phrases across languages.
"

Keywords

Project’s keywords as indicated by the project coordinator. Not to be confused with the EuroSciVoc taxonomy (Fields of science)

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MSCA-IF - Marie Skłodowska-Curie Individual Fellowships (IF)

See all projects funded under this funding scheme

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

(opens in new window) H2020-MSCA-IF-2018

See all projects funded under this call

Coordinator

ATHINA-EREVNITIKO KENTRO KAINOTOMIAS STIS TECHNOLOGIES TIS PLIROFORIAS, TON EPIKOINONION KAI TIS GNOSIS
Net EU contribution

Net EU financial contribution. The sum of money that the participant receives, deducted by the EU contribution to its linked third party. It considers the distribution of the EU financial contribution between direct beneficiaries of the project and other types of participants, like third-party participants.

€ 165 085,44
Address
ARTEMIDOS 6 KAI EPIDAVROU
151 25 MAROUSSI
Greece

See on map

Region
Αττική Aττική Βόρειος Τομέας Αθηνών
Activity type
Research Organisations
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

€ 165 085,44
My booklet 0 0