Skip to main content
Go to the home page of the European Commission (opens in new window)
English English
CORDIS - EU research results
CORDIS
Content archived on 2024-06-18

Heterogeneous Learning for Natural Language Processing

Objective

A major challenge in machine learning and artificial intelligence is to reduce the dependency in full direct supervision and learn from various undirected resources as well. Most successful machine-learning systems require some amount of human supervision. Currently, a dominant paradigm for building a statistical parser, for example, is to first have human annotators to manually parse a large amount of sentences, and then use the parsed sentences to learn the parameters of the parsing system. For example, a parser built using the Penn Tree Bank, a large corpora of parsed sentences from the Wall Street Journal, is expected to parse well newswire text fragments, but not e-mails, which are different in nature. Yet, one would like to employ all data available from various resources, genres and types to build either a general system or a system that is adapted to a particular task. The goal of the proposed project is to design new paradigms for large-scale learning of natural language problems in various languages from heterogeneous data sources of variable size, quality, amount of supervision and type. Our primary objective is to develop theory, design algorithms, analyze them and build systems for processing written and spoken natural language. Furthermore, the world-wide-web and similar available resources contain a huge amount of heterogeneous collections of data. I propose to make use of the heterogeneous data and based on the tools I will develop to build statistical-based automated systems for various natural language processing tasks, with applications ranging from automatic document classification, via a full range of information extractions to speech analysis and recognition.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

You need to log in or register to use this function

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

FP7-PEOPLE-2009-RG
See other projects for this call

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

MC-IRG - International Re-integration Grants (IRG)

Coordinator

TECHNION - ISRAEL INSTITUTE OF TECHNOLOGY
EU contribution
€ 100 000,00
Address
SENATE BUILDING TECHNION CITY
32000 Haifa
Israel

See on map

Activity type
Higher or Secondary Education Establishments
Links
Total cost

The total costs incurred by this organisation to participate in the project, including direct and indirect costs. This amount is a subset of the overall project budget.

No data
My booklet 0 0