Unlocking topicality in text - foreground and background information in written language | LLAVES | Project | Fact Sheet | FP5 | CORDIS

Project Information

LLAVES

Grant agreement ID: IST-1999-12219

Project website

Project closed

Start date 1 January 2000

End date 31 December 2000

Funded under

Programme for research, technological development and demonstration on a "User-friendly information society, 1998-2002"

Total cost

€ 142 000,00

EU contribution

€ 71 000,00

71 000,00

Coordinated by

SWEDISH INSTITUTE OF COMPUTER SCIENCE
Sweden

Objective

This language technology project aims to bridge the gap from clausal syntax to text, and show how the syntactic mechanisms of the language indicate topical themes in text. The project will investigate a large number of texts using both human assessments of foreground and background statements and state-of-the art syntactic analysis tools to chart known and newly found systematic differences between how foreground and background themes are presented.
This language technology project aims to bridge the gap from clausal syntax to text, and show how the syntactic mechanisms of the language indicate topical themes in text. The project will investigate a large number of texts using both human assessments of foreground and background statements and state-of-the art syntactic analysis tools to chart known and newly found systematic differences between how foreground and background themes are presented.

OBJECTIVES
A bottleneck for improving today's information management systems is that we know little of texts as text. Systems view texts as simple sets of words or terms, discarding information such as clause style and argument structure as noise. This project aims to bridge the gap from syntax to text, and show how syntactic mechanisms of language, which primarily concern clause-internal structure, carry text-level information as well. Once we are able to chart some features of the topical progression in a text we will give a road map for algorithms for further processing: indexing and search, summarisation, report generation, and optical text recognition are all application areas which would benefit from better knowledge of what makes texts.

DESCRIPTION OF WORK
We will take a large number of texts in several languages and partition the clauses in them into a number of graded categories according to foregroundedness. These clause categories can then be used in different ways for indexing, multi-document summarization, and text item similarity calculation. This first assessment project takes the form of an experiment on text. If the experiment is successful, it opens up an entire research field, which we will continue examining in a future project.
1. Assemble corpus. If possible we will use the multilingual TREC corpus.
2. Define prototypical clause types based on our theory of foregroundedness.
3. Use human test subjects to partition clauses according to prototypical type.
4. Find and explain formal differences between types of clause as shown by test subjects, based on theory of transitivity.
5. Build tools to automatically identify clause types.
6. Index large number of texts using tools, and run test sets of information retrieval queries.
7. Result dissemination.
8. Plan for continued and refined experimentation.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

This project has not yet been classified with EuroSciVoc.
Be the first one to suggest relevant scientific fields and help us improve our classification service

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP5-IST - Programme for research, technological development and demonstration on a "User-friendly information society, 1998-2002"

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

1.1.2.-6.1.1 - FET O: Open domain

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

CSC - Cost-sharing contracts

Coordinator

SWEDISH INSTITUTE OF COMPUTER SCIENCE

EU contribution

No data

Address

ISAFJORDSGATAN 22
164 29 KISTA
Sweden

Total cost

No data

Participants (1)

CONEXOR OY

Finland

EU contribution

No data

Address

PORRASSALMENKATU 19 A 15
50100 MIKKELI

Total cost

No data

Unlocking topicality in text - foreground and background information in written language

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (1)

Share this page Share this page on social networks

Download Download the content of the page

Unlocking topicality in text - foreground and background information in written language

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (1)

Share this page Share this page on social networks

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.