Descriptive lexical specifications and tools for corpus-based lexicon-building

Objective

DELIS is a multidisciplinary project with three broad objectives:- to contribute to a methodology of dictionary development based on corpus evidence; to produce parallel dictionary fragments in five languages, and to produce software tools supporting this king of lexicographic work.

Its methodological goal is to use syntactic phenomena found in corpus evidence to define properties of lexical semantic classes, individual lexemes belonging to these classes and the readings of such items. Its descriptive goal is to produce a set of parallel dictionary fragments for English, French, Italian, Danish and Dutch, covering selected lexical semantic classes. In parallel with this work, software tools will be specified, implemented and integrated in a common user environment, providing computational support for the lexicographic work and the underlying methodology. These tools will include tools for corpus exploration and for the manual acquisition of lexical knowledge, its management and the population of previously defined type feature based models and its eventual (SGML-based) exportation and presentation in dictionary articles.

DELIS is a concrete, albeit incomplete, example of corpus-based design of multifunctional dictionaries as developed and discussed in the Eurotra-7. It is based on the assumptions that:

the criteria according to which lexical items are classified must be made as explicit, communicable and thus reproducible as possible by binding them to pieces of observable linguistic phenomena;
a single representation formalism, adequately supported by computational tools, leading to a consistent descriptive specification is required to use corpus evidence as a raw material for the linguistic description of lexical items (TFL, as an emerging standard, will be used for this purpose);
tools designed for the handling of descriptive linguistic specifications need to be generic with respect to the linguistic container (ie, independent of the contents), but they must also accommodate initial user requirements and subsequently be tailored according to the results of live testing.

The project software will be produced with the assistance of professional users from a dictionary publishing house and a translation/documentation company, in the form of requirements definition, feedback on specifications and field tests of early prototypes.

DELIS is an interdisciplinary technology-transfer project, making technologies which have been developed and are now beginning to be used in NLP available for lexicographic work in translation/documentation and publishing. It will also make significant contribution to the research areas of linguistic (particularly semantic) description and the integration of typed feature systems and user interfaces.

In particular, DELIS will contribute to a methodology of structuring semantic and syntactic information so that it is independent of editorial tools used to manage formal, typographical and other characteristics of lexical information. Ultimately the creation of product-independent lexical databases that can be used for more than just traditional paper dictionaries is envisaged.

The DELIS prototype will be parameterizable and thus adaptable to the systems and databases used by the various project participants.

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

FP3-LRE - Specific programme of research and technological development (EEC) in the field of telematic systems in areas of general interest - Linguistic research and engineering -, 1990-1994

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Data not available

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Data not available

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Data not available

Coordinator

Universität Stuttgart

EU contribution

No data

Address

Azenbergstraße 12
7000 Stuttgart
Germany

Total cost

No data

Participants (7)

Centre for Sprogteknologi (CST), Copenhagen

Denmark

EU contribution

No data

Address

80 Njalsgade
2300 Kobenhaven S

Total cost

No data

Consiglio Nazionale delle Ricerche (CNR)

Italy

EU contribution

No data

Address

Via della Faggiola 32
56100 Pisa

Total cost

No data

Lingsoft Inc

Finland

EU contribution

No data

Address

Total cost

No data

Linguacubun Ltd

United Kingdom

EU contribution

No data

Address

17 Oakley Road
N1 3LL London

Total cost

No data

Sonovision ITEP Technologies

France

EU contribution

No data

Address

12 rue de Reims
94701 Maisons-Alfort

Total cost

No data

Van Dale Lexicografie BV

Netherlands

EU contribution

No data

Address

21c Mariaplaats PO Box 19232
3511 LK Utrecht

Total cost

No data

Vrije University Amsterdam

Netherlands

EU contribution

No data

Address

1105 De Boelelaan
1081 HV Amsterdam

Total cost

No data

Objective

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (7)

Share this page Share this page on social networks

Download Download the content of the page

Descriptive lexical specifications and tools for corpus-based lexicon-building

Objective

Fields of science (EuroSciVoc) CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s) Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s) Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.

Coordinator

Participants (7)

Share this page Share this page on social networks

Download Download the content of the page

Fields of science (EuroSciVoc)

CORDIS classifies projects with EuroSciVoc, a multilingual taxonomy of fields of science, through a semi-automatic process based on NLP techniques. See: The European Science Vocabulary.

Programme(s)

Multi-annual funding programmes that define the EU’s priorities for research and innovation.

Topic(s)

Calls for proposals are divided into topics. A topic defines a specific subject or area for which applicants can submit proposals. The description of a topic comprises its specific scope and the expected impact of the funded project.

Call for proposal

Procedure for inviting applicants to submit project proposals, with the aim of receiving EU funding.

Funding Scheme

Funding scheme (or “Type of Action”) inside a programme with common features. It specifies: the scope of what is funded; the reimbursement rate; specific evaluation criteria to qualify for funding; and the use of simplified forms of costs like lump sums.