Skip to main content
European Commission logo print header

Conceptual Modelling of Networking of Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources

Final Report Summary - MONDILEX (Conceptual modelling of networking of centres for high-quality research in Slavic Lexicography and their digital resources)

This report presents a conceptual scheme of networking of centres for high-quality research in Slavic lexicography and their language digital resources. The report, produced by the MONDILEX consortium, summarises the studies and analysis, presented at the five open workshops, organised in the framework of the project. The experts' recommendations are also discussed and summarised. The results will be very useful for a future project proposal for implementation of such research infrastructure for the coming years.

MONDILEX consortium is composed of six members appointed as centres for high-quality research in area of NLP of Slavic languages: the partners are research organisations from European countries, whose national languages belong to the Slavic group - Bulgaria, Poland, Russia, Slovakia, Slovenia and Ukraine:
1. Institute of Mathematics and Informatics, Bulgarian Academy of Sciences (coordinator)
2. Institute of Slavic Studies, Polish Academy of Sciences
3.L. Stur Institute of Linguistics, Slovak Academy of Sciences
4. Jozef Stefan Institute, Ljubljana, Slovenia
5. Institute for Information Transmission Problems, Russian Academy of Sciences
6. Ukrainian Lingua-Information Fund of the National Academy of Sciences of Ukraine

Contact information:
Dr Ludmila Dimitrova, IMI-BAS, Sofia, Bulgaria, coordinator of MONDILEX, ludmila@cc.bas.bg
Project website: http://www.mondilex.org

The main objective of the MONDILEX project was to design a conceptual scheme of a research infrastructure supporting the networking of centres for high-quality research in Slavic lexicography. This infrastructure is necessary in view of the obvious mismatch between the importance of Slavic languages, spoken by a substantial part of Europe's population, and the insufficient number and inadequate quality of digital resources for these languages.

Other main objectives of the MONDILEX were to study problems of the development, management, and reuse of lexical resources in a multilingual context. The increased EU participation of countries, whose national language belongs to the Slavic group, the intensified communication with non-EU Slavic countries, brings up the problem for standardisation of digital multilingual resources to facilitate exchange and serve in education, business, and research.

The project examined strategies for the coordination, unification, integration and extension of existing digital lexical resources and the creation of new ones, in accordance with recent advances in the field and international standards. These resources include those developed along the lines of best practices and recommendations: corpora (multilingual, parallel, comparable, and annotated), dictionaries (mono- and bilingual, online), lexicons, ontologies etc.

The series of five MONDILEX open workshops investigated the following problems. The first workshop analysed the needs of the partners for a common infrastructure supporting scientific and applied activities in digital lexicography. The second workshop studied the state of the art in digital lexical resources and requirements for their integration. The third workshop tackled innovative solutions for lexical entry design in digital Slavic lexicography. The representation of semantics, phraseology, etymology and related matters were discussed in the fourth workshop. The last workshop concentrated on the research infrastructure for Slavic lexicography.

MONDILEX presented a conceptual scheme of a research infrastructure supporting the networking of centres for high-quality research in Slavic lexicography. MONDILEX investigated the features of knowledge grid as a technological platform for implementation of a network of centres for research in Slavic lexicography and their digital linguistic resources according to the specific requirements of its functionalities.