European Commission logo
français français
CORDIS - Résultats de la recherche de l’UE
CORDIS
Contenu archivé le 2023-11-13

Generalized Analysis of Logs for Automatic Translation and Episodic Analysis of Searches

Description du projet


Multilingual Web: Machine translation for the multilingual web
GALATEAS offers content providers an innovative approach to understanding users' behaviour by analysing search engine transaction logs and facilitates the development of multilingual content access.

With the growth of digital libraries and digital library federation (as well as partially unstructured collections of documents such as web sites), a large set of vendors is offering engines for retrieving contents and metadata via search requests by the end user (queries). In most cases these queries are just unstructured fragments of text in a specific language.
The first service offered by GALATEAS (LangLog) is focussed on getting meaning out of these lists of queries and it is addressed to library/federation/site managers. Contrary to mainstream service in this field, GALATEAS services will not considered standard structured information of web logs (e.g. click rate, visited pages, user's paths inside the document tree) but the information contained in queries from the point of view of language interpretation. By subscribing LangLog federations administrator and managers will be able to answer questions such as: as "Which are the topics which are most commonly searched in my collection, according to a certain language?"; "how do these topics relate with my catalogue?"; "Which named entities (people, places) are more popular among my users?".The second problem addressed by GALATEAS is the one of Cross Language Information Retrieval (CLIR) i.e. the capability of typing a query in one specific language and retrieving documents which are available in different languages. The CACAO consortium is already successfully providing services for indexing and searching over digital libraries and metadata repositories. During commercial exploration for marketing CACAO it emerged that certain institutions prefer to keep indexing and searching at their premises (using their own favourite search engine) and would be perfectly satisfied with a service of plain query translation.The second service offered by GALATEAS (QueryTrans) has the ambitious and innovative goal of providing the first web translation service specially tailored on query translation.

Appel à propositions

CIP-ICT-PSP-2009-3
Voir d’autres projets de cet appel

Régime de financement

PB - Pilot Type B

Coordonnées du coordinateur

Nikolaos Lagos Mr.

Coordinateur

XEROX
Contribution de l’UE
€ 342 540,00
Adresse
33 RUE DES VANESSES IMMEUBLE EXELMANS
93420 Villepinte
France

Voir sur la carte

Région
Ile-de-France Ile-de-France Seine-Saint-Denis
Contact administratif
Michel Gastaldo
Liens
Coût total
Aucune donnée

Participants (7)