Skip to main content
Vai all'homepage della Commissione europea (si apre in una nuova finestra)
italiano italiano
CORDIS - Risultati della ricerca dell’UE
CORDIS
Contenuto archiviato il 2023-11-13

Platform for Online Sharing of Training Data and Building User Tailored MT

Descrizione del progetto


Multilingual Web: Machine translation for the multilingual web
Exploiting the potential of SMT technologies

In recent years, statistical machine translation (SMT) has become the leading paradigm for machine translation. SMT systems are built by analyzing huge volumes of parallel corpus and learning translation models from this data. The quality of SMT systems largely depends on the size of training data. Since the majority of parallel data is in major languages, SMT systems for larger languages are of much better quality compared to systems for smaller languages. This quality gap is further deepened due to the complex linguistic structure of many smaller languages. Languages like Latvian, Lithuanian and Croatian (to name just a few) have complex morphological structure and free word order. To learn this complexity from corpus data, much larger volumes of training data are needed. Current systems are built on the data accessible on the web, but it is just a fraction of all parallel texts. Most of them still reside in the local systems of different corporations, public and private institutions, and desktops of individual users. The cost and the know-how required for building custom MT solutions deter many small-to-medium companies from utilizing the power of MT technologies.To fully exploit the huge potential of existing open SMT technologies we propose to build an innovative online collaborative platform for data sharing and MT building. This platform will support upload of public as well as proprietary MT training data and building of multiple MT systems, public or proprietary, by combining and prioritizing this data. The project will extend the use of existing state-of-the-art SMT methods that will be applied to data supplied by users to increase quality, scope and language coverage of machine translation.LetsMT! services will be focused on two application scenarios – the free online translation of business and financial news and the application in the localisation and translation industry. At the same time, it will be of interest for a variety of users: web users in general, speakers of less-covered languages, academia, etc.For the localisation and translation industry, LetsMT! will provide facilities for training of SMT systems on their data and generating custom SMT solutions to be used by localisation service providers as well as enterprises and organizations with multilingual translation needs. Integration of SMT solutions in professional productivity environments will be provided.For readers of business and financial news, LetsMT! will provide free and instant MT services with emphasis on less-covered languages. Their quality will be ensured by application of a large pool of domain-specific resources and subsequent evaluation cycles.LetsMT! services will be accessible through the Web portal for free translation of texts, through a translation widget provided for inclusion in a web-page, through browser plug-ins for quick access to translation, and through integration in professional translation tools.

Programma(i)

Programmi di finanziamento pluriennali che definiscono le priorità dell’UE in materia di ricerca e innovazione.

Argomento(i)

Gli inviti a presentare proposte sono suddivisi per argomenti. Un argomento definisce un’area o un tema specifico per il quale i candidati possono presentare proposte. La descrizione di un argomento comprende il suo ambito specifico e l’impatto previsto del progetto finanziato.

Invito a presentare proposte

Procedura per invitare i candidati a presentare proposte di progetti, con l’obiettivo di ricevere finanziamenti dall’UE.

CIP-ICT-PSP-2009-3
Vedi altri progetti per questo bando

Meccanismo di finanziamento

Meccanismo di finanziamento (o «Tipo di azione») all’interno di un programma con caratteristiche comuni. Specifica: l’ambito di ciò che viene finanziato; il tasso di rimborso; i criteri di valutazione specifici per qualificarsi per il finanziamento; l’uso di forme semplificate di costi come gli importi forfettari.

PB - Pilot Type B

Coordinatore

TILDE SIA
Contributo UE
€ 409 447,00
Indirizzo
VIENIBAS GATVE 75 A
LV-1004 RIGA
Lettonia

Mostra sulla mappa

Tipo di attività
Private for-profit entities (excluding Higher or Secondary Education Establishments)
Collegamenti
Costo totale

I costi totali sostenuti dall’organizzazione per partecipare al progetto, compresi i costi diretti e indiretti. Questo importo è un sottoinsieme del bilancio complessivo del progetto.

Nessun dato

Partecipanti (6)

Il mio fascicolo 0 0