The goal of MMT is to deliver a language independent commercial online translation service based on a new open-source machine translation distributed architecture.
MMT does not require any initial training phase. Once fed with training data MMT will be ready to translate. MMT de-facto will merge translation memory and machine translation technology into one single product. Quality of translations will increase as soon as new training data are added.
MMT manages context automatically so that it will not require building domain specific systems. MMT will provide best translation quality for any topic/domain by storing training segments together with context linking information.
MMT enables scalability of data and users so that no more expensive ad-hoc hardware installations are needed. The MMT architecture will support high performance and linear scalability up to thousands of nodes. The same software will work to set-up a personal translation system or to create a web-based service on a cluster of commodity nodes able to handle terabytes of data and millions of users.
MMT will create a data collection infrastructure that accelerates the process of filling the data gap between large IT companies and the MT industry. MMT will leverage the data crawled on the web by Common Crawl, TAUS, Translated’s MyMemory and Matecat data and facilities to set up a processing pipeline that will create unprecedented amounts of clean parallel and monolingual data to develop machine translation systems.
Fields of science
Call for proposal
See other projects for this call