Resource: EAC-TM

Reference EAC Translation Memory
Date of Submission Oct. 20, 2014, 12:01 p.m.
Status accepted
ISLRN 589-927-543-547-4
Resource Type Primary Text
Media Type Text
Source
Language Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Modern (1453-), Hungarian, Icelandic, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Castilian, Swedish, Turkish
Format/MIME Type text/xml
Size 32,000 Translation Units, 320,000 words
Access Medium files for download
Description

EAC-TM is a Translation Memory in 25 languages focusing on the subject domain of education, training, culture and youth.

Translation memories are collections of small pieces of text and their manually produced translations. Translation memories are typically used to support human translators, but they can also be used to train statistical machine translation systems. It is distributed in the widely used TMX format.

The parallel corpus was provided by the European Commission’s Directorate General for Education and Culture (EAC) and the data has been processed further by the JRC. The EAC-TM is smaller compared to the other parallel corpora distributed by the JRC, but it has the advantage that it focuses on a very different domain. EAC-TM consists of a total of over 32,000 translation units (320,000 words).

Version 1.0 (January 2013)
Creator European Commission – Directorate General for Education and Culture (EAC)
Distributor Ralf Steinberger - European Commission - Joint Research Centre (JRC)
Rights Holder European Union (EU)