Resource: ECDC-TM

Reference ECDC Translation Memory
Date of Submission Oct. 20, 2014, 11:51 a.m.
Status accepted
ISLRN 476-596-396-497-8
Resource Type Primary Text
Media Type Text
Source
Language Bulgarian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Modern (1453-), Hungarian, Icelandic, Irish, Italian, Latvian, Lithuanian, Maltese, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Castilian, Swedish
Format/MIME Type text/xml
Size 64,000 translation units, 1.35 million words
Access Medium files for download
Description

ECDC-TM is a Translation Memory of the web pages of the European Centre for Disease Prevention and Control (ECDC). It covers 25 languages and all 300 language pairs.

Translation memories are collections of small pieces of text and their manually produced translations. Translation memories are typically used to support human translators, but they can also be used to train statistical machine translation systems. DGT-TM consists of over 2 million units per language. It is distributed in the widely used TMX format.

The major part of the documents talks about health-related topics (anthrax, botulism, cholera, dengue fever, hepatitis, etc.), but some of the web pages also describe the organisation ECDC (e.g. its organisation, job opportunities) and its activities (e.g. epidemic intelligence, surveillance). ECDC-TM consists of up to 2500 translation units per language. It is distributed in the widely used TMX format.

Version 1.0 (October 2012)
Creator European Centre for Disease Prevention and Control (ECDC)
Distributor Ralf Steinberger - European Commission - Joint Research Centre (JRC)
Rights Holder European Centre for Disease Prevention and Control (ECDC)