Resource: TRAD Chinese-English Email Parallel corpus - Test Set

Reference TRAD Chinese-English Email Parallel corpus - Test Set
Date of Submission Nov. 21, 2016, 5:09 p.m.
Status accepted
ISLRN 985-956-234-357-3
Resource Type Primary Text
Media Type Text
Source
Language Chinese, English
Description

This is a parallel corpus of 15,000 characters in Chinese (equivalent to 10,000 words) and 2 reference translations in English. The source texts are a selection of emails from the Speechocean King-NLP-001 corpus, a corpus of private emails collected from the daily life and business domains. The translation has been conducted by two different translation teams following a strict protocol aimed at producing high quality translations.

The content has also been translated into French (see ELRA-W0116).

This corpus was produced by ELDA within the PEA TRAD project supported by the French Ministry of Defence (DGA). It was used as a test set for an internal MT evaluation campaign.

Version 1.0
Distributor ELRA