Resource: TRAD Arabic-French Parallel Text -- Newsgroup

Reference TRAD Arabic-French Parallel Text -- Newsgroup
Date of Submission April 16, 2018, 6:02 p.m.
Status accepted
ISLRN 582-339-053-329-9
Resource Type Primary Text
Media Type Text
Language Arabic, French, Standard Arabic
Format/MIME Type application/xml
Size 1136 KB
Access Medium Web Download


TRAD Arabic-French Parallel Text -- Newsgroup was developed by ELDA as part of the PEA-TRAD project. It contains French translations of a subset of approximately 10,000 Arabic words from GALE Phase 1 Arabic Newsgroup Parallel Text - Part 1 (LDC2009T03).

The PEA-TRAD project (Translation as a Support for Document Analysis) was supported by the French Ministry of Defense (DGA). Its purpose was to develop speech-to-speech translation technology for multiple languages (e.g., Arabic, Chinese, Pashto) from a variety of domains. ELDA developed several corpora for this effort.

LDC has also released TRAD Chinese-French Parallel Text -- Blog (LDC2018T02).


This release consists of 398 segments (translations units) from 17 documents. The source data is Arabic newsgroup text collected and translated into English by the Linguistic Data Consortium for the DARPA GALE (Global Autonomous Language Exploitation) program. Information about the ELDA translation team, translation guidelines and validation results is contained in the documentation accompanying this release.

The Arabic source file contains 10,706 words and the French reference translation contains 15,843 words. The data is presented in two unicode-encoded XML files along with an associated DTD.

Version 1.0
Creator Linguistic Data Consortium , ELDA
Distributor Linguistic Data Consortium
Rights Holder Portions © 2018 ELDA, © 2005-2007, 2009, 2018 Trustees of the University of Pennsylvania