Romanian - English news corpus (Processed)

Full Official Name: Romanian - English news corpus (Processed)
Submission date: March 9, 2020, 12:27 p.m.

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: Bilingual Romanian – English news corpus built from SouthEast European Times (2008 dump). The texts are positionaly aligned, i.e. the sentence on line i in the English text is aligned with the sentence on line i in the Romanian text. Alignment was manually validated.

Right Holder(s)