English Punjabi Parallel Corpus

Full Official Name: English Punjabi Parallel Corpus
Submission date: April 15, 2021, 9:39 a.m.

The English-Punjabi Parallel corpus consists of 14 lakh parallel sentences each of English and Punjabi languages. These sentences are extracted from Comparable corpus of varying degree collected from Wikipedia and other authentic language resources. The comparable corpus obtained from Wikipedia dump is used to extract parallel sentences by using similarity measures through an integrated approach.

Creator(s)
Distributor(s)
Right Holder(s)