Bitext Lexical Dataset - Language Variants - English

Full Official Name: Bitext Lexical Dataset - Language Variants - English
Submission date: July 17, 2023, 5:25 p.m.

As a complement to the generic vocabulary provided in ELRA-L0140, language variants of English are provided with the following features: Tense, Person, Number, Gender, Degree, Contraction. Variants are distributed as follows: - English US: 63,000 lemmas / 188,000 forms - English UK: 63,000 lemmas / 190,000 forms - English India: 65,000 lemmas / 193,000 forms

Creator(s)
Distributor(s)
Right Holder(s)