The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Indonesian consists of 35,000 lemmas (150,000 forms) as well as the following extra features: Voice, Aspect, Number, Degree and Pronominal Clitics.