ISLRN

Bitext Lexical Dataset - Indonesian

Full Official Name: Bitext Lexical Dataset - Indonesian

Submission date: July 17, 2023, 5:26 p.m.

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Indonesian consists of 35,000 lemmas (150,000 forms) as well as the following extra features: Voice, Aspect, Number, Degree and Pronominal Clitics.

Creator(s)

Distributor(s)

ELRA

Right Holder(s)

Status : Accepted

ISLRN :

533-126-629-015-2

Version

1.0

Source

http://catalog.elra.info/en-us/repository/browse/ELRA-L0144

Resource Type

Lexicon

Media Type

Text

Language(s)

Indonesian

Access Medium