Karl May Korpus (KMK)

Full Official Name: Karl May Korpus (KMK)
Submission date: Jan. 24, 2014, 4:30 p.m.

The "Karl-May-Korpus" is a monolingual German corpus, available in an SGML-tagged ASCII text format. It contains the works of the German author Karl May (1842-1912) and consists of around 1.6 million words (divided into 9 subcorpora of about 180,000 words each). The corpus was created between 1993 and 1997. Each word form is tagged with a word class (1 out of 43 classes) and appropriate lemma. File format: Text Standard in use: SGML Character set: 8-bit ASCII

Creator(s)
Distributor(s)
Right Holder(s)