Japanese Phonetic (Romaji) Error Dataset

Full Official Name: Japanese Phonetic (Romaji) Error Dataset
Submission date: March 13, 2026, 7:59 p.m.

A corpus of Japanese sentences with programmatically introduced phonetic Romaji errors (Missing, Addition, NearbyKey Typo, TwoKey Misorder), designed for evaluating Input Method Editor (IME) systems. Derived from the Snow Simplified Japanese Corpus.

Creator(s)
Distributor(s)
Right Holder(s)