Corpus of Conversational Persian Transcripts

Full Official Name: Corpus of Conversational Persian Transcripts
Submission date: Aug. 19, 2019, 5:36 p.m.

*Introduction* Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release. *Data* This corpus is extracted from 1,201 minutes of conversations among 22 participants, 12 male and 10 female. The participants recorded their daily phone calls and face-to-face interactions in a variety of informal settings. The conversations represent various interaction types, settings, types of relationship, and communicative goals. The transcripts were annotated for gender, age, and recording method and setting. See the included documentation for more information about the annotations and transcription methodology. Each conversation is presented as a UTF-8 encoded XML file.

Creator(s)
Distributor(s)
Right Holder(s)