ISLRN

Corpus of Conversational Persian Transcripts

Full Official Name: Corpus of Conversational Persian Transcripts

Submission date: Aug. 19, 2019, 5:36 p.m.

*Introduction* Corpus of Conversational Persian Transcripts consists of transcripts from approximately 20 hours of naturally occurring informal conversations in the Tehrani dialect of Iranian Persian. The corresponding speech is not included in this release. *Data* This corpus is extracted from 1,201 minutes of conversations among 22 participants, 12 male and 10 female. The participants recorded their daily phone calls and face-to-face interactions in a variety of informal settings. The conversations represent various interaction types, settings, types of relationship, and communicative goals. The transcripts were annotated for gender, age, and recording method and setting. See the included documentation for more information about the annotations and transcription methodology. Each conversation is presented as a UTF-8 encoded XML file.

Creator(s)

Ariana Negar Mohammadi

Distributor(s)

Linguistic Data Consortium

Right Holder(s)

Status : Accepted

ISLRN :

187-041-892-174-7

Version

1.0

Source

https://catalog.ldc.upenn.edu/LDC2019T11

Resource Type

Primary Text

Media Type

Text

Language(s)

Persian

Access Medium

Web Download