Resource: MDT Mandarin Chinese Conversational Recognition Corpus – Complete set

Reference MDT Mandarin Chinese Conversational Recognition Corpus – Complete set
Date of Submission May 20, 2020, 12:48 p.m.
Status accepted
ISLRN 559-956-475-937-1
Resource Type Primary Text
Media Type Audio
Source
Language Chinese
Format/MIME Type audio/wav
Description

This dataset consists of 4.98 hours of transcribed conversational speech in Mandarin Chinese, where 30 conversations are uttered by 32 speakers (16 males and 16 females). The audios are sampled at 16 kHz and quantized at 16 bits.

For each conversation, there are two close-talking channels recorded via the microphones, one for each speaker, as well as three far-field channels recorded by iPhone, Androïd Phone, and recorder respectively.
This corpus may be obtained as a complete set or by selecting specific channels (two close-talking channels shall be understood as 1 single channel):
- MDT Mandarin Chinese Conversational Recognition Corpus - complete set (ELRA-S0409-01)
- MDT Mandarin Chinese Conversational Recognition Corpus - 1 channel (ELRA-S0409-02)
- MDT Mandarin Chinese Conversational Recognition Corpus - 2 channels (ELRA-S0409-03)
- MDT Mandarin Chinese Conversational Recognition Corpus - 3 channels (ELRA-S0409-04)

Version 1.0