ISLRN

Mixed Speech with Chinese and English Data by Mobile Phone - 1,535 Hours

Full Official Name: Mixed Speech with Chinese and English Data by Mobile Phone - 1,535 Hours

Submission date: Oct. 7, 2022, 4:43 p.m.

The data is recorded by 3972 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used for improving the recognition effect of the speech recognition system on Chinese-English mixed reading speech. Format：16kHz, 16bit, uncompressed wav, mono channel Recording environment：quiet indoor environment, without echo Recording content (read speech)：general category; human-machine interaction category Demographics：3,972 speakers totally, with 43% males and 57% females, and 68% speakers of all are in the age group of 12-25, 31% speakers of all in the age group of 26-45, 1% speakers of all are in the age group of 46-60 Device：Android mobile phone, iPhone; Language：mandarin; English Application scenarios：speech recognition; voiceprint recognition.

Creator(s)

Distributor(s)

ELRA

Right Holder(s)

Status : Accepted

ISLRN :

451-966-049-653-3

Version

1.0

Source

http://catalog.elra.info/en-us/repository/browse/ELRA-S0457

Resource Type

Primary Text

Media Type

Audio

Language(s)

Chinese

English

Access Medium