ISLRN

Italian Speech Data Collected by Mobile Phone - 347 Hours

Full Official Name: Italian Speech Data Collected by Mobile Phone - 347 Hours

Submission date: Oct. 7, 2022, 4:45 p.m.

Italian languageaudio data captured by mobile phone , with total duration of 347 hours. It is recorded by 800 Italian native speakers, balanced in gender is balanced; the recording environment is quiet; all texts are manually transcribed with high accuracy. This data set can be applied on automatic speech recognition, machine translation, and sound pattern recognition. Format：16kHz, 16bit, uncompressed wav, mono channel Recording environment：quiet indoor environment, without echo Recording content (read speech)：common sentences Speaker：800 people from Italy, 53% of which are female Device：Android mobile phone, iPhone Language：Italian Transcription content：text, time point of speech data, 2 noise symbols, 5 special identifiers Accuracy rate：95% (the accuracy rate of noise symbols and other identifiers is not included) Application scenarios：speech recognition, voiceprint recognition

Creator(s)

Distributor(s)

ELRA

Right Holder(s)

Status : Accepted

ISLRN :

382-599-484-763-7

Version

1.0

Source

http://catalog.elra.info/en-us/repository/browse/ELRA-S0461

Resource Type

Primary Text

Media Type

Audio

Language(s)

Italian

Access Medium