Italian Speech Data Collected by Mobile Phone - 347 Hours

Full Official Name: Italian Speech Data Collected by Mobile Phone - 347 Hours
Submission date: Oct. 7, 2022, 4:45 p.m.

Italian languageaudio data captured by mobile phone , with total duration of 347 hours. It is recorded by 800 Italian native speakers, balanced in gender is balanced; the recording environment is quiet; all texts are manually transcribed with high accuracy. This data set can be applied on automatic speech recognition, machine translation, and sound pattern recognition. Format:16kHz, 16bit, uncompressed wav, mono channel Recording environment:quiet indoor environment, without echo Recording content (read speech):common sentences Speaker:800 people from Italy, 53% of which are female Device:Android mobile phone, iPhone Language:Italian Transcription content:text, time point of speech data, 2 noise symbols, 5 special identifiers Accuracy rate:95% (the accuracy rate of noise symbols and other identifiers is not included) Application scenarios:speech recognition, voiceprint recognition

Right Holder(s)