Indonesian Speech Data by Mobile Phone - 639 Hours

Full Official Name: Indonesian Speech Data by Mobile Phone - 639 Hours
Submission date: Oct. 7, 2022, 4:36 p.m.

1285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones. The data set can be applied for automatic speech recognition, and machine translation scenes. Format:16kHz, 16bit, uncompressed wav, mono channel Recording Environment:quiet indoor environment, low background noise, without echo Recording Content:oral category; human-machine interaction category; smart home command and in-car command category; numbers; news category Population:1,285 speakers totally, with 47% male and 53% female; and 77.3% speakers of all are in the age group of 18-25,22.3% speakers of all are in the age group of 26-45, 0.4% speakers of all are in the age group of 46-60; Device:Android mobile phone, iPhone Language:Indonesian Application Scene:speech recognition, voiceprint recognition

Creator(s)
Distributor(s)
Right Holder(s)