Malay Speech Data by Mobile Phone - 370 Hours

Full Official Name: Malay Speech Data by Mobile Phone - 370 Hours
Submission date: Oct. 7, 2022, 4:35 p.m.

675 Malaysians native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android and Apple system phones. The data set can be applied for automatic speech recognition, and machine translation scenes. Format:16kHz,16bit, uncompressed wav, mono channel Recording environment:quiet indoor environment, low background noise, without echo Recording content (read speech):oral category; human-machine interaction category; smart home command and in-car command category; numbers; news category; Demographics:675 speakers totally, with 44% male and 56% female; and 66% speakers of all are in the age group of 18-25,32% speakers of all are in the age group of 26-45, 5% speakers of all are in the age group of 46-60, with a floating rate of 2% Device:Android mobile phone, iPhone Language:Malay Application scenarios:speech recognition; voiceprint recognition

Creator(s)
Distributor(s)
Right Holder(s)