Crowdsourced high-quality UK and Ireland English dialect speech data set by Google.

Full Official Name: Crowdsourced high-quality UK and Ireland English dialect speech data set by Google.
Submission date: Nov. 14, 2019, 12:17 p.m.

This data set contains transcribed high-quality audio of English sentences recorded by volunteers speaking different dialects of the language. The data set consists of wave files, and a TSV file (line_index.tsv). The file line_index.csv contains a line id, an anonymized FileID and the transcription of audio in the file. The recordings from the Welsh English speakers were collected in collaboration with Cardiff University. The data set contains the following number of lines: Irish English male: 450 Midlands English female: 246 Midlands English male: 450 Northern English female: 750 Northern English male: 2097 Scottish English female: 894 Scottish English male: 1649 Southern English female: 4161 Southern English male: 4331 Welsh English female: 1199 Welsh English male: 1650 The data set has been manually quality checked, but there might still be errors. Please report any issues in the following issue tracker on GitHub. https://github.com/googlei18n/language-resources/issues See LICENSE file for license information. Copyright 2018, 2019 Google, Inc.

Creator(s)
Distributor(s)
Right Holder(s)