Spoken Digits in Hindi and Indian English

Full Official Name: Spoken Digits in Hindi and Indian English
Submission date: Feb. 15, 2022, 7:12 p.m.

<h3>Introduction</h3> <p>Spoken Digits in Hindi and Indian English was developed by the <a href="https://www.bits-pilani.ac.in/goa/">Birla Institute of Technology and Science Pilani</a>. It contains approximately two hours of speech comprised of spoken digits from one to ten in Hindi and English with regional accents from across India.</p> <h3>Data</h3> <p>The speech data was collected as follows: in person, on a mobile handset recorder app; via one-to-one online communications over social apps; and from social media sites. Each audio file represents a single spoken digit in either Hindi or Indian English. Background noise was mostly retained. Some data was recorded in a noise-free environment or cleaned after recording to avoid abrupt noises such as car horns.</p> <p>The audio data is organized by number, language and gender. The gender breakdown for speakers is 17% female, 27% male, and 56% unspecified.</p> <p>A Google Colab Notebook file which can be used for basic functionalities such as removing noise or unwanted spaces is also included in this release.</p> <p>All audio data is presented as single channel 16-bit 16kHz flac compressed linear PCM.</p> <h3>Samples</h3> <p>Please view these samples:</p> <ul> <li><a href="desc/addenda/LDC2022S03-hin-f.flac">Hindi Female (FLAC)</a></li> <li><a href="desc/addenda/LDC2022S03-eng-u.flac">English Unspecified (FLAC)</a></li> <li><a href="desc/addenda/LDC2022S03-eng-m.flac">English Male (FLAC)</a></li> </ul> <h3>Updates</h3> <p>None at this time.</p>

