Spanish TTS Speech Corpus (Appen)

Full Official Name: Spanish TTS Speech Corpus (Appen)
Submission date: Jan. 24, 2014, 4:31 p.m.

The Spanish TTS Speech Corpus contains the recordings of 1 native Spanish speaker (male, 28 years old) recorded in a studio over 1 channel (Shure SM15 unidirectional professional head-word condenser microphone). The data collection and transcription were performed by Appen (Australia). Speech samples are stored as sequences of 16-bit 22.05 kHz PCM in uncompressed WAV files. The speaker read 1,787 prompted sentences covering all legal triphones and diphones. The database is provided with orthographic transcriptions in SAMPA, including canonical and alternative pronunciation, and syllable, stress and acoustic events markings. All transcriptions were segmented at the utterance (sentence/command word) level, annotated at the word level and checked manually. A pronunciation lexicon including 3,748 headwords (plus variants) is also available. This database is aimed to be used within text-to-speech and speech synthesis applications.

Right Holder(s)