Resource: ONOMASTICA-COPERNICUS DATABASE
|Date of Submission||Jan. 24, 2014, 4:30 p.m.|
|Media Type||Text, Audio|
|Language||Czech, Estonian, Latvian, Polish, Slovak, Slovenian, Ukrainian|
|Format/MIME Type||Plain text|
The ONOMASTICA project was a European-wide research initiative within the scope of the Linguistic Research and Engineering Programme, the aim of which was the construction of a multi-language pronunciation lexicon of proper names. That project covered eleven European languages: Danish, Dutch, English, French, German, Greek, Italian, Norwegian, Portuguese, Spanish and Swedish.
Although the ONOMASTICA project ended in June 1995, the work continued with the introduction of new partners, addressing names in Eastern and Central European languages: Czech, Estonian, Latvian, Polish, Romanian, Slovakian, Slovenian and Ukrainian, in a new project funded by the European Commission?s Copernicus Programme.
The corpus consists of a collection of 1,783,390 transcriptions of 1,705,653 names, broken down as follows:
The databases are presented in Microsoft Access format and in ASCII text format, together with a database browser software prepared by Keith Edwards of the Centre for Communication Interface Research, The University of Edinburgh.