Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-S0388
GlobalPhone Bulgarian Pronunciation Dictionary 260k entries (extended version)
This extended version of the Bulgarian Pronunciation Dictionary called Bulgarian-Dict260k contains pronunciations of more than 260,000 word forms. The dictionary matches in phone set and format the original GlobalPhone Bulgarian Pronunciation Dictionary (see ELRA-S0351) of 20,000 word forms. Bulgarian-Dict260k was built based on the extension of the Bulgarian GlobalPhone text database to improve language modeling and to reduce the high Out-Of-Vocabulary rate resulting from the rich morphology of the Bulgarian language. For this purpose, roughly 9 Million word tokens were collected from the internet sources of national, international, and economic news available from the online newspapers "Banker" (
), "Kesh" (
), and “Sega" (
). After text cleaning and normalization, all word forms were extracted. Pronunciations were created in an automatic process using hand-crafted grapheme-to-phoneme rules. The generated pronunciations were manually cross-checked by native speakers, correcting potential errors of the automatic generation.
Distribution medium :
Click on the arrow to display content.
Academic - Commercial 9000.00 EUR
Academic - Research 1800.00 EUR
Commercial - Commercial 9000.00 EUR
Commercial - Research 9000.00 EUR
Non Member Prices
Academic - Commercial 10800.00 EUR
Academic - Research 2100.00 EUR
Commercial - Commercial 10800.00 EUR
Commercial - Research 10800.00 EUR
Monday 24 July, 2017
23493960 requests since Monday 27 September, 2004
Copyright © 2008