Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-L0087
This is a Persian (Farsi) lexicon of more than 40,000 entries of non-inflected forms of words. Each word is transliterated based on the proposed framework from MBROLA (Text-To-Speech synthesizer). The database includes a large variety of descriptors for each entry (plural, homograph, ...).
This lexicon has been made out from a corpus of newspaper publications collected during a period of six months from the Shargh Newspaper, a publication containing articles from diverse topics: art, culture, policy, social, sport, etc. Due to its coverage, this lexicon can be in particular interesting for Persian TTS systems, as the pronunciation of Persian words cannot be derived directly from their transcription due to the omission of short vowels in Persian writing systems.
The number of records is distributed as follows:
Verb-Past Stem: 455
Verb-Present Stem: 435
The lexicon is provided in a MS Access database.
Distribution medium :
Click on the arrow to display content.
Number of languages
Number of tokens :
more than 40,000 entries
Academic - Commercial 5000.00 EUR
Academic - Research 500.00 EUR
Commercial - Commercial 5000.00 EUR
Commercial - Research 5000.00 EUR
Non Member Prices
Academic - Commercial 7000.00 EUR
Academic - Research 700.00 EUR
Commercial - Commercial 7000.00 EUR
Commercial - Research 7000.00 EUR
Tuesday 25 April, 2017
22450109 requests since Monday 27 September, 2004
Copyright © 2008