ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    ELRA releases free Language Resources.


    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • ELRA-S0375 : GlobalPhone Swahili
    The GlobalPhone Swahili corpus contains
    7,728 utterances spoken by 70 speakers.
    Native speakers of Swahili were asked to
    read prompted sentences of newspaper
    articles. The entire collection took
    place in Nairobi, Kenya.

  • ELRA-S0376 : GlobalPhone Swahili Pronunciation Dictionary
    The GlobalPhone pronunciation
    dictionaries contain the pronunciations
    of all word forms found in the
    transcription data of the GlobalPhone
    speech & text database. The Swahili
    dictionary contains 10664 entries.

  • ELRA-S0377 : GlobalPhone Ukrainian
    The GlobalPhone Ukrainian corpus
    contains 12,814 utterances spoken by 119
    speakers. Native speakers of Ukrainian
    were asked to read prompted sentences of
    newspaper articles. The entire
    collection took place in Donezk,
    Ukraine.

  • ELRA-S0378 : GlobalPhone Ukrainian Pronunciation Dictionary
    The GlobalPhone pronunciation
    dictionaries contain the pronunciations
    of all word forms found in the
    transcription data of the GlobalPhone
    speech & text database. The Ukrainian
    dictionary contains 7748 entries/7740
    words.

  • ELRA-L0096 : MCL - Multifunctional Computational Lexicon of Contemporary Portuguese
    MCL is a 26,443 lemma Frequency Lexicon
    with 140,315 tokens extracted from
    CORLEX, a contemporary Portuguese corpus
    (16,210,438 words). In order to extract
    the lexicon, all the different lexical
    forms occurring in the corpus were
    indexed and subsequently tagged
    morphosyntactically and lemmatised by
    PALAVROSO. Each lemma in MCL is followed
    by morphosyntactic and quantitative
    information.

  • (last update: February 2016)

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0