ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • S0302 : TC-STAR female baseline voice: Laura
    Laura contains the recordings of one
    female English (British) speaker
    recorded in a noise-reduced room through
    a headset microphone. It consists of the
    recordings and annotations of read text
    material of approximately 10 hours of
    speech for baseline applications
    (Text-to-Speech systems). The TC-STAR
    male baseline voice: Ian is also
    available via ELRA under reference
    ELRA-S0303.

  • S0303 : TC-STAR male baseline voice: Ian
    Ian contains the recordings of one male
    English (British) speaker recorded in a
    noise-reduced room through a headset
    microphone. It consists of the
    recordings and annotations of read text
    material of approximately 10 hours of
    speech for baseline applications
    (Text-to-Speech systems). The TC-STAR
    female baseline voice: Laura is also
    available via ELRA under reference
    ELRA-S0302.

  • S0304 : SpeechDat(M) Italian Mobile Network Speech Database
    This speech database contains the
    recordings of 342 Italian speakers
    recorded over the Italian mobile
    telephone network. Each speaker uttered
    around 40 read and spontaneous items.

  • S0301 : Norwegian EUROM1
    EUROM1 is the first really multilingual
    speech database produced in Europe. Over
    60 speakers per language pronounced
    numbers, sentences, isolated words using
    close talking microphone.

  • T0373 : BioLexicon
    BioLexicon is a large-scale English
    terminological resource which has been
    developed to address the needs emerging
    in text mining efforts in the biomedical
    domain. It contains over 2.2M lexical
    entries (over 3.3M semantic relations),
    and information on over 1.8M variants
    and on over 2M synonymy relations.
    BioLexicon is available in a relational
    database format (MySQL dump format) and
    it adheres to the EAGLES/ISO standards
    for lexical resources.

  • (last update: February 2010)

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0