ELRA ELRA
  Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalogue of Language Resources

    ELRA releases free Language Resources. (last update: January 24, 2013)


    The ELRA Catalogue of Language Resources offers a repository of Language Resources (LRs) made available through ELRA.


    (See full-size image)

    An increasing number of LRs in the various fields of Human Language Technology (see image on the left-hand side) are distributed on behalf of ELRA via its operational body ELDA, thanks to the contribution of various players of the HLT community.

    Our aim is to provide Language Resources, by means of this repository, so as to prevent researchers and developers from investing efforts to rebuild resources which already exist as well as help them identify and access those resources.

    Other resources identified, but not available through ELRA, can be viewed in the Universal Catalogue.

    If you have any suggestions or comments, or need any further details about ELRA and its Catalogue of Language Resources, please refer to the contact us section.

    ELRA is a partner of OLAC (Open Language Archives Community). The catalogue can be viewed as an OLAC repository.

    New Resources
  • ELRA-E0041 : CHIL 2007+ Evaluation Package
    The CHIL Seminars are scientific
    presentations given by students, faculty
    members or invited speakers in the field
    of multimodal interfaces and speech
    processing. The language is European
    English spoken by non native speakers.
    The recordings comprise the following:
    videos of the speaker and the audience
    from 4 fixed cameras, frontal close ups
    of the speaker, close talking and
    far-field microphone data of the
    speaker’s voice and background sounds.
    The CHIL 2007+ Evaluation Package
    includes: 1) CHIL 2007 Evaluation
    Package (see ELRA-E0033) and 2)
    additional annotations which have been
    created within the scope of the
    Metanet4u Project (ICT PSP No 270893),
    sponsored by the European Commission.

  • ELRA-S0349 : Quaero Broadcast News Extended Named Entity corpus
    This corpus consists of the manual
    annotation of (i) the ESTER 2 (see also
    ELRA-S0338) manual transcription corpus
    and (ii) the Quaero Speech Recognition
    Evaluation corpus (manual and automatic
    transcriptions coming from 3 different
    ASR systems). The corpus is fully
    manually annotated according to the
    Quaero extended and structured named
    entity definition.

  • ELRA-W0073 : Quaero Old Press Extended Named Entity corpus
    This corpus consists of the manual
    annotation of 76 newspaper issues
    published in 1890-1891 and provided by
    the French National Library
    (Bibliothèque Nationale de France).
    Three different titles are used (Le
    Temps, La Croix and Le Figaro) for a
    total of 295 pages. The corpus is fully
    manually annotated according to the
    Quaero extended and structured named
    entity definition.

  • ELRA-W0057 : PANACEA English-French and English-Greek parallel corpus acquired for Environment domain
    This package consists of an
    English-French and English-Greek
    sentence-aligned parallel corpus from
    the Environment domain automatically
    acquired from the web during 2010 and
    2011. It was acquired in the framework
    of the PANACEA project. Data and
    language pairs are split into training,
    test and development test sets.

  • ELRA-W0058 : PANACEA English-French and English-Greek parallel corpus acquired for Labour Legislation domain
    This package consists of an
    English-French and English-Greek
    sentence-aligned parallel corpus from
    the Labour Legislation domain
    automatically acquired from the web
    during 2010 and 2011. It was acquired
    in the framework of the PANACEA project.
    Data and language pairs are split into
    training, test and development test
    sets.

  • (last update: May 2013)

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0