Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Anglais Français
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalog Reference : ELRA-E0024
    MEDIA Evaluation Package
    The MEDIA Evaluation Package was produced within the French national project MEDIA (Automatic evaluation of man-machine dialogue systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The MEDIA project enabled to carry out a campaign for the evaluation of man-machine dialogue systems for French.

    This package includes the material that was used for the MEDIA evaluation campaign. It includes resources, protocols, scoring tools, results of the campaign, etc., that were used or produced during the campaign. The aim of these evaluation packages is to enable external players to evaluate their own system and compare their results with those obtained during the campaign itself.

    The campaign is distributed over two actions:
    1) Evaluation taking into account the dialogue context: it consists in producing semantic annotation outside dialogue context for each of the 3,000 test prompts.
    2) Evaluation not taking into account the dialogue context: it consists in evaluating the capacity of understanding systems a) from orthographic transcriptions only and b) from transcriptions and reference annotations outside dialogue context.

    The MEDIA evaluation package contains the following data and tools:
    1) Corpus of 1,258 dialogues (WoZ) recorded for a tourist information task, with transcriptions and annotations in dialogue acts and in semantic segments within or outside dialogue context. The annotations follow the XML formalism. This corpus consists of 18,831 user prompts used for the adaptation of systems and for evaluation.
    2) The text corpus (200 dialogues) is annotated with meta-annotations in order to proceed with a diagnosis study of the systems output. These meta-annotations consist of annotations of speech-oriented phenomena such as repetitions, auto-corrections, incises, etc.
    3) The Semantizer annotation tool.
    4) The Mediaval HC (outside context) evaluation tool.
    5) The Mediaval EC (within context) evaluation tool.

    A description of the project is available at the following address:
    http://www.technolangue.net/article.php3?id_article=62 (in French language)

    ISLRN : 699-856-029-354-6
    Project : EVALDA
    Technical Information
    Distribution medium : Downloadable
    Contents Click on the arrow to display content.
     speech corpus 
    Members Prices
    Academic - Commercial 20000.00 EUR
    Academic - Research 300.00 EUR
    Commercial - Commercial 20000.00 EUR
    Commercial - Evaluation 1000.00 EUR
    Commercial - Research 5000.00 EUR
    Non Member Prices
    Academic - Commercial 25000.00 EUR
    Academic - Research 2000.00 EUR
    Commercial - Commercial 25000.00 EUR
    Commercial - Evaluation 6500.00 EUR
    Commercial - Research 7500.00 EUR

    Resource available in the bundle(s) referenced below.

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0