Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Languages
Informations
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Contact Us
Catalog Reference : E0034
EASy Evaluation Package
The EASy Evaluation Package was produced within the French national project EASy (Evaluation of syntactic parsers of French), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The project enabled to carry out a campaign for the evaluation of syntactic parsers of French.
This package includes the material that was used for the EASy evaluation campaign. It includes resources, protocols, scoring tools, results of the campaign, etc., that were used or produced during the campaign. The aim of these evaluation packages is to enable external players to evaluate their own system and compare their results with those obtained during the campaign itself.
The campaign is distributed over two actions:
1) Evaluation of constituent annotation: it consists in evaluating the ability of parsers with respect to the type of corpus (e.g. literature, conversation transcription, parliamentary speech, questions for information retrieval tools).
2) Evaluation of dependency relation annotation: it consists in evaluating the ability of parsers with respect to the relations between constituents or words.
The EASy evaluation package contains the following data and tools:
1) A collection of syntactically tagged French texts gathered over 6 domains (about one million words) :
- medicine: 100,000 words, including 5,000 annotated words,
- literature: 150,000 words, including 15,000 annotated words,
- emails: 2,250 anonymised personal emails (121,000 words),
- general: 250,000 words, including 24,000 annotated words, extracted from Le Monde newspaper, reports from the French Senate and the European Assembly (MLCC, MultiLingual Corpora for Co-operation, catalogue ref: ELRA-W0023),
- speech: 10 passages of transcribed dialogues from the Spoken French corpus (8,000 annotated words),
- questions: corpus of 137,000 words, extracted from the TREC and AMARYLLIS campaigns, including 5,000 annotated words.
2) PASTK++: gathers evaluation tools for constituents and relations. It includes a version of the EASy campaign tools that were modified during the PASSAGE campaign (which followed the EASy campaigns).
3) Visualization tools for constituents and relations
A description of the project is available at the following address:
http://www.technolangue.net/article.php3?id_article=198
(in French language)
Production
Project :
EVALDA
Contents
Click on the arrow to display content.
written corpus
Number of languages
: Monolingual
Language(s) :
French
Number of tokens :
About 1 million words
TEXT_ANNOTATION_LEVELSyntactic
Members Prices
Academic - Evaluation 150.00 EUR
Commercial - Evaluation 500.00 EUR
Non Member Prices
Academic - Evaluation 300.00 EUR
Commercial - Evaluation 1000.00 EUR
Saturday 31 July, 2010
5266556 requests since Monday 27 September, 2004
Copyright © 2008
ELRA
ELRACatalogue 0.8.0