Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-E0034
EASy Evaluation Package
The EASy Evaluation Package was produced within the French national project EASy (Evaluation of syntactic parsers of French), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The project enabled to carry out a campaign for the evaluation of syntactic parsers of French.
This package includes the material that was used for the EASy evaluation campaign. It includes resources, protocols, scoring tools, results of the campaign, etc., that were used or produced during the campaign. The aim of these evaluation packages is to enable external players to evaluate their own system and compare their results with those obtained during the campaign itself.
The campaign is distributed over two actions:
1) Evaluation of constituent annotation: it consists in evaluating the ability of parsers with respect to the type of corpus (e.g. literature, conversation transcription, parliamentary speech, questions for information retrieval tools).
2) Evaluation of dependency relation annotation: it consists in evaluating the ability of parsers with respect to the relations between constituents or words.
The EASy evaluation package contains the following data and tools:
1) A collection of syntactically tagged French texts gathered over 6 domains (about one million words) :
- medicine: 100,000 words, including 5,000 annotated words,
- literature: 150,000 words, including 15,000 annotated words,
- emails: 2,250 anonymised personal emails (121,000 words),
- general: 250,000 words, including 24,000 annotated words, extracted from Le Monde newspaper, reports from the French Senate and the European Assembly (MLCC, MultiLingual Corpora for Co-operation, catalogue ref: ELRA-W0023),
- speech: 10 passages of transcribed dialogues from the Spoken French corpus (8,000 annotated words),
- questions: corpus of 137,000 words, extracted from the TREC and AMARYLLIS campaigns, including 5,000 annotated words.
2) PASTK++: gathers evaluation tools for constituents and relations. It includes a version of the EASy campaign tools that were modified during the PASSAGE campaign (which followed the EASy campaigns).
3) Visualization tools for constituents and relations
A description of the project is available at the following address:
(in French language)
Click on the arrow to display content.
Number of languages
Number of tokens :
About 1 million words
Academic - Evaluation 150.00 EUR
Commercial - Evaluation 500.00 EUR
Non Member Prices
Academic - Evaluation 300.00 EUR
Commercial - Evaluation 1000.00 EUR
Sunday 23 November, 2014
14791914 requests since Monday 27 September, 2004
Copyright © 2008