Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

19 Language Resources

Order by:

 88milSMS. A corpus of authentic text messages in French    
  • French

ID: ELRA-W0082

ISLRN: 024-713-187-947-8

A pluridisciplinary team of linguists and computer scientists (Rachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand Verine (Praxiling, Lirmm, Lidilem, Tetis, Viseo) collected more than 88,000 French authentic text messages in Montpellier (2011), as part of th...

MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
0.00 € submit
0.00 € submit
 Amaryllis Corpus - Evaluation Package    
  • French

ID: ELRA-W0029

ISLRN: 786-395-313-491-8

Launched at the end of 1995, the AMARYLLIS project aimed at evaluating information retrieval software for French text corpora in order to provide a methodology for the evaluation of other similar tools. AMARYLLIS was organised by the Institut de l'Information Scientifique et Technique (INIST) wit...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Complete version    
  • French

ID: ELRA-W0025-02

ISLRN: 798-363-116-656-4

This "scientific" corpus of modern French was produced by the University of Nantes (France) within the European Commission funded project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche magazine in 1998, including issues 30...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Raw data    
  • French

ID: ELRA-W0025-01

ISLRN: 508-941-013-339-7

This "scientific" corpus of modern French was produced by the University of Nantes (France) through a funding from ELRA in the framework of the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche mag...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
240.00 € submit
1200.00 € submit
Licence: Commercial Use - ELRA VAR
1200.00 € submit
1200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
310.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 CESART Evaluation Package    
  • French

ID: ELRA-E0019

ISLRN: 154-799-255-123-0

The CESART Evaluation Package was produced within the French national project CESART (Evaluation of terminology extraction tools), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The CESART project enabled to carry out a campaign for th...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 CLEF QAST (2007-2009) – Evaluation Package    
  • English
  • French
  • Spanish; Castilian

ID: ELRA-E0039

ISLRN: 460-370-870-489-0

The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit

Special offers are also available. Check here for details.

 DEFT'08 Evaluation Package    
  • French

ID: ELRA-E0035

ISLRN: 161-881-080-899-5

DEFT (DEfi Fouille de Texte – Text Mining Challenge) organizes evaluation campaigns in the field of text mining. The topic of DEFT 2008 edition is related to the classification of texts by topics and genres. Automatic classification has multiple applications in text mining. Many application fiel...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
240.91 € submit
240.91 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
313.18 € submit
313.18 € submit
 EASy Evaluation Package    
  • French

ID: ELRA-E0034

ISLRN: 238-723-334-894-5

The EASy Evaluation Package was produced within the French national project EASy (Evaluation of syntactic parsers of French), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The project enabled to carry out a campaign for the evaluation...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 EQueR Evaluation Package    
  • French

ID: ELRA-E0022

ISLRN: 725-358-759-122-3

The EQueR Evaluation Package was produced within the French national project EQueR (Evaluation campaign for Question-Answering systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The EQueR project enabled to carry out a campaign f...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 EvaSy Evaluation Package      
  • French

ID: ELRA-E0023

ISLRN: 340-228-754-954-9

The EvaSy Evaluation Package was produced within the French national project EvaSy (Evaluation of speech synthesis systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The EvaSy project enabled to carry out a campaign for the evalu...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 "Le Monde Diplomatique" Text corpus in French - archives 1980-1998    
  • French

ID: ELRA-W0036-01

ISLRN: 232-619-161-765-9

Electronic archiving of "Le Monde Diplomatique" articles in French from 1980 to 1998. The corpus is available in HTML. Each HTML file contains one article. Number of articles available per year : • 1980: 575 articles (500,088 words) • 1981: 560 articles (462,611 words) • 1982: 604 articles (49...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
171.00 € submit
171.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
295.00 € submit
295.00 € submit
 "Le Monde Diplomatique" Text corpus in French - archives from 1999    
  • French

ID: ELRA-W0036-02

ISLRN: 588-072-916-734-0

Electronic archiving of "Le Monde Diplomatique" articles in French from 1999. The corpus is available in HTML. Each HTML file contains one article. Number of articles available per year : • 1999: 820 articles (393,813 words) • 2000: 765 articles (376,027 words) • 2001: 743 articles (368,739...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
46.00 € submit
46.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
69.00 € submit
69.00 € submit
 Modern French Corpus including Anaphors Tagging    
  • French

ID: ELRA-W0032

ISLRN: 488-420-763-510-8

The corpus that includes the tagging of the anaphors was created by the CRISTAL-GRESEC (Stendhal-Grenoble 3 University, France) team and XRCE (Xerox Research Centre Europe, France) in the framework of the call launched by the DGLF-LF (national institution for the French language and the languages...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
250.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
1000.00 € submit
 PANACEA Environment French monolingual corpus    
  • French

ID: ELRA-W0065

ISLRN: 400-316-779-360-9

The PANACEA Environment French monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
 PANACEA Labour French monolingual corpus    
  • French

ID: ELRA-W0066

ISLRN: 349-917-944-285-0

The PANACEA Labour French monolingual corpus was acquired in the framework of the PANACEA project (Platform for Automatic, Normalized Annotation and Cost-Effective Acquisition of Language Resources for Human Language Technologies), under the European Commission's Seventh Framework Programme. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
 PAROLE French Corpus    
  • French

ID: ELRA-W0020

ISLRN: 270-087-727-987-7

The PAROLE French corpus contains the following data: Miscellaneous: Data provided by ELRA (CRATER, MLCC Multilingual and Parallel Corpora) 2 025 964 words Books: CNRS Editions 3 267 409 words Periodicals: CNRS Info, Hermès 942 963 words Newspapers: Le Monde, provided by ELRA 13 856 763 words Tot...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1540.00 € submit
1540.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4300.00 € submit
4300.00 € submit
 Quaero Old Press Extended Named Entity corpus    
  • French

ID: ELRA-W0073

ISLRN: 864-217-681-552-4

The Quaero Old Press Extended Named Entity corpus consists of the manual annotation of 76 newspaper issues published in 1890-1891 and provided by the French National Library (Bibliothèque Nationale de France). Three different titles are used (Le Temps, La Croix and Le Figaro) for a total of 295 p...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Tagged text in French (MEMODATA) with rules of morphological disambiguation    
  • French

ID: ELRA-W0012

ISLRN: 054-030-339-261-8

More than 170 books (classical novels, legal texts...) are tagged with rules of morphological disambiguation. A tagged corpus of 50 books is available for research. It consists of several authors of the 19th century (Balzac, Hugo, Stendhal). See also W0011.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2461.00 € submit
3077.00 € submit
Licence: Commercial Use - ELRA VAR
3077.00 € submit
3077.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3077.00 € submit
3846.00 € submit
Licence: Commercial Use - ELRA VAR
3846.00 € submit
3846.00 € submit
 Tagged text in French (MEMODATA) with typographic tags    
  • French

ID: ELRA-W0011

ISLRN: 147-607-533-294-8

More than 170 books (classical novels, legal texts...) are tagged with typographic tags. A tagged corpus of 50 books is available for research. It consists of several authors of the 19th century (Balzac, Hugo, Stendhal). See also W0012.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1723.00 € submit
2154.00 € submit
Licence: Commercial Use - ELRA VAR
2154.00 € submit
2154.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2154.00 € submit
2692.00 € submit
Licence: Commercial Use - ELRA VAR
2692.00 € submit
2692.00 € submit