Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
213 Language Resources (Page 1 of 11)
« Previous | Next »Order by:


- French
ID: ELRA-W0082
ISLRN: 024-713-187-947-8A pluridisciplinary team of linguists and computer scientists (Rachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand Verine (Praxiling, Lirmm, Lidilem, Tetis, Viseo) collected more than 88,000 French authentic text messages in Montpellier (2011), as part of th...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |
0.00 €
![]() |
0.00 €
![]() |


- French
ID: ELRA-W0029
ISLRN: 786-395-313-491-8Launched at the end of 1995, the AMARYLLIS project aimed at evaluating information retrieval software for French text corpora in order to provide a methodology for the evaluation of other similar tools. AMARYLLIS was organised by the Institut de l'Information Scientifique et Technique (INIST) wit...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45.00 €
![]() |
100.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45.00 €
![]() |
100.00 €
![]() |


- Arabic
- English
- French
ID: ELRA-W0323
ISLRN: 482-848-308-105-6The annotated tweet corpus in Arabizi, French and English was built by ELDA on behalf of INSA Rouen Normandie (Normandie Université, LITIS team), in the framework of the SAPhIRS project (System for the Analysis of Information Propagation in Social Networks), funded by the DGE (Direction Générale ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
7000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
7000.00 €
![]() |
7000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |


- Arabic
- Chinese
- English
- French
- German
- Italian
- Japanese
- Modern Greek (1453-)
- Persian
- Russian
- Spanish; Castilian
ID: ELRA-E0018
ISLRN: 875-865-064-331-9The ARCADE II Evaluation Package was produced within the French national project ARCADE II (Evaluation of parallel text alignment systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The ARCADE II project enabled to carry out a cam...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |


- English
- French
- Italian
ID: ELRA-W0018
ISLRN: 681-769-134-114-2The ARCADE/ROMANSEVAL corpus was used as a reference corpus in two international competitions: · ARCADE, an exercise on multilingual text alignment financed by AUPELF-UREF · ROMANSEVAL, part of the SENSEVAL exercise sponsored by ACL-SIGLEX and EURALEX, on word sense disambiguation. The corpus ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |


- French
ID: ELRA-W0025-02
ISLRN: 798-363-116-656-4This "scientific" corpus of modern French was produced by the University of Nantes (France) within the European Commission funded project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche magazine in 1998, including issues 30...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |


- French
ID: ELRA-W0025-01
ISLRN: 508-941-013-339-7This "scientific" corpus of modern French was produced by the University of Nantes (France) through a funding from ELRA in the framework of the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche mag...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
240.00 €
![]() |
1200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1200.00 €
![]() |
1200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
310.00 €
![]() |
1500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1500.00 €
![]() |
1500.00 €
![]() |


- English
- French
- German
- Spanish; Castilian
ID: ELRA-T0097
ISLRN: 536-306-764-088-7Cards available: 1420 Languages: German, English, French, Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1746.60 €
![]() |
1746.60 €
![]() |
Licence: Commercial Use - ELRA VAR |
1746.60 €
![]() |
1746.60 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2911.00 €
![]() |
2911.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2911.00 €
![]() |
2911.00 €
![]() |


- English
- French
- German
- Italian
- Spanish; Castilian
ID: ELRA-M0001
ISLRN: 874-922-751-076-4Entries: 30 000 each language Languages: French, English, Italian, German, Spanish Format: ASCII or ANSI with separators between entries Medium: CD-ROM The words are associated by the meaning. The lexical categories are: nouns (5 * 18 000), verbs (5 * 8 000), adjectives (5 * 6 000), adverbs (5 * ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8861.00 €
![]() |
11077.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
11077.00 €
![]() |
11077.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
11077.00 €
![]() |
13846.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
13846.00 €
![]() |
13846.00 €
![]() |



- French
ID: ELRA-S0004
ISLRN: 613-587-811-827-8BDLEX consists of a lexical database developed within the French GDR-PRC CHM at IRIT (IMH-PT team), Paul Sabatier University, Toulouse. The data cover lexical, phonological, and morphological information. The database BDLEX consists of about 440,000 inflected forms (generated from about 50,000 c...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1000.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2000.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |


- French
ID: ELRA-L0142
ISLRN: 729-083-390-616-0The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - French consists of 60,000 lemmas (1,400,000 forms...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
67000.00 €
![]() |


- French
ID: ELRA-L0156
ISLRN: 484-371-030-999-7As a complement to the generic vocabulary provided in ELRA-L0142, language variants of French are provided with the following features: Tense, Mood, Person, Number, Gender, Contraction, Pronominal Clitics. Variants are distributed as follows: - French (France): 76,000 lemmas / 1,450,000 forms ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
88000.00 €
![]() |


- French
ID: ELRA-E0019
ISLRN: 154-799-255-123-0The CESART Evaluation Package was produced within the French national project CESART (Evaluation of terminology extraction tools), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The CESART project enabled to carry out a campaign for th...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |


- Arabic
- English
- French
ID: ELRA-E0020
ISLRN: 809-316-046-724-8The CESTA Evaluation Package was produced within the French national project CESTA (Evaluation of MT systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The CESTA project enabled to carry out a campaign for the evaluation of machi...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |


- Bulgarian
- Czech
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hungarian
- Italian
- Persian
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
ID: ELRA-E0036
ISLRN: 378-279-085-589-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |
Special offers are also available. Check here for details.


- English
- French
- Spanish; Castilian
ID: ELRA-E0039
ISLRN: 460-370-870-489-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |
Special offers are also available. Check here for details.


- Bulgarian
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Portuguese
- Romanian; Moldavian; Moldovan
- Spanish; Castilian
ID: ELRA-E0038
ISLRN: 394-993-527-034-7The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |
Special offers are also available. Check here for details.


- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-T0377
ISLRN: 452-383-219-228-0The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, distributed separately under reference ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank). The PhraseBank consists of 2,000 p...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1680.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2240.00 €
![]() |


- Arabic
- Bengali
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
ID: ELRA-T0376
ISLRN: 990-814-402-335-7The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank) and a multilingual set of sentences in 28 languages (the PhraseBank, distributed separately under reference ELRA-T0377). The WordBank contains 10,000 words...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
![]() |


- English
- French
- Modern Greek (1453-)
ID: ELRA-W0309
ISLRN: 290-988-825-874-8This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English text of the Convention against Torture and Other...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
« Previous | Next »