101 Language Resources (Page 1 of 6)

« Previous | Next »Order by:

 88milSMS. A corpus of authentic text messages in French    
  • French

ID: ELRA-W0082

ISLRN: 024-713-187-947-8

A pluridisciplinary team of linguists and computer scientists (Rachel Panckhurst, Catherine Détrie, Cédric Lopez, Claudine Moïse, Mathieu Roche, Bertrand Verine (Praxiling, Lirmm, Lidilem, Tetis, Viseo) collected more than 88,000 French authentic text messages in Montpellier (2011), as part of th...

MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - Non Standard Licence Terms
0.00 € submit
0.00 € submit
 Amaryllis Corpus - Evaluation Package    
  • French

ID: ELRA-W0029

ISLRN: 786-395-313-491-8

Launched at the end of 1995, the AMARYLLIS project aimed at evaluating information retrieval software for French text corpora in order to provide a methodology for the evaluation of other similar tools. AMARYLLIS was organised by the Institut de l'Information Scientifique et Technique (INIST) wit...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
100.00 € submit
 Annotated tweet corpus in Arabizi, French and English    
  • Arabic
  • English
  • French

ID: ELRA-W0323

ISLRN: 482-848-308-105-6

The annotated tweet corpus in Arabizi, French and English was built by ELDA on behalf of INSA Rouen Normandie (Normandie Université, LITIS team), in the framework of the SAPhIRS project (System for the Analysis of Information Propagation in Social Networks), funded by the DGE (Direction Générale ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 ARCADE/ROMANSEVAL corpus    
  • English
  • French
  • Italian

ID: ELRA-W0018

ISLRN: 681-769-134-114-2

The ARCADE/ROMANSEVAL corpus was used as a reference corpus in two international competitions: · ARCADE, an exercise on multilingual text alignment financed by AUPELF-UREF · ROMANSEVAL, part of the SENSEVAL exercise sponsored by ACL-SIGLEX and EURALEX, on word sense disambiguation. The corpus ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Complete version    
  • French

ID: ELRA-W0025-02

ISLRN: 798-363-116-656-4

This "scientific" corpus of modern French was produced by the University of Nantes (France) within the European Commission funded project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche magazine in 1998, including issues 30...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 A "scientific" corpus of modern French ("La Recherche" magazine) - Raw data    
  • French

ID: ELRA-W0025-01

ISLRN: 508-941-013-339-7

This "scientific" corpus of modern French was produced by the University of Nantes (France) through a funding from ELRA in the framework of the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). The corpus contains all articles published in La Recherche mag...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
240.00 € submit
1200.00 € submit
Licence: Commercial Use - ELRA VAR
1200.00 € submit
1200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
310.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 Automobile Engineering    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-T0097

ISLRN: 536-306-764-088-7

Cards available: 1420 Languages: German, English, French, Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1746.60 € submit
1746.60 € submit
Licence: Commercial Use - ELRA VAR
1746.60 € submit
1746.60 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2911.00 € submit
2911.00 € submit
Licence: Commercial Use - ELRA VAR
2911.00 € submit
2911.00 € submit
 Basic multilingual lexicon (MEMODATA)    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-M0001

ISLRN: 874-922-751-076-4

Entries: 30 000 each language Languages: French, English, Italian, German, Spanish Format: ASCII or ANSI with separators between entries Medium: CD-ROM The words are associated by the meaning. The lexical categories are: nouns (5 * 18 000), verbs (5 * 8 000), adjectives (5 * 6 000), adverbs (5 * ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8861.00 € submit
11077.00 € submit
Licence: Commercial Use - ELRA VAR
11077.00 € submit
11077.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11077.00 € submit
13846.00 € submit
Licence: Commercial Use - ELRA VAR
13846.00 € submit
13846.00 € submit
 BDLEX      
  • French

ID: ELRA-S0004

ISLRN: 613-587-811-827-8

BDLEX consists of a lexical database developed within the French GDR-PRC CHM at IRIT (IMH-PT team), Paul Sabatier University, Toulouse. The data cover lexical, phonological, and morphological information. The database BDLEX consists of about 440,000 inflected forms (generated from about 50,000 c...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Collins Multilingual database (MLD) - PhraseBank    
  • Arabic
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Hindi
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Vietnamese

ID: ELRA-T0377

ISLRN: 452-383-219-228-0

The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, distributed separately under reference ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank). The PhraseBank consists of 2,000 p...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1680.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2240.00 € submit
 Collins Multilingual database (MLD) - WordBank    
  • Arabic
  • Bengali
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Hindi
  • Italian
  • Japanese
  • Korean
  • Malayalam
  • Modern Greek (1453-)
  • Norwegian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Spanish; Castilian
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-T0376

ISLRN: 990-814-402-335-7

The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank) and a multilingual set of sentences in 28 languages (the PhraseBank, distributed separately under reference ELRA-T0377). The WordBank contains 10,000 words...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3600.00 € submit
 Corpus of Interactions between Seniors and an Empathic Virtual Coach in Spanish, French and Norwegian      
  • English
  • French
  • Norwegian
  • Spanish; Castilian

ID: ELRA-S0414

ISLRN: 631-345-309-445-9

The Corpus of Interactions between Seniors and an Empathic Virtual Coach in Spanish, French and Norwegian was built within the EMPATHIC project (Empathic, Expressive, Advanced Virtual Coach to Improve Independent Healthy-Life-Years of the Elderly), funded within the European Union’s Horizon 2020 ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
25000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
25000.00 € submit

Special offers are also available. Check here for details.

 CRATER 2 Corpus    
  • English
  • French
  • Spanish; Castilian

ID: ELRA-W0033

ISLRN: 052-466-219-226-4

The CRATER corpus was built upon the foundations of an earlier project, ET10/63, which was funded in the final phase of the Eurotra programme. The Corpus Resources and Terminology Extraction project (MLAP-93 20) extended the bilingual annotated English-French International Telecommunications Unio...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
25.00 € submit
Licence: Commercial Use - ELRA VAR
25.00 € submit
25.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
125.00 € submit
Licence: Commercial Use - ELRA VAR
125.00 € submit
125.00 € submit
 CRATER corpus    
  • English
  • French
  • Spanish; Castilian

ID: ELRA-W0003

ISLRN: 645-721-607-031-5

The Corpus Resources and Terminology Extraction project (MLAP-93 20) has extended the bilingual annotated English-French International Telecommunications Union corpus to include Spanish, and has also debugged the existing corpus. The offer consists of a multi-lingual aligned corpus of 1,000,000 t...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
20.00 € submit
Licence: Commercial Use - ELRA VAR
20.00 € submit
20.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
100.00 € submit
Licence: Commercial Use - ELRA VAR
100.00 € submit
100.00 € submit
 DICO-MORPH_Collocation    
  • French

ID: ELRA-L0002

ISLRN: 316-221-997-443-5

Entries: up to 35000 Language: French Format: ASCII Medium: Floppy disk This is an adding for the French lexicon for morphological works (referenced herein as ELRA-L0001 DICO-MORPH_Lemme MEMODATA).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6992.00 € submit
8740.00 € submit
Licence: Commercial Use - ELRA VAR
8740.00 € submit
8740.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8740.00 € submit
10925.00 € submit
Licence: Commercial Use - ELRA VAR
10925.00 € submit
10925.00 € submit
 DICO-MORPH_Lemme    
  • French

ID: ELRA-L0001

ISLRN: 701-907-618-139-4

Entries: more than 400 000 Language: French Format: ASCII with separators Medium: CD-ROM French reusable lexicon for morphological works which produces the canonical form from the inflexional form. This lexicon is divided into the following lexical categories: nouns (55,000), verbs (8,000), a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12090.00 € submit
15112.00 € submit
Licence: Commercial Use - ELRA VAR
15112.00 € submit
15112.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15112.00 € submit
18890.00 € submit
Licence: Commercial Use - ELRA VAR
18890.00 € submit
18890.00 € submit
 DICO-SYNT    
  • French

ID: ELRA-L0003

ISLRN: 823-800-747-350-2

Entries: 90 000 inflectional forms Language: French Format: ASCII Medium: Floppy disk This resource gives the morpho-syntactical information for DICO-MORPH_lemme: proper noun, transitive verb, ... There are around 800 categories of verbs. The lexical categories are: nouns (25,000), verbs (8.000 t...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8861.00 € submit
11077.00 € submit
Licence: Commercial Use - ELRA VAR
11077.00 € submit
11077.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11077.00 € submit
13846.00 € submit
Licence: Commercial Use - ELRA VAR
13846.00 € submit
13846.00 € submit
 DixAF (Bilingual Dictionary French Arabic, Arabic French)    
  • Arabic
  • French

ID: ELRA-M0040

ISLRN: 941-284-040-958-7

DixAF (Dictionnaire bilingue français arabe, arabe français - Bilingual Dictionary French Arabic, Arabic French) is a joint ownership of CNRS/ENS lettres et sciences humaines. It was developed by Mr Fathi Debili, a CNRS officer, and it consists of around 125,000 binary links between ca. 43,800 Fr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
41000.00 € submit
Licence: Commercial Use - ELRA VAR
41000.00 € submit
41000.00 € submit
 Dutch-French Lexicon (LanTmark) - Administrative domain    
  • Dutch; Flemish
  • French

ID: ELRA-M0004-02

ISLRN: 380-050-603-364-8

Specialised vocabulary: Administrative Entries: 32000 Format: ASCII format with ISO 8859-1 character set. A lexicon file contains entries with feature value pairs on each line and separators between entries. Medium: Floppy Disk, QIC 150 MB Cartridge Tape Administrative vocabulary is divided i...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3840.00 € submit
30720.00 € submit
Licence: Commercial Use - ELRA VAR
30720.00 € submit
30720.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6400.00 € submit
51200.00 € submit
Licence: Commercial Use - ELRA VAR
51200.00 € submit
51200.00 € submit
 Dutch-French Lexicon (LanTmark) - Data Processing domain    
  • Dutch; Flemish
  • French

ID: ELRA-M0004-03

ISLRN: 370-704-271-586-6

Specialised vocabulary: Data processing Entries: 10000 Format: ASCII format with ISO 8859-1 character set. A lexicon file contains entries with feature value pairs on each line and separators between entries. Medium: Floppy Disk, QIC 150 MB Cartridge Tape Data processing vocabulary has 10 000...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
9600.00 € submit
Licence: Commercial Use - ELRA VAR
9600.00 € submit
9600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
16000.00 € submit
Licence: Commercial Use - ELRA VAR
16000.00 € submit
16000.00 € submit

« Previous | Next »