Text (1052)
Audio (679)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1681 Language Resources (Page 65 of 85)

« Previous | Next »Order by:

 THAMUS Generic Italian Dictionary - canonical forms    
  • Italian

ID: ELRA-L0013-01

ISLRN: 273-356-234-601-1

A Generic monolingual Italian dictionary of 87,000 canonical forms. Multi-word terms contain morphological coding for the headword.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
19140.00 € submit
47850.00 € submit
Licence: Commercial Use - ELRA VAR
47850.00 € submit
47850.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20880.00 € submit
52200.00 € submit
Licence: Commercial Use - ELRA VAR
52200.00 € submit
52200.00 € submit
 THAMUS. Generic Italian Dictionary - canonical forms - technical domain    
  • Italian

ID: ELRA-L0013-03

ISLRN: 974-347-456-124-9

A Generic monolingual Italian dictionary of 48,000 canonical forms (Technical). Multi-word terms contain morphological coding for the headword.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10560.00 € submit
26400.00 € submit
Licence: Commercial Use - ELRA VAR
26400.00 € submit
26400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11520.00 € submit
28800.00 € submit
Licence: Commercial Use - ELRA VAR
28800.00 € submit
28800.00 € submit
 THAMUS. Generic Italian Dictionary - inflected forms    
  • Italian

ID: ELRA-L0013-02

ISLRN: 832-758-690-732-9

A Generic monolingual Italian dictionary of 612,000 inflected forms. Multi-word terms contain morphological coding for the headword.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
135080.00 € submit
336600.00 € submit
Licence: Commercial Use - ELRA VAR
336600.00 € submit
336600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
147360.00 € submit
367200.00 € submit
Licence: Commercial Use - ELRA VAR
367200.00 € submit
367200.00 € submit
 THAMUS. Generic Italian Dictionary - inflected forms - technical domain    
  • Italian

ID: ELRA-L0013-04

ISLRN: 824-935-665-423-8

A Generic monolingual Italian dictionary of 96,000 inflected forms (Technical). Multi-word terms contain morphological coding for the headword.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21120.00 € submit
52800.00 € submit
Licence: Commercial Use - ELRA VAR
52800.00 € submit
52800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
23040.00 € submit
57600.00 € submit
Licence: Commercial Use - ELRA VAR
57600.00 € submit
57600.00 € submit
 The CINTIL Corpus – International Corpus of Portuguese    
  • Portuguese

ID: ELRA-W0050

ISLRN: 176-775-844-396-0

CINTIL-Corpus Internacional do Português is a linguistically interpreted written and spoken corpus of European Portuguese. It is composed of one million annotated tokens, each one of which verified by human expert annotators. The annotation comprises information on part-of-speech, open class lemm...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 The CLEF Test Suite for the CLEF 2000-2003 Campaigns – Evaluation Package    
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Italian
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish

ID: ELRA-E0008

ISLRN: 317-005-302-361-6

The CLEF Test Suite contains the data used for the main tracks of the CLEF campaigns carried out from 2000 to 2003: Multilingual text retrieval, Bilingual text retrieval, Monolingual text retrieval, and Domain-specific text retrieval. The CLEF Test Suite is composed of: • The multilingual docum...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit

Special offers are also available. Check here for details.

 The Coimisineir Teanga Bilingual Corpus of Reference Documents (Processed)    
  • English
  • Irish

ID: ELRA-W0224

ISLRN: 067-439-806-269-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. General Reference content from the Language Commissioner...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 The Coimisineir Teanga Bilingual Corpus of Reports and Press Releases (Processed)    
  • English
  • Irish

ID: ELRA-W0230

ISLRN: 440-182-838-797-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Reports and Press Release data from the Language Commiss...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 The Croatian-English corpus with the nature protection strategy of Croatia (Processed)    
  • Croatian
  • English

ID: ELRA-W0296

ISLRN: 250-662-686-256-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The Croatian-English corpus with The Nature Protection S...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 The EMILLE/CIIL Corpus    
  • Assamese
  • Bengali
  • English
  • Gujarati
  • Hindi
  • Kannada
  • Kashmiri
  • Malayalam
  • Marathi
  • Oriya (macrolanguage)
  • Panjabi; Punjabi
  • Sinhala; Sinhalese
  • Tamil
  • Telugu
  • Urdu

ID: ELRA-W0037

ISLRN: 039-846-040-604-0

The EMILLE/CIIL Corpus consists of three components: monolingual, parallel and annotated corpora. There are fourteen monolingual corpora, including both written and (for some languages) spoken data for fourteen South Asian languages: Assamese, Bengali, Gujarati, Hindi, Kannada, Kashmiri, Malayala...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
 The EMILLE Lancaster Corpus    
  • Bengali
  • English
  • Gujarati
  • Hindi
  • Panjabi; Punjabi
  • Sinhala; Sinhalese
  • Tamil
  • Urdu

ID: ELRA-W0038

ISLRN: 438-045-014-925-0

The EMILLE Lancaster Corpus consists of three components: monolingual, parallel and annotated corpora. There are monolingual corpora for seven South Asian languages: Bengali, Gujarati, Hindi, Punjabi, Sinhala, Tamil, Urdu. The EMILLE monolingual corpora contain approximately 58,880,000 words (i...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
12000.00 € submit
 The FAME! Speech Corpus    
  • Dutch; Flemish
  • Western Frisian

ID: ELRA-S0391

ISLRN: 340-994-352-616-4

The components of the Frisian data collection are speech and language resources gathered for building a large vocabulary ASR system for the Frisian language. Firstly, a new broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
 The Gaois bilingual corpus of English-Irish legislation (Processed)    
  • English
  • Irish

ID: ELRA-W0223

ISLRN: 881-570-220-966-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual corpus of English-Irish legislation provided b...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 The HIWIRE database, a noisy and non-native English speech corpus for cockpit communication    
  • English

ID: ELRA-S0293

ISLRN: 934-733-835-065-0

This database has been collected and packaged under the auspices of the IST-EU STREP project HIWIRE (Human Input that Works In Real Environments). The database was designed to be used as a tool for development and test of speech processing and recognition techniques dealing with robust non-native...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
50.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
50.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
 The Lancaster Corpus of Mandarin Chinese (LCMC)    
  • Chinese

ID: ELRA-W0039

ISLRN: 990-638-120-277-2

The Lancaster Corpus of Mandarin Chinese (LCMC) is designed as a Chinese match for the FLOB and FROWN corpora for modern British and American English. The corpus is suitable for use in both monolingual research into modern Mandarin Chinese and cross-linguistic contrast of Chinese and British/Am...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
7500.00 € submit
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 The MWN.PT - MultiWordnet of Portuguese    
  • English
  • Hebrew
  • Italian
  • Latin
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Spanish; Castilian

ID: ELRA-M0050

ISLRN: 431-556-012-743-8

MWN.PT - MultiWordnet of Portuguese (version 1) spans over 17,200 manually validated concepts/synsets, linked under the semantic relations of hyponymy and hypernymy. These concepts are made of over 21,000 word senses/word forms and 16,000 lemmas from both European and American variants of Portugu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
 The Oxford Spanish Dictionary    
  • Spanish; Castilian

ID: ELRA-L0061

ISLRN: 846-924-011-331-6

The highly-acclaimed Oxford Spanish Dictionary, described by John Butt in the TLS (Times Literary Supplement) as “indispensable for all serious Hispanists”, provides an authoritative, up-to-date guide to world Spanish. It is the only Spanish dictionary to present the full wealth of Spanish from b...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6125.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8750.00 € submit
 The "SIVA" Speech Database for Speaker Verification and Identification    
  • Italian

ID: ELRA-S0028

ISLRN: 405-085-829-227-5

The Italian speech database SIVA (?Speaker Identification and Verification Archives: SIVA?), is a database comprising more than two thousands calls, collected over the public switched telephone network, and available very soon via ELRA. The SIVA database consists of four speaker categories: male...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
4500.00 € submit
Licence: Commercial Use - ELRA VAR
4500.00 € submit
4500.00 € submit
 Toponymic Geography    
  • French

ID: ELRA-T0100

ISLRN: 750-292-888-530-1

Entries: 70 000 Languages: French Format: ASCII Medium: Floppy disk or CD-ROM Card Description: About 70,000 structured toponyms with two main sets of information: nature (cities, towns, rivers, plains...) and localisation. It is possible to extract terms associated with toponyms ("japonais" - "J...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6794.00 € submit
8492.00 € submit
Licence: Commercial Use - ELRA VAR
8492.00 € submit
8492.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8492.00 € submit
10615.00 € submit
Licence: Commercial Use - ELRA VAR
10615.00 € submit
10615.00 € submit
 T-PAS    
  • Croatian
  • English

ID: ELRA-M0109

ISLRN: 432-666-503-743-8

T-PAS (Typed Predicate Argument Structures) is a digital lexicon consisting of a corpus-derived collection of Italian verb argument structures, whose arguments have been manually annotated with a set of hierarchically organised semantic labels called Semantic Types. T-PAS is primarily tailored f...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
0.00 € submit
0.00 € submit

« Previous | Next »