Text (1052)
Audio (679)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1681 Language Resources (Page 32 of 85)

« Previous | Next »Order by:

 Hellenic Ministry of Foreign Affairs Greek-English announcements corpus (Processed)    
  • English
  • Modern Greek (1453-)

ID: ELRA-W0271

ISLRN: 243-990-404-547-2

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. The Hellenic Ministry of Foreign Affairs Greek-English a...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 Helsinki Corpus of Swahili    
  • Swahili (macrolanguage)

ID: ELRA-W0119

ISLRN: 941-187-059-145-7

This is a text corpus of Swahili language of 25 million words, annotated for part-of-speech, morphology and syntax. The corpus contains prose text from fiction, news media and government documents domains, from the period between 1953 and 2016. This package contains: - the Helsinki Corpus of Swa...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
7500.00 € submit
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Hempel    
  • German

ID: ELRA-S0162

ISLRN: 683-410-635-177-8

This corpus contains 3,909 recordings via public phone lines (fixed network only) of 3,909 German speakers with a total of 184,240 spoken words. The contents are free monologues answering the question: "Was haben Sie in der letzten Stunde gemacht?" (What did you do within the last hour?). 25.5 ho...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
755.00 € submit
4755.00 € submit
Licence: Commercial Use - ELRA VAR
4755.00 € submit
4755.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1010.00 € submit
5010.00 € submit
Licence: Commercial Use - ELRA VAR
5010.00 € submit
5010.00 € submit
 Hindi Speech Data by Mobile Phone - 759 Hours    
  • Hindi

ID: ELRA-S0452

ISLRN: 942-490-066-841-8

The data is 759 hours long and was recorded by 1,425 Indian native speakers. The accent is authentic. The recording text is designed by language experts and covers general, interactive, car, home and other categories. The text is manually proofread, and the accuracy is high. Recording devices are...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
115368.00 € submit
115368.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
115368.00 € submit
115368.00 € submit

Special offers are also available. Check here for details.

 Hindi Speech Data by Mobile Phone_R - 240 Hours    
  • Hindi

ID: ELRA-S0463

ISLRN: 037-729-898-638-1

The data is 240 hours and is recorded by 401 Indian. It is recorded in both quiet and noisy environment, which is more suitable for the actual application scenario. The recording content is rich, covering economic, entertainment, news, spoken language, etc. All texts are manually transcrits, with...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
34200.00 € submit
34200.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
34200.00 € submit
34200.00 € submit

Special offers are also available. Check here for details.

 Hindi Speech Recognition Corpus (Desktop)    
  • Hindi

ID: ELRA-S0228-114

ISLRN: 198-341-627-529-5

This corpus was recorded in a quiet office environment over 4 channels and collected from a total of 196 speakers, including 95 males and 101 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news and da...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 Hindi Speech Recognition Corpus (Mobile)    
  • Hindi

ID: ELRA-S0228-125

ISLRN: 078-014-181-343-9

This corpus was recorded in both quiet and noisy environments over 3 channels and collected from a total of 180 speakers, including 99 males and 81 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16200.00 € submit
16200.00 € submit
Licence: Commercial Use - ELRA VAR
16200.00 € submit
16200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16200.00 € submit
16200.00 € submit
Licence: Commercial Use - ELRA VAR
16200.00 € submit
16200.00 € submit
 Hong Kong Cantonese Speech Recognition Corpus (Desktop)    
  • Chinese

ID: ELRA-S0228-75

ISLRN: 083-033-068-532-0

This corpus comprises 101,964 entries uttered by 51 speakers, recorded over 4 channels (desktop). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 24.18 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 Hong Kong English Speech Recognition Corpus (Mobile)    
  • English

ID: ELRA-S0228-127

ISLRN: 611-015-326-388-0

This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 200 speakers, including 99 males and 101 females, all of whom have been carefully screened to ensure their standard and clear pronunciation.The audio scripts cover information such as news, f...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 How2Sign Dataset      
  • American Sign Language
  • English

ID: ELRA-S0416

ISLRN: 583-408-694-292-6

The How2Sign dataset consists of a parallel corpus of speech and transcriptions of instructional videos and their corresponding American Sign Language (ASL) translation videos and annotations. It has been produced by recording 11 persons (6 males and 5 females) with various hearing status (5 self...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0
 Hungarian Speecon database    
  • Hungarian

ID: ELRA-S0297

ISLRN: 517-412-534-623-4

The Hungarian Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 555 adult Hungarian speakers (280 males, 275 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises th...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
 Hydrogeology database    
  • English
  • French

ID: ELRA-T0363

ISLRN: 010-078-480-514-3

400 terms 275 definitions in French 297 definitions in English French-English hydrogeology terminology extracted from "Le forage d'eau - réalisation, entretien, réhabilitation", Michel DETAY, pp. 379, Masson, Paris, 1993, compiled following translation into English by Dr M.S.N. CARPENTER (?Water ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
850.00 € submit
1190.00 € submit
Licence: Commercial Use - ELRA VAR
1190.00 € submit
1190.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1190.00 € submit
1700.00 € submit
Licence: Commercial Use - ELRA VAR
1700.00 € submit
1700.00 € submit
 IBNC - An Italian Broadcast News Corpus    
  • Italian

ID: ELRA-S0093

ISLRN: 133-155-327-792-1

The Italian Broadcast News Corpus (IBNC) was produced by the ITC-IRST (Italy) through a funding from ELRA in the framework of the European Commission project LRsP&P (Language Resources Production & Packaging - LE4-8335). RAI, the major Italian broadcast company, supplied studio quality recordings...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8000.00 € submit
25000.00 € submit
 ICE-GB (British English component of the International Corpus of English)    
  • English

ID: ELRA-W0021

ISLRN: 046-881-902-857-2

ICE-GB is the British component of the International Corpus of English (ICE). ICE began in 1990 with the primary aim of providing material for comparative studies of varieties of English throughout the world. Twenty centres around the world are preparing corpora of their own national or regional ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
780.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
 IDIOLOGOS 1 “Bootstrap” (NEOLOGOS Project)    
  • French

ID: ELRA-S0226-01

ISLRN: 777-240-321-917-2

The IDIOLOGOS 1 “Bootstrap” database was produced within the French national project NEOLOGOS, as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The databases produced in the framework of the NEOLOGOS project are designed for the developm...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
16000.00 € submit
Licence: Commercial Use - ELRA VAR
16000.00 € submit
16000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

Special offers are also available. Check here for details.

 IDIOLOGOS 2 “Eingenspeakers” (NEOLOGOS Project)    
  • French

ID: ELRA-S0226-02

ISLRN: 377-605-098-134-9

The IDIOLOGOS 2 “Eingenspeakers” database was produced within the French national project NEOLOGOS, as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The databases produced in the framework of the NEOLOGOS project are designed for the dev...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

Special offers are also available. Check here for details.

 Idioms French-Vietnamese Dictionary    
  • French
  • Vietnamese

ID: ELRA-T0383

ISLRN: 167-512-984-991-8

Idioms French-Vietnamese Dictionary with French terms translated in Vietnamese and one idiomatic sentence per Vietnamese word of 448 entries in XML format.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 ILC Italian Morphological Lexicon    
  • Italian

ID: ELRA-L0006

ISLRN: 965-829-467-456-4

The ILC Italian Morphological Lexicon consists of a set of lemmas/lexical entries (about 60,000) with the corresponding inflected word-forms, and a morphological engine for morphological analysis and generation. Lemmas and word-forms are encoded with grammatical codes compatible with the EAGLES r...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
 ILE: Italian LExicon      
  • Italian

ID: ELRA-S0059

ISLRN: 052-156-999-928-3

ILE is a 588,000 entries Italian lexicon transcribed with SAMPA notation. It was generated, mainly for speech recognition purposes, by means of a morphological analyzer handling more than 100,000 morphemes, each of them transcribed and manually checked. Each stem was combined with all its possibl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 ILPho phonetic lexicon      
  • French

ID: ELRA-S0163

ISLRN: 779-878-863-649-8

The ILPho database is a phonetic lexicon which contains 39,000 lemmas (319,318 entries). It is distributed in two formats. The first format is compact and corresponds to an easy extension of the text format in which the Multext lexicons (réf. ELRA-L0010) (Ide et Veronis, 1994) are distributed, by...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit

« Previous | Next »