Available (94)
Audio (82)
Text (12)
Video (4)
True (4)
TEI (1)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

93 Language Resources (Page 2 of 5)

« Previous | Next »Order by:

 German SpeechDat-Car    
  • German

ID: ELRA-S0122

ISLRN: 090-326-240-961-9

The German SpeechDat-Car database comprises 338 German speakers recorded over the mobile telephone network. This database is partitioned into 17 DVDs and 1 CD. The speech databases made within the SpeechDat-Car project were validated by SPEX, the Netherlands, to assess their compliance with the S...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
90000.00 € submit
90000.00 € submit
Licence: Commercial Use - ELRA VAR
90000.00 € submit
90000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
120000.00 € submit
120000.00 € submit
Licence: Commercial Use - ELRA VAR
120000.00 € submit
120000.00 € submit
 German SpeechDat(II) MDB-1000    
  • German

ID: ELRA-S0096

ISLRN: 222-030-211-822-7

The German SpeechDat(II) MDB-1000 comprises 1295 German speakers (663 males, 610 females, 22 speakers with gender not specified) recorded over the German mobile telephone network. The MDB-1000 database is partitioned into 8 CDs in ISO 9660 format. The speech databases made within the SpeechDat(I...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
28000.00 € submit
Licence: Commercial Use - ELRA VAR
28000.00 € submit
28000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit
 German Speech Recognition Corpus (Desktop)    
  • German

ID: ELRA-S0228-81

ISLRN: 498-659-051-445-2

This corpus comprises 51,912 entries uttered by 52 speakers (26 males and 26 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 23.9 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 German Speecon database    
  • German

ID: ELRA-S0216

ISLRN: 578-285-282-393-9

The German Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 562 adult German speakers (272 males, 290 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the record...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 GlobalPhone 2000 Speaker Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0400

ISLRN: 331-592-378-424-7

The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 GlobalPhone German    
  • German

ID: ELRA-S0198

ISLRN: 937-733-002-847-8

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 Hempel    
  • German

ID: ELRA-S0162

ISLRN: 683-410-635-177-8

This corpus contains 3,909 recordings via public phone lines (fixed network only) of 3,909 German speakers with a total of 184,240 spoken words. The contents are free monologues answering the question: "Was haben Sie in der letzten Stunde gemacht?" (What did you do within the last hour?). 25.5 ho...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
755.00 € submit
4755.00 € submit
Licence: Commercial Use - ELRA VAR
4755.00 € submit
4755.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1010.00 € submit
5010.00 € submit
Licence: Commercial Use - ELRA VAR
5010.00 € submit
5010.00 € submit
 Karl May Korpus (KMK)    
  • German

ID: ELRA-W0016

ISLRN: 628-817-117-400-1

The "Karl-May-Korpus" is a monolingual German corpus, available in an SGML-tagged ASCII text format. It contains the works of the German author Karl May (1842-1912) and consists of around 1.6 million words (divided into 9 subcorpora of about 180,000 words each). The corpus was created between 199...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
 MIST Multi-lingual Interoperability in Speech Technology database    
  • Dutch; Flemish
  • English
  • French
  • German

ID: ELRA-S0238

ISLRN: 189-835-264-931-4

In 1996, some 75 Dutch people participated in recording a multi-purpose continuous speech database. Most of them were recruited from the TNO Human Factors Research Institute, where the recordings were made. The main part of the database consisted of Dutch sentences. However, most speakers partici...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
500.00 € submit
 MLCC Multilingual and Parallel Corpora    
  • Danish
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Portuguese
  • Spanish; Castilian

ID: ELRA-W0023

ISLRN: 963-635-729-341-8

The MLCC text corpus has two main components - one set to allow comparable studies to be carried out in different languages and one set as the basis for translation studies. The first set is referred as the Polylingual Document Collection, a collection of newspaper articles from financial new...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3600.00 € submit
 MTP Annotated German corpus - tagged version    
  • German

ID: ELRA-W0008-02

ISLRN: 173-651-658-528-0

This morphosyntactically annotated 500,000 word German corpus was developed as part of the Münster Tagging Project (MTP). It comprises a collection of SGML-formatted texts from two German newspapers, "Die Frankfurter Allgemeine Zeitung" and "Die Zeit", for the years 1990 to 1992. The articles ref...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8000.00 € submit
8000.00 € submit
Licence: Commercial Use - ELRA VAR
8000.00 € submit
8000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 MTP Annotated German corpus - untagged version    
  • German

ID: ELRA-W0008-01

ISLRN: 417-827-623-669-9

This morphosyntactically annotated 500,000 word German corpus was developed as part of the Münster Tagging Project (MTP). It comprises a collection of SGML-formatted texts from two German newspapers, "Die Frankfurter Allgemeine Zeitung" and "Die Zeit", for the years 1990 to 1992. The articles ref...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3500.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
 MULTEXT JOC Corpus    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-W0017

ISLRN: 900-482-746-635-0

This CD-ROM contains a part of the corpus developed in the MULTEXT project financed by the European Commission (LRE 62-050). This part contains raw, tagged and aligned data from the Written Questions and Answers of the Official Journal of the European Community. The corpus contains approx. 5 mill...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 MULTEXT Prosodic database    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-S0060

ISLRN: 098-719-242-965-4

This database comprises one CD-ROM for each five languages (French, English, Italian, German and Spanish), totalling 4 hours and 20 minutes of speech and involving 50 different speakers (5 male and 5 female per language). The recordings on which the corpus is based consist of passages of about f...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Nautilus Speaker Characterization (NSC) Corpus    
  • German

ID: ELRA-S0395

ISLRN: 157-037-166-491-1

The Nautilus Speaker Characterization (NSC) Corpus comprises clean microphone recordings of conversational speech from 300 German speakers (126 males and 174 females) aged 18 to 35 years, with no marked dialect/accent. The recordings were performed in the acoustically-isolated room "Nautilus" (wh...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
 PHONDAT 1 - PD1 (2nd edition)    
  • German

ID: ELRA-S0023

ISLRN: 776-688-402-560-8

The corpus contains read speech of 201 different speakers. Each speaker has read a subcorpus of 450 different sentences (including alphanumericals and two short passages of prose text); 8 speakers have read the whole sentence corpus. The speakers were recorded at four different sites in Germany (...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
511.29 € submit
7669.38 € submit
Licence: Commercial Use - ELRA VAR
7669.38 € submit
7669.38 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1022.58 € submit
9203.25 € submit
Licence: Commercial Use - ELRA VAR
9203.25 € submit
9203.25 € submit
 PHONDAT 2 - PD2 (2nd edition)    
  • German

ID: ELRA-S0024

ISLRN: 937-744-173-899-5

The corpus contains read speech of 16 different speakers. Each speaker has read a corpus of 200 different sentences from a train inquiry task. The speakers were recorded at three different sites in Germany (University of Kiel, University of Bonn, University of Munich). The language is German. The...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
127.82 € submit
127.82 € submit
Licence: Commercial Use - ELRA VAR
127.82 € submit
127.82 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
255.65 € submit
Licence: Commercial Use - ELRA VAR
255.65 € submit
255.65 € submit
 RVG1 (Regional Variants of German 1, Part 1)    
  • German

ID: ELRA-S0058

ISLRN: 109-219-623-443-2

The corpus consists of single digits, connected digits, phone numbers, phonetically balanced sentences, computer command phrases and spontaneous speech. Each speaker has read a subcorpus of 85 items: * 11 single digits (0-9, with the two pronunciations of 2 (`zwei', `zwo')), * 19 connect...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7669.38 € submit
11759.71 € submit
Licence: Commercial Use - ELRA VAR
11759.71 € submit
11759.71 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11759.71 € submit
16872.63 € submit
Licence: Commercial Use - ELRA VAR
16872.63 € submit
16872.63 € submit

« Previous | Next »