Text (152)
Audio (92)
Video (4)
Available (237)
True (21)
Parallel (13)
TEI (1)
TMX (1)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

237 Language Resources (Page 1 of 12)

« Previous | Next »Order by:

 2006 CoNLL Shared Task - Ten Languages    
  • Bulgarian
  • Danish
  • Dutch; Flemish
  • German
  • Japanese
  • Portuguese
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Turkish

ID: ELRA-W0086

ISLRN: 578-227-532-044-0

2006 CoNLL Shared Task - Ten Languages consists of dependency treebanks in ten languages used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The languages covered in this release are: Bulgarian, Danish, Dutch, German, Japanese, Portuguese, Slovene, Spanish, Swedish and...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
 aGender    
  • German

ID: ELRA-S0365

ISLRN: 038-476-412-610-4

aGender contains speech sample recordings over public telephone lines with read and (semi-)spontaneous speech. Native German speakers called a voice portal from their private phone, and read text + answered some open questions. The purpose of the corpus is the automatic detection of gender and/or...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
327.00 € submit
8127.00 € submit
Licence: Commercial Use - ELRA VAR
8127.00 € submit
8127.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
455.00 € submit
8255.00 € submit
Licence: Commercial Use - ELRA VAR
8255.00 € submit
8255.00 € submit
 Alcohol Language Corpus (BAS ALC)    
  • German

ID: ELRA-S0299

ISLRN: 780-368-852-139-3

ALC contains recordings of German speakers that are either intoxicated or sober. The type of speech ranges from read single digits to full conversation style. Recordings were done during drinking test where speakers drank beer or wine to reach a self-chosen level of alcoholic intoxication. The ac...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
510.00 € submit
510.00 € submit
Licence: Commercial Use - ELRA VAR
510.00 € submit
510.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1020.00 € submit
1020.00 € submit
Licence: Commercial Use - ELRA VAR
1020.00 € submit
1020.00 € submit
 ANITA (Audio eNhancement In Telecom Applications)    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-S0156

ISLRN: 537-894-870-719-4

ANITA (Audio eNhancement In secured Telecommunication Applications) is a European project launched on the initiative of EADS TELECOM with the objective of reducing audio acoustics noise in secured communications in adverse environments (sirens, alarms, engines, water pumps, stress situations, etc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
 ARCADE II Evaluation Package    
  • Arabic
  • Chinese
  • English
  • French
  • German
  • Italian
  • Japanese
  • Modern Greek (1453-)
  • Persian
  • Russian
  • Spanish; Castilian

ID: ELRA-E0018

ISLRN: 875-865-064-331-9

The ARCADE II Evaluation Package was produced within the French national project ARCADE II (Evaluation of parallel text alignment systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The ARCADE II project enabled to carry out a cam...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 AURORA Project database - Subset of SpeechDat-Car - German database - Evaluation Package    
  • German

ID: ELRA-AURORA-CD0003-03

ISLRN: 613-751-084-730-0

The Aurora project was originally set up to establish a world wide standard for the feature extraction software which forms the core of the front-end of a DSR (Distributed Speech Recognition) system. ETSI formally adopted this activity as work items 007 and 008. The two work items within ETSI ar...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
1000.00 € submit
 Austrian SpeechDat(AT) FDB-1000 database    
  • German

ID: ELRA-S0142

ISLRN: 989-950-794-642-6

The SpeechDat(AT) FDB-1000 database contains the recordings of 1,000 Austrian speakers (544 males, 456 females) recorded over the Austrian fixed telephone network. The database is partitioned into 5 CD-ROMs, in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-law, uncompr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 Austrian SpeechDat(AT) MDB-1000 database    
  • German

ID: ELRA-S0143

ISLRN: 294-112-593-120-6

The Austrian SpeechDat(AT) MDB-1000 database contains the recordings of 1,000 Austrian speakers (543 males, 457 females) recorded over the Austrian mobile telephone network. The database is partitioned into 5 CD-ROMs, in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-la...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
30000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 Automobile Engineering    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-T0097

ISLRN: 536-306-764-088-7

Cards available: 1420 Languages: German, English, French, Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1746.60 € submit
1746.60 € submit
Licence: Commercial Use - ELRA VAR
1746.60 € submit
1746.60 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2911.00 € submit
2911.00 € submit
Licence: Commercial Use - ELRA VAR
2911.00 € submit
2911.00 € submit
 BAS GEO1      
  • German

ID: ELRA-S0164

ISLRN: 853-731-110-167-7

BAS GEO1 is a simple database about the most important location names of Germany, Austria and Switzerland together with their canonical pronunciation coded in SAMPA. BAS GEO1 may be used as a basis for automatic speech recognition of German postal addresses or to feed a speech synthesis algorithm...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
172.82 € submit
1400.00 € submit
Licence: Commercial Use - ELRA VAR
1400.00 € submit
1400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
2800.00 € submit
Licence: Commercial Use - ELRA VAR
2800.00 € submit
2800.00 € submit
 Basic multilingual lexicon (MEMODATA)    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-M0001

ISLRN: 874-922-751-076-4

Entries: 30 000 each language Languages: French, English, Italian, German, Spanish Format: ASCII or ANSI with separators between entries Medium: CD-ROM The words are associated by the meaning. The lexical categories are: nouns (5 * 18 000), verbs (5 * 8 000), adjectives (5 * 6 000), adverbs (5 * ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8861.00 € submit
11077.00 € submit
Licence: Commercial Use - ELRA VAR
11077.00 € submit
11077.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11077.00 € submit
13846.00 € submit
Licence: Commercial Use - ELRA VAR
13846.00 € submit
13846.00 € submit
 BAS PHATT 1.0.X (sub-set)    
  • German

ID: ELRA-S0282-01

ISLRN: 704-844-083-488-7

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material rec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
512.00 € submit
2512.00 € submit
Licence: Commercial Use - ELRA VAR
2512.00 € submit
2512.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
512.00 € submit
3512.00 € submit
Licence: Commercial Use - ELRA VAR
3512.00 € submit
3512.00 € submit
 BAS PHATT 1.1.X (complete corpus)    
  • German

ID: ELRA-S0282-02

ISLRN: 847-046-185-654-8

The Ph@ttSessionz speech database, funded by the German Ministry of Science and Education (BMBF), contains recordings of 864 adolescent speakers of German (age range 12-20). The recordings were performed via the WWW in public schools (Gymnasium) in 41 locations in Germany. The speech material rec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1917.30 € submit
3917.30 € submit
Licence: Commercial Use - ELRA VAR
3917.30 € submit
3917.30 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3834.75 € submit
6834.75 € submit
Licence: Commercial Use - ELRA VAR
6834.75 € submit
6834.75 € submit
 Bilingual Collocational Dictionary (Horst Bogatz)    
  • English
  • German

ID: ELRA-M0013

ISLRN: 770-934-073-250-5

The bilingual English-German collocational dictionary consists of around 69,000 English headwords, including concepts expressed by more than one word (e.g. "environmental awareness" (German:"Umweltbewusstsein") or "maximum level of possible taxation" (German:"steuerliche Belastbarkeit")) and comp...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
210.00 € submit
210.00 € submit
Licence: Commercial Use - ELRA VAR
210.00 € submit
210.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
300.00 € submit
Licence: Commercial Use - ELRA VAR
300.00 € submit
300.00 € submit
 Bitext Lexical Dataset - German    
  • German

ID: ELRA-L0143

ISLRN: 384-783-627-727-2

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - German consists of 100,000 lemmas (2,500,000 form...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
67000.00 € submit
 Bitext Lexical Dataset - Language Variants - German    
  • German

ID: ELRA-L0157

ISLRN: 423-414-945-503-9

As a complement to the generic vocabulary provided in ELRA-L0143, language variants of German are provided with the following features: Tense, Mood, Person, Number, Gender, Case, Degree, Contraction. Variants are distributed as follows: - German (Germany): 101,000 lemmas / 2,600,000 forms - G...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
74000.00 € submit
 BITS Logatome Synthesis Corpus – BITS-LG    
  • German

ID: ELRA-S0217

ISLRN: 887-235-135-658-7

BITS stands for "BAS Infrastructures for Technical Speech Processing" and was funded by the German Ministry of Science and Education during 2003-2005. The BITS synthesis corpus consists of two parts: a set of logatome recordings for controlled diphone synthesis (ELRA-S0217) and a set of sentence ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
627.17 € submit
4627.17 € submit
Licence: Commercial Use - ELRA VAR
4627.17 € submit
4627.17 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
754.35 € submit
9000.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
 BITS Unit Selection Synthesis Corpus    
  • German

ID: ELRA-S0224

ISLRN: 553-776-339-039-5

BITS stands for "BAS Infrastructures for Technical Speech Processing" and was funded by the German Ministry of Science and Education during 2003-2005. The BITS synthesis corpus consists of two parts: a set of logatome recordings for controlled diphone synthesis (ELRA-S0217) and a set of sentenc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
627.17 € submit
4627.17 € submit
Licence: Commercial Use - ELRA VAR
4627.17 € submit
4627.17 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
754.35 € submit
9000.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
 BMI Brochures 2011-2015 (Processed)    
  • English
  • German

ID: ELRA-W0200

ISLRN: 886-938-216-393-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 BMI Brochures and Website 2016 (Processed)    
  • English
  • German

ID: ELRA-W0199

ISLRN: 416-672-686-637-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit

« Previous | Next »