8 Language Resources

Order by:

 Bilingual (Spanish-English) Speech synthesis HTS models    
  • English
  • Spanish; Castilian

ID: ELRA-S0335

ISLRN: 277-380-359-561-3

This database contains Bilingual (English and Spanish) Festival HTS models. Models were trained with 9h of speech from 2 female bilingual speakers and 2 male bilingual speakers. Each speaker recorded 2h 15 min per language. The speech data can be found in the TC-STAR Bilingual Voice-Conversion S...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 Chinese-Vietnamese - PhraseBank with audio files    
  • Chinese
  • Vietnamese

ID: ELRA-S0485

ISLRN: 428-557-564-826-7

Chinese-Vietnamese - PhraseBank with audio files of daily conversations spoken by native speakers containing 4002 sentence pairs. Scripts with Pinyin, Topic, Cat, Vietnamese translation with corresponding audio in Chinese and Vietnamese. Corpus in XML and WAV formats.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
900.00 € submit
900.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
750.00 € submit
Licence: Commercial Use - ELRA VAR
1350.00 € submit
1350.00 € submit
 Mbochi speech corpus    
  • Bantu languages
  • French

ID: ELRA-S0396

ISLRN: 747-055-093-447-8

The Mbochi speech corpus was developed in the framework of ANR-DFG BULB project. This project aims to provide field linguists (eg working on morphology) with tools for less or not written languages. The provided corpus is a subset from the corpus developed in this framework. The provided corpu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
 Mixed Speech with Chinese and English Data by Mobile Phone - 1,535 Hours    
  • Chinese
  • English

ID: ELRA-S0457

ISLRN: 451-966-049-653-3

The data is recorded by 3972 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
145825.00 € submit
145825.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
145825.00 € submit
145825.00 € submit

Special offers are also available. Check here for details.

 TAXI - Multilingual telephone dialog database    
  • English
  • German

ID: ELRA-S0137

ISLRN: 734-992-877-270-4

TAXI was produced by BAS, in collaboration with the German research centre for artificial intelligence, DFKI. This speech database contains recordings which consist of dialogues, 94 on the whole (spontaneous speech), between a German speaking cab dispatcher and his clients, who always answered in...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
127.82 € submit
383.03 € submit
Licence: Commercial Use - ELRA VAR
383.03 € submit
383.03 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
511.29 € submit
Licence: Commercial Use - ELRA VAR
511.29 € submit
511.29 € submit
 TC-STAR Bilingual Expressive Speech Database    
  • English
  • Spanish; Castilian

ID: ELRA-S0313

ISLRN: 088-656-828-489-3

8 hours of speech as spoken by 2 female speakers and 2 male speakers for each language (English and Spanish).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 The FAME! Speech Corpus    
  • Dutch; Flemish
  • Western Frisian

ID: ELRA-S0391

ISLRN: 340-994-352-616-4

The components of the Frisian data collection are speech and language resources gathered for building a large vocabulary ASR system for the Frisian language. Firstly, a new broadcast database is created by collecting recordings from the archives of the regional broadcaster Omrop Fryslân, and ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3500.00 € submit
Licence: Commercial Use - ELRA VAR
3500.00 € submit
3500.00 € submit
 Wake-up Words Speech Data by Microphone - 1,027 People    
  • Chinese
  • English

ID: ELRA-S0418

ISLRN: 615-126-948-152-3

More than 1,000 recorders read the specified wake-up words, covering slow, normal, and fast three speeds. Audios are recorded in the professional recording studio using the microphone. Format:48kHz, 16bit, uncompressed wav, mono channel Recording environment:professional recording studio Rec...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
9756.50 € submit
9756.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
9756.50 € submit
9756.50 € submit

Special offers are also available. Check here for details.