112 Language Resources (Page 2 of 6)

« Previous | Next »Order by:

 Comprehensive Word Lists for Chinese, Japanese, Korean and Arabic    
  • Arabic
  • Chinese
  • Japanese
  • Korean

ID: ELRA-M0071

ISLRN: 476-146-877-598-3

Comprehensive monolingual word lists for both Simplified and Traditional Chinese, Japanese, Korean and Arabic, including a full-form Arabic word list. For Simplified and Traditional Chinese, Japanese and Korean, we provide readings as well, making them ideal for speech-related applications such...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
30000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
60000.00 € submit
100000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
37500.00 € submit
62500.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
125000.00 € submit
 Database of Chinese Full Names    
  • Chinese

ID: ELRA-L0106

ISLRN: 356-835-468-182-0

Covers Chinese full names of real people, including celebrities. Includes pinyin readings.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 Database of Chinese Names    
  • Chinese

ID: ELRA-L0129

ISLRN: 792-499-131-789-4

Chinese name components, accompanied by accurate pinyin readings, gender codes, and flags denoting whether name is a given name, surname, or both.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7500.00 € submit
12500.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
25000.00 € submit
 Database of Chinese Name Variants    
  • Chinese

ID: ELRA-L0105

ISLRN: 379-237-021-386-4

Provides comprehensive coverage for the major Chinese romanization systems and their variants, and if needed can be expanded considerably with dialectical variants (Cantonese, Hakka, Hokkien, etc.).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
50000.00 € submit
 English-Chinese-Vietnamese Trilingual Parallel Corpus    
  • Chinese
  • English
  • Vietnamese

ID: ELRA-W0314

ISLRN: 637-630-726-817-9

The English-Chinese-Vietnamese Trilingual Parallel Corpus consists of 20,046 trilingual sets of sentence pairs. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
225.00 € submit
750.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 English-to-Simplified Chinese Dictionary    
  • Chinese
  • English

ID: ELRA-M0055

ISLRN: 407-348-028-638-3

80,000 headwords, expandable to 100,000, of general vocabulary and important proper names.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2250.00 € submit
3750.00 € submit
Licence: Commercial Use - ELRA VAR
4500.00 € submit
7500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2813.00 € submit
4688.00 € submit
Licence: Commercial Use - ELRA VAR
5625.00 € submit
9375.00 € submit
 GlobalPhone 2000 Speaker Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0400

ISLRN: 331-592-378-424-7

The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 GlobalPhone Chinese-Mandarin    
  • Chinese

ID: ELRA-S0193

ISLRN: 976-318-571-969-1

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Chinese-Mandarin Pronunciation Dictionary      
  • Chinese

ID: ELRA-S0363

ISLRN: 457-511-870-286-9

The GlobalPhone pronunciation dictionaries, created within the framework of the multilingual speech and language corpus GlobalPhone, were developed in collaboration with the Karlsruhe Institute of Technology (KIT). The GlobalPhone pronunciation dictionaries contain the pronunciations of all wo...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
3600.00 € submit
Licence: Commercial Use - ELRA VAR
3600.00 € submit
3600.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Chinese-Shanghai    
  • Chinese

ID: ELRA-S0194

ISLRN: 879-999-559-792-7

The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
1800.00 € submit
Licence: Commercial Use - ELRA VAR
1800.00 € submit
1800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
355.00 € submit
2125.00 € submit
Licence: Commercial Use - ELRA VAR
2125.00 € submit
2125.00 € submit

Special offers are also available. Check here for details.

 GlobalPhone Multilingual Model Package    
  • Arabic
  • Bulgarian
  • Chinese
  • Croatian
  • Czech
  • French
  • German
  • Hausa
  • Japanese
  • Korean
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swahili (macrolanguage)
  • Swedish
  • Tamil
  • Thai
  • Turkish
  • Ukrainian
  • Vietnamese

ID: ELRA-S0399

ISLRN: 204-945-263-927-6

The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1400.00 € submit
7200.00 € submit
Licence: Commercial Use - ELRA VAR
7200.00 € submit
7200.00 € submit
 Hanzi Pinyin Database for Simplified Chinese    
  • Chinese

ID: ELRA-L0104

ISLRN: 292-895-602-975-4

Covers entries of general vocabulary, along with high-frequency technical terms and proper nouns. In addition to large coverage and high level of accuracy, the database has several special features including explicit codes to indicate headword type and part-of speech, coverage of all polyphones, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
9000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11250.00 € submit
18750.00 € submit
Licence: Commercial Use - ELRA VAR
22500.00 € submit
37500.00 € submit
 Hong Kong Cantonese Speech Recognition Corpus (Desktop)    
  • Chinese

ID: ELRA-S0228-75

ISLRN: 083-033-068-532-0

This corpus comprises 101,964 entries uttered by 51 speakers, recorded over 4 channels (desktop). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 24.18 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Korean-Chinese Database of Proper Nouns    
  • Chinese
  • Korean

ID: ELRA-M0070

ISLRN: 207-127-841-003-9

A large comprehensive database of Korean-Chinese personal and place names, with coverage of not only native Korean proper nouns, but also Japanese, Chinese and Western proper nouns as well.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 LC-STAR Mandarin Chinese Phonetic lexicon      
  • Chinese

ID: ELRA-S0256

ISLRN: 103-062-804-789-9

The LC-STAR Mandarin Chinese Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 104,368 entries, distributed over three categories: - a set of 38,098 common word entries. This set is extract...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
27000.00 € submit
40000.00 € submit
Licence: Commercial Use - ELRA VAR
40000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
38000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
50000.00 € submit
50000.00 € submit
 Mandarin Chinese Desktop Speech Recognition Corpus - Digit String (120 people)    
  • Chinese

ID: ELRA-S0228-16

ISLRN: 490-058-387-834-9

This corpus comprises 1,500 entries uttered by 120 speakers of different dialects, ages and various educational levels (59 males and 61 females), recorded through head-mounted noise-canceling microphone. The database comprises 3,600 digit strings. Speech samples are stored as a sequence of 16-bit...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Mandarin Chinese Desktop Speech Recognition Corpus - Digit String (200 people)    
  • Chinese

ID: ELRA-S0228-13

ISLRN: 356-016-188-560-2

This corpus comprises 1,500 entries uttered by 200 speakers of different dialects, ages and various educational levels (87 males and 113 females), recorded over 4 channels (Mic1: SHURE SM58; Mic2: ANC-700 Head-mounted; Mic3: TELEX M-60; Mic4: ACOUSTIC MAGIC). The database comprises 6,000 digit st...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Mandarin Chinese Desktop Speech Recognition Corpus - Digit String (849 people)    
  • Chinese

ID: ELRA-S0228-22

ISLRN: 886-056-542-487-1

This corpus comprises 750 entries uttered by 849 speakers of different dialects, ages and various educational levels (420 males and 429 females), recorded over 2 channels (Mic1: SHURE SM58; Mic2: Labtec Axis-002). The database comprises 12,750 digit strings per channel. Speech samples are stored...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Mandarin Chinese Desktop Speech Recognition Corpus - Digit String (98 people)    
  • Chinese

ID: ELRA-S0228-32

ISLRN: 817-471-874-316-6

This corpus comprises 1,500 entries uttered by 98 speakers of different dialects, ages and various educational levels (46 males and 52 females), recorded over 4 channels (Mic 1: SHURE SM58; Mic 2: Labtec Axis-002; Mic 3: KOSS; Mic 4: ATR 60C). The database comprises digit strings. Speech samples ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Mandarin Chinese Desktop Speech Recognition Corpus - Monosyllabic (94 people)    
  • Chinese

ID: ELRA-S0228-31

ISLRN: 296-670-975-155-3

This corpus comprises 1,267 entries uttered by 94 speakers of different dialects, ages and various educational levels (45 males and 49 females), recorded over 3 channels (Mic 1: SHURE SM58; Mic 2: Labtec Axis-002; Mic 3: ATR 60C). The database comprises monosyllables. Speech samples are stored as...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit

« Previous | Next »