Audio (97)
Text (62)
True (4)
TEI (2)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

157 Language Resources (Page 1 of 8)

« Previous | Next »Order by:

 ARCADE II Evaluation Package    
  • Arabic
  • Chinese
  • English
  • French
  • German
  • Italian
  • Japanese
  • Modern Greek (1453-)
  • Persian
  • Russian
  • Spanish; Castilian

ID: ELRA-E0018

ISLRN: 875-865-064-331-9

The ARCADE II Evaluation Package was produced within the French national project ARCADE II (Evaluation of parallel text alignment systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The ARCADE II project enabled to carry out a cam...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
150.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
300.00 € submit
1000.00 € submit
 AUDIO Human Voice Pronunciations - Chinese (Simplified)    
  • Chinese

ID: ELRA-S0490-03

ISLRN: 569-723-482-271-6

Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
471.90 € submit
471.90 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
495.50 € submit
495.50 € submit

Special offers are also available. Check here for details.

 Bitext Lexical Dataset - Chinese (Simplified)    
  • Chinese

ID: ELRA-L0137

ISLRN: 803-896-567-451-2

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Chinese (Simplified) consists of 75,000 lemmas (f...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
 Bitext Lexical Dataset - Chinese (Traditional)    
  • Chinese

ID: ELRA-L0138

ISLRN: 934-287-681-414-0

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Chinese (Traditional) consists of 75,000 lemmas (...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
 Bitext Lexical Dataset - Language Variants - Chinese    
  • Chinese

ID: ELRA-L0152

ISLRN: 345-861-801-718-8

As a complement to the generic vocabulary provided in ELRA-L0137 and ELRA-L0138, the following language variants of Chinese are provided: - Chinese Simplified: 74,000 lemmas (forms) - Chinese Traditional: 74,000 lemmas (forms)

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
 Cantonese Conversational Speech Data by Mobile Phone and Voice Recorder - 607 Hours    
  • Chinese

ID: ELRA-S0427

ISLRN: 722-447-977-629-5

995 local Cantonese speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transcribed m...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
98030.50 € submit
98030.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
98030.50 € submit
98030.50 € submit

Special offers are also available. Check here for details.

 Cantonese Dialect Speech Data by Mobile Phone - 1,652 Hours    
  • Chinese

ID: ELRA-S0478

ISLRN: 049-624-028-135-7

It collects 4,888 speakers from Guangdong Province and is recorded in quiet indoor environment. The recorded content covers 500,000 commonly used spoken sentences, including high-frequency words in weico and daily used expressions. The average number of repetitions is 1.5 and the average sentence...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
141246.00 € submit
141246.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
141246.00 € submit
141246.00 € submit

Special offers are also available. Check here for details.

 Cantonese Readings Database    
  • Chinese

ID: ELRA-L0101

ISLRN: 634-690-317-631-5

This database is not only comprehensive but also linguistically accurate. It is based on solid principles of Cantonese phonology and semantics, and takes into account the phenomena of polyphony as well as tone change, which is unpredictable and requires manual proofreading. It covers 300,000 entr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
9000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11250.00 € submit
18750.00 € submit
Licence: Commercial Use - ELRA VAR
22500.00 € submit
37500.00 € submit
 Cantonese Speecon database    
  • Chinese

ID: ELRA-S0287

ISLRN: 537-563-219-913-3

The Cantonese Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 550 adult Cantonese speakers (273 males, 277 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 Changsha Dialect Speech Data by Mobile Phone - 997 Hours    
  • Chinese

ID: ELRA-S0453

ISLRN: 520-610-210-012-3

2,000 Changsha natives participated in the recording, covering multiple age groups, with a balanced gender distribution and authentic accent. The recorded text is rich in content, covering general, interactive, car, home and other categories. Local people in changsha check and proofread. The ac...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94715.00 € submit
94715.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94715.00 € submit
94715.00 € submit

Special offers are also available. Check here for details.

 Chinese Children Speech data by Mobile phone - 3,255 Hours    
  • Chinese

ID: ELRA-S0458

ISLRN: 607-995-858-759-4

Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at hom...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247380.00 € submit
247380.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247380.00 € submit
247380.00 € submit

Special offers are also available. Check here for details.

 Chinese Digital Speech Data by Mobile Phone - 11,010 People    
  • Chinese

ID: ELRA-S0419

ISLRN: 434-094-443-871-0

11,010 Chinese native speakers participated in the recording with equal gender. Each speaker reads 30 sentences of 4 -8 digit number. Format:16kHz, 16bit, uncompressed wav, mono channel Recording environment:quiet indoor environment, without echo Recording content (read speech):four to eight...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
41838.00 € submit
41838.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
41838.00 € submit
41838.00 € submit

Special offers are also available. Check here for details.

 Chinese-English Database of Proper Nouns    
  • Chinese
  • English

ID: ELRA-M0058

ISLRN: 638-295-493-483-2

A large comprehensive database of Chinese-English personal and place names, with coverage of not only native Chinese proper nouns, but also Japanese, Korean, and Western proper nouns as well.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
50000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18750.00 € submit
31250.00 € submit
Licence: Commercial Use - ELRA VAR
37500.00 € submit
62500.00 € submit
 Chinese-English Database of Proverbs and Idioms (Chengyu)    
  • Chinese
  • English

ID: ELRA-M0056

ISLRN: 506-728-933-717-0

This database is important for translating 成語 chengyu (Chinese proverbs and idioms), which cannot be translated literally since they are often based on classical Chinese. For example, 臨陣磨槍, literally 'face battle sharpen spear', which means "do something at the last moment," cannot be correctly ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1050.00 € submit
1750.00 € submit
Licence: Commercial Use - ELRA VAR
2100.00 € submit
3500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1313.00 € submit
2188.00 € submit
Licence: Commercial Use - ELRA VAR
2625.00 € submit
4375.00 € submit
 Chinese-Japanese Database of Proper Nouns    
  • Chinese
  • Japanese

ID: ELRA-M0059

ISLRN: 951-838-928-664-9

A large comprehensive database of Chinese-Japanese personal and place names, with coverage of not only native Chinese proper nouns, but also Korean and Western proper nouns as well.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
50000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18750.00 € submit
31250.00 € submit
Licence: Commercial Use - ELRA VAR
37500.00 € submit
62500.00 € submit
 Chinese-Japanese Technical Terms Dictionary    
  • Chinese
  • Japanese

ID: ELRA-M0057

ISLRN: 079-503-057-574-0

Covers over 800,000 terms from over 20 science and technology domains, including computers/IT, mechanical engineering, biotechnology, chemistry, and medicine.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6450.00 € submit
10750.00 € submit
Licence: Commercial Use - ELRA VAR
12900.00 € submit
21500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8063.00 € submit
13438.00 € submit
Licence: Commercial Use - ELRA VAR
16125.00 € submit
26875.00 € submit
 Chinese Lexical Database    
  • Chinese

ID: ELRA-L0107

ISLRN: 500-068-723-953-8

A comprehensive monolingual lexical database of Chinese consisting of Simplified and Traditional Chinese modules, covering general vocabulary and important technical terms. Each entry is accompanied by various attributes, such as phonological, grammatical, and morphological information, as well a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 Chinese Mandarin (North) database    
  • Chinese

ID: ELRA-S0398

ISLRN: 353-548-770-894-7

This database contains the recordings of 500 Chinese Mandarin speakers from Northern China (250 males and 250 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native sp...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5200.00 € submit
7400.00 € submit
Licence: Commercial Use - ELRA VAR
7400.00 € submit
7400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7400.00 € submit
7400.00 € submit
Licence: Commercial Use - ELRA VAR
7400.00 € submit
7400.00 € submit
 Chinese Mandarin (South) database    
  • Chinese

ID: ELRA-S0397

ISLRN: 503-886-852-083-2

This database contains the recordings of 1000 Chinese Mandarin speakers from Southern China (500 males and 500 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native s...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10400.00 € submit
14800.00 € submit
Licence: Commercial Use - ELRA VAR
14800.00 € submit
14800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14800.00 € submit
14800.00 € submit
Licence: Commercial Use - ELRA VAR
14800.00 € submit
14800.00 € submit
 Chinese Mandarin Speech Recognition Corpus (Mobile) - 204.2 hours    
  • Chinese

ID: ELRA-S0228-67

ISLRN: 509-044-363-238-7

This corpus comprises 120,144 entries uttered by 400 speakers (199 males and 201 females), recorded over the mobile telephone network. Speech samples are stored as a sequence of 16-bit 16 kHz for a total of 204.2 hours of speech.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit

« Previous | Next »