Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
28 Language Resources (Page 1 of 2)
« Previous | Next »Order by:
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0383
ISLRN: 398-655-047-044-5The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the audio files corresponding t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3360.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4480.00 €
|
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-S0382
ISLRN: 309-438-781-042-2The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, see ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank, see ELRA-T0377). This version includes the corresponding audio files c...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3640.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5200.00 €
|
- Arabic
- Bulgarian
- Chinese
- Croatian
- Czech
- French
- German
- Hausa
- Japanese
- Korean
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
ID: ELRA-S0400
ISLRN: 331-592-378-424-7The GlobalPhone 2000 Speaker Package contains transcribed read speech spoken by 2000 native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Mandarin (ELRA-S0193), C...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1200.00 €
|
6000.00 €
|
Licence: Commercial Use - ELRA VAR |
6000.00 €
|
6000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1400.00 €
|
7200.00 €
|
Licence: Commercial Use - ELRA VAR |
7200.00 €
|
7200.00 €
|
- Korean
ID: ELRA-S0200
ISLRN: 520-329-707-787-0The GlobalPhone corpus developed in collaboration with the Karlsruhe Institute of Technology (KIT) was designed to provide read speech data for the development and evaluation of large continuous speech recognition systems in the most widespread languages of the world, and to provide a uniform, mu...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
700.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
Special offers are also available. Check here for details.
- Arabic
- Bulgarian
- Chinese
- Croatian
- Czech
- French
- German
- Hausa
- Japanese
- Korean
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swahili (macrolanguage)
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
ID: ELRA-S0399
ISLRN: 204-945-263-927-6The GlobalPhone Multilingual Model Package contains about 22 hours of transcribed read speech spoken by native speakers in 22 languages. The data are sampled from the GlobalPhone Speech and Text Data available in the ELRA Catalogue, i.e.: Arabic (ELRA-S0192), Bulgarian (ELRA-S0319), Chinese-Manda...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1200.00 €
|
6000.00 €
|
Licence: Commercial Use - ELRA VAR |
6000.00 €
|
6000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1400.00 €
|
7200.00 €
|
Licence: Commercial Use - ELRA VAR |
7200.00 €
|
7200.00 €
|
- Korean
ID: ELRA-S0460
ISLRN: 452-991-963-242-3357 hours of Korean speech data collected by cellphone, recorded by 999 Korean in quiet environment and rich in content. All texts are transtribed by professional annotator. The accuracy rate of sentence is 95%. It can be used for speech recognition, machine translation and voiceprint recognition...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57655.50 €
|
57655.50 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57655.50 €
|
57655.50 €
|
Special offers are also available. Check here for details.
- Korean
ID: ELRA-S0431
ISLRN: 988-295-249-804-1Korean audio data with duration of 516 hours. Recorded texts include: daily language, various interactive sentences, home commands, on-board commands, etc. Among 1,077 speakers, male and female speakers are 49% and 51%. The duration of each speaker is around half an hour. Format:16kHz, 16bit, ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
83334.00 €
|
83334.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
83334.00 €
|
83334.00 €
|
Special offers are also available. Check here for details.
- Korean
ID: ELRA-S0475
ISLRN: 330-652-829-242-5It collects 291 Korean locals and is recorded in quiet indoor environment. The recordings include economics, entertainment, news, oral, figure, letter. 400 sentences for each speaker. Recording devices are mainstream Android phones and iPhones. Format:16kHz, 16bit, uncompressed wav, mono chann...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
31815.50 €
|
31815.50 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
31815.50 €
|
31815.50 €
|
Special offers are also available. Check here for details.
- Korean
ID: ELRA-S0228-52
ISLRN: 331-652-936-814-0This corpus comprises 13,200 Korean digit strings uttered by 110 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 18.89 hours of speech per channel. The total capacity of the data is 24.2...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
- Korean
ID: ELRA-S0228-103
ISLRN: 852-908-669-816-4This corpus comprises 32,247 entries uttered by 52 speakers (26 males and 26 females), recorded over 3 channels (desktop and mobile in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 15.76 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4500.00 €
|
4500.00 €
|
Licence: Commercial Use - ELRA VAR |
4500.00 €
|
4500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4500.00 €
|
4500.00 €
|
Licence: Commercial Use - ELRA VAR |
4500.00 €
|
4500.00 €
|
- Korean
ID: ELRA-S0228-62
ISLRN: 832-286-766-358-8This corpus comprises 83,756 entries uttered by 150 speakers (66 males and 84 females), recorded over 4 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 29.65 hours of speech per channel. This set combines ELRA-S0228-50, ELRA-S0228-51, ELR...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
13500.00 €
|
13500.00 €
|
Licence: Commercial Use - ELRA VAR |
13500.00 €
|
13500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
13500.00 €
|
13500.00 €
|
Licence: Commercial Use - ELRA VAR |
13500.00 €
|
13500.00 €
|
- Korean
ID: ELRA-S0228-50
ISLRN: 146-478-758-877-6This corpus comprises 1,500 Korean person names uttered by 150 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 1.69 hours of speech per channel. The total capacity of the data is 2 Gb. ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
- Korean
ID: ELRA-S0228-51
ISLRN: 227-476-631-391-1This corpus comprises 1,500 Korean place names uttered by 150 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 1.65 hours of speech per channel. The total capacity of the data is 2 Gb. E...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
3600.00 €
|
Licence: Commercial Use - ELRA VAR |
3600.00 €
|
3600.00 €
|
- Korean
ID: ELRA-S0228-53
ISLRN: 892-259-683-285-7This corpus comprises 4,800 Korean sentences uttered by 40 speakers of different dialects, ages and various educational levels, recorded over 4 channels. Speech samples are stored as a sequence of 16-bit 48kHz WAV for 7.43 hours of speech per channel. The total capacity of the data is 9.82 Gb. E...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2700.00 €
|
2700.00 €
|
Licence: Commercial Use - ELRA VAR |
2700.00 €
|
2700.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2700.00 €
|
2700.00 €
|
Licence: Commercial Use - ELRA VAR |
2700.00 €
|
2700.00 €
|
- Korean
ID: ELRA-S0228-118
ISLRN: 777-298-567-915-1This corpus was recorded in a quiet office/home environment and collected from a total of 500 speakers, including 246 males and 254 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news. Speech samples ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45000.00 €
|
45000.00 €
|
Licence: Commercial Use - ELRA VAR |
45000.00 €
|
45000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45000.00 €
|
45000.00 €
|
Licence: Commercial Use - ELRA VAR |
45000.00 €
|
45000.00 €
|
- Korean
ID: ELRA-S0177
ISLRN: 429-596-342-929-3The Korean Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 568 adult Korean speakers (259 males, 309 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the record...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
50000.00 €
|
67000.00 €
|
Licence: Commercial Use - ELRA VAR |
67000.00 €
|
67000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
60000.00 €
|
75000.00 €
|
Licence: Commercial Use - ELRA VAR |
75000.00 €
|
75000.00 €
|
- Korean
- Vietnamese
ID: ELRA-W0313
ISLRN: 365-128-449-700-7The Korean-Vietnamese Parallel Corpus consists of 200,000 sentence pairs, with an average length of 15 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
|
400.00 €
|
Licence: Commercial Use - ELRA VAR |
1400.00 €
|
1400.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
|
600.00 €
|
Licence: Commercial Use - ELRA VAR |
2100.00 €
|
2100.00 €
|
- Korean
ID: ELRA-S0295
ISLRN: 391-771-784-796-1The LILA Korean database collected in South Korea was recorded within the scope of the LILA project. It contains the recordings of 1,000 Korean speakers (500 males and 500 females) recorded over the Korean mobile telephone network. The following acoustic conditions were selected as representativ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
43125.00 €
|
47500.00 €
|
Licence: Commercial Use - ELRA VAR |
47500.00 €
|
47500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
46405.00 €
|
51875.00 €
|
Licence: Commercial Use - ELRA VAR |
51875.00 €
|
51875.00 €
|
- Chinese
- English
- Korean
ID: ELRA-W0035
ISLRN: 731-151-596-869-3Multilingual parallel corpus produced by Kaist Korterm containing 60 000 expressions in Korean, Chinese and English.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
750.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
|
6000.00 €
|
Licence: Commercial Use - ELRA VAR |
6000.00 €
|
6000.00 €
|
- Arabic
- Chinese
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hebrew
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Northern Sami
- Norwegian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Turkish
ID: ELRA-W0336
ISLRN: 471-919-856-164-1Parallel corpora for nearly 400 language pairs and numerous multilingual combinations, including 10 million bilingual segments and 90 million tokens in 20 languages: Arabic, Chinese (Simplified), Danish, Dutch, English, Finnish, French, German, Greek, Hebrew, Italian, Japanese, Korean, North Sami...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
0.10 €
|
0.10 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
0.11 €
|
0.11 €
|
Special offers are also available. Check here for details.
« Previous | Next »