Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1681 Language Resources (Page 33 of 85)
« Previous | Next »Order by:


- Modern Greek (1453-)
ID: ELRA-W0022
ISLRN: 002-552-644-443-1The ILSP/ELEFTHEROTYPIA Corpus contains approximately 3 million words classified and annotated according to the common core PAROLE encoding standard. Thus, each file is classified according to the parameters of Medium, Topic and Genre, and structurally annotated at paragraph level (CES Level 1). ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
850.00 €
![]() |
850.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1275.00 €
![]() |
1275.00 €
![]() |


- English
ID: ELRA-S0456
ISLRN: 001-453-575-915-4Indian English audio data captured by mobile phones, 1,012 hours in total, recorded by 2,100 Indian native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
153824.00 €
![]() |
153824.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
153824.00 €
![]() |
153824.00 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0439
ISLRN: 394-545-170-456-21285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Andro...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
103198.50 €
![]() |
103198.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
103198.50 €
![]() |
103198.50 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0470
ISLRN: 311-413-414-907-0Indonesia speech data (reading) is collected from 496 Indonesian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, figure, letter, and oral. Around 400 sentences for each speaker. The valid ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57978.50 €
![]() |
57978.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57978.50 €
![]() |
57978.50 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0228-115
ISLRN: 238-085-521-885-2This corpus was recorded in a quiet office environment over 4 channels and collected from a total of 200 speakers, including 97 males and 103 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news and da...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
21600.00 €
![]() |
21600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
21600.00 €
![]() |
21600.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
21600.00 €
![]() |
21600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
21600.00 €
![]() |
21600.00 €
![]() |


- Catalan; Valencian
- English
- Spanish; Castilian
ID: ELRA-T0094
ISLRN: 723-632-688-733-6Insurance contracts, private and public insurance, resource terminology used within European Union institutions. Cards available: 1000 Languages: Catalan, Spanish, English Format: ASCII Medium: floppy disk Card Description: Each card in this terminological database contains a definition, abb...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER | ||
Licence: Commercial Use - ELRA VAR |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER | ||
Licence: Commercial Use - ELRA VAR |


- English
- Latvian
ID: ELRA-W0158
ISLRN: 810-722-062-476-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. International Agreements have been translated into natio...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- Yoruba
ID: ELRA-S0492
ISLRN: 012-405-700-001-6A modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. The subject matter is drawn from the Broadcast News domain as well as fictional texts, delivering a multi-purpose, contemporary spe...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
11200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
11200.00 €
![]() |
11200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
12000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12000.00 €
![]() |
12000.00 €
![]() |


- English
ID: ELRA-S0083
ISLRN: 723-960-059-948-7Approx. 20 minutes of speech (per speaker) from 23 German and 23 Italian intermediate learners of English. Each speaker recorded sentences from several blocks of differing types (reading simple sentences, using minimal pairs, giving answers to multiple choice questions). The prompts were of varyi...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
1500.00 €
![]() |


- English
ID: ELRA-S0228-108
ISLRN: 554-977-743-197-5This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 213 speakers, including 103 males and 110 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
25200.00 €
![]() |
25200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25200.00 €
![]() |
25200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
25200.00 €
![]() |
25200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25200.00 €
![]() |
25200.00 €
![]() |


- Italian
ID: ELRA-S0228-98
ISLRN: 501-616-216-038-6This corpus comprises 19,788 entries uttered by 31 speakers (15 males and 16 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 4.9 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- Italian
ID: ELRA-L0069
ISLRN: 840-625-201-574-7This Italian lexicon is made up of 862,500 inflected forms corresponding to 112,000 simple word lemmas. It contains: - 66,340 nouns, with type, gender, number and inflected forms (including irregular forms) - 12,030 verbs, with mood, tense, person, gender, number and inflected forms (including ir...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5500.00 €
![]() |
6500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
8000.00 €
![]() |
8000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7000.00 €
![]() |
8500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |


- Italian
ID: ELRA-L0070
ISLRN: 565-957-248-233-5This Italian lexicon is the same as the one described in ELRA-L0069, but with the addition of clitic verbs, which increases the number of inflected forms to 1,800,000 (still corresponding to 112,000 simple words lemmas). Half the lexicon is made up of clitic verbs. It contains: - 66,340 nouns, wi...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
6500.00 €
![]() |
8000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8500.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12500.00 €
![]() |
12500.00 €
![]() |


- English
ID: ELRA-S0429
ISLRN: 703-740-233-998-1497 Italians recorded in a relatively quiet environment in authentic English. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android an...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
53912.50 €
![]() |
53912.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
53912.50 €
![]() |
53912.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0147
ISLRN: 458-657-455-735-5The Italian Speech Corpus 1 contains the recordings of 202 native Italian speakers (112 males, 90 females) recorded in an office and a closed public place, over 4 channels, in a range of low to medium background noise environments (Plantronics Audio 10 (computer/desk mic), Shure SM58 (desk mounte...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1200.00 €
![]() |
9500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
9500.00 €
![]() |
9500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
![]() |
15000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
15000.00 €
![]() |
15000.00 €
![]() |


- Italian
ID: ELRA-S0450
ISLRN: 217-750-727-467-7The data were recorded by 3,109 native Italian speakers with authentic Italian accents. The recorded content covers a wide range of categories such as general purpose, interactive, in car commands, home commands, etc. The recorded text is designed by a language expert, and the text is manually pr...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
342237.50 €
![]() |
342237.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
342237.50 €
![]() |
342237.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0472
ISLRN: 341-812-724-006-1Italian speech data (reading) is collected from 325 Italian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, and oral. Each sentence contains 9.2 words in average. Each sentence is repeated...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
38807.50 €
![]() |
38807.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
38807.50 €
![]() |
38807.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0461
ISLRN: 382-599-484-763-7Italian languageaudio data captured by mobile phone , with total duration of 347 hours. It is recorded by 800 Italian native speakers, balanced in gender is balanced; the recording environment is quiet; all texts are manually transcribed with high accuracy. This data set can be applied on automat...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
62633.50 €
![]() |
62633.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
62633.50 €
![]() |
62633.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0144
ISLRN: 513-325-829-468-0The Italian SpeechDat-Car database contains the recordings of 300 Italian speakers (149 females, 151 males) recorded over the GSM telephone network, in a car. This database is partitioned into 14 DVDs. The speech data files are in two formats. Four of the 5 microphones were recorded on the comput...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
90000.00 €
![]() |
90000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
90000.00 €
![]() |
90000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
120000.00 €
![]() |
120000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
120000.00 €
![]() |
120000.00 €
![]() |


- Italian
ID: ELRA-S0228-80
ISLRN: 789-295-563-911-2This corpus comprises 49,994 entries uttered by 50 speakers (23 males and 27 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 24.21hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |