Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1681 Language Resources (Page 11 of 85)
« Previous | Next »Order by:


- Basque
ID: ELRA-S0153
ISLRN: 941-344-942-204-3Bizkaifon contains sound archives and associated information of dialectal varieties of spoken Basque. The database was collected by the Department of Electronics and Telecommunications, University of the Basque Country, with the financial help of the Diputación Foral de Bizkaia. It consists of 21...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1000.00 €
![]() |
1000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1000.00 €
![]() |
1000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1000.00 €
![]() |
1000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1000.00 €
![]() |
1000.00 €
![]() |


- English
- German
ID: ELRA-W0200
ISLRN: 886-938-216-393-3This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- English
- German
ID: ELRA-W0199
ISLRN: 416-672-686-637-0This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- English
- German
ID: ELRA-W0197
ISLRN: 492-102-548-814-7This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. TMX file with 11555 TUs, bilingual German/English, publi...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- English
- German
ID: ELRA-W0198
ISLRN: 391-726-618-848-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. tmx file, 2718 TUs, bilingual German/English, texts from...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Portuguese
ID: ELRA-L0034
ISLRN: 654-505-941-943-8BrasiLEX is a multifunctional monolingual lexicon of the Brazilian variety of Portuguese, developed by the Natural Language Group of INESC. It has about 65,000 entries (lemmas) and 1,600 correspondent inflexion paradigms. The set of entries includes compound words and the inflexion paradigms incl...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
![]() |
25000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25000.00 €
![]() |
25000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
![]() |
30000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
30000.00 €
![]() |
30000.00 €
![]() |
This resource is also available in a bundle. Check here for bundled pricing.


- Portuguese
ID: ELRA-S0445
ISLRN: 767-329-448-534-2The data volumn is 1044 hours and is recorded by 2038 Brazilian native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones a...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
247950.00 €
![]() |
247950.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
247950.00 €
![]() |
247950.00 €
![]() |
Special offers are also available. Check here for details.


- Portuguese
ID: ELRA-S0228-74
ISLRN: 403-396-918-176-7This corpus comprises 99,804 entries uttered by 50 speakers (25 males and 25 females), recorded over 4 channels (desktop in quiet office/home). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 37.3 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- French
ID: ELRA-S0067
ISLRN: 843-228-642-422-1BREF-120 resulted from the efforts of LIMSI-CNRS researchers under sponsorship from the GDR-PRC CHM, the ACCT (OFIL), the EEC (ESPRIT Polyglot project), and the Aupelf-Uref. A sub-set of BREF-120 is BREF-80 (ELRA-S0006), which consists of about 50-60 sentences per speaker and recordings conducted...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2500.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4000.00 €
![]() |
15000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
15000.00 €
![]() |
15000.00 €
![]() |


- French
ID: ELRA-S0006
ISLRN: 310-036-258-354-7The BREF corpus was designed to provide enough read speech data for the development and evaluation of continuous speech recognition systems (both speaker-dependent and speaker-independent), and to provide a large corpus of continuous speech for the acquisition of acoustic-phonetic knowledge of sp...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
![]() |
6000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
6000.00 €
![]() |
6000.00 €
![]() |


- French
ID: ELRA-S0007
ISLRN: 382-431-956-363-1The BREF-Polyglot is a sub-corpus of the BREF corpus (1 ISO9660 CDROM); it contains speaker-dependent training data from 6 speakers. There are a total of 3193 sentences (2 signal files for each sentence), on average 530 per speaker. While this data represents only a small portion of the entire BR...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
![]() |
6000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
6000.00 €
![]() |
6000.00 €
![]() |


- English
ID: ELRA-S0474
ISLRN: 604-288-560-387-1It collects 201 British children. The recordings are mainly children textbooks, storybooks. The average sentence length is 4.68 words and the average sentence repetition rate is 6.6 times. This data is recorded by high fidelity microphone. The text is manually transcribed with high accuracy. ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
39187.50 €
![]() |
39187.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
39187.50 €
![]() |
39187.50 €
![]() |
Special offers are also available. Check here for details.


- English
ID: ELRA-S0228-96
ISLRN: 732-482-893-782-4This corpus comprises 19,196 entries uttered by 30 speakers (15 males and 15 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 3.65 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- English
ID: ELRA-L0058
ISLRN: 875-872-158-794-8BESL is a complete database of the English lexicon. It consists of over 230,000 lemmas, over 350,000 word forms, 60,000 proper nouns, 3,000 abbreviations, and 58,000 multi-word compound nouns. Each headword is provided with a full listing of all inflected forms and other morphological variation. ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
10000.00 €
![]() |


- English
ID: ELRA-S0448
ISLRN: 542-952-231-001-2831 Hours–Mobile Telephony British English Speech Data, which is recorded by 1651 native British speakers. The recording contents cover many categories such as generic, interactive, in-car and smart home. The texts are manually proofreaded to ensure a high accuracy rate. The database matchs the A...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
213151.50 €
![]() |
213151.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
213151.50 €
![]() |
213151.50 €
![]() |
Special offers are also available. Check here for details.


- English
ID: ELRA-S0466
ISLRN: 825-851-392-960-9The data set contains 346 British English speakers' speech data, all of whom are English locals. Around 392 sentences of each speaker. The valid data is 199 hours. Recording environment is quiet. Recording contents contain various categories like economics, news, entertainment, commonly used spok...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
47262.50 €
![]() |
47262.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
47262.50 €
![]() |
47262.50 €
![]() |
Special offers are also available. Check here for details.


- English
ID: ELRA-S0131
ISLRN: 804-196-753-996-4The British English SpeechDat-Car database contains the recordings of 300 British English speakers from 6 different regions (170 males, 130 females), recorded over the GSM telephone network, in a car. This database is partitioned into 115 CDs (DVDs are also available). The speech data files are ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
90000.00 €
![]() |
90000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
90000.00 €
![]() |
90000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
120000.00 €
![]() |
120000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
120000.00 €
![]() |
120000.00 €
![]() |


- English
ID: ELRA-S0097
ISLRN: 575-262-304-348-7The British English SpeechDat(II) FDB-4000 database contains the recordings of 4,000 British English speakers (1,968 males, 2,032 females) recorded over the British fixed telephone network. This database is partitioned into 20 CDs. Speech samples are stored as sequences of 8-bit 8 kHz A-law. Eac...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
35000.00 €
![]() |
45000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
45000.00 €
![]() |
45000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45000.00 €
![]() |
55000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
55000.00 €
![]() |
55000.00 €
![]() |


- English
ID: ELRA-S0074
ISLRN: 424-526-381-046-7The British English SpeechDat(II) MDB-1000 database contains the recordings of 1,000 British speakers recorded over the GSM digital mobile network. The MDB-1000 database is partitioned into 5 CDs in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utter...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
10000.00 €
![]() |
19000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
19000.00 €
![]() |
19000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
12000.00 €
![]() |
24000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
24000.00 €
![]() |
24000.00 €
![]() |


- English
ID: ELRA-S0098
ISLRN: 007-575-120-102-1The British English SpeechDat(II) SDB-2400 database is designed for development and assessment of speaker verification and identification systems. It contains the recordings of 120 speakers who uttered 22 items 20 times, and was collected over the fixed and mobile telephone networks in quiet and ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
32000.00 €
![]() |
39000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
39000.00 €
![]() |
39000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
39000.00 €
![]() |
47000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
47000.00 €
![]() |
47000.00 €
![]() |