Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1681 Language Resources (Page 17 of 85)
« Previous | Next »Order by:


- English
- Modern Greek (1453-)
ID: ELRA-W0196
ISLRN: 114-726-489-848-3This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Convention, additional protocol on the convention, recom...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- Portuguese
ID: ELRA-S0367
ISLRN: 499-311-025-331-2The CORAL corpus was collected in the framework of a national project sponsored by the PRAXIS XXI program, by a consortium formed by INESC, CLUL, FLUL (Faculdade de Letras da Universidade de Lisboa), and FCSH-UNL (Faculdade de Ciências Sociais e Humanas da Universidade Nova de Lisboa). The purpos...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
![]() |
750.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
750.00 €
![]() |
750.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
1500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1500.00 €
![]() |
1500.00 €
![]() |


- French
- Italian
- Portuguese
- Spanish; Castilian
ID: ELRA-S0172
ISLRN: 318-977-046-077-4Description The C-ORAL-ROM resource is a multilingual corpus of spontaneous1 speech for the main romance languages of around 1,200,000 words (IST 2000-26228). The resource comprises three components: a)Multimedia corpus; b)Speech software; c)Appendix. The corpus consists of four comparable recor...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
![]() |
20000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
20000.00 €
![]() |
20000.00 €
![]() |


- English
ID: ELRA-W0337
ISLRN: 478-366-550-085-8The Corpus for fine-grained analysis and automatic detection of irony on Twitter was carefully annotated by trained annotators (Master’s students in Linguistics) using a detailed annotation scheme for irony categorization, which describes four labels: ‘ironic by means of a polarity contrast’, ‘si...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
100.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
100.00 €
![]() |
100.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
200.00 €
![]() |
200.00 €
![]() |


- Spanish; Castilian
ID: ELRA-W0041
ISLRN: 837-873-214-287-0This corpus consists of 11 novels written in Castilian Spanish by Inmaculada Ferrer-Vidal Turull, a contemporaneous author. The list of novels consists of: - La búsqueda: 113,639 words - Tristeza: 41,125 words - Cuarto menguante: 42,419 words - Recuerdos: 55,694 words - Sucedió en Abril: 46,...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
800.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
800.00 €
![]() |
800.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
![]() |
1000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1000.00 €
![]() |
1000.00 €
![]() |


- Icelandic
ID: ELRA-W0298
ISLRN: 420-670-865-427-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Corpus of Icelandic texts from the Central Bank of Icela...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |




- English
- French
- Norwegian
- Spanish; Castilian
ID: ELRA-S0414
ISLRN: 631-345-309-445-9The Corpus of Interactions between Seniors and an Empathic Virtual Coach in Spanish, French and Norwegian was built within the EMPATHIC project (Empathic, Expressive, Advanced Virtual Coach to Improve Independent Healthy-Life-Years of the Elderly), funded within the European Union’s Horizon 2020 ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
![]() |
25000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
![]() |
25000.00 €
![]() |
Special offers are also available. Check here for details.


- Japanese
ID: ELRA-S0488
ISLRN: 280-594-494-328-0The "Corpus of Spontaneous Japanese" (or CSJ) is a database containing a large collection of Japanese spoken language data and information for use in linguistic research; jointly developed by NINJAL, NICT and the Tokyo Institute of Technology, the CSJ is world-class in both the quantity and quali...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
345.00 €
![]() |
34500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
34500.00 €
![]() |
34500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
431.25 €
![]() |
43125.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
43125.00 €
![]() |
43125.00 €
![]() |


- English
- Latvian
ID: ELRA-W0169
ISLRN: 636-211-843-827-4This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Latvian Web, home pages of ministries and state public s...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- English
- Latvian
ID: ELRA-W0216
ISLRN: 389-271-130-137-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of web site https://makroekonomika.lv/ -- Latv...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- English
ID: ELRA-S0009
ISLRN: 615-316-959-146-3The COST232 consortium collected a "Multi-English" speech database over the telephone in Europe. Originally, it had been planned to collect data only at FUB (Fondazione Ugo Bordoni) in Rome, but in the event it was also possible to make a collection at BT labs in the UK. A total of 797 "successfu...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
200.00 €
![]() |
200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
300.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
300.00 €
![]() |
300.00 €
![]() |


- English
- French
- Spanish; Castilian
ID: ELRA-W0033
ISLRN: 052-466-219-226-4The CRATER corpus was built upon the foundations of an earlier project, ET10/63, which was funded in the final phase of the Eurotra programme. The Corpus Resources and Terminology Extraction project (MLAP-93 20) extended the bilingual annotated English-French International Telecommunications Unio...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
25.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25.00 €
![]() |
25.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
125.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
125.00 €
![]() |
125.00 €
![]() |


- English
- French
- Spanish; Castilian
ID: ELRA-W0003
ISLRN: 645-721-607-031-5The Corpus Resources and Terminology Extraction project (MLAP-93 20) has extended the bilingual annotated English-French International Telecommunications Union corpus to include Spanish, and has also debugged the existing corpus. The offer consists of a multi-lingual aligned corpus of 1,000,000 t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
20.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
20.00 €
![]() |
20.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
100.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
100.00 €
![]() |
100.00 €
![]() |


- Croatian
- English
ID: ELRA-W0142
ISLRN: 095-764-087-898-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with Acts on Biological and Land...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0264
ISLRN: 378-424-378-060-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with statistical reports and stu...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0266
ISLRN: 389-289-275-352-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with studies on the challenges t...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0295
ISLRN: 994-259-799-079-0This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English corpus with the Rural Development Progr...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0294
ISLRN: 403-261-656-321-0This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, No Derivatives - CC-BY-ND-3.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, No Derivatives - CC-BY-ND-3.0 |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0292
ISLRN: 938-270-896-641-7This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Croatian
- English
ID: ELRA-W0291
ISLRN: 288-712-830-695-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |