Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
671 Language Resources (Page 1 of 34)
« Previous | Next »Order by:


- Arabic
- English
ID: ELRA-W0123
ISLRN: 505-782-255-628-82007 CoNLL Shared Task - Arabic & English consists of dependency treebanks in two languages used as part of the CoNLL 2007 shared task on multi-lingual dependency parsing and domain adaptation. The languages covered in this release are: Arabic and English. The Conference on Computational Natur...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |


- English
- Ukrainian
ID: ELRA-M0104
ISLRN: 110-617-195-245-4The bilingual English-Ukrainian lexicon of named entities uses Wikipedia metadata as a source. The extracted named entity pairs are classified into five classes: PERSON, ORGANIZATION, LOCATION, PRODUCT, and MISC (miscellaneous). The lexicon consists of 624,168 pairs and comes in two formats: csv ...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- English
ID: ELRA-T0375
ISLRN: 699-305-362-089-6Automatic Term Recognition (ATR) is a research task that deals with the identification of domain-specific terms. Terms, in simple words, are textual realization of significant concepts in an expertise domain. Additionally, domain-specific terms may be classified into a number of categories, in wh...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
1000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1000.00 €
![]() |
1000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |


- Amharic
- English
ID: ELRA-W0074
ISLRN: 590-255-335-719-0The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. This parallel corpus contains documents from two domains, namely legal...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
4000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4000.00 €
![]() |
4000.00 €
![]() |


- Arabic
- English
- French
ID: ELRA-W0323
ISLRN: 482-848-308-105-6The annotated tweet corpus in Arabizi, French and English was built by ELDA on behalf of INSA Rouen Normandie (Normandie Université, LITIS team), in the framework of the SAPhIRS project (System for the Analysis of Information Propagation in Social Networks), funded by the DGE (Direction Générale ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
7000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
7000.00 €
![]() |
7000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |


- Arabic
- English
ID: ELRA-M0105
ISLRN: 161-842-321-771-2This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
![]() |
15000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
15000.00 €
![]() |
15000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7000.00 €
![]() |
22000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
22000.00 €
![]() |
22000.00 €
![]() |
Special offers are also available. Check here for details.


- Arabic
- English
ID: ELRA-M0107
ISLRN: 773-974-582-139-4This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
15000.00 €
![]() |
45000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
45000.00 €
![]() |
45000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
24000.00 €
![]() |
71000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
71000.00 €
![]() |
71000.00 €
![]() |
Special offers are also available. Check here for details.


- Arabic
- English
ID: ELRA-M0106
ISLRN: 943-592-129-040-2This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
9000.00 €
![]() |
27000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
27000.00 €
![]() |
27000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
16000.00 €
![]() |
49000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
49000.00 €
![]() |
49000.00 €
![]() |
Special offers are also available. Check here for details.


- Arabic
- Chinese
- English
- French
- German
- Italian
- Japanese
- Modern Greek (1453-)
- Persian
- Russian
- Spanish; Castilian
ID: ELRA-E0018
ISLRN: 875-865-064-331-9The ARCADE II Evaluation Package was produced within the French national project ARCADE II (Evaluation of parallel text alignment systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The ARCADE II project enabled to carry out a cam...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |


- English
- French
- Italian
ID: ELRA-W0018
ISLRN: 681-769-134-114-2The ARCADE/ROMANSEVAL corpus was used as a reference corpus in two international competitions: · ARCADE, an exercise on multilingual text alignment financed by AUPELF-UREF · ROMANSEVAL, part of the SENSEVAL exercise sponsored by ACL-SIGLEX and EURALEX, on word sense disambiguation. The corpus ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |


- English
- French
- German
- Spanish; Castilian
ID: ELRA-T0097
ISLRN: 536-306-764-088-7Cards available: 1420 Languages: German, English, French, Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1746.60 €
![]() |
1746.60 €
![]() |
Licence: Commercial Use - ELRA VAR |
1746.60 €
![]() |
1746.60 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2911.00 €
![]() |
2911.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2911.00 €
![]() |
2911.00 €
![]() |


- English
- French
- German
- Italian
- Spanish; Castilian
ID: ELRA-M0001
ISLRN: 874-922-751-076-4Entries: 30 000 each language Languages: French, English, Italian, German, Spanish Format: ASCII or ANSI with separators between entries Medium: CD-ROM The words are associated by the meaning. The lexical categories are: nouns (5 * 18 000), verbs (5 * 8 000), adjectives (5 * 6 000), adverbs (5 * ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8861.00 €
![]() |
11077.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
11077.00 €
![]() |
11077.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
11077.00 €
![]() |
13846.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
13846.00 €
![]() |
13846.00 €
![]() |


- Basque
- English
ID: ELRA-M0049
ISLRN: 699-845-639-511-8The Basque WordNet is a lexical database including information about Basque words. It is an extension of WordNet 1.6, a lexical database for English developed at the Princeton University. The Basque WordNet is tightly aligned to the English WordNet. The Basque WordNet models nouns, verbs and ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4500.00 €
![]() |
4500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
![]() |
6000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
9000.00 €
![]() |
9000.00 €
![]() |


- Bulgarian
- English
ID: ELRA-W0263
ISLRN: 182-772-814-980-2This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Bulgarian-English corpus from the 2018 Proposa...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Bulgarian
- English
ID: ELRA-W0173
ISLRN: 039-753-902-920-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Bulgarian-English corpus of administrative doc...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |


- English
- Modern Greek (1453-)
ID: ELRA-W0132
ISLRN: 391-837-431-539-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A parallel corpus(Greek-English) regarding the Cyprus P...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |


- English
- Modern Greek (1453-)
ID: ELRA-W0244
ISLRN: 456-799-985-207-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A bilingual collection of translation units extracted fr...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- English
- German
ID: ELRA-M0013
ISLRN: 770-934-073-250-5The bilingual English-German collocational dictionary consists of around 69,000 English headwords, including concepts expressed by more than one word (e.g. "environmental awareness" (German:"Umweltbewusstsein") or "maximum level of possible taxation" (German:"steuerliche Belastbarkeit")) and comp...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
210.00 €
![]() |
210.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
210.00 €
![]() |
210.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
![]() |
300.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
300.00 €
![]() |
300.00 €
![]() |


- Croatian
- English
ID: ELRA-W0204
ISLRN: 789-854-428-995-7This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Croatian-English Parallel Corpus of 21340 tran...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |


- Bulgarian
- English
ID: ELRA-W0133
ISLRN: 942-857-416-126-2This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual collection of documents in the field of Intern...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Open Under-PSI |
0.00 €
![]() |
0.00 €
![]() |
« Previous | Next »