Search and Browse – ELRA Catalogue

A Bilingual English-Ukrainian Lexicon of Named Entities Extracted from Wikipedia text

English
Ukrainian

ID: ELRA-M0104

The bilingual English-Ukrainian lexicon of named entities uses Wikipedia metadata as a source. The extracted named entity pairs are classified into five classes: PERSON, ORGANIZATION, LOCATION, PRODUCT, and MISC (miscellaneous). The lexicon consists of 624,168 pairs and comes in two formats: csv ...

MEMBER	academic	commercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0	0.00 €	0.00 €

Amharic-English bilingual corpus text

Amharic
English

ID: ELRA-W0074

ISLRN: 590-255-335-719-0

The Amharic-English bilingual corpus contains parallel text from legal and news domains in Amharic script, in transliterated form and in English. The size of the corpus is of 232,653 words in Amharic and 291,701 in English. This parallel corpus contains documents from two domains, namely legal...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	2000.00 €
Licence: Commercial Use - ELRA VAR	2000.00 €	2000.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	0.00 €	4000.00 €
Licence: Commercial Use - ELRA VAR	4000.00 €	4000.00 €

Basque WordNet text

Basque
English

ID: ELRA-M0049

ISLRN: 699-845-639-511-8

The Basque WordNet is a lexical database including information about Basque words. It is an extension of WordNet 1.6, a lexical database for English developed at the Princeton University. The Basque WordNet is tightly aligned to the English WordNet. The Basque WordNet models nouns, verbs and ...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	300.00 €	3000.00 €
Licence: Commercial Use - ELRA VAR	4500.00 €	4500.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	600.00 €	6000.00 €
Licence: Commercial Use - ELRA VAR	9000.00 €	9000.00 €

Bilingual Bulgarian-English corpus from the 2018 Proposal for a National Climate Change Adaptation Strategy and Action Plan from the website of the Bulgarian Ministry of Environment and Water (Processed) text

Bulgarian
English

ID: ELRA-W0263

ISLRN: 182-772-814-980-2

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Bulgarian-English corpus from the 2018 Proposa...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual Bulgarian-English corpus from the National Revenue Agency (BG) (Processed) text

Bulgarian
English

ID: ELRA-W0173

ISLRN: 039-753-902-920-1

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Bulgarian-English corpus of administrative doc...

MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

Bilingual collection of reports of the Greek Public Power Corporation (Processed) text

English
Modern Greek (1453-)

ID: ELRA-W0244

ISLRN: 456-799-985-207-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A bilingual collection of translation units extracted fr...

MEMBER	academic	commercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0	0.00 €	0.00 €

Bilingual Croatian-English Parallel Corpus (Processed) text

Croatian
English

ID: ELRA-W0204

ISLRN: 789-854-428-995-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual Croatian-English Parallel Corpus of 21340 tran...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual documents Bulgarian-English in the field of ICT and Transport (Processed) text

Bulgarian
English

ID: ELRA-W0133

ISLRN: 942-857-416-126-2

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual collection of documents in the field of Intern...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual documents Bulgarian-English in the field of open data, broadband and information society (Processed) text

Bulgarian
English

ID: ELRA-W0134

ISLRN: 812-088-133-045-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Bulgarian collection in the field of open data, ...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual hr-en parallel corpus from Croatian Mine Action website (Processed) text

Croatian
English

ID: ELRA-W0131

ISLRN: 789-620-257-493-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.hcr.hr website downloaded, aligne...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual hr-en parallel corpus from Croatian National Bank website (Processed) text

Croatian
English

ID: ELRA-W0226

ISLRN: 248-991-649-363-5

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.hnb.hr were crawled, aligned on d...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bilingual hr-en parallel corpus from the Journal of the Croatian Association of Civil Engineers website (Processed) text

Croatian
English

ID: ELRA-W0273

ISLRN: 732-156-538-451-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://casopis-gradjevinar.hr were crawled, ...

MEMBER	academic	commercial
Licence: Attribution - CC-BY-4.0	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Attribution - CC-BY-4.0	0.00 €	0.00 €

Bilingual hr-en parallel corpus from the National and University Library in Zagreb website (Processed) text

Croatian
English

ID: ELRA-W0135

ISLRN: 196-404-604-094-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://www.nsk.hr were crawled, aligned on d...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

BMI Brochures 2011-2015 (Processed) text

English
German

ID: ELRA-W0200

ISLRN: 886-938-216-393-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

BMI Brochures and Website 2016 (Processed) text

English
German

ID: ELRA-W0199

ISLRN: 416-672-686-637-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

BMVI Publications (Processed) text

English
German

ID: ELRA-W0197

ISLRN: 492-102-548-814-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. TMX file with 11555 TUs, bilingual German/English, publi...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

BMVI Website (Processed) text

English
German

ID: ELRA-W0198

ISLRN: 391-726-618-848-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. tmx file, 2718 TUs, bilingual German/English, texts from...

MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Open Under-PSI	0.00 €	0.00 €

Bulgarian WordNet text

Bulgarian
English

ID: ELRA-M0041

ISLRN: 941-120-951-927-7

The Bulgarian WordNet is a network of lexical-semantic relations, an electronic thesaurus with a structure modelled on that of the Princeton WordNet and those constructed in the EuroWordNet and BalkaNet project. Bulgarian WordNet describes meaning of a lexical unit by placing it within a network ...

MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	300.00 €	3000.00 €
Licence: Commercial Use - ELRA VAR	4500.00 €	4500.00 €

NON MEMBER	academic	commercial
Licence: Non Commercial Use - ELRA END USER	600.00 €	6000.00 €
Licence: Commercial Use - ELRA VAR	9000.00 €	9000.00 €

Central Statistical Office Dataset (Processed) text

English
Polish

ID: ELRA-W0174

ISLRN: 268-175-960-200-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Two Polish-English publications of the Polish Central St...

MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

Civil Aviation Regulations (Processed) text

English
Polish

ID: ELRA-W0186

ISLRN: 792-786-685-848-5

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published ...

MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

NON MEMBER	academic	commercial
Licence: Other - Public Domain	0.00 €	0.00 €

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

Resource Type:

Media Type:

186 Language Resources (Page 1 of 10)