Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
112 Language Resources (Page 1 of 6)
« Previous | Next »Order by:
- English
- Ukrainian
ID: ELRA-M0104
ISLRN: 110-617-195-245-4The bilingual English-Ukrainian lexicon of named entities uses Wikipedia metadata as a source. The extracted named entity pairs are classified into five classes: PERSON, ORGANIZATION, LOCATION, PRODUCT, and MISC (miscellaneous). The lexicon consists of 624,168 pairs and comes in two formats: csv ...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
- Catalan; Valencian
ID: ELRA-W0327
ISLRN: 186-654-762-852-8The AnCora Catalan Corpus 2.0.0 is a corpus of 500,000 words annotated at different levels: - Lemma and Part of Speech, - Syntactic constituents and functions, - Argument structure and thematic roles, - Semantic classes of the verb, - Denotative type of deverbal nouns, - Nouns related to W...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Commercial Use - GPL |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Commercial Use - GPL |
0.00 €
|
0.00 €
|
- Spanish; Castilian
ID: ELRA-W0326
ISLRN: 252-495-813-736-1The AnCora Spanish Corpus 2.0.0 is a corpus of 500,000 words annotated at different levels: - Lemma and Part of Speech, - Syntactic constituents and functions, - Argument structure and thematic roles, - Semantic classes of the verb, - Denotative type of deverbal nouns, - Nouns related to W...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Commercial Use - GPL |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Commercial Use - GPL |
0.00 €
|
0.00 €
|
- Arabic
ID: ELRA-S0384
ISLRN: 866-568-447-697-8This speech corpus has been developed as part of a PhD work carried out by Nawar Halabi at the University of Southampton. The corpus was recorded through a Neumann TLM 103 Studio Microphone by one male speaker in South Levantine Arabic (Damascian accent) in a professional studio. The transcript w...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
9000.00 €
| |
Licence: Attribution - CC-BY |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
11200.00 €
| |
Licence: Attribution - CC-BY |
0.00 €
|
0.00 €
|
- English
- Modern Greek (1453-)
ID: ELRA-W0244
ISLRN: 456-799-985-207-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A bilingual collection of translation units extracted fr...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-W0273
ISLRN: 732-156-538-451-4This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of http://casopis-gradjevinar.hr were crawled, ...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
- Bulgarian
ID: ELRA-W0329
ISLRN: 832-960-876-604-2The Bulgarian Event Corpus is composed 324,905 tokens appropriate for training Named Entity Recognition (NER), Named Entity Linking (NEL) and Event Recognition models for Bulgarian in a multidomain context within Humanities. The texts are domain related. They include documents from the area of So...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: ? - CC-BY-SA-3.0 |
0.00 €
| |
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
- Bulgarian
ID: ELRA-W0328
ISLRN: 761-430-854-533-2The Bulgarian Treebank Corpus is composed of 156,149 tokens (11,138 sentences) coming from three main sources in the domain of Grammar Notebooks (1,391 sentences), News (6,698 sentences), Other (3,049 sentences). It is available with syntactical and morphological annotation on a sentence basis in...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
0.00 €
|
- Bulgarian
ID: ELRA-L0132
ISLRN: 188-702-981-369-5The Bulgarian Valency Frame Lexicon is composed of 9547 lexical entries organized by frames with 960 mappings to Princeton WordNet available in XML format. It is a treebank-driven resource of extracted valency frames from BulTreeBank. The frames were manually curated. The frames followed the surf...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-3.0 |
0.00 €
|
0.00 €
|
- Arabic
ID: ELRA-L0133
ISLRN: 462-532-124-988-8Comprehensive Arabic LEMmas is a lexicon covering a large list of Arabic lemmas and their corresponding inflected word forms (stems) with details (POS + Root). Each lexical entry represents a lemma followed by all its possible stems and each stem is enriched by its morphological features, especia...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
7500.00 €
|
7500.00 €
|
- English
- Modern Greek (1453-)
ID: ELRA-W0196
ISLRN: 114-726-489-848-3This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Convention, additional protocol on the convention, recom...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
- Icelandic
ID: ELRA-W0298
ISLRN: 420-670-865-427-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Corpus of Icelandic texts from the Central Bank of Icela...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
- English
- Latvian
ID: ELRA-W0169
ISLRN: 636-211-843-827-4This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Latvian Web, home pages of ministries and state public s...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
- English
- Latvian
ID: ELRA-W0216
ISLRN: 389-271-130-137-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Contents of web site https://makroekonomika.lv/ -- Latv...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-W0294
ISLRN: 403-261-656-321-0This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, No Derivatives - CC-BY-ND-3.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, No Derivatives - CC-BY-ND-3.0 |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-W0292
ISLRN: 938-270-896-641-7This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-W0291
ISLRN: 288-712-830-695-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-W0293
ISLRN: 677-053-215-388-2This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Croatian-English parallel corpus from the website of the...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
- Croatian
- English
ID: ELRA-M0108
ISLRN: 649-554-159-147-9CroaTPAS (Croatian Typed Predicate Argument Structures) is a bi-lingual lexicon in Croatian and English. It was created by manual annotation from the Croatian Web as Corpus and pattern creation using the Skema editor on the Sketch Engine platform. CroaTPAS is tailor-made to represent verb polysem...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
- Danish
- English
ID: ELRA-W0157
ISLRN: 026-863-463-067-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Parallel texts Danish-English from the Danish Ministry o...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
« Previous | Next »