Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
59 Language Resources (Page 1 of 3)
« Previous | Next »Order by:
- Arabic
- Czech
ID: ELRA-W0087
ISLRN: 798-485-294-792-12006 CoNLL Shared Task – Arabic & Czech consists of dependency treebanks used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The Conference on Computational Natural Language Learning (CoNLL) is accompanied every year by a shared task intended to promote natural lan...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - Non Standard Licence Terms |
- Arabic
ID: ELRA-W0030
ISLRN: 365-777-769-398-7The corpus was developed in the course of a research project at the University of Essex, in collaboration with the Open University. The corpus contains Al-Hayat newspaper articles with value added for Language Engineering and Information Retrieval applications development purposes. The data have ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
480.00 €
|
960.00 €
|
Licence: Commercial Use - ELRA VAR |
960.00 €
|
960.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
720.00 €
|
1440.00 €
|
Licence: Commercial Use - ELRA VAR |
1440.00 €
|
1440.00 €
|
- Arabic
ID: ELRA-W0027
ISLRN: 083-457-618-309-8The An-Nahar Lebanon Newspaper Text Corpus comprises articles in standard Arabic from 1995 to 2000 (6 years) stored as HTML files on CDRom media. Each year contains 45 000 articles and 24 million words. Each article includes information such as title, newspaper's name, date, country, type, page, ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2016.00 €
|
3192.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3024.00 €
|
4788.00 €
|
Special offers are also available. Check here for details.
- Arabic
- English
- French
ID: ELRA-W0323
ISLRN: 482-848-308-105-6The annotated tweet corpus in Arabizi, French and English was built by ELDA on behalf of INSA Rouen Normandie (Normandie Université, LITIS team), in the framework of the SAPhIRS project (System for the Analysis of Information Propagation in Social Networks), funded by the DGE (Direction Générale ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
7000.00 €
|
Licence: Commercial Use - ELRA VAR |
7000.00 €
|
7000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
10000.00 €
|
Licence: Commercial Use - ELRA VAR |
10000.00 €
|
10000.00 €
|
- Arabic
ID: ELRA-L0098
ISLRN: 049-623-948-389-2The Arabic dictionary of inflected words consists of a list of 6 million inflected forms, fully vowelized, generated in compliance with the grammatical rules of Arabic and tagged with grammatical information which includes POS and grammatical features, including number, gender, case, definiteness...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
|
10000.00 €
|
Licence: Commercial Use - ELRA VAR |
10000.00 €
|
10000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4500.00 €
|
15000.00 €
|
Licence: Commercial Use - ELRA VAR |
15000.00 €
|
15000.00 €
|
- Arabic
ID: ELRA-L0099
ISLRN: 963-860-792-289-9This dictionary consists of 6 million inflected forms, fully vowelized, generated in compliance with the grammatical rules of Arabic and tagged with grammatical information which includes POS and grammatical features, including number, gender, case, definiteness, tense, mood and compatibility wit...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
25000.00 €
|
25000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
37000.00 €
|
37000.00 €
|
- Arabic
ID: ELRA-L0088
ISLRN: 472-591-121-577-5The Arabic Morphological Dictionary contains 4,912,749 entries, including: - 3,374,852 nouns, - 1,537,699 verbs, - 198 grammatical words. The dictionary is stored on 1 CD. All files are provided as plain text in UTF8 character encoding, which represents about 154 Mb of data. The dictionary form...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
250.00 €
|
6000.00 €
|
Licence: Commercial Use - ELRA VAR |
6000.00 €
|
6000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
450.00 €
|
12000.00 €
|
Licence: Commercial Use - ELRA VAR |
12000.00 €
|
12000.00 €
|
- Arabic
ID: ELRA-L0131
ISLRN: 879-334-992-724-8This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
22000.00 €
|
66000.00 €
|
Licence: Commercial Use - ELRA VAR |
66000.00 €
|
66000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
42000.00 €
|
125000.00 €
|
Licence: Commercial Use - ELRA VAR |
125000.00 €
|
125000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- English
ID: ELRA-M0105
ISLRN: 161-842-321-771-2This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5000.00 €
|
15000.00 €
|
Licence: Commercial Use - ELRA VAR |
15000.00 €
|
15000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7000.00 €
|
22000.00 €
|
Licence: Commercial Use - ELRA VAR |
22000.00 €
|
22000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- English
ID: ELRA-M0107
ISLRN: 773-974-582-139-4This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
15000.00 €
|
45000.00 €
|
Licence: Commercial Use - ELRA VAR |
45000.00 €
|
45000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
24000.00 €
|
71000.00 €
|
Licence: Commercial Use - ELRA VAR |
71000.00 €
|
71000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- English
ID: ELRA-M0106
ISLRN: 943-592-129-040-2This database is part of the ArabLEX set of data which consists of the Database of Arabic General Vocabulary (DAG), Database of Arabic Place Names (DAP), Database of Foreign Names in Arabic (DAF) and Database of Arab Names (DAN) available from ELRA under references, respectively, ELRA-L0131, ELRA...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
9000.00 €
|
27000.00 €
|
Licence: Commercial Use - ELRA VAR |
27000.00 €
|
27000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
16000.00 €
|
49000.00 €
|
Licence: Commercial Use - ELRA VAR |
49000.00 €
|
49000.00 €
|
Special offers are also available. Check here for details.
- Arabic
- Chinese
- English
- French
- German
- Italian
- Japanese
- Modern Greek (1453-)
- Persian
- Russian
- Spanish; Castilian
ID: ELRA-E0018
ISLRN: 875-865-064-331-9The ARCADE II Evaluation Package was produced within the French national project ARCADE II (Evaluation of parallel text alignment systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The ARCADE II project enabled to carry out a cam...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
- Arabic
ID: ELRA-L0136
ISLRN: 034-867-750-463-4The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Arabic (MSA) consists of 22,000 lemmas (17,000,00...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
85000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
85000.00 €
|
- Arabic
ID: ELRA-L0151
ISLRN: 898-259-529-174-3As a complement to the generic vocabulary provided in ELRA-L0136, language variants of Arabic are provided with the following features: Voice, Tense, Mood, Person, Number, Gender, Case, Definiteness, Pronominal Clitics, Category (except for Arabic MSA). Variants are distributed as follows: - A...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
185000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
185000.00 €
|
- Arabic
ID: ELRA-L0133
ISLRN: 462-532-124-988-8Comprehensive Arabic LEMmas is a lexicon covering a large list of Arabic lemmas and their corresponding inflected word forms (stems) with details (POS + Root). Each lexical entry represents a lemma followed by all its possible stems and each stem is enriched by its morphological features, especia...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use, No Derivatives - CC-BY-NC-ND |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
7500.00 €
|
7500.00 €
|
- Arabic
- English
- French
ID: ELRA-E0020
ISLRN: 809-316-046-724-8The CESTA Evaluation Package was produced within the French national project CESTA (Evaluation of MT systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The CESTA project enabled to carry out a campaign for the evaluation of machi...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
- Arabic
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Modern Greek (1453-)
- Norwegian
- Persian
- Polish
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
- Thai
- Turkish
- Vietnamese
ID: ELRA-T0377
ISLRN: 452-383-219-228-0The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank, distributed separately under reference ELRA-T0376) and a multilingual set of sentences in 28 languages (the PhraseBank). The PhraseBank consists of 2,000 p...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1680.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2240.00 €
|
- Arabic
- Bengali
- Chinese
- Croatian
- Czech
- Danish
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hindi
- Italian
- Japanese
- Korean
- Malayalam
- Modern Greek (1453-)
- Norwegian
- Polish
- Portuguese
- Romanian; Moldavian; Moldovan
- Russian
- Spanish; Castilian
- Swedish
- Tamil
- Thai
- Turkish
- Ukrainian
- Vietnamese
ID: ELRA-T0376
ISLRN: 990-814-402-335-7The Collins Multilingual database covers Real Life Daily vocabulary. It is composed of a multilingual lexicon in 32 languages (the WordBank) and a multilingual set of sentences in 28 languages (the PhraseBank, distributed separately under reference ELRA-T0377). The WordBank contains 10,000 words...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2400.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3600.00 €
|
- Arabic
- Chinese
- Japanese
- Korean
ID: ELRA-M0071
ISLRN: 476-146-877-598-3Comprehensive monolingual word lists for both Simplified and Traditional Chinese, Japanese, Korean and Arabic, including a full-form Arabic word list. For Simplified and Traditional Chinese, Japanese and Korean, we provide readings as well, making them ideal for speech-related applications such...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
30000.00 €
|
50000.00 €
|
Licence: Commercial Use - ELRA VAR |
60000.00 €
|
100000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
37500.00 €
|
62500.00 €
|
Licence: Commercial Use - ELRA VAR |
75000.00 €
|
125000.00 €
|
« Previous | Next »