Resource Type:
Corpus: | |
Lexical/Conceptual: | |
Tool/Service: | |
Language Description: |
Media Type:
Text: | |
Audio: | |
Image: | |
Video: | |
Text Numerical: | |
Text N-Gram: |
113 Language Resources (Page 2 of 6)
« Previous | Next »Order by:
- Portuguese
ID: ELRA-W0061
ISLRN: 133-035-138-613-6The CINTIL-DependencyBank (Silva and Branco, 2012) is a corpus of sentences annotated with their syntactic dependency graphs and grammatical function tags composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Portuguese
ID: ELRA-W0056
ISLRN: 723-486-478-286-6The CINTIL-PropBank is a corpus of sentences annotated with their constituency structure and semantic role tags, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082 tokens). In addition,...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- Portuguese
ID: ELRA-W0055
ISLRN: 411-691-515-701-9The CINTIL-TreeBank is a corpus of syntactic constituency trees of Portuguese texts composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 sentences; 3,082 tokens). In addition, there are 779 sentences (5,654 t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
3000.00 €
|
3000.00 €
|
- English
ID: ELRA-E0042
ISLRN: 749-772-201-451-6The CLEF Initiative (Conference and Labs of the Evaluation Forum) promotes the systematic evaluation of information access systems through experimentation on shared tasks, with an emphasis on multilingual and multimodal information. The CLEFeHealth 2013 Task 3 Evaluation Package contains data ...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
|
0.00 €
|
- English
ID: ELRA-E0043
ISLRN: 725-020-897-275-7The CLEF Initiative (Conference and Labs of the Evaluation Forum) promotes the systematic evaluation of information access systems through experimentation on shared tasks, with an emphasis on multilingual and multimodal information. It is structured in two main parts: a series of Evaluation Labs ...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
|
0.00 €
|
- English
- French
- Spanish; Castilian
ID: ELRA-E0039
ISLRN: 460-370-870-489-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
Special offers are also available. Check here for details.
- English
ID: ELRA-W0337
ISLRN: 478-366-550-085-8The Corpus for fine-grained analysis and automatic detection of irony on Twitter was carefully annotated by trained annotators (Master’s students in Linguistics) using a detailed annotation scheme for irony categorization, which describes four labels: ‘ironic by means of a polarity contrast’, ‘si...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
100.00 €
|
Licence: Commercial Use - ELRA VAR |
100.00 €
|
100.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
200.00 €
|
Licence: Commercial Use - ELRA VAR |
200.00 €
|
200.00 €
|
- Spanish; Castilian
ID: ELRA-W0041
ISLRN: 837-873-214-287-0This corpus consists of 11 novels written in Castilian Spanish by Inmaculada Ferrer-Vidal Turull, a contemporaneous author. The list of novels consists of: - La búsqueda: 113,639 words - Tristeza: 41,125 words - Cuarto menguante: 42,419 words - Recuerdos: 55,694 words - Sucedió en Abril: 46,040 w...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
|
800.00 €
|
Licence: Commercial Use - ELRA VAR |
800.00 €
|
800.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
|
1000.00 €
|
Licence: Commercial Use - ELRA VAR |
1000.00 €
|
1000.00 €
|
- Icelandic
ID: ELRA-W0298
ISLRN: 420-670-865-427-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Corpus of Icelandic texts from the Central Bank of Icela...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Other - Open Under-PSI |
0.00 €
|
0.00 €
|
- Danish
ID: ELRA-W0318
ISLRN: 024-504-318-388-3The Danish Gigaword Project (DAGW) maintains a corpus for Danish with over a billion words. The general goals are to create a dataset that is: 1. representative; 2. accessible; 3. a suitable common starting point for Danish NLP models. The present version 1.0 was collected from various webs...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
|
0.00 €
|
- Danish
ID: ELRA-W0117
ISLRN: 213-212-351-142-5The Danish Propbank (DPB) is a multi-layer treebank, annotated not only with morphosyntactic, but also with semantic information, in particular propositions/frames with VerbNet classes and semantic roles for both arguments and satellites. In addition, the corpus has been annotated with 20 Named E...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
150.00 €
|
3000.00 €
|
Licence: Commercial Use - ELRA VAR |
5000.00 €
|
5000.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
|
7000.00 €
|
Licence: Commercial Use - ELRA VAR |
7000.00 €
|
7000.00 €
|
- French
ID: ELRA-E0035
ISLRN: 161-881-080-899-5DEFT (DEfi Fouille de Texte – Text Mining Challenge) organizes evaluation campaigns in the field of text mining. The topic of DEFT 2008 edition is related to the classification of texts by topics and genres. Automatic classification has multiple applications in text mining. Many application fiel...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
240.91 €
|
240.91 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
313.18 €
|
313.18 €
|
- German
ID: ELRA-W0083
ISLRN: 339-799-085-669-8The deL1L2IM corpus, created between May and August 2012 and last updated in August 2014, has been collected within the framework of a PhD project on the development of a learning method implying conversations with an artificial companion. This PhD work is presented as a qualitative investigation...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
|
0.00 €
|
Licence: Commercial Use - ELRA VAR |
0.00 €
|
0.00 €
|
- Dutch; Flemish
ID: ELRA-W0019
ISLRN: 440-290-917-102-7The Dutch PAROLE Distributable Corpus is a 3 million words selection from the 20 million words Dutch PAROLE Reference corpus. The Dutch corpus annotation and checking was made accordingly to the common core PAROLE tagset. The Dutch data were also checked for type. The Dutch PAROLE Distributable...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
270.00 €
|
800.00 €
|
Licence: Commercial Use - ELRA VAR |
1600.00 €
|
1600.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
|
1300.00 €
|
Licence: Commercial Use - ELRA VAR |
2500.00 €
|
2500.00 €
|
Special offers are also available. Check here for details.
- French
ID: ELRA-E0034
ISLRN: 238-723-334-894-5The EASy Evaluation Package was produced within the French national project EASy (Evaluation of syntactic parsers of French), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The project enabled to carry out a campaign for the evaluation...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
- German
- Italian
ID: ELRA-W0005
ISLRN: 869-857-775-378-7The objective is to provide a small but fine grained morphosyntactically tagged corpus, 50.000 running words for each of the two languages (Italian and German) to be used in research work on tagging methods and models. The text for German comes from the Frankfurter Rundschau extracted from the EC...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
20.00 €
|
20.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45.00 €
|
45.00 €
|
- English
- Manipuri
ID: ELRA-W0316
ISLRN: 588-170-827-016-7The Ema-lon Manipuri Corpus consists of a set of resources for Manipuri language (locally known as Meiteilon) for the purpose of machine translation. The main source for these resources is the Sangai Express news website. The resources that constitute the present corpus are listed below: 1. EM C...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Non Commercial Use - CC-BY-NC-4.0 |
0.00 €
|
0.00 €
|
- French
ID: ELRA-E0022
ISLRN: 725-358-759-122-3The EQueR Evaluation Package was produced within the French national project EQueR (Evaluation campaign for Question-Answering systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The EQueR project enabled to carry out a campaign f...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
- French
ID: ELRA-E0023
ISLRN: 340-228-754-954-9The EvaSy Evaluation Package was produced within the French national project EvaSy (Evaluation of speech synthesis systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The EvaSy project enabled to carry out a campaign for the evalu...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
|
500.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
|
1000.00 €
|
- German
ID: ELRA-W0330
ISLRN: 381-445-879-769-5This corpus consists of a collection of political speeches in German crawled from the online archive of the German presidency (Bundespraësident) and the Chancellery (Bundesregierung). For the German Presidency the speeches are available from July 1, 1984 to February 17, 2012 and the corpus con...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA |
0.00 €
|
0.00 €
|
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA |
0.00 €
|
0.00 €
|