Text (1052)
Audio (679)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1681 Language Resources (Page 33 of 85)

« Previous | Next »Order by:

 ILSP/ELEFTHEROTYPIA Corpus (Greek corpus)    
  • Modern Greek (1453-)

ID: ELRA-W0022

ISLRN: 002-552-644-443-1

The ILSP/ELEFTHEROTYPIA Corpus contains approximately 3 million words classified and annotated according to the common core PAROLE encoding standard. Thus, each file is classified according to the parameters of Medium, Topic and Genre, and structurally annotated at paragraph level (CES Level 1). ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
850.00 € submit
850.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1275.00 € submit
1275.00 € submit
 Indian English Speech Data by Mobile Phone - 1,012 Hours    
  • English

ID: ELRA-S0456

ISLRN: 001-453-575-915-4

Indian English audio data captured by mobile phones, 1,012 hours in total, recorded by 2,100 Indian native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
153824.00 € submit
153824.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
153824.00 € submit
153824.00 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Data by Mobile Phone - 639 Hours    
  • Indonesian

ID: ELRA-S0439

ISLRN: 394-545-170-456-2

1285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Andro...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
103198.50 € submit
103198.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
103198.50 € submit
103198.50 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Data by Mobile Phone_R - 359 Hours    
  • Indonesian

ID: ELRA-S0470

ISLRN: 311-413-414-907-0

Indonesia speech data (reading) is collected from 496 Indonesian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, figure, letter, and oral. Around 400 sentences for each speaker. The valid ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57978.50 € submit
57978.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57978.50 € submit
57978.50 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Recognition Corpus (Desktop)    
  • Indonesian

ID: ELRA-S0228-115

ISLRN: 238-085-521-885-2

This corpus was recorded in a quiet office environment over 4 channels and collected from a total of 200 speakers, including 97 males and 103 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news and da...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21600.00 € submit
21600.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
21600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21600.00 € submit
21600.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
21600.00 € submit
 Insurance (Termcat)    
  • Catalan; Valencian
  • English
  • Spanish; Castilian

ID: ELRA-T0094

ISLRN: 723-632-688-733-6

Insurance contracts, private and public insurance, resource terminology used within European Union institutions. Cards available: 1000 Languages: Catalan, Spanish, English Format: ASCII Medium: floppy disk Card Description: Each card in this terminological database contains a definition, abb...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
 International Agreements (Processed)    
  • English
  • Latvian

ID: ELRA-W0158

ISLRN: 810-722-062-476-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. International Agreements have been translated into natio...

MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0
0.00 € submit
0.00 € submit
 ÌròyìnSpeech    
  • Yoruba

ID: ELRA-S0492

ISLRN: 012-405-700-001-6

A modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. The subject matter is drawn from the Broadcast News domain as well as fictional texts, delivering a multi-purpose, contemporary spe...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
11200.00 € submit
Licence: Commercial Use - ELRA VAR
11200.00 € submit
11200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 ISLE Speech Corpus    
  • English

ID: ELRA-S0083

ISLRN: 723-960-059-948-7

Approx. 20 minutes of speech (per speaker) from 23 German and 23 Italian intermediate learners of English. Each speaker recorded sentences from several blocks of differing types (reading simple sentences, using minimal pairs, giving answers to multiple choice questions). The prompts were of varyi...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
 Italian English Speech Recognition Corpus (Mobile)    
  • English

ID: ELRA-S0228-108

ISLRN: 554-977-743-197-5

This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 213 speakers, including 103 males and 110 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25200.00 € submit
25200.00 € submit
Licence: Commercial Use - ELRA VAR
25200.00 € submit
25200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25200.00 € submit
25200.00 € submit
Licence: Commercial Use - ELRA VAR
25200.00 € submit
25200.00 € submit
 Italian Kids Speech Recognition Corpus (Desktop)    
  • Italian

ID: ELRA-S0228-98

ISLRN: 501-616-216-038-6

This corpus comprises 19,788 entries uttered by 31 speakers (15 males and 16 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 4.9 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 Italian lexicon with morphological information    
  • Italian

ID: ELRA-L0069

ISLRN: 840-625-201-574-7

This Italian lexicon is made up of 862,500 inflected forms corresponding to 112,000 simple word lemmas. It contains: - 66,340 nouns, with type, gender, number and inflected forms (including irregular forms) - 12,030 verbs, with mood, tense, person, gender, number and inflected forms (including ir...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5500.00 € submit
6500.00 € submit
Licence: Commercial Use - ELRA VAR
8000.00 € submit
8000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7000.00 € submit
8500.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 Italian lexicon with morphological information and clitic verbs    
  • Italian

ID: ELRA-L0070

ISLRN: 565-957-248-233-5

This Italian lexicon is the same as the one described in ELRA-L0069, but with the addition of clitic verbs, which increases the number of inflected forms to 1,800,000 (still corresponding to 112,000 simple words lemmas). Half the lexicon is made up of clitic verbs. It contains: - 66,340 nouns, wi...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6500.00 € submit
8000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8500.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
12500.00 € submit
12500.00 € submit
 Italian Speaking English Speech Data by Mobile Phone - 227 Hours    
  • English

ID: ELRA-S0429

ISLRN: 703-740-233-998-1

497 Italians recorded in a relatively quiet environment in authentic English. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android an...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
53912.50 € submit
53912.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
53912.50 € submit
53912.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Corpus 1 (Appen)    
  • Italian

ID: ELRA-S0147

ISLRN: 458-657-455-735-5

The Italian Speech Corpus 1 contains the recordings of 202 native Italian speakers (112 males, 90 females) recorded in an office and a closed public place, over 4 channels, in a range of low to medium background noise environments (Plantronics Audio 10 (computer/desk mic), Shure SM58 (desk mounte...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
9500.00 € submit
Licence: Commercial Use - ELRA VAR
9500.00 € submit
9500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Italian Speech Data by Mobile Phone - 1,441 Hours    
  • Italian

ID: ELRA-S0450

ISLRN: 217-750-727-467-7

The data were recorded by 3,109 native Italian speakers with authentic Italian accents. The recorded content covers a wide range of categories such as general purpose, interactive, in car commands, home commands, etc. The recorded text is designed by a language expert, and the text is manually pr...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Data by Mobile Phone_Reading - 215 Hours    
  • Italian

ID: ELRA-S0472

ISLRN: 341-812-724-006-1

Italian speech data (reading) is collected from 325 Italian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, and oral. Each sentence contains 9.2 words in average. Each sentence is repeated...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Data Collected by Mobile Phone - 347 Hours    
  • Italian

ID: ELRA-S0461

ISLRN: 382-599-484-763-7

Italian languageaudio data captured by mobile phone , with total duration of 347 hours. It is recorded by 800 Italian native speakers, balanced in gender is balanced; the recording environment is quiet; all texts are manually transcribed with high accuracy. This data set can be applied on automat...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
62633.50 € submit
62633.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
62633.50 € submit
62633.50 € submit

Special offers are also available. Check here for details.

 Italian SpeechDat-Car database    
  • Italian

ID: ELRA-S0144

ISLRN: 513-325-829-468-0

The Italian SpeechDat-Car database contains the recordings of 300 Italian speakers (149 females, 151 males) recorded over the GSM telephone network, in a car. This database is partitioned into 14 DVDs. The speech data files are in two formats. Four of the 5 microphones were recorded on the comput...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
90000.00 € submit
90000.00 € submit
Licence: Commercial Use - ELRA VAR
90000.00 € submit
90000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
120000.00 € submit
120000.00 € submit
Licence: Commercial Use - ELRA VAR
120000.00 € submit
120000.00 € submit
 Italian Speech Recognition Corpus (Desktop)    
  • Italian

ID: ELRA-S0228-80

ISLRN: 789-295-563-911-2

This corpus comprises 49,994 entries uttered by 50 speakers (23 males and 27 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 24.21hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit

« Previous | Next »