20 Language Resources

Order by:

 APASCI    
  • Italian

ID: ELRA-S0039

ISLRN: 501-292-014-931-9

APASCI is an Italian speech database recorded in an insulated room with a Sennheiser MKH 416 T microphone. It includes 5,290 phonetically rich sentences and 10,800 isolated digits, for a total of 58,924 word occurrences (2,191 different words) and 641 minutes of speech. The speech material was re...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1600.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit
 AUDIO Human Voice Pronunciations - Italian    
  • Italian

ID: ELRA-S0490-10

ISLRN: 204-114-156-037-4

Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
1679.80 € submit
1679.80 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
1763.79 € submit
1763.79 € submit

Special offers are also available. Check here for details.

 CLIPS_MT_MANUAL    
  • Italian

ID: ELRA-S0369

ISLRN: 865-893-448-970-5

CLIPS_MT_MANUAL is a sub-corpus of the original Italian CLIPS corpus (Corpora e Lessici dell'Italiano Parlato e Scritto). This corpus contains 3228 inspected and partially repaired WAV signal files, each containing one dialogue turn (*.wav), 3228 corrected original CLIPS annotation files (*.acs, ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
127.82 € submit
255.64 € submit
Licence: Commercial Use - ELRA VAR
255.64 € submit
255.64 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
511.30 € submit
Licence: Commercial Use - ELRA VAR
511.30 € submit
511.30 € submit
 C-ORAL-ROM - Integrated reference corpora for spoken romance languages. Multi-media edition; tools of analysis; standard linguistic measurements for validation in HLT    
  • French
  • Italian
  • Portuguese
  • Spanish; Castilian

ID: ELRA-S0172

ISLRN: 318-977-046-077-4

Description The C-ORAL-ROM resource is a multilingual corpus of spontaneous1 speech for the main romance languages of around 1,200,000 words (IST 2000-26228). The resource comprises three components: a)Multimedia corpus; b)Speech software; c)Appendix. The corpus consists of four comparable recor...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
 ILE: Italian LExicon      
  • Italian

ID: ELRA-S0059

ISLRN: 052-156-999-928-3

ILE is a 588,000 entries Italian lexicon transcribed with SAMPA notation. It was generated, mainly for speech recognition purposes, by means of a morphological analyzer handling more than 100,000 morphemes, each of them transcribed and manually checked. Each stem was combined with all its possibl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 Italian Kids Speech Recognition Corpus (Desktop)    
  • Italian

ID: ELRA-S0228-98

ISLRN: 501-616-216-038-6

This corpus comprises 19,788 entries uttered by 31 speakers (15 males and 16 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 4.9 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Italian Speech Corpus 1 (Appen)    
  • Italian

ID: ELRA-S0147

ISLRN: 458-657-455-735-5

The Italian Speech Corpus 1 contains the recordings of 202 native Italian speakers (112 males, 90 females) recorded in an office and a closed public place, over 4 channels, in a range of low to medium background noise environments (Plantronics Audio 10 (computer/desk mic), Shure SM58 (desk mounte...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
9500.00 € submit
Licence: Commercial Use - ELRA VAR
9500.00 € submit
9500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Italian Speech Data by Mobile Phone - 1,441 Hours    
  • Italian

ID: ELRA-S0450

ISLRN: 217-750-727-467-7

The data were recorded by 3,109 native Italian speakers with authentic Italian accents. The recorded content covers a wide range of categories such as general purpose, interactive, in car commands, home commands, etc. The recorded text is designed by a language expert, and the text is manually pr...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Data by Mobile Phone_Reading - 215 Hours    
  • Italian

ID: ELRA-S0472

ISLRN: 341-812-724-006-1

Italian speech data (reading) is collected from 325 Italian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, and oral. Each sentence contains 9.2 words in average. Each sentence is repeated...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Data Collected by Mobile Phone - 347 Hours    
  • Italian

ID: ELRA-S0461

ISLRN: 382-599-484-763-7

Italian languageaudio data captured by mobile phone , with total duration of 347 hours. It is recorded by 800 Italian native speakers, balanced in gender is balanced; the recording environment is quiet; all texts are manually transcribed with high accuracy. This data set can be applied on automat...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
62633.50 € submit
62633.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
62633.50 € submit
62633.50 € submit

Special offers are also available. Check here for details.

 Italian SpeechDat-Car database    
  • Italian

ID: ELRA-S0144

ISLRN: 513-325-829-468-0

The Italian SpeechDat-Car database contains the recordings of 300 Italian speakers (149 females, 151 males) recorded over the GSM telephone network, in a car. This database is partitioned into 14 DVDs. The speech data files are in two formats. Four of the 5 microphones were recorded on the comput...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
90000.00 € submit
90000.00 € submit
Licence: Commercial Use - ELRA VAR
90000.00 € submit
90000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
120000.00 € submit
120000.00 € submit
Licence: Commercial Use - ELRA VAR
120000.00 € submit
120000.00 € submit
 Italian Speech Recognition Corpus (Desktop)    
  • Italian

ID: ELRA-S0228-80

ISLRN: 789-295-563-911-2

This corpus comprises 49,994 entries uttered by 50 speakers (23 males and 27 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 24.21hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Italian Speecon database    
  • Italian

ID: ELRA-S0213

ISLRN: 239-555-046-548-2

The Italian Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 550 adult Italian speakers (273 males, 277 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the reco...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 Italian TTS Speech Corpus (Appen)    
  • Italian

ID: ELRA-S0148

ISLRN: 976-246-706-503-6

The Italian TTS Speech Corpus contains the recordings of 1 native Italian speaker (male, 50 years old) recorded in a studio over 1 channel (Shure SM15 unidirectional professional head-word condenser microphone). The data collection and transcription were performed by Appen (Australia). Speech sam...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
9000.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
9000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3500.00 € submit
11000.00 € submit
Licence: Commercial Use - ELRA VAR
11000.00 € submit
11000.00 € submit
 LC-STAR English-Italian Bilingual Aligned Phrasal lexicon      
  • English
  • Italian

ID: ELRA-S0271

ISLRN: 678-773-434-361-5

The LC-STAR English-Italian Bilingual Aligned Phrasal lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. It was designed for SST (Speech-to-Speech Translation). The lexicon comprises 10,466 phrases from the tourist domain....

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3750.00 € submit
5500.00 € submit
Licence: Commercial Use - ELRA VAR
5500.00 € submit
5500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4875.00 € submit
7150.00 € submit
Licence: Commercial Use - ELRA VAR
7150.00 € submit
7150.00 € submit
 LC-STAR Italian Phonetic lexicon      
  • Italian

ID: ELRA-S0270

ISLRN: 593-069-994-396-4

The LC-STAR Italian Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 109,712 entries, distributed over three categories: - a set of 56,420 common word entries. This set is extracted from a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14250.00 € submit
22000.00 € submit
Licence: Commercial Use - ELRA VAR
22000.00 € submit
22000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21500.00 € submit
29250.00 € submit
Licence: Commercial Use - ELRA VAR
29250.00 € submit
29250.00 € submit
 MULTEXT Prosodic database    
  • English
  • French
  • German
  • Italian
  • Spanish; Castilian

ID: ELRA-S0060

ISLRN: 098-719-242-965-4

This database comprises one CD-ROM for each five languages (French, English, Italian, German and Spanish), totalling 4 hours and 20 minutes of speech and involving 50 different speakers (5 male and 5 female per language). The recordings on which the corpus is based consist of passages of about f...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 PortMedia French and Italian corpus    
  • French
  • Italian

ID: ELRA-S0371

ISLRN: 135-793-959-390-8

The PortMedia French and Italian corpus was produced by ELDA, with the same paradigm and specifications as the MEDIA speech database (ELRA-S0272) but on a different domain. The method chosen for the corpus construction process is that of a ‘Wizard of Oz’ (WoZ) system. This consists of simulati...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
20000.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
25000.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
6500.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 SPK    
  • Italian

ID: ELRA-S0049

ISLRN: 604-522-044-889-5

SPK is an Italian speech database of isolated and connected digits. It was designed and collected at the Istituto per la Ricerca Scientifica e Tecnologica (ITC/IRST), Trento, Italy. SPK was conceived for speaker recognition and verification purposes.With this CD-ROM, speech material corresponding...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
800.00 € submit
Licence: Commercial Use - ELRA VAR
800.00 € submit
800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
1600.00 € submit
Licence: Commercial Use - ELRA VAR
1600.00 € submit
1600.00 € submit
 The "SIVA" Speech Database for Speaker Verification and Identification    
  • Italian

ID: ELRA-S0028

ISLRN: 405-085-829-227-5

The Italian speech database SIVA (?Speaker Identification and Verification Archives: SIVA?), is a database comprising more than two thousands calls, collected over the public switched telephone network, and available very soon via ELRA. The SIVA database consists of four speaker categories: male...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
4500.00 € submit
Licence: Commercial Use - ELRA VAR
4500.00 € submit
4500.00 € submit