Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-S0252
TC-STAR Spanish Training Corpora for ASR: Recordings of EPPS Speech
TC-STAR is a European integrated project focusing on all core technologies for Speech-to-Speech Translation (SST): Automatic Speech Recognition (ASR), Spoken Language Translation (SLT), and Text to Speech Synthesis (TTS).
This corpus consists of the recordings of around 283 hours from EPPS (European Parliament Plenary Sessions) speeches held or interpreted in European Spanish (a mixture of native and non-native Spanish), 62 hours of which were annotated (transcribed) within the project (the transcriptions are not provided in the present package but will be made available soon). These recordings were obtained from Europe by Satellite (
) from May 2004 until May 2006.
The speech signals were submitted by EbS via internet in Real Media format and via satellite in MPEG1-layer2 format. The signals were decoded, resampled and are stored in WAVE RIFF (Resource Interchange File Format). Each file contains a single channel with 16-bit resolution at a sample rate of 16kHz.
Applications existing :
Automatic speech recognition
Distribution medium :
Click on the arrow to display content.
TEXT_DURATION283 hours, 62 hours of which are annotated
Source Channel :
Academic - Commercial 600.00 EUR
Academic - Research 400.00 EUR
Commercial - Commercial 600.00 EUR
Commercial - Research 600.00 EUR
Non Member Prices
Academic - Commercial 800.00 EUR
Academic - Research 520.00 EUR
Commercial - Commercial 800.00 EUR
Commercial - Research 800.00 EUR
Saturday 25 November, 2017
24183116 requests since Monday 27 September, 2004
Copyright © 2008