Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-S0251
TC-STAR English Training Corpora for ASR: Recordings of EPPS Speech
TC-STAR is a European integrated project focusing on all core technologies for Speech-to-Speech Translation (SST): Automatic Speech Recognition (ASR), Spoken Language Translation (SLT), and Text to Speech Synthesis (TTS).
This corpus consists of the recordings of around 290 hours from EPPS (European Parliament Plenary Sessions) speeches held or interpreted in European English (a mixture of native and non-native English), 92 hours of which were annotated (transcribed) (the transcriptions are not included in the present package). These recordings were obtained from Europe by Satellite (
) from May 2004 until May 2006.
The speech signals were submitted by EbS via internet in Real Media format and via satellite in MPEG1-layer2 format. The signals were decoded, resampled and are stored in WAVE RIFF (Resource Interchange File Format). Each file contains a single channel with 16-bit resolution at a sample rate of 16kHz.
The speech databases made within the TC-STAR project were validated by SPEX, in the Netherlands, to assess their compliance with the TC-STAR format and content specifications.
For corresponding transcriptions, see ELRA-S0249.
Applications existing :
Automatic speech recognition
Distribution medium :
Click on the arrow to display content.
Source Channel :
Academic - Commercial 600.00 EUR
Academic - Research 400.00 EUR
Commercial - Commercial 600.00 EUR
Commercial - Research 600.00 EUR
Non Member Prices
Academic - Commercial 800.00 EUR
Academic - Research 520.00 EUR
Commercial - Commercial 800.00 EUR
Commercial - Research 800.00 EUR
Resource available in the bundle(s) referenced below.
Thursday 24 August, 2017
23681205 requests since Monday 27 September, 2004
Copyright © 2008