Send us your bug reports.
Use keywords to find the product you are looking for.
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Catalog Reference : ELRA-S0250
TC-STAR English-Spanish Training Corpora for Machine Translation: Aligned Final Text Editions of EPPS
TC-STAR is a European integrated project focusing on all core technologies for Speech-to-Speech Translation (SST): Automatic Speech Recognition (ASR), Spoken Language Translation (SLT), and Text to Speech Synthesis (TTS).
This corpus consists of respectively 34 million (English) and 38 million (Spanish) running words of bilingual sentence segmented and aligned texts in English and Spanish obtained from the Final Text Editions provided by the European Parliament (
) from April 1996 to Sept. 2004, Dec. 2004 to May 2005, and Dec. 2005 to May 2006. The data is accompanied by tools for further preprocessing.
Distribution medium :
Click on the arrow to display content.
Academic - Commercial 4250.00 EUR
Academic - Research 3000.00 EUR
Commercial - Commercial 4250.00 EUR
Commercial - Research 4250.00 EUR
Non Member Prices
Academic - Commercial 5600.00 EUR
Academic - Research 3925.00 EUR
Commercial - Commercial 5600.00 EUR
Commercial - Research 5600.00 EUR
Tuesday 27 June, 2017
23191525 requests since Monday 27 September, 2004
Copyright © 2008