Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Languages
Informations
Purchase procedure & Conditions
Pricing & user licences
How to promote your resources ?
Contact Us
Catalog Reference : ELRA-S0226-01
IDIOLOGOS 1 “Bootstrap” (NEOLOGOS Project)
The IDIOLOGOS 1 “Bootstrap” database was produced within the French national project NEOLOGOS, as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The databases produced in the framework of the NEOLOGOS project are designed for the development and the assessment of French speech or speaker recognizers and speech synthesizers. They consist of:
1) the IDIOLOGOS databases are made of adults voices and are available in 2 subsets:
- the “Bootstrap” database (catalogue ref. ELRA-S0226-01),
- the “Eingenspeakers” database (catalogue ref. ELRA-S0226-02);
2) the PAIDIALOGOS database (catalogue ref. ELRA-S0227) is made of children’s and teenagers’ voices.
The IDIOLOGOS 1 “Bootstrap” database contains the recordings of 1000 adult French speakers (470 males and 530 females) recorded over the French fixed telephone network. The speakers uttered 45 phonetically rich sentences. The 45 sentences are the same for all speakers.
This database is distributed as 1 DVD-ROM. The speech files are stored as sequences of 8-bit, 8kHz A-law speech files and are not compressed, according to the specifications of NEOLOGOS. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.
This speech database was validated by SPEX (the Netherlands) to assess its compliance with the NEOLOGOS format and content specifications.
Each speaker uttered the following items:
- 1 digit sequence (5+ digits)
- 1 telephone number (10 digits)
- 1 credit card number (16 digits)
- 1 spelling of directory assistance city name
- 1 real/artificial for coverage
- 45 phonetically rich sentences
The following age distribution has been obtained: 288 speakers are between 18 and 30, 264 speakers are between 31 and 45, 247 speakers are between 46 and 61, and 201 speakers are over 61.
A pronunciation lexicon with a phonemic transcription in SAMPA is also included.
Production
Project :
NEOLOGOS
Applications
Applications existing :
Speech recognition#Speech synthesis
Technical Information
Distribution medium :
DVD
Contents
Click on the arrow to display content.
speech corpus
Language(s) :
French
TEXT_QUANTISATION8-bit
TEXT_SIGNAL_ENCODINGA-law
TEXT_CLIPPING_RATE_PERCENTAGE8 kHz
Source Channel :
Telephone
TEXT_ANNOTATION_LEVELOrthographic
Resource files
Validation report
Members Prices
Academic - Commercial 10000.00 EUR
Academic - Research 1000.00 EUR
Commercial - Commercial 10000.00 EUR
Commercial - Research 10000.00 EUR
Non Member Prices
Academic - Commercial 16000.00 EUR
Academic - Research 1000.00 EUR
Commercial - Commercial 16000.00 EUR
Commercial - Research 16000.00 EUR
Special Prices
Special prices available for linguistic or humanities studies upon request.
Bundle(s)
Resource available in the bundle(s) referenced below.
ELRA-B0007
Sunday 19 May, 2013
10464524 requests since Monday 27 September, 2004
Copyright © 2008
ELRA
ELRACatalogue 0.8.0