Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Anglais Français
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalog Reference : ELRA-S0226-02
    IDIOLOGOS 2 “Eingenspeakers” (NEOLOGOS Project)
    The IDIOLOGOS 2 “Eingenspeakers” database was produced within the French national project NEOLOGOS, as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The databases produced in the framework of the NEOLOGOS project are designed for the development and the assessment of French speech or speaker recognizers and speech synthesizers. They consist in:
    1) the IDIOLOGOS databases are made of adults voices and are available in 2 subsets:
    - the “Bootstrap” database (catalogue ref. ELRA-S0226-01),
    - the “Eingenspeakers” database (catalogue ref. ELRA-S0226-02)
    2) the PAIDIALOGOS database (catalogue ref. ELRA-S0227) is made of children’s and teenagers’ voices.

    The IDIOLOGOS 2 “Eingenspeakers” database contains the recordings of 200 adult French speakers (97 males and 103 females) recorded over the French fixed telephone network. The speakers uttered 45 sentences per call with 10 calls per speaker. The 450 sentences per speaker are common to all speakers. Speakers were selected from the IDIOLOGOS 1 “Bootstrap” (ELRA-S0226-01) database.

    This database is distributed as 1 DVD-ROM. The speech files are stored as sequences of 8-bit, 8kHz A-law speech files and are not compressed, according to the specifications of NEOLOGOS. Each prompt utterance is stored within a separate file and has an accompanying ASCII SAM label file.

    This speech database was validated by SPEX (the Netherlands) to assess its compliance with the NEOLOGOS format and content specifications.

    Each speaker uttered the following items:
    - 1 digit sequence (6 digits)
    - 1 telephone number (10 digits)
    - 1 credit card number (16 digits)
    - 1 spelling of directory assistance city name
    - 1 real/artificial for coverage
    - 45 phonetically rich sentences

    The following age distribution has been obtained: 42 speakers are between 18 and 30, 50 speakers are between 31 and 45, 62 speakers are between 46 and 61, and 46 speakers are over 61.

    A pronunciation lexicon with a phonemic transcription in SAMPA is also included.

    ISLRN : 377-605-098-134-9
    Project : NEOLOGOS
    Applications existing : Speech recognition#Speech synthesis
    Technical Information
    Distribution medium : Downloadable
    Contents Click on the arrow to display content.
     speech corpus 
    Resource files
  • ICON_FILE_DOWNLOAD Validation report
    Members Prices
    Academic - Commercial 15000.00 EUR
    Academic - Research 1000.00 EUR
    Commercial - Commercial 15000.00 EUR
    Commercial - Research 15000.00 EUR
    Non Member Prices
    Academic - Commercial 24000.00 EUR
    Academic - Research 1000.00 EUR
    Commercial - Commercial 24000.00 EUR
    Commercial - Research 24000.00 EUR

    Special Prices

    Special prices available for linguistic or humanities studies upon request.

    Resource available in the bundle(s) referenced below.

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0