Home Catalogue
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
Use keywords to find the product you are looking for.
Advanced Search
Anglais Français
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Catalog Reference : ELRA-S0278
    SmartWeb Handheld Corpus (SHC)
    The SMARTWEB UMTS data collection was created within the publicly funded German SmartWeb project in the years 2004-2006. It comprises a collection of user queries to a naturally spoken Web interface with the main focus on the soccer world series in 2006. The recordings include field recordings using a hand-held UMTS device (one person, SmartWeb Handheld Corpus SHC, ref. ELRA-S0278), field recordings with video capture of the primary speaker and a secondary speaker (SmartWeb Video Corpus SVC, ref. ELRA-S0279), as well as mobile recordings performed on a BMW motorbike (one speaker, SmartWeb Motorbike Corpus SMC, ref. ELRA-S0280).

    This corpus corresponds to the hand-held UMTS device (SmartWeb Handheld Corpus) and contains recordings spoken by 156 speakers in a human-machine query situation. Users were asked to solve several tasks with a spoken query system to the WWW using a smart phone as portable device in natural environments (office, hall, restaurant, street). Recorded channels are the Bluetooth headset over UMTS (telephone quality), the Bluetooth headset and an additional collar microphone in high quality.

    The corpus contains:
    - Total number of recorded queries: 10,966
    - Total duration segmented speech: 1835 minutes
    - Formats: WAV 44,1kHz, 16 bit, ALAW 8kHz 8bit, Verbmobil transliteration, BAS Partitur Format (BPF)
    - Segmentation: automatic segmentation into queries by the recording server
    - Distribution: 15 DVD-R

    See also ELRA-S0279 and ELRA-S0280.

    ISLRN : 335-792-173-200-7
    Project : SmartWeb Creation date : 2004-2006
    Applications existing : Spoken dialogue systems
    Technical Information
    Distribution medium : Downloadable
    Fileformat : wav
    Contents Click on the arrow to display content.
     speech corpus 
    Members Prices
    Academic - Commercial 2912.50 EUR
    Academic - Research 1912.50 EUR
    Commercial - Commercial 2912.50 EUR
    Commercial - Research 2912.50 EUR
    Non Member Prices
    Academic - Commercial 4825.00 EUR
    Academic - Research 3825.00 EUR
    Commercial - Commercial 4825.00 EUR
    Commercial - Research 4825.00 EUR

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0