ELRA ELRA
  Home Catalogue » Multimodal/Multimedia Resources
Language Resources
Bug reports
Send us your bug reports.
Search Catalogue
 
Use keywords to find the product you are looking for.
Advanced Search
Languages
Anglais Français
Informations
  • Purchase procedure & Conditions

  • Pricing & user licences

  • How to promote your resources ?

  • Contact Us
  • Multimodal/Multimedia Resources
    Displaying 1 to 17 (of 17 products) Result Pages:  1 

    B0012
    CHIL 2004 Evaluation Package (Available since 22/11/2006)
    CHIL 2005 Evaluation Package (Available since 22/11/2006)
    CHIL 2006 Evaluation Package (Available since 14/11/2008)


    E0009 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Audio and Video Recordings of 10 seminars
    2) Video annotations done displaying 1 over 10 pictures in sequence, for the 4 cameras.
    3) Transcriptions using both TRS and STMUID formats.
    Language(s) : English
    E0010 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Contents of the CHIL 2004 Evaluation Package (see catalogue reference ELRA-E0009 for description).
    2) Audio and Video Recordings: 5 seminars recorded in November 2004).
    3) Stereo Video Recordings of 10 subjects that move in the camera’s field of view while performing pointing gestures.
    2) Video annotations.
    3) Transcriptions.
    Language(s) : English
    E0017 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The CHIL 2006 Evaluation Package consists of:
    1) A set of audiovisual recordings of seminars, called non-interactive seminars and of highly-interactive small working groups’ seminars, called interactive seminars. The recordings were done between 2004 and 2005 according to the “CHIL Room Setup” specification.
    2) Video annotations.
    3) Orthographic transcriptions.
    Language(s) : English

    Special prices for a combined purchase of CHIL 2004 (ELRA-E0009), CHIL 2005 (ELRA-E0010) and CHIL 2006 (ELRA-E0017)

    Membres Academic org. Commercial org.
    Research Use 1700.00 EUR 7000.00 EUR
    Commercial Use 7000.00 EUR 7000.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 2050.00 EUR 9000.00 EUR
    Commercial Use 9000.00 EUR 9000.00 EUR


    B0012
    CHIL 2004 Evaluation Package (Available since 22/11/2006)
    CHIL 2005 Evaluation Package (Available since 22/11/2006)
    CHIL 2006 Evaluation Package (Available since 14/11/2008)


    E0009 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Audio and Video Recordings of 10 seminars
    2) Video annotations done displaying 1 over 10 pictures in sequence, for the 4 cameras.
    3) Transcriptions using both TRS and STMUID formats.
    Language(s) : English
    E0010 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Contents of the CHIL 2004 Evaluation Package (see catalogue reference ELRA-E0009 for description).
    2) Audio and Video Recordings: 5 seminars recorded in November 2004).
    3) Stereo Video Recordings of 10 subjects that move in the camera’s field of view while performing pointing gestures.
    2) Video annotations.
    3) Transcriptions.
    Language(s) : English
    E0017 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The CHIL 2006 Evaluation Package consists of:
    1) A set of audiovisual recordings of seminars, called non-interactive seminars and of highly-interactive small working groups’ seminars, called interactive seminars. The recordings were done between 2004 and 2005 according to the “CHIL Room Setup” specification.
    2) Video annotations.
    3) Orthographic transcriptions.
    Language(s) : English

    Special prices for a combined purchase of CHIL 2004 (ELRA-E0009), CHIL 2005 (ELRA-E0010) and CHIL 2006 (ELRA-E0017)

    Membres Academic org. Commercial org.
    Research Use 1700.00 EUR 7000.00 EUR
    Commercial Use 7000.00 EUR 7000.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 2050.00 EUR 9000.00 EUR
    Commercial Use 9000.00 EUR 9000.00 EUR


    B0012
    CHIL 2004 Evaluation Package (Available since 22/11/2006)
    CHIL 2005 Evaluation Package (Available since 22/11/2006)
    CHIL 2006 Evaluation Package (Available since 14/11/2008)


    E0009 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Audio and Video Recordings of 10 seminars
    2) Video annotations done displaying 1 over 10 pictures in sequence, for the 4 cameras.
    3) Transcriptions using both TRS and STMUID formats.
    Language(s) : English
    E0010 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Contents of the CHIL 2004 Evaluation Package (see catalogue reference ELRA-E0009 for description).
    2) Audio and Video Recordings: 5 seminars recorded in November 2004).
    3) Stereo Video Recordings of 10 subjects that move in the camera’s field of view while performing pointing gestures.
    2) Video annotations.
    3) Transcriptions.
    Language(s) : English
    E0017 : The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The CHIL 2006 Evaluation Package consists of:
    1) A set of audiovisual recordings of seminars, called non-interactive seminars and of highly-interactive small working groups’ seminars, called interactive seminars. The recordings were done between 2004 and 2005 according to the “CHIL Room Setup” specification.
    2) Video annotations.
    3) Orthographic transcriptions.
    Language(s) : English

    Special prices for a combined purchase of CHIL 2004 (ELRA-E0009), CHIL 2005 (ELRA-E0010) and CHIL 2006 (ELRA-E0017)

    Membres Academic org. Commercial org.
    Research Use 1700.00 EUR 7000.00 EUR
    Commercial Use 7000.00 EUR 7000.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 2050.00 EUR 9000.00 EUR
    Commercial Use 9000.00 EUR 9000.00 EUR


    E0009
    CHIL 2004 Evaluation Package (Available since 22/11/2006)


    The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Audio and Video Recordings of 10 seminars
    2) Video annotations done displaying 1 over 10 pictures in sequence, for the 4 cameras.
    3) Transcriptions using both TRS and STMUID formats.
    Language(s) : English

    Membres Academic org. Commercial org.
    Research Use 200.00 EUR 1500.00 EUR
    Commercial Use 1500.00 EUR 1500.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 240.00 EUR 1800.00 EUR
    Commercial Use 1800.00 EUR 1800.00 EUR


    E0010
    CHIL 2005 Evaluation Package (Available since 22/11/2006)


    The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The database consists of:
    1) Contents of the CHIL 2004 Evaluation Package (see catalogue reference ELRA-E0009 for description).
    2) Audio and Video Recordings: 5 seminars recorded in November 2004).
    3) Stereo Video Recordings of 10 subjects that move in the camera’s field of view while performing pointing gestures.
    2) Video annotations.
    3) Transcriptions.
    Language(s) : English

    Membres Academic org. Commercial org.
    Research Use 700.00 EUR 3000.00 EUR
    Commercial Use 3000.00 EUR 3000.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 850.00 EUR 3600.00 EUR
    Commercial Use 3600.00 EUR 3600.00 EUR


    E0017
    CHIL 2006 Evaluation Package (Available since 14/11/2008)


    The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The CHIL 2006 Evaluation Package consists of:
    1) A set of audiovisual recordings of seminars, called non-interactive seminars and of highly-interactive small working groups’ seminars, called interactive seminars. The recordings were done between 2004 and 2005 according to the “CHIL Room Setup” specification.
    2) Video annotations.
    3) Orthographic transcriptions.
    Language(s) : English

    Membres Academic org. Commercial org.
    Research Use 1200.00 EUR 4500.00 EUR
    Commercial Use 4500.00 EUR 4500.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1450.00 EUR 6000.00 EUR
    Commercial Use 6000.00 EUR 6000.00 EUR


    E0033
    CHIL 2007 Evaluation Package (Available since 23/03/2009)


    The CHIL Seminars are scientific presentations given by students, faculty members or invited speakers in the field of multimodal interfaces and speech processing. The language is European English spoken by non native speakers. The recordings comprise the following: videos of the speaker and the audience from 4 fixed cameras, frontal close ups of the speaker, close talking and far-field microphone data of the speaker’s voice and background sounds.

    The CHIL 2007 Evaluation Package consists of:
    1) A set of audiovisual recordings of interactive seminars. The recordings were done between June and September 2006 according to the “CHIL Room Setup” specification.
    2) Video annotations.
    3) Orthographic transcriptions.
    Language(s) : English

    Membres Academic org. Commercial org.
    Research Use 1200.00 EUR 4500.00 EUR
    Commercial Use 4500.00 EUR 4500.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1450.00 EUR 6000.00 EUR
    Commercial Use 6000.00 EUR 6000.00 EUR


    S0021
    M2VTS Speaker Verification Database (Available since 01/09/1996)


    Multi Modal Verification for Teleservices and Security applications project. Multilingual data base designed to facilitate access control using multimodal identification of human faces (speech & image).
    Language(s) : French

    Membres Academic org. Commercial org.
    Research Use 250.00 EUR 250.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 500.00 EUR 500.00 EUR


    S0136
    SmartKom multimodal corpus – SK-P 2.0 (Available since 04/06/2004)


    This multimodal corpus produced by BAS comprises the recordings of 96 speakers recorded in public places in the technical setup SmartKom Public, which is comparable to a traditional public phone booth, equipped with additional intelligent communication devices. A number of modalities have been recorded, including e.g. the video of the face, the video of the upper body from the left, etc. The corpus is structured into volumes that can be selected and purchased separately, and is available either on DVD or an IDE (hard disk).
    Language(s) : German

    Membres Academic org. Commercial org.
    Research Use 7700.00 EUR 7700.00 EUR
    Commercial Use 7700.00 EUR 7700.00 EUR
    * Distributed on IDE. Other media available: - total on DVD: 19500 € - single volume: 127.82 €

    Non Membres Academic org. Commercial org.
    Research Use 15400.00 EUR 15400.00 EUR
    Commercial Use 15400.00 EUR 15400.00 EUR
    * Distributed on IDE. Other media available: - total on DVD: 39100 € - single volume: 255.65 €


    S0174-05
    FASiL multimodal “fasil-mm” corpus (fasil-mm)(Available since 07/04/2005)


    This corpus was collected within the FASiL project. It contains wizard-of-oz sound and interaction recordings of 90 subjects (30 per language: English, Portuguese and Swedish). See also S0174-01, S0174-02, S0174-03, and S0174-04.
    Language(s) : English (United Kingdom) - Portuguese - Swedish

    Membres Academic org. Commercial org.
    Research Use 9000.00 EUR 30000.00 EUR
    Commercial Use 30000.00 EUR 30000.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 20000.00 EUR 30000.00 EUR
    Commercial Use 30000.00 EUR 30000.00 EUR


    S0280
    SmartWeb Video Corpus (SVC) (Available since 11/07/2008)


    This multimodal corpus contains 99 recordings each containing a human-human-machine dialogue: one speaker (which is being recorded) interacts with a human partner as well with a dialogue system via a smart phone (SmartWeb system).
    See also ELRA-S0278 and ELRA-S0279.
    Language(s) : German

    Membres Academic org. Commercial org.
    Research Use 635.00 EUR 1635.00 EUR
    Commercial Use 1635.00 EUR 1635.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1275.00 EUR 2275.00 EUR
    Commercial Use 2275.00 EUR 2275.00 EUR


    S0283
    Laboratory Conditions Czech Audio-Visual Speech Corpus (UWB-05-LCAVC)(Available since 05/11/2008)


    This is an audio-visual speech database for training and testing of Czech audio-visual continuous speech recognition systems. The corpus consists of about 25 hours of audio-visual records of 65 speakers in laboratory conditions. Data collection was done with static illumination, and recorded subjects were instructed to remain static. The average speaker age was 22 years old. Speakers were asked to read 200 sentences each (50 common for all speakers and 150 specific to each speaker).
    Language(s) : Czech

    Membres Academic org. Commercial org.
    Research Use 550.00 EUR 550.00 EUR
    Commercial Use 2050.00 EUR 2050.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1050.00 EUR 1050.00 EUR
    Commercial Use 3050.00 EUR 3050.00 EUR


    S0284
    Czech Audio-Visual Speech Corpus for Recognition with Impaired Conditions (UWB-07-ICAVR I)(Available since 05/11/2008)


    This is an audio-visual speech database for training and testing of Czech audio-visual continuous speech recognition systems collected with impaired illumination conditions. The corpus consists of about 20 hours of audio-visual records of 50 speakers in laboratory conditions. Recorded subjects were instructed to remain static. The illumination varied and chunks of each speaker were recorded with several different conditions, such as full illumination, or illumination from one side (left or right) only. These conditions make the database usable for training lip-/head-tracking systems under various illumination conditions independently of the language. Speakers were asked to read 200 sentences each (50 common for all speakers and 150 specific to each speaker).
    Language(s) : Czech

    Membres Academic org. Commercial org.
    Research Use 650.00 EUR 650.00 EUR
    Commercial Use 3050.00 EUR 3050.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1250.00 EUR 1250.00 EUR
    Commercial Use 4550.00 EUR 4550.00 EUR


    S0285
    Czech Sign Language Corpus for Recognition – Amateur Signer (UWB-06-SLR-A)(Available since 05/11/2008)


    This is an amateur sign-language database comprising 25 signs from Czech sign language. 15 signers (4 women and 11 men) carried out 5 repetitions of each sign and were recorded from 3 different views. The first is a frontal view of the upper part of the body. The data contain 5685 avi files (one per sign performance), using up 7 GB of disk space, and are stored on DVDs.
    Language(s) :

    Membres Academic org. Commercial org.
    Research Use 125.00 EUR 125.00 EUR
    Commercial Use 275.00 EUR 275.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 175.00 EUR 175.00 EUR
    Commercial Use 425.00 EUR 425.00 EUR


    S0286
    Czech Sign Language Corpus for Recognition – Professional Signer (UWB-07-SLR-P)(Available since 05/11/2008)


    This database comprises 378 signs from Czech sign language as performed by 4 everyday sign-language users (4 women, 2 of them deaf). 5 repetitions of each sign were recorded from 3 different views. The data contain 21000 avi files (one per sign performance), using up 20 GB of disk space, and are stored on DVDs.
    Language(s) :

    Membres Academic org. Commercial org.
    Research Use 225.00 EUR 225.00 EUR
    Commercial Use 525.00 EUR 525.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 325.00 EUR 325.00 EUR
    Commercial Use 825.00 EUR 825.00 EUR


    S0300
    SIGNUM Database (Available since 20/05/2009)


    The SIGNUM Database contains both isolated and continuous utterances of various signers. The corpus was recorded on video. For quick random access to individual frames, each video clip is stored as a sequence of images. The vocabulary comprises 450 basic signs in German Sign Language (DGS) representing different word types. Based on this vocabulary, overall 780 sentences were constructed. Each sentence ranges from two to eleven signs in length. The entire corpus was performed once by 25 native signers of different sexes and ages. One of them was chosen to be the so-called reference signer. His performances were recorded three times.
    Language(s) :

    Membres Academic org. Commercial org.
    Research Use 600.00 EUR 600.00 EUR
    Commercial Use 600.00 EUR 600.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 1000.00 EUR 1000.00 EUR
    Commercial Use 1000.00 EUR 1000.00 EUR


    W0048
    TUNA Corpus (Available since 30/09/2008)


    The TUNA Corpus of Referring Expressions is built with the contributions from 50 native or fluent speakers of English and it contains about 2000 descriptions (referring expressions). Participants described objects (targets) in visual domains by typing and submitting referring expressions that distingued them from other objects that were shown simultaneously (distractors). Each description is annotated with semantic information.
    Language(s) : English

    Membres Academic org. Commercial org.
    Research Use 45.00 EUR 45.00 EUR
    Commercial Use 45.00 EUR 45.00 EUR

    Non Membres Academic org. Commercial org.
    Research Use 45.00 EUR 45.00 EUR
    Commercial Use 45.00 EUR 45.00 EUR


    Displaying 1 to 17 (of 17 products) Result Pages:  1 

    Copyright © 2008 ELRA
    ELRACatalogue 0.8.0