Text (1052)
Audio (679)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1681 Language Resources (Page 51 of 85)

« Previous | Next »Order by:

 Pashto Conversational Speech Recognition Corpus (Telephone)    
  • Pushto; Pashto

ID: ELRA-S0228-59

ISLRN: 457-089-583-656-6

The corpus contains 26 pairs of Afghanistan Southern Pashto spontaneous conversational speech, which were from 52 speakers (27 males and 25 females). For this collection, 2 speakers of each group performed the recording in separate quiet rooms. 21 topics were contained in this database. The audio...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16500.00 € submit
16500.00 € submit
Licence: Commercial Use - ELRA VAR
16500.00 € submit
16500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
16500.00 € submit
16500.00 € submit
Licence: Commercial Use - ELRA VAR
16500.00 € submit
16500.00 € submit
 Pashto phonetic lexicon      
  • Pushto; Pashto

ID: ELRA-S0392

ISLRN: 186-827-325-462-6

This is a phonetic lexicon of 21,560 tokens in Pashto with their phonetic transcription in IPA. It covers the major dialect of the TRAD Pashto Broadcast News Speech Corpus (see ELRA Catalogue reference ELRA-S0381) from which the most frequent words were extracted. The pronunciation dictionary of ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Pedology database    
  • English
  • French

ID: ELRA-T0364

ISLRN: 506-710-931-377-3

453 terms 358 definitions in French 143 definitions in English French-English pedology terminology, extracted from an INRA/CILF document and other sources (TERMIUM, Concise Oxford Dictionary of Earth Sciences, etc.). Records compiled using index file (from Mme BOUROCHE, INRA, corrections delivere...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
585.00 € submit
819.00 € submit
Licence: Commercial Use - ELRA VAR
819.00 € submit
819.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
819.00 € submit
1170.00 € submit
Licence: Commercial Use - ELRA VAR
1170.00 € submit
1170.00 € submit
 Persian 1984 corpus (Multext-East framework)    
  • Persian

ID: ELRA-W0054

ISLRN: 851-240-629-673-1

This corpus contains the Persian (Farsi) translation of a part of the novel “1984” (G. Orwell) annotated in the Multext-East framework (Multilingual Text Tools and Corpora for Eastern and Central European Languages). The aim of the Multext-East project was to develop standardized language resourc...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Persian Audio Dictionary      
  • Persian

ID: ELRA-S0401

ISLRN: 133-181-128-420-9

This dictionary consists of more than 50,000 entries (along with almost all wordforms and proper names) with corresponding audio files in MP3 and English transliterations. The words have been recorded with standard Persian (Farsi) pronunciation (all by a single speaker). This dictionary is provid...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
900.00 € submit
900.00 € submit
Licence: Commercial Use - ELRA VAR
4500.00 € submit
4500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1100.00 € submit
1100.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 Persian Ezafe Construction Dataset    
  • Persian

ID: ELRA-W0315

ISLRN: 663-014-610-121-2

The Persian Ezafe Construction Dataset includes gold Ezafe tags in almost 30 thousand Persian sentences. The sentences were manually annotated by six annotators who where all native Persian speakers and linguists. The inter-annotator agreement of a small portion of the data (one thousand sentence...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
750.00 € submit
3750.00 € submit
Licence: Commercial Use - ELRA VAR
3750.00 € submit
3750.00 € submit
 Persian Kids’ Speech Corpus    
  • Persian

ID: ELRA-S0487

ISLRN: 822-550-731-416-7

The Persian Kids’ Speech Corpus consists of speech signals recorded by 286 children (141 girls, 145 boys), from 6 to 9 years old, through an Andreas Mic Anti-Noise microphone and a Premium Speechmike headphone. The CoolEdit Pro2.1 software was utilized to record the speech at 16 kHz, single-chann...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
 Persian Lexicon    
  • Persian

ID: ELRA-L0087

ISLRN: 547-614-436-004-7

This is a Persian (Farsi) lexicon of more than 40,000 entries of non-inflected forms of words. Each word is transliterated based on the proposed framework from MBROLA (Text-To-Speech synthesizer). The database includes a large variety of descriptors for each entry (plural, homograph, ...). This...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
700.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
 Persian Multext-East framework lexicon    
  • Persian

ID: ELRA-L0086

ISLRN: 884-966-712-343-0

This is a Persian (Farsi) morphosyntactic lexicon derived from the Persian 1984 corpus (Multext-East framework) (see ELRA-W0054). It contains the full inflectional paradigms of a superset of lemmas that appear in the Persian 1984 corpus. Each entry gives the word-form, its lemma and morphosyntact...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
500.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Persian Speech Corpus    
  • Persian

ID: ELRA-S0393

ISLRN: 068-845-898-304-0

This about 2.5-hour Single-Speaker Speech corpus has been developed using the same methodologies used in the PhD work carried out by Nawar Halabi at the University of Southampton. The corpus was recorded in Persian (Tehrani accent) by one male speaker using a professional studio, through a "Blubb...

MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Non Commercial Use, Share Alike - CC-BY-NC-SA
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
 Persian Speech Corpus    
  • Persian

ID: ELRA-S0415

ISLRN: 058-406-130-314-1

This dataset contains more than 31 hours and 30 minutes of Persian scripted monologue and dialogue data, recorded from 89 Persian speakers (39 males and 50 females) between 17-80 years old in Iran (Tehrani dialect). Recordings were made between April and January 2022. Data consists of read and sp...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 PHONDAT 1 - PD1 (2nd edition)    
  • German

ID: ELRA-S0023

ISLRN: 776-688-402-560-8

The corpus contains read speech of 201 different speakers. Each speaker has read a subcorpus of 450 different sentences (including alphanumericals and two short passages of prose text); 8 speakers have read the whole sentence corpus. The speakers were recorded at four different sites in Germany (...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
511.29 € submit
7669.38 € submit
Licence: Commercial Use - ELRA VAR
7669.38 € submit
7669.38 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1022.58 € submit
9203.25 € submit
Licence: Commercial Use - ELRA VAR
9203.25 € submit
9203.25 € submit
 PHONDAT 2 - PD2 (2nd edition)    
  • German

ID: ELRA-S0024

ISLRN: 937-744-173-899-5

The corpus contains read speech of 16 different speakers. Each speaker has read a corpus of 200 different sentences from a train inquiry task. The speakers were recorded at three different sites in Germany (University of Kiel, University of Bonn, University of Munich). The language is German. The...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
127.82 € submit
127.82 € submit
Licence: Commercial Use - ELRA VAR
127.82 € submit
127.82 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
255.65 € submit
255.65 € submit
Licence: Commercial Use - ELRA VAR
255.65 € submit
255.65 € submit
 Phonetically Balanced Sentences    
  • Korean

ID: ELRA-S0129

ISLRN: 134-396-214-473-8

Large acoustic corpus in Korean produced by Kaist Korterm. 20 native Korean speakers (males and females) read 1 time 539 sentences and a set of 50 common sentence. Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. T...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
4000.00 € submit
Licence: Commercial Use - ELRA VAR
4000.00 € submit
4000.00 € submit
 Phonetically Balanced Words (1)    
  • Korean

ID: ELRA-S0124

ISLRN: 511-274-208-444-3

Large acoustic corpus of read text in Korean. 2 announcers and 70 native speakers have been recorded (38 males, 32 females), distributed according to 4 age classes. They read two times 452 eojeols (Korean terms), and 2 announcers read one time 2000 eojeols. In these 2000 eojeols, the above 452 eo...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Phonetically Balanced Words (2)    
  • Korean

ID: ELRA-S0125

ISLRN: 270-301-778-832-8

Large acoustic corpus of read text in Korean produced by Kaist Korterm. Native Korean speakers (males and females) have uttered 36 geographical proper nouns. Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. The dat...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50.00 € submit
200.00 € submit
Licence: Commercial Use - ELRA VAR
200.00 € submit
200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
400.00 € submit
Licence: Commercial Use - ELRA VAR
400.00 € submit
400.00 € submit
 Phonetically Balanced Words (3)    
  • Korean

ID: ELRA-S0126

ISLRN: 327-664-548-453-3

Large acoustic corpus in Korean produced by Kaist Korterm. Two announcers and 70 native speakers (males and females) read 2 times one paragraph. . Information such as the size and the level of studies of the speakers are provided. The recordings took place in a soundproof room. The data are store...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
63.00 € submit
250.00 € submit
Licence: Commercial Use - ELRA VAR
250.00 € submit
250.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
125.00 € submit
500.00 € submit
Licence: Commercial Use - ELRA VAR
500.00 € submit
500.00 € submit
 Phonetically Balanced Words (4)    
  • Korean

ID: ELRA-S0127

ISLRN: 081-051-013-524-4

Large acoustic corpus in Korean produced by Kaist Korterm. 70 native Korean speakers (males and females) read 4 times 32 cardinal numbers and 9 determinatives of one syllable. Two announcers read these only 2 times. Information such as the size and the level of studies of the speakers are provide...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
200.00 € submit
800.00 € submit
Licence: Commercial Use - ELRA VAR
800.00 € submit
800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
1600.00 € submit
Licence: Commercial Use - ELRA VAR
1600.00 € submit
1600.00 € submit
 Phonetically Balanced Words (5)    
  • Korean

ID: ELRA-S0128

ISLRN: 605-115-604-193-3

Large acoustic corpus in Korean produced by Kaist Korterm. 70 native Korean speakers (males and females) read 4 times 35 cardinal numbers compounded of 4 single numbers. Two announcers read these only two times. Information such as the size and the level of studies of the speakers are provided. T...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 Phonetically Rich Words    
  • Korean

ID: ELRA-S0130

ISLRN: 222-999-434-634-2

Large acoustic corpus in Korean produced by Kaist Korterm. 500 native speakers have been recorded (250 males, 250 females). They have uttered 32 single cardinal numbers, 1620 cardinal numbers compounded of 4 single numbers and 3813 phonetically rich words. The recordings took place in natural env...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
313.00 € submit
1250.00 € submit
Licence: Commercial Use - ELRA VAR
1250.00 € submit
1250.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
625.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit

« Previous | Next »