Central and Eastern European Survey
Resources
Department of Computer Science
"Politehnica" University
NL and Speech Resources available: Speech, Lexical.
Name: TIMIT Nature: speech Language: American English Size: about 5 hours Format: NIST wav Coverage: read speech Medium: CD-ROM Availability: commercial product
Name: OGI Multilanguage Nature: speech Language: English, Farsi (Persian), French, German, Hindi, Japanese, Korean, Mandarin Chinese, Spanish, Tamil, Vietnamese Size: about 25 hours Format: NIST wav Coverage: telephone speech Medium: CD-ROM Availability: commercial product
Name: RODI Nature: speech Language: Romanian Size: about 50 minutes Format: RIFF WAV, NIST wav Coverage: isolated digits Medium: CD-ROM Availability: confidential
Name: BABEL - Romanianpart Nature: speech Language: Romanian Size: about 10 hours Format: raw speech Coverage: continuouslyread passages and sentences, numbers, CVC words (isolated and in
contexts), Romanian alphabet, names (spoken and spelled), telephone
numbers, dates, letters + digits strings, addresses Medium: CD-ROM (3) Availability: confidential
Name: pronunciationdictionary Nature: lexical Language: Romanian Size: about 24000 words Format: ASCII Coverage: prose Medium: diskette Availability: confidential
|