多媒體‎ > ‎音樂工具‎ > ‎

Multilingual Speech Database

張貼者:2010年8月19日 下午8:57未知的使用者   [ service orderble 已於 2012年8月8日 上午2:33 更新 ]
 image

適合講話材料的研究語言識別和語音識別性能評價終端。






---
開發商:NTT Advanced Technology Corporation
原廠網址:http://www.ntt-at.com/products_e/multilingual/index.html
更新日期:2011/05/11
採購正式版、大量授權報價、技術支援、軟體諮詢、委託採購、詢問報價請來電 02-29299388 分機16 , 
來信service@orderble.com,或點我
------------------------------------------------------------------------------


規範

健談的最大限度,168揚聲器 *闡明同一城市名稱根據自己的母語的發音。

他們的母語語言;
美國英語,阿拉伯語,中文(普通話),荷蘭語,英語(英國),芬蘭語,法語,德語,希臘語,匈牙利語,印地文,印尼文,意大利語,日語,韓語,波蘭語,葡萄牙語(巴西),俄語,西班牙語,瑞典語,泰國人。

對於每一種語言,四男四女發言人分配。


城市名稱200名國際知名的城市,位於空港。

發言者表達了同樣的一套200名城市根據自己的母語的發音。



*注: 由於省略了一些材料,由於搖搖欲墜的發音,其數量約20健談小於這個數字對某些城市。

錄音條件順應 ITU - T建議臨 80。
數碼錄音和編輯機,使用。 (同樣作為我們的產品“多語言語音數據庫 Telephonometry 1994”。)


媒體所有的語音信號記錄在CD - ROM磁盤作為個人電腦的二進制文件(小endian字節順序)按照ISO9660格式的格式。
採樣率:16千赫
振幅分辨率:16位
客戶可以檢索量化的語音信號通過 PC與 CD - ROM驅動器。


註釋: 這些文件在CD - ROM的磁盤不在Windows波格式;客戶無法重現他們由普通的PC板無聲音的格式轉換。 該文件還不能播放聲音信號作為一個商業 CD播放器。












Feature
Suitable speech materials for the research on language identification and performance evaluation for speech recognition terminals.




Price
400,000 JP Yen for one set of CD-ROM disks (4 disks) for an overseas mailing address.
Note: Clients are requested to pay their domestic tax or custom duty by themselves.

Page Top

Specification

TalkersMaximally, 168 speakers *articulate the same city names according to their native pronunciation.

Their native languages are;
American English, Arabic, Chinese (Mandarin), Dutch, English (British), Finnish, French, German, Greek, Hungarian, Hindi, Indonesian, Italian, Japanese, Korean, Polish, Portuguese (Brazilian), Russian, Spanish, Swedish, Thai.

For each language, 4 male and 4 female speakers were allocated.


City NamesTwo hundred famous cities where international air ports are located.

Speakers articulate the same set of 200 city names according to their own native pronunciation.



*Note: Since some material was omitted due to faltering pronunciation, the number of talkers is about 20 less than this number for some cities.

Recording conditionsConforming to the ITU-T Recommendation P.80.
Digital recording and editing machines were used. (The same as our product "Multilingual Speech Database for Telephonometry 1994".)


MediaAll speech signals are recorded on CD-ROM disks as the PC binary files (little endian Byte order) according to ISO9660 format.
Sampling Rate: 16 kHz
Amplitude resolution: 16 bits
Clients can retrieve the quantized speech signal by PC with a CD-ROM drive.
Comments