Japanese Read Speech Recognition Corpus was developed by MAGICDATA TECHNOLOGY Co., Ltd. with a significant volume of 1500 hours. A subset of 30-hour scripted read speech data was developed and freely published for non-commercial use. 37 native speakers are from different areas, including Tokyo, Osaka, Hokkaido, etc. The corpus is a test set, recorded indoors and the output is PCM formatted. The recording texts are from daily conversation.
|Recording Environment||:||Quiet Indoors|
|Audio Duration||:||30 Hours|
|Data Content||:||Common sentence in daily life|
|Speakers Intro||:||37 speakers from different areas (including Tokyo, Osaka, Hokkaido, etc) in Japan|
|Application Fields||:||Speech Recognition|
|Copyright Ownership||:||Magic Data|
The user shall observe the following rules when browsing the website and using the data: