Provide extensive training data for AI research and improve model performance quickly
Japanese Read Speech Recognition Corpus
Japnese, Read Speech, Smartphone
Japanese Read Speech Recognition Corpus was developed by MAGICDATA TECHNOLOGY Co., Ltd. with a significant volume of 1500 hours. A subset of 30-hour scripted read speech data was developed and freely published for non-commercial use. 37 native speakers are from different areas, including Tokyo, Osaka, Hokkaido, etc. The corpus is a test set, recorded indoors and the output is PCM formatted. The recording texts are from daily conversation.
Common sentence in daily life
37 speakers from different areas (including Tokyo, Osaka, Hokkaido, etc) in Japan
The user shall observe the following rules when browsing the website and using the data: