MAGICDATA Mandarin Chinese Read Speech Corpus was developed by MAGIC DATA TECHNOLOGY Co., Ltd. and freely published for non-commercial use. The corpus consists of 755 hours of scripted read speech data by 1000 native speakers of the Mandarin Chinese spoken in mainland China.
Language | : | Chinese |
Recording Environment | : | Quiet Indoors |
Audio duration | : | 755 Hours |
Data Content | : | Common sentence in daily life |
Speakers Intro | : | 1000 native speakers of the Mandarin Chinese from different areas |
File Format | : | PCM |
Recording Equipment | : | Smartphone |
Application Fields | : | Speech Recognition |
Sensitive Items | : | No |
Copyright Ownership | : | Magic Data |
The user shall observe the following rules when browsing the website and using the data: