Dataset

Providing valuable data for machine learning and helping improve performances of AI systems

Data Specification
Language

ZH-CN

Recording Environment

Quiet Indoors

Audio Duration

732 Hours

Data Content

Query, command, SMS

Speech parameter

16K/16bit

Recording Equipment

Smartphone

Application Fields

Speech Recognition

Sensitive Items

No

Copyright Ownership

Magic Data

Samples
打开净化空气
00:00/00:00
我要看情圣
00:00/00:00
请 帮 我 把温度 调 高 一点
00:00/00:00
接收 最近 的 短信
00:00/00:00
Enquiry
MDT-ASR-A003 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-A011 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-B002 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-B016 Mandarin Chinese Speech Recognition Corpus

MDT-ASR-A003 Mandarin Chinese Speech Recognition Corpus

Mandarin Chinese Conversational Speech Recognition Corpus is transcribed with an accuracy of 97%, produced by Magic Data Tech. It is valuable to improve the recognition accuracy of your ASR models.
Name *
Phone Number
E-mail *
Company Name *
Message Content
Country
City

Please fill in your information before obtaining the date link.