Dataset

Providing valuable data for machine learning and helping improve performances of AI systems

Data Specification
Language

ZH-CN

Recording Environment

Quiet Indoors

Audio Duration

370 Hours

Data Content

Query, command, SMS

Speech parameter

44.1K/16bit

Recording Equipment

Microphone

File Format

WAV

Application Fields

Speech Recognition

Sensitive Items

No

Copyright Ownership

Magic Data

Samples
小薇你好给我播一首伤感的歌
00:00/00:00
嘿小薇播放元耳的直播
00:00/00:00
找一首打游戏的歌曲
00:00/00:00
十月十五号三点半提醒我睡觉
00:00/00:00
Enquiry
MDT-ASR-A003 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-A011 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-B002 Mandarin Chinese Speech Recognition Corpus
MDT-ASR-B016 Mandarin Chinese Speech Recognition Corpus

MDT-ASR-B016 Mandarin Chinese Speech Recognition Corpus

Mandarin Chinese Speech Recognition Corpus is transcribed with an accuracy of 97%, produced by Magic Data Tech. It is valuable to improve the recognition accuracy of your ASR models.
Name *
Phone Number
E-mail *
Company Name *
Message Content
Country
City

Please fill in your information before obtaining the date link.