See What's NEW

Industry :

Automotive
Financial Services
Social Networks
Smart Home
Smart Devices

Language :

Mandarin Chinese
English
Chinese Dialects
Chinese-English Code-Mixing
Filipino/Tagalog
Japanese
Korean
Thai
Spanish
Hindi
Bahasa Indonesia
Russian
Malay
Turkish
Arabic
German
Portuguese
Urdu
Italian
French
Residential Noise
In-Vehicle Noise
Swedish
Music

Application :

Consumer Robot Controls
Automotive Virtual Assistant
Voice Commerce and Consumer Service
Smart Home Controls
Security and Authentication
Healthcare
Smart phone/watch/wearable device

Type :

Training Set
Test Set

Region :

China
UAE
Egypt
Pakistan
Brazil
Germany
Russia
Kazakhstan
Ukraine
France
the Philippines
Korea
Malaysia
USA
Japan
Saudi Arabia
Thailand
Turkey
Spain
Singapore
Italy
India
Indonesia
Sweden

Topic :

Work
Art
Business and Economy
Career Development
Climate and Environment
Culture
Daily Life
Education and Healthcare
Entertainment
Family Life
Health
Humanities
Interpersonal Relation
Military
Personalities
Politics and Law
Science and Technology
Smart Devices
Sport
Study
Travel

Speech Style :

Conversational Speech
Read Speech
Noise

Recording Environment :

Indoor Environments
Outdoors
In-Vehicle Environments
Professional TTS Recording Studio
Residential
Public Space

Recording Device :

Telephony
Recorder
Microphone
Desktop
High Fidelity Mic
Smart Phone
Bluetooth headset
336 datasets are found Request quote

MDT-ASR-B012 Mandarin Chinese Conversational Speech Recognition Corpus

[Open-Source]
Play Audio

MDT-ASR-D014 Chinese English Scripted Speech Corpus—Daily Use Sentence

Play Audio

MDT-ASR-D020 American English Speech Corpus

Play Audio

MDT-ASR-A007 Mandarin Chinese Conversational Speech Recognition Corpus—Telephony

[Open-Source]
Play Audio

MDT-ASR-A001 Mandarin Chinese Conversational Speech Recognition Corpus

Play Audio

MDT-ASR-B011 American English Speech Recognition Corpus

Play Audio

MDT-ASR-C012 Mexican Spanish Speech Corpus—Daily Use Sentence

[Open-Source]
Play Audio

Multi-Turn Conversation Dataset for Speech Language Model

Play Audio

48kHz Multi-Speaker Speech Dataset for Voice Cloning--Chinese

Play Audio

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

No related data? Contact us

Contact Us

TOP
Talk to Magic Data