MDT-AG030 Mandarin Heavy Accent (Hunan) Conversational Speech Corpus
Play Audio
MDT-AE032 Minnan Dialect Scripted Speech Corpus—Daily Use Sentence
Play Audio
Noise Audio Dataset
This dataset provides extensive coverage of various types of noisy environments, including but not limited to the bustling sounds of airports, the lively chatter at train stations, and the background noise in cafés.
Play Audio
MDT-AF060 Bahasa Indonesia Conversational Speech Corpus—Outdoors
Play Audio
MDT-BE011 Mandarin Chinese Speech Corpus for TTS
[Open-Source]
Play Audio
MDT-AE017 Filipino/Tagalog Scripted Speech Corpus—Daily Use Sentence
[Open-Source]
Play Audio
Why MD Datasets
Full Compliance
ISO/IEC 27001 & ISO/IEC 27701:2019 compliant
Multiple Dimension
Audio, text, image, and video multi-modal data
Extensive Scope
Conversational, scripted, and spontaneous data covering extensive domains