MDT-AE027 Turkish English Scripted Speech Corpus—Keyword Spotting
Play Audio
MDT-AE017 Filipino/Tagalog Scripted Speech Corpus—Daily Use Sentence
[Open-Source]
Play Audio
MDT-AF058 Mandarin Chinese Scripted Speech Corpus—Keyword Spotting
Play Audio
Noise Audio Dataset
This dataset provides extensive coverage of various types of noisy environments, including but not limited to the bustling sounds of airports, the lively chatter at train stations, and the background noise in cafés.
Play Audio
MDT-AF078 Spanish Conversational Speech Corpus
Play Audio
MDT-AE039 Hunan Dialect Scripted Speech Corpus—Daily Use Sentence
Play Audio
Why MD Datasets
Full Compliance
ISO/IEC 27001 & ISO/IEC 27701:2019 compliant
Multiple Dimension
Audio, text, image, and video multi-modal data
Extensive Scope
Conversational, scripted, and spontaneous data covering extensive domains