MDT-AF056 Mandarin Chinese Child Scripted Speech Corpus
Play Audio
MDT-AC015 Chinese English Speech Corpus for Evaluation
Play Audio
MDT-AE027 Turkish English Scripted Speech Corpus—Keyword Spotting
Play Audio
Multi-Turn Conversation Dataset for Speech Language Model
Play Audio
Noise Audio Dataset
This dataset provides extensive coverage of various types of noisy environments, including but not limited to the bustling sounds of airports, the lively chatter at train stations, and the background noise in cafés.
Play Audio
MDT-AE083 Peninsular Arabic Conversational Speech Corpus
Play Audio
Why MD Datasets
Full Compliance
ISO/IEC 27001 & ISO/IEC 27701:2019 compliant
Multiple Dimension
Audio, text, image, and video multi-modal data
Extensive Scope
Conversational, scripted, and spontaneous data covering extensive domains