Training Datasets
datasets
Authentic training data for your AI models

We provide valuable and reliable training data to empower your state-of-the-art AI models. You can find datasets in different languages, styles, and solutions. Our datasets can improve your AI models’ performance, thus accelerating the commercialization of AI initiatives.

Select >
Language>
All
Arabic
Bahasa Indonesia
Chinese Dialects
Chinese-English Code-Mixing
English
Filipino/Tagalog
French
German
Hindi
Italian
Japanese
Korean
Malay
Mandarin Chinese
Portuguese
Russian
Spanish
Thai
Turkish
Urdu
Style>
All
Conversational Speech
Noise
Read Speech
Spontaneous
Recording Environment>
All
In-Vehicle Environments
Indoor Environments
Outdoors
Professional TTS Recording Studio
Others
Recording Device>
All
Desktop
High Fidelity Microphone
Microphone
Mobile
Recorder
Telephony
Domain>
All
Automatic Speech Recognition (ASR)
Computer Vision (CV)
Natural Language Processing (NLP)
Pronunciation Lexicon (Lex.)
Text To Speech (TTS)
Application>
All
Automotive Virtual Assistant
Consumer Robot Controls
Healthcare
Security and Authentication
Smart Home Controls
Voice Commerce & Consumer Service
MDT-ASR-F060 Bahasa Indonesia Conversational Speech Corpus
MDT-ASR-F046 Egyptian Arabic Conversational Speech Corpus
MDT-ASR-E083 Peninsular Arabic Conversational Speech Corpus
MDT-ASR-F053 Mandarin Chinese Conversational Speech Corpus
MDT-ASR-F054 Tianjin Dialect Conversational Speech Corpus
MDT-ASR-F052 Saudi Arabic Conversational Speech Corpus
MDT-ASR-F051 Xi‘an Dialect Conversational Speech Corpus
MDT-ASR-F027 Brazilian Portuguese Conversational Speech Corpus
MDT-ASR-F021 Bahasa Indonesia Conversational Speech Corpus
1 2 3 4 >
Show More
>
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Sales Department
Please fill in this form and we will contact you soon
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
>
TOP