Training Datasets
datasets
Authentic training data for your AI models

We provide valuable and reliable training data to empower your state-of-the-art AI models. You can find datasets in different languages, styles, and solutions. Our datasets can improve your AI models’ performance, thus accelerating the commercialization of AI initiatives.

Select >
Language>
All
Arabic
Bahasa Indonesia
Chinese Dialects
Chinese-English Code-Mixing
English
Filipino/Tagalog
French
German
Hindi
Italian
Japanese
Korean
Malay
Mandarin Chinese
Portuguese
Russian
Spanish
Thai
Turkish
Urdu
Style>
All
Conversational Speech
Noise
Read Speech
Spontaneous
Recording Environment>
All
In-Vehicle Environments
Indoor Environments
Outdoors
Professional TTS Recording Studio
Recording Device>
All
Desktop
High Fidelity Microphone
Microphone
Mobile
Recorder
Telephony
Domain>
All
Automatic Speech Recognition (ASR)
Computer Vision (CV)
Natural Language Processing (NLP)
Pronunciation Lexicon (Lex.)
Text To Speech (TTS)
Application>
All
Automotive Virtual Assistant
Consumer Robot Controls
Healthcare
Security and Authentication
Smart Home Controls
Voice Commerce & Consumer Service
MDT-ASR-F064 Mandarin Chinese Scripted Speech Corpus
MDT-ASR-F063 Mandarin Chinese Scripted Speech Corpus
MDT-ASR-F062 Mandarin Chinese Colloquial Scripted Speech Corpus
MDT-ASR-F065 Shanghai Dialect Conversational Speech Corpus
MDT-ASR-F066 Filipino/Tagalog Scripted Speech Corpus
MDT-ASR-E084 French Scripted Speech Corpus
MDT-ASR-E086 Peninsular Spanish Scripted Speech Corpus
MDT-ASR-E085 Brazilian Portuguese Scripted Speech Corpus
MDT-ASR-F060 Bahasa Indonesia Conversational Speech Corpus
1 2 3 4 5 6 7 8 9 10 11 12 13 14 >
Show More
>
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Sales Department
Please fill in this form and we will contact you soon
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
>
TOP