Training Datasets
datasets
Authentic training data for your AI models

E.g. English conversation

Language>
All
Mandarin Chinese
English
Chinese Dialects
Chinese-English Code-Mixing
Filipino/Tagalog
Japanese
Korean
Thai
Spanish
Hindi
Bahasa Indonesia
Russian
Malay
Turkish
Arabic
German
Portuguese
Urdu
Italian
French
Domain>
All
Automatic Speech Recognition (ASR)
Natural Language Processing (NLP)
Text To Speech (TTS)
Pronunciation Lexicon (Lex.)
Computer Vision (CV)
Style>
All
Conversational Speech
Read Speech
Noise
Recording Environment>
All
Indoor Environments
Outdoors
In-Vehicle Environments
Professional TTS Recording Studio
Recording Device>
All
Mobile
Recorder
Microphone
Desktop
High Fidelity Mic
Telephony
Application>
All
Consumer Robot Controls
Automotive Virtual Assistant
Voice Commerce and Consumer Service
Smart Home Controls
Security and Authentication
Healthcare
Region>
All
China
UAE
Egypt
Pakistan
Brazil
Germany
Russia
Kazakhstan
Ukraine
France
the Philippines
Korea
Malaysia
USA
Japan
Saudi Arabia
Thailand
Turkey
Spain
Singapore
Italy
India
Indonesia
Type>
All
Training Set
Test Set

We provide valuable and reliable training data to empower your state-of-the-art AI models. You can find datasets in different languages, styles, and solutions. Our datasets can improve your AI models’ performance, thus accelerating the commercialization of AI initiatives.

MDT-ASR-F049 Pakistani English Scripted Speech Corpus
MDT-ASR-F024 Filipino English Scripted Speech Corpus—Keyword Spotting
MDT-ASR-F010 Singaporean English Scripted Speech Corpus—Keyword Spotting
Show More
>
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Title
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Country
City
Submit
Sales Department
Please fill in this form and we will contact you soon
Name
*
Company Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
Talk to Magic Data
>
TOP