Company
blog
Blog
blog
22
Feb
16
Browse: 387
Magic Data Launches Conversational AI Datasets for Machine Learning

Reprinted: yahoo finance

Magic Data, a global AI data service provider, has launched an accumulation of more than 200,000 hours of training datasets, including 140,000 hours of conversational AI training datasets and 60,000 hours of read speech datasets, covering Asian languages, English dialects, and European languages, boosting the rapid development of human-computer interaction in artificial intelligence.

Why conversational AI dataset?

Experiments show conversational data has better performance on ASR machine learning. Magic Data R&D Center works on conversational speech data and read speech data comparison, where 3,000 hours of conversational speech training data and read speech training data were respectively used to train Automatic Speech Recognition (ASR) models under customer service scenario, broadcasting, and navigation command. It shows that compared with read speech data, conversational speech data word accuracy is improved relatively up to 84%.

In addition, Magic Data R&D Center conduct experiment with 3,000 hours and 30,000 hours conversational training data. The result shows the more the conversational data is used, the higher the word accuracy comes.

Conversational AI data compliance and transparency

Magic Data applies a series of measures to ensure data compliance and transparency. The internal processes are in accordance with industry security standards, and are GDPR compliant, ISO 27001 and ISO/IEC 27701:2019 certified.

Contact Magic Data expert team for more information: business@magicdatatech.com

Share
Previous
Page
What Conversational Data Play in the Future of Online Conferencing?
Next
Page
How to Start Your Machine Learning Projects with MagicData-RAMC?
Latest Blogs
What Conversational Data Play in the Future of Online Conferencing?

Over two years into the pandemic, a lot of things have changed in the remote work landscape. As more jobs move to remote settings than ever before, the communication between coworkers and customers has shifted to that realm as well. With that shift comes a new set of trials and tribulations that didn’t exist in face-to-face meetings.

22
Feb
16
How to Start Your Machine Learning Projects with MagicData-RAMC?

As a collection of high quality and richly annotated training data, MagicData-RAMC is applicable to a series of research. This article will introduce 3 experiments related to speech recognition, speaker diarization and keyword search based on MagicData-RAMC conducted by Magic Data, together with the Institute of Acoustics, Chinese Academy of Sciences, Shanghai Jiao Tong University and Northwestern Polytechnic University.

22
Feb
16
Open-source MagicData-RAMC: 180-hour Conversational Speech Dataset in Mandarin Released

MagicHub, an open-source community for AI, releases 180-hour conversational speech dataset in Mandarin for free, enriching the open source speech corpus and promoting the development of spoken language processing technology and conversational AI.

22
Feb
16
Moving Toward the Globe | Magic Data Builds Partnership with AWS, Empowering AI Data Processing

Recently, Magic Data officially become one of AWS’s ISV partners after Annotator® 5.0, an AI-assisted data labeling platform passing the ASW foundation technology Review (FTR).

22
Feb
16
Data Security and Compliance in Deploying AI — Magic Data’s Data Security Commitment

The importance of data security has been increasingly realized, no matter it is in national or personal level. Always putting data security at the first priority, Magic Data designs and applies a strict data protection mechanism so as to provide sufficient trusted AI training data for the industry.

22
Feb
16
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Title
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Country
City
Submit
Sales Department
Please fill in this form and we will contact you soon
Name
*
Company Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*

Country code + Phone Number

Phone number-e.g. +86 138315xxxxx

Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
Talk to Magic Data
>
TOP