Company
Blog
blog
20
Sep
18
Browse: 84
No. LDC2019S23, Magic Data Chinese Mandarin Conversational Speech was selected into LDC Catalog

We are honored to say that our Chinese Mandarin Conversational Speech was selected in LDC Catalog! The catalog No. is LDC2019S23 (You can search at https://catalog.ldc.upenn.edu/LDC2019S23 metadata standard.

New trends for conversational datasets AI continue to expand into new use cases and new verticals. As the leading companies such as Google, Amazon pay more attention to continuous conversation, the importance of conversational datasets increases. Besides, the accuracy of read speech data recognition is up to 97-98%, but in conversational speech recognition, the accuracy is nearly 50% (referred to results of the CHiME-5 Challenge). This large gap means the challenge in automatic speech recognition (ASR) extend to a new phase.

This is an excellent testing dataset for conversational speech recognition models There are three main characteristics in this corpus for data collection, annotation and application. For data collection, the key word is diversity. These data are collected to cover conversations recorded in different accents and transmission channels, with speakers in different ages and genders and with a background noise corresponding to the scenario.

1) Speakers: 60 speakers from different areas in China, with age range from 4 to 67.

2) Recording environment: 3 rooms with different reverberation

3) Recording equipment: Android device (9 varieties); iOS device (8 varieties); recorder (2 varieties)

4) Recording channels: single-channel and multi-channel

5) The corpus consists both far-field and near-field voice.

For data annotation, the key word is accuracy. The annotation work is compliance to strict specifications and documentations and completed by trained annotators. Our team has formulated a series of tagging rules to meet actual needs. What does it mean? Spontaneous conversation produces overlapping, pause, cough, and clapping. These sounds are meaningful in some conditions as they may indicate the speaker's state, mood, and even hint at the speaker's mental activities. According to the company’s advanced annotation specifications, these sounds could be recognizable by AI systems.

The last key word variety is for data application. This corpus is valuable for at least 3 applications: conversational speech recognition, speaker separation and robustness testing.

1) Accuracy testing of various speech recognition models. For example, in a typical family application scenario, the family members using voice interaction include the elderly, the wife (adult female), the husband (adult male), and the children. These family members have different pronunciation patterns and habits. In the speech recognition model, the age diversity of the corpus can be used to test the recognition effect of the model for different age groups.

2) Accuracy testing of speaker separation.Scene recognition based on specific speaker has become a research hotspot. In the collection, there are both single-player recording channel and multi-player recording channel. Therefore, this dataset can be used to test the accuracy for speaker separation tasks.

3) Robustness testing of the model. Since there are far-field and near-field voice recorded at the same time, different audio contains different reverberation and background noise. The corpus was valuable for researchers to test the robustness of their systems.

Benefits

1) Spontaneous conversational data generates various responses and accordance to real life scenarios;

2) Annotation norms that can be customized to meet actual needs;

3) Strict quality management system, ensuring a continuous output of high-quality data products.

This corpus is a part of our whole databases. Magic Data Technology owns 100,000+ hours self-owned copyright datasets which can be used to improve the performance of models rapidly. If you are interested in our datasets or our data services, don’t hesitate to contact us via business@magicdatatech.com.

Share
Previous
Page
MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese
Next
Page
Daniel Povey joins Magic Data Technology as the Principal Scientist Advisor
Popular Tags
Latest Blogs
MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese

MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese was recorded by a four-year-old Chinese girl originally born in Beijing China. This time we published 15-minute speech data from the corpus for non-commercial use. This is the first time to publish this voice!

20
Sep
18
Daniel Povey joins Magic Data Technology as the Principal Scientist Advisor

Daniel Povey, the founder of Kaldi, officially accepted the invitation of Magic Data Technology Co., Ltd. as the Principal Scientist Advisor (http://www.magicdatatech.com).

20
Sep
18
Magic Data Technology was featured as one of Top 10 Big Data Solution Providers 2019

At the end of the year 2019, we are honored to be featured as one of APAC CIO Outlook: Top 10 Big Data Solution Providers.Certification: APAC CIO Outlook TOP 10 BIG DATA SOLUTION PROVIDERS-2019

20
Sep
18
How to Get Comprehensive Data Solution for Customer Service Models

For intelligent speech-interaction customer service models, as for other similar human-AI interaction models, in order to recognize effectively a human customer’s speech content, to identify his intentions and then to react properly in a certain customer service scenario, models require rich conversational speech data and conversations’ content in this scenario for training.

20
Sep
18
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
>
TOP