Company
blog
Blog
blog
20
Sep
18
Browse: 1071
No. LDC2019S23, Magic Data Chinese Mandarin Conversational Speech was selected into LDC Catalog

We are honored to say that our Chinese Mandarin Conversational Speech was selected in LDC Catalog! The catalog No. is LDC2019S23 (You can search at https://catalog.ldc.upenn.edu/LDC2019S23 metadata standard.

New trends for conversational datasets AI continue to expand into new use cases and new verticals. As the leading companies such as Google, Amazon pay more attention to continuous conversation, the importance of conversational datasets increases. Besides, the accuracy of read speech data recognition is up to 97-98%, but in conversational speech recognition, the accuracy is nearly 50% (referred to results of the CHiME-5 Challenge). This large gap means the challenge in automatic speech recognition (ASR) extend to a new phase.

This is an excellent testing dataset for conversational speech recognition models There are three main characteristics in this corpus for data collection, annotation and application. For data collection, the key word is diversity. These data are collected to cover conversations recorded in different accents and transmission channels, with speakers in different ages and genders and with a background noise corresponding to the scenario.

1) Speakers: 60 speakers from different areas in China, with age range from 4 to 67.

2) Recording environment: 3 rooms with different reverberation

3) Recording equipment: Android device (9 varieties); iOS device (8 varieties); recorder (2 varieties)

4) Recording channels: single-channel and multi-channel

5) The corpus consists both far-field and near-field voice.

For data annotation, the key word is accuracy. The annotation work is compliance to strict specifications and documentations and completed by trained annotators. Our team has formulated a series of tagging rules to meet actual needs. What does it mean? Spontaneous conversation produces overlapping, pause, cough, and clapping. These sounds are meaningful in some conditions as they may indicate the speaker's state, mood, and even hint at the speaker's mental activities. According to the company’s advanced annotation specifications, these sounds could be recognizable by AI systems.

The last key word variety is for data application. This corpus is valuable for at least 3 applications: conversational speech recognition, speaker separation and robustness testing.

1) Accuracy testing of various speech recognition models. For example, in a typical family application scenario, the family members using voice interaction include the elderly, the wife (adult female), the husband (adult male), and the children. These family members have different pronunciation patterns and habits. In the speech recognition model, the age diversity of the corpus can be used to test the recognition effect of the model for different age groups.

2) Accuracy testing of speaker separation.Scene recognition based on specific speaker has become a research hotspot. In the collection, there are both single-player recording channel and multi-player recording channel. Therefore, this dataset can be used to test the accuracy for speaker separation tasks.

3) Robustness testing of the model. Since there are far-field and near-field voice recorded at the same time, different audio contains different reverberation and background noise. The corpus was valuable for researchers to test the robustness of their systems.

Benefits

1) Spontaneous conversational data generates various responses and accordance to real life scenarios;

2) Annotation norms that can be customized to meet actual needs;

3) Strict quality management system, ensuring a continuous output of high-quality data products.

This corpus is a part of our whole databases. Magic Data Technology owns 100,000+ hours self-owned copyright datasets which can be used to improve the performance of models rapidly. If you are interested in our datasets or our data services, don’t hesitate to contact us via business@magicdatatech.com.

Share
Previous
Page
Moving Toward the Globe | Magic Data Builds Partnership with AWS, Empowering AI Data Processing
Next
Page
Data Security and Compliance in Deploying AI — Magic Data’s Data Security Commitment
Popular Tags
Latest Blogs
Moving Toward the Globe | Magic Data Builds Partnership with AWS, Empowering AI Data Processing

Recently, Magic Data officially become one of AWS’s ISV partners after Annotator® 5.0, an AI-assisted data labeling platform passing the ASW foundation technology Review (FTR).

20
Sep
18
Data Security and Compliance in Deploying AI — Magic Data’s Data Security Commitment

The importance of data security has been increasingly realized, no matter it is in national or personal level. Always putting data security at the first priority, Magic Data designs and applies a strict data protection mechanism so as to provide sufficient trusted AI training data for the industry.

20
Sep
18
How to Improve Multilingual Speech Recognition Performance? In View of Acoustic Modeling

The code-mixing phenomenon brings much challenges to the automatic speech recognition system development. How to develop a reliable multilingual speech recognition system have become a heated topic within the industry.

20
Sep
18
What is Conversational AI? And the challenge

The demand for a quick, intelligent and natural-sounding conversation between human and machine is increasing.

20
Sep
18
Massive High-Quality AI Training Data Makes HMI More Intelligent, More Humanized and More Personalized

The 3rd China Automotive Intelligent Summit 2021, took place on 27-28, Sept. 2021, Shanghai, gathers about 120 experts and executives from the automotive industry to focus on the networked technology, software development, hardware innovation, business model and user insight of intelligent cockpit, and provide an in-depth comprehensive analysis of the opportunities and challenges of intelligent cockpit development.

20
Sep
18
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Sales Department
Please fill in this form and we will contact you soon
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
Contact us
>
TOP