Company
Blog
blog
20
Sep
18
Browse: 70
MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese

MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese was recorded by a four-year-old Chinese girl originally born in Beijing China. This time we published 15-minute speech data from the corpus for non-commercial use. This is the first time to publish this voice!

The contents and the corresponding descriptions of the corpus:

(1) The corpus contains 15 minutes of speech data, which is recorded in NC-20 acoustic studio.

(2) The speaker is 4 years old originally born in Beijing

(3) Detail information such as speech data coding and speaker information is preserved in the metadata file.

(4) This corpus is natural kid style.

(5) Annotation includes four parts: pronunciation proofreading, prosody labeling, phone boundary labeling and POS Tagging.

(6) The annotation accuracy is higher than 99%.

(7) For phone labeling, the database contains the annotation not only on the boundary of phonemes, but also on the boundary of the silence parts.

The corpus aims to help researchers in the TTS fields. And it is part of a much bigger dataset (2.3 hours MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese) which was recorded in the same environment.

Speaker intro: The speaker, NiuNiu, is lively and cheerful. When she first came to the studio, she couldn't wait to introduce herself. "My name is NiuNiu, I am 4 years old." An outgoing child can always get along with others quickly. NiuNiu ‘s favorite cartoons are “Frozen” and “My Little Pony”.

Please note that this corpus has got the speaker and her parents’ authorization.

For more details or for commercial use, please contact us: E-mail: business@magicdatatech.com

Share
Previous
Page
No. LDC2019S23, Magic Data Chinese Mandarin Conversational Speech was selected into LDC Catalog
Next
Page
Daniel Povey joins Magic Data Technology as the Principal Scientist Advisor
Popular Tags
Latest Blogs
No. LDC2019S23, Magic Data Chinese Mandarin Conversational Speech was selected into LDC Catalog

We are honored to say that our Chinese Mandarin Conversational Speech was selected in LDC Catalog! The catalog No. is

20
Sep
18
Daniel Povey joins Magic Data Technology as the Principal Scientist Advisor

Daniel Povey, the founder of Kaldi, officially accepted the invitation of Magic Data Technology Co., Ltd. as the Principal Scientist Advisor (http://www.magicdatatech.com).

20
Sep
18
Magic Data Technology was featured as one of Top 10 Big Data Solution Providers 2019

At the end of the year 2019, we are honored to be featured as one of APAC CIO Outlook: Top 10 Big Data Solution Providers.Certification: APAC CIO Outlook TOP 10 BIG DATA SOLUTION PROVIDERS-2019

20
Sep
18
How to Get Comprehensive Data Solution for Customer Service Models

For intelligent speech-interaction customer service models, as for other similar human-AI interaction models, in order to recognize effectively a human customer’s speech content, to identify his intentions and then to react properly in a certain customer service scenario, models require rich conversational speech data and conversations’ content in this scenario for training.

20
Sep
18
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
>
TOP