See What's NEW
MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

Language

zh

Number of Utterances

100,736

Data Content

Text Normalization

File Format

TXT

Field of Application

TTS

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合1.63秒。
另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合一点六三秒。

第二局比赛中国队攻势不减,侯宇阳在23分33秒时将比分改写为3:0。
第二局比赛中国队攻势不减,侯宇阳在二十三分三十三秒时将比分改写为三比零。

我们将在接下去的24-48个小时内知道会发生什么。
我们将在接下去的二十四到四十八个小时内知道会发生什么。

No related data? Contact us

Contact Us

Related Datasets

MDT-TTS-E013 Sichuan Dialect Speech Corpus for TTS

Play Audio

MDT-TTS-F007 Changsha Dialect Speech Corpus for TTS

Play Audio

MDT-TTS-E019 Korean Speech Corpus for TTS

Play Audio

MDT-TTS-G007 Northeastern Mandarin Speech Corpus for TTS

Play Audio

MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

MDT-TTS-E011 Mandarin Chinese Speech Corpus for TTS

[Open-Source]
Play Audio

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data