See What's NEW
MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

Language

zh

Number of Utterances

100,736

Data Content

Text Normalization

File Format

TXT

Field of Application

TTS

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合1.63秒。
另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合一点六三秒。

第二局比赛中国队攻势不减,侯宇阳在23分33秒时将比分改写为3:0。
第二局比赛中国队攻势不减,侯宇阳在二十三分三十三秒时将比分改写为三比零。

我们将在接下去的24-48个小时内知道会发生什么。
我们将在接下去的二十四到四十八个小时内知道会发生什么。

No related data? Contact us

Contact Us

Related Datasets

MDT-ASR-E043-A4 Mandarin Chinese Conversational Speech Corpus — Daily Life

Play Audio

MDT-ASR-E072&F029-A1 Italian Scripted Speech Corpus — Keyword Spotting

Play Audio

MDT-ASR-B015 Korean Speech Recognition Corpus—News

[Open-Source]
Play Audio

MDT-ASR-E067-A6 Korean Conversational Speech Corpus — Climate and Environment

Play Audio

MDT-ASR-E059 Turkish In-Vehicle Scripted Speech Corpus—Smart Mobility

Play Audio

MDT-ASR-E008 Bahasa Indonesia Scripted Speech Corpus—Daily Use Sentence

Play Audio

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data