See What's NEW
MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

MDT-TTS-E020 Mandarin Chinese Text Normalization Text Corpus for TTS

Language

zh

Number of Utterances

100,736

Data Content

Text Normalization

File Format

TXT

Field of Application

TTS

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合1.63秒。
另一队中国组合由邵奕俊担任舵手,最终排名第十四,落后冠军组合一点六三秒。

第二局比赛中国队攻势不减,侯宇阳在23分33秒时将比分改写为3:0。
第二局比赛中国队攻势不减,侯宇阳在二十三分三十三秒时将比分改写为三比零。

我们将在接下去的24-48个小时内知道会发生什么。
我们将在接下去的二十四到四十八个小时内知道会发生什么。

No related data? Contact us

Contact Us

Related Datasets

MDT-ASR-E067-A11 Korean Conversational Speech Corpus — Sport

Play Audio

MDT-ASR-E067-A5 Korean Conversational Speech Corpus — Science and Technology

Play Audio

MDT-ASR-E065 Guangzhou Cantonese Conversational Speech Corpus

Play Audio

MDT-ASR-AA12 Mandarin Chinese Conversational Speech Corpus — Education and Healthcare

Play Audio

MDT-ASR-B007-A8 Residential Noise Dataset — from Kitchen Ventilator

Play Audio

MDT-ASR-E043-A11 Mandarin Chinese Conversational Speech Corpus — Climate and Environment

Play Audio

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data