See What's NEW
MDT-NLP-F025 Mandarin Chinese Polyphone Text Corpus

MDT-NLP-F025 Mandarin Chinese Polyphone Text Corpus

Language

zh-CN

Number of Utterances

244,630

Data Content

Polyphony

Feature

Polyphonic character

File Format

TXT

Field of Application

NLP

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

带钩子的这几个都是虚胖吧,连带爪(zhua3)子的铁男都不是真肉。
我看有找到四只二牛的但是我只找到两只全脸出镜的二牛狗爪(zhua3)子?
我晚点再盘一下牌吧,应该底牌的狼爪(zhao3)是miku的。

No related data? Contact us

Contact Us

Related Datasets

MDT-NLP-F022 English Medical Customer Service Text Corpus

MDT-NLP-A010 Turkish Chatting Corpus

MDT-NLP-A005 Filipino/Tagalog Chatting Corpus

MDT-NLP-F001 Chinese Hindi Parallel Corpus

MDT-NLP-F017 Guangzhou Cantonese Text Corpus

[Open-Source]

MDT-NLP-F006 Chinese Filipino Parallel Corpus

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data