See What's NEW
MDT-NF018 Shanghai Text Corpus

MDT-NF018 Shanghai Text Corpus

Language

wuu-Shanghai

Number of Utterances

745,550

Data Content

Daily-Use Sentence

Feature

Parallel corpus

File Format

TXT

Field of Application

NLP

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

路上小心开车今天谢谢你啦
路高头小心开车今朝谢谢侬啦

你家怎么了请了几天假
侬屋里哪能了请了几天假

装糊涂你还在看周星驰
装糊涂侬还辣看周星驰

No related data? Contact us

Contact Us

Related Datasets

MDT-NF013 Chinese Onboard Navigation Text Corpus

[Open-Source]

MDT-NF027 Chinese Financial Customer Service Text Corpus

MDT-NF019 Minnan Text Corpus

MDT-NB007 English In-Vehicle Command and Query Text Corpus

[Open-Source]

MDT-NF024 Mandarin Chinese Text Normalization Text Corpus

MDT-NB003 Chinese English Parallel Corpus

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data