See What's NEW
MDT-NLP-F017 Guangzhou Cantonese Text Corpus

MDT-NLP-F017 Guangzhou Cantonese Text Corpus

Download

Language

yue-Guangdong

Number of Utterances

866,204

Data Content

Daily-Use Sentence

Feature

Parallel corpus

File Format

TXT

Field of Application

NLP

Data Sensitive Items

nil

Copyright Owner

Magic Data

Sample

你漫画看多了吧
你漫画睇多咗啊

没问道怎么说
冇问到哦点讲啊

你能容忍我的一切吗
你能够容忍我嘅一切咩

Open Source Utterances

100

No related data? Contact us

Contact Us

Related Datasets

MDT-NLP-G001 Chinese POI Text Corpus

MDT-NLP-F028 Chinese-English Winter Sports News Parallel Text Corpus

[Open-Source]

MDT-NLP-F018 Shanghai Text Corpus

MDT-NLP-F012 Chinese Smart Home C&C Text Corpus

MDT-NLP-F019 Minnan Text Corpus

MDT-NLP-B004 In-Vehicle Command and Query Corpus

[Open-Source]

Why MD Datasets

Full Compliance

ISO/IEC 27001 & ISO/IEC 27701:2019 compliant

Multiple Dimension

Audio, text, image, and video multi-modal data

Extensive Scope

Conversational, scripted, and spontaneous data covering extensive domains

High Accuracy

Expertise secured quality result

TOP
Talk to Magic Data