See What's NEW

Industry Solutions

Social Networks

social-img

Social Networks

Optimizing AI models with Magic Data AI data total solution.

Magic Data AI data solution in social network scenarios such as sentiment analysis, recommendation systems, virtual host, makes the platform intelligent.

Contact Sales

Scenarios

0-img

Video

Subtitle Generator/Review&Comment Analysis/Content Classification

1-img

Livestreaming

Voice Changer/Realtime Captioning/Review&Comment Analysis

2-img

Virtual Host

Anchor/E-commerce livestreaming/Digital singer

Challenge

Imprecise voice recognition of special words and phrases and accented speaking
Large volume and great variety of comments and reviews
Stiff and unnatural response of virtual human
Large category and cross-domain topics of video

Annotator® AI-Assisted Annotation Platform

Audio Annotation Text Annotation Image Annotation
  • Live Show - Content annotation (ASR)
  • Virtual Host - Voice interaction annotation (ASR)
  • Virtual Host - Text segmentation, part-of-speech, and phoneme annotation (TTS)
  • Video - Video segmentation, content classification (Video)
annotator-img
  • Live Show - Sentiment analysis of reviews (NLP)
  • Virtual Host - Sentiment analysis of reviews (NLP)
annotator-img
  • Video - Video segmentation, content classification (Video)
annotator-img

MD Dataset Portfolio

Speech Recognition
Text-to-Speech
Natural Language Understanding
OCR

Contact us for data collection and annotation service

annotator-serve-img

Related Datasets

MDT-AF066 Filipino/Tagalog Scripted Speech Corpus

Play Audio

MDT-NB005 Chinese German Parallel Corpus

MDT-AF008 Japanese Conversational Speech Corpus

Play Audio

Multi-Emotional Natural Speech Dataset

Magic Data has newly introduced the "Multi-Emotional Natural Speech Dataset", comprising various datasets designed to enhance expressiveness and naturalness in speech technology, enabling intelligent devices to exhibit a wide range of emotional expressions. This dataset significantly enhances the emotional expressiveness of large speech models. By leveraging our dataset, the expressiveness and emotional authenticity of large speech models can be greatly improved.
Play Audio

MDT-LE002 Italian Lexicon

[Open-Source]

MDT-NF026 Mandarin Chinese Prosody Text Corpus

Contact us for the best practices

Get started today

TOP
Talk to Magic Data