See What's NEW

Industry Solutions

Social Networks

social-img

Social Networks

Optimizing AI models with Magic Data AI data total solution.

Magic Data AI data solution in social network scenarios such as sentiment analysis, recommendation systems, virtual host, makes the platform intelligent.

Contact Sales

Scenarios

0-img

Video

Subtitle Generator/Review&Comment Analysis/Content Classification

1-img

Livestreaming

Voice Changer/Realtime Captioning/Review&Comment Analysis

2-img

Virtual Host

Anchor/E-commerce livestreaming/Digital singer

Challenge

Imprecise voice recognition of special words and phrases and accented speaking
Large volume and great variety of comments and reviews
Stiff and unnatural response of virtual human
Large category and cross-domain topics of video

Annotator® AI-Assisted Annotation Platform

Audio Annotation Text Annotation Image Annotation
  • Live Show - Content annotation (ASR)
  • Virtual Host - Voice interaction annotation (ASR)
  • Virtual Host - Text segmentation, part-of-speech, and phoneme annotation (TTS)
  • Video - Video segmentation, content classification (Video)
annotator-img
  • Live Show - Sentiment analysis of reviews (NLP)
  • Virtual Host - Sentiment analysis of reviews (NLP)
annotator-img
  • Video - Video segmentation, content classification (Video)
annotator-img

MD Dataset Portfolio

Speech Recognition
Text-to-Speech
Natural Language Understanding
OCR

Contact us for data collection and annotation service

annotator-serve-img

Related Datasets

MDT-AG037 Swedish Spontaneous Speech Corpus

Play Audio

MDT-AF013 Singaporean English Scripted Speech Corpus

Play Audio

MDT-NF006 Chinese Filipino Parallel Corpus

MDT-AF070 Jinan Dialect Conversational Speech Corpus

Play Audio

MDT-AI101 Spanish Duplex Conversation Training Dataset

Preserving features such as tonal jumps, spontaneous interruptions, and collaborative speech in fast-paced native conversations, this dataset uses independent channel recording for precise voice separation. Combined with multi-speaker labeling and scenario classification, it provides a solid training foundation for AI models to manage diverse speech rates and regional linguistic variations in Spanish.

MDT-NF003 Chinese Thai Parallel Corpus

Contact us for the best practices

Get started today

TOP
Talk to Magic Data