See What's NEW

Industry Solutions

Financial Services

financial-img

Financial Services

Streamling your business process with enterprise-level security guaranteed.

Our top priority lays on compliance and security.

Revitalizing financial industry by providing one-stop AI data solutions for customer service, virtual counter, virtual assistant, targeting marketing, and other AI applications.

Contact Sales

Scenarios

0-img

Customer Service

Dial In/Out, Collect Accounts Receivable

1-img

Smart Meeting

Realtime Captioning,Translation,Meeting Minutes Generation

2-img

Automated Invoice Processing

Identity Certification, Warranty OCR, Medical Record OCR

3-img

Virtual Human

Smart Shopping Guidance, Marketing

Challenge

Imprecise voice recognition of customer service scenario
Unable to correctly understand commands and queries
Impersonal and unnatural communication
Different format between invoices, warranties, and medical records

Annotator® AI-Assisted Annotation Platform

Audio Annotation Text Annotation Image Annotation
  • Customer Service - Customer service annotation
  • Virtual Human - Command and query annotation
  • Smart Meeting - Meeting scenarios voice annotation
  • Virtual Human - Rhythm, Text segmentation, part-of-speech, and phoneme annotation
annotator-img
  • Customer Service - User queries relevance annotation
  • Virtual Human - User interaction content annotation
annotator-img
  • Automated Invoice Processing - Invoice OCR annotation
annotator-img

MD Dataset Portfolio

Speech Recognition
Text-to-Speech
Natural Language Understanding
OCR

Contact us for data collection and annotation service

annotator-serve-img

Related Datasets

MDT-AI101 Spanish Duplex Conversation Training Dataset

Preserving features such as tonal jumps, spontaneous interruptions, and collaborative speech in fast-paced native conversations, this dataset uses independent channel recording for precise voice separation. Combined with multi-speaker labeling and scenario classification, it provides a solid training foundation for AI models to manage diverse speech rates and regional linguistic variations in Spanish.

MDT-AG024 Peninsular Arabic Scripted Speech Corpus

Play Audio

MDT-LE003 Filipino/Tagalog Lexicon

MDT-LD005 Bahasa Indonesia Lexicon

[Open-Source]

MDT-AI090 Spontaneous Speech Datasets of Expressive Paralinguistics

This dataset is characterized by a high sampling rate of 48 kHz, recordings collected in controlled quiet environments, and contributions from a demographically diverse pool of speakers varying in region, age, and gender to ensure speech diversity. The dataset encompasses a wide array of topics across 20 domains, including daily life, leisure and entertainment, education and training, and healthcare.
Play Audio

MDT-NF002 Chinese Bahasa Indonesia Parallel Corpus

Contact us for the best practices

Get started today

TOP
Talk to Magic Data