We provide a wide range of training data, such as commands datasets and interaction datasets. These datasets were collected from real-world scenarios, covering different languages, dialects, genders, and ages. Specifically, we record domestic noise datasets as well, since it also impacts recognition. We ensure the consistency and stability through our strict criterion for speakers, recording environments, devices, annotation specification, acceptance inspection, etc.
We have delivered single language and multiple language mixed datasets to our customers. These customized datasets assist our customers’ to enhance their AI or machine learning models which help them operate smoothly even in adverse environments.
For intelligent speech-interaction customer service models, as for other similar human-AI interaction models, in order to recognize effectively a human customer’s speech content, to identify his intentions and then to react properly in a certain customer service scenario, models require rich conversational speech data and conversations’ content in this scenario for training.
At the end of the year 2019, we are honored to be featured as one of APAC CIO Outlook: Top 10 Big Data Solution Providers.Certification: APAC CIO Outlook TOP 10 BIG DATA SOLUTION PROVIDERS-2019
Daniel Povey, the founder of Kaldi, officially accepted the invitation of Magic Data Technology Co., Ltd. as the Principal Scientist Advisor (http://www.magicdatatech.com).