At the INTERSPEECH2020 industrial forum, Magic Data as the platinum sponsor of the conference, will hold a live session at the virtual booth from 20:15 - 20:45 on Oct 28th. The live broadcast theme is: “Data sets your model --Which data strategy should be adopted to achieve better performance?”
Three speech processing experts will participate in the live discussion, they are: Daniel Povey, main developer and maintainer of Kaldi, internationally renowned professor in the field of Speech Recognition and AI, technical consultant of Magic Data, and the chief speech scientist of Xiaomi Inc. Now he is working on development of new framework and tools for speech recognition.
Xuelu Zhang, linguistic/phonetic consultant of Magic Data, research & development manager at Magic Data, Ph.D in Experimental Phonetics at University of Strasbourg( France). Her research focused on recognition of prosody components, automatic detection of specific linguistic phenomena and TTS.
Gaofeng Cheng, researcher of The Institute of Acoustics of the Chinese Academy of Sciences. Most of his work is in the field of speech recognition technology and intelligent speech platform, and his team often outperforms in national speech technology evaluations. Part of his research and development achievements have been adopted by Kaldi and integrated in the famous framework.
During the Magic Data’s live session, the three interlocuteurs will have an in-depth discussion on the theme "Data sets your model --Which data strategy should be adopted to achieve better performance?" The discussion will cover the following topics: Minimization of model’s dependency on data quantity and necessary attributes of datasets Data augmentation and its problematics Exploitation of raw data for training Data selection for model pretraining and auto-adaptation Etc.
In order to have an open and real-time communcation with Daniel Povey, an interactive session will be arranged at the end of the live broadcast. You are welcome to participate in this session if you are interested in Daniel Povey's recent research, in Kaldi, in data selection and processing, as well as in other related topics.
Beijing time: Oct. 28th,2020 20:15 - 20:45
Virtual booth and Live session entrance:
About Magic Data
Beijing Magic Data Technology Company Limited (Magic Data) is one of the artificial intelligence data service providers. It owns data processing centers within China and a wholly-owned subsidiary in Hong Kong.
Magic Data provides professional data services for enterprises and scientific research institutions engaged in the research, development and application in AI fields such as Automatic Speech Recognition(ASR), Text-To-Speech generation(TTS), Computer Vision(CV) and Natural Language Processing(NLP). In hopes of marrying data and accurate AI with speech interface, inspiring enterprises’ technological wisdom, as well as leading innovative ideas, Magic Data proposes a new brand slogan: Data set your mind. By providing new technology and data solutions, we are motivated to help AI learn processing and analyzing received information.
Magic Data has accumualted a wealth of experience in data service field. Its products and services are forward-looking and of high-quality, diverse in scenario as well as in other dimensions. Currently, Magic Data possesses rich data processing experience, processing capabilities of data in 50+ languages, and dozens of head companies in IT and technology industries all over the world as its cooperators.
Magic Data has reserved 100,000+ hours of speech data, which is available for immediate application for various commercial uses,. The datasets built by Magic Data represent various AI application scenarios, including smart home, intelligent customer service, intelligent assistant, smart car, smart education, machine translation, etc.
MAGICDATA Kid Voice TTS Corpus in Mandarin Chinese was recorded by a four-year-old Chinese girl originally born in Beijing China. This time we published 15-minute speech data from the corpus for non-commercial use. This is the first time to publish this voice!
We are honored to say that our Chinese Mandarin Conversational Speech was selected in LDC Catalog! The catalog No. is
At the end of the year 2019, we are honored to be featured as one of APAC CIO Outlook: Top 10 Big Data Solution Providers.Certification: APAC CIO Outlook TOP 10 BIG DATA SOLUTION PROVIDERS-2019
For intelligent speech-interaction customer service models, as for other similar human-AI interaction models, in order to recognize effectively a human customer’s speech content, to identify his intentions and then to react properly in a certain customer service scenario, models require rich conversational speech data and conversations’ content in this scenario for training.