At this year's global WWDC22 Apple Developer Conference, Apple not only announced its core processor and other technologies, but also demonstrated the newly upgraded car linkage system CarPlay. Since then, CarPlay is no longer just a simple projection of the iOS system, but now iOS can fill the entire car screen. HE Xiaopeng, chairman and CEO of XPeng, also mentioned on the social media platform Weibo, that CarPlay is a very good solution in this generation of cars, and the next-generation of smart car solutions require more comprehensive full-stack self-development and ecological construction.
With the development of the Internet and globalization, people's daily language communication is often mixed with other languages, such as: "我的IPAD不能下载APP了，可以陪我去APPLE store修理一下吗？ (My IPAD can't download this APP, can you accompany me to the APPLE store to repair it?)” , “明天就是deadline了，我的paper还没有ready。 (Tomorrow is the deadline, my paper is not ready.)”, "老板的schedule需要调整，麻烦你check一下你得email。(The boss's schedule needs to be adjusted, please check your email.)” ...
Whether it is the personalized recommendation of short videos, or the optimal route design for takeaway delivery, or the face recognition during payment, AI technology represented by algorithms has been applied in full swing in the consumer Internet industry.
In recent years, with the development of artificial intelligence technology, the performance of speech recognition application has been significantly improved. Many companies claim that the accuracy rate of speech recognition technology has reached more than 98%. Has the performance of speech recognition exceeded the human ear? There is something more need to be discussed before we making the final conclusion.
As of its launch on July 4, 2022, ISCSLP 2022 Conversational Short-phrase Speaker Diarization Challenge has received more than 40 registration. On July 24, the committee releases the baseline and training datasets for all participants.
Recently, according to the ‘Washington Post’ report, Blake Lemoine, a software engineer at Google, said that Google's artificial intelligence chatbot LaMDA (Language Model for Dialogue Applications) already has ‘consciousness’ and even a ‘soul’. One of the signs that distinguishes humans from other species is that people think that they are conscious, and consciousness is a choice made by human for the world. If AI really has consciousness, then human beings are possible to be taken over by AI robots some day.
Voice assistants have been playing a more and more important role in our life. However, sometimes a smart voice assistant indoors may become "dumb" when it is used in outdoor environment. Just like us humans, the voice assistant gets panic when it arrives in a new environment. People tend to be more nervous and cautious when communicating in a non-native environment. Humans are uncomfortable with things they are not familiar, not to mention artificial intelligence algorithms. The voice assistant need to adapt to different domain, which is called transfer learning, an important but difficult problem in speech recognition, speech synthesis, speaker recognition and other speech fields.
On July 4, 2022, ISCSLP 2022 Conversational Short-phrase Speaker Diarization Challenge (CSSD) which is jointly sponsored by the Institute of Acoustics CAS, Northwestern Polytechnical University, Singapore A*STAR Institute of Information and Communication, Shanghai Jiaotong University and Magic Data (Beijing Aishu Smart Technology Co., Ltd.), is officially opened for registration. Groups and individuals from academia and industry are welcome to register for the competition.
Magic Data’s paper Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset is accepted by INTERSPEECH 2022, the world's largest and most comprehensive conference on the science and technology of spoken language processing. Themed "Human and Humanizing Speech Technology", INTERSPEECH 2022 will take place from September 18-22 virtually and in Incheon Korea.