Integrating ASR with Text Summarizer, Secure Your Leading Position in Web Conferencing Market with Magic Data Multi-Person Spontaneous Meetings Dataset

Date : 2023-01-05 View : 7716

Online meetings have become a frequently used tool for business and learning. How to meet the more diversifying online conferencing needs of users has brought great challenges to remote work applications, including captioning, real-time machine translation, smart meeting minutes and other artificial intelligence applications.

The core technology of smart web conferencing is ASR (Automatic Speech Recognition). For conference audio, the biggest challenge is how to continuously detect and recognize speaker switching in multi-person conference scenes. Machine learning training on big data is what will help make this technology a reality.

However, licensable multi-person meeting data in real scenarios is limited. For this, Magic Data specially releases a dataset of multi-person spontaneous meetings, including 400 meetings with a total of 200 hours. The topics of the meetings include company activities, employee benefits, employee training, product design, product optimization, marketing activities, business management, team management, and family life. etc., which are all conference content based on real business and life scenarios. No script is provided, the speakers express themselves spontaneously.

This data set is collected in conference rooms, subways, parks, airports, shopping malls, railway stations, airports and other scenes. More than 400 people participated in the collection. In addition to the built-in recording of mainstream conference software on the market, each speaker is also equipped with a separate recording device for the data collection, including headset, computers, recorder, mobile phones, tablets, etc. This can meet the research and development needs of speech recognition and speaker recognition under multi-channel and single-channel.

Data Specification

Transcription of the conference audio is attached in this dataset. More than that, for the research and development needs of smart meeting minutes, Magic Data has conducted natural language processing annotation on the transcription, extracting key conference information such as speakers, participants, topics, key points, conference conclusions, conference decisions, and to-do list. Consecutive speeches by the same person in the transcription are merged into one segment, which prevents the labeling result from being excessively split into multiple fragments. The tagged content in the transcription supports index, which can be traced back to the original text by character. An example of labeling is as follows:

[{"start": 3, "end": 19, "text": "We will hold a team building decision-making meeting online today", "labels": ["theme"]},{""start"": 20 , ""end"": 47, ""text"": ""Then this meeting is chaired by Sister Li, and the participants include Feifei, Chen Chen, and Xiaonan"", ""labels"": ["" participant""]}]" [{"start": 116, "end": 133, "text": "Then Chen Chen and Feifei will talk about their respective plans", "labels": ["speaker"]}]"

Magic Data hopes that this Multi-Person Spontaneous Meetings Dataset can help the web conference R&D team quickly realize product iterations, build leading smart conference system, and win more market opportunities.

Contact business@magicdatatech.com to learn more details and obtain a total smart web conferencing data solution.

Latest Press

Qingqing ZHANG: Conversation Data Promotes AIGC—Training Data of Large-Scale Models

"Training data is technology " .

That’s what OpenAI co-founder Ilya Sutskever said when taking interview with The Verge. ChatGPT amaze the world since its release. The stunning performance of GPT-4 makes us believe we have enter a new era in AI.

What makes large model so omniscient? In our opinion, the reason may lie in the data...

This article is a collection of Dr. Qingqing Zhang’s thoughts on data, large models and generative AI.

Open Dataset | Automobile Cabin Voice Interaction Data Solution

In recent years, with the development of artificial intelligence, chip technology, and new innovations in the automotive industry have been driven by the increase in smart car popularity. A smart car consists of three parts: The Internet of Vehicles, the smart cockpit, and the autonomous driving. The smart cockpit is equipped with intelligent and networked in-vehicle software, which can intelligently interact with people, roads, and vehicles. It is an important link and key node for the evolution of the human-vehicle relationship from a tool to a partner.

The Future of Virtual Companionship

Nowadays, more and more young people are buying chat services on e-commerce platforms to accompany them virtually and confiding in “chat buddy” to communicate and express their feelings. Prices for various degrees of companionship range from tens of yuan to the customized "virtual lover" for thousands of yuan. In recent years, virtual companionship services have become a fashionable self-healing way for young people to seek spiritual comfort and express their voices on the Internet. There are many stores on Taobao that provide this service, such as "gentle and cute little sweetheart", "overbearing dictatorial president fan", as long as you pay, you can find your favorite "buddy".

Will Humans Be Replaced by AI?

AI-generated art has experienced rapid growth in both popularity and accessibility over the past few months. With engines like DALL-E, Midjourney, and Stable Diffusion spurring an influx of AI-generated artwork on online platforms.

Visual Perception - The Eyes of Self-Driving Cars

Autonomous cars, or self-driving cars, has gradually entered the public eye from the original black technology. According to the degree of intelligence, autonomous driving is divided into 5 levels from L1 to L5: L1 refers to assisted driving, L2 refers to partial autonomous driving, L3 refers to conditional autonomous driving, L4 refers to highly autonomous driving, and L5 refers to fully autonomous driving- a true driverless vehicle.

News

Integrating ASR with Text Summarizer, Secure Your Leading Position in Web Conferencing Market with Magic Data Multi-Person Spontaneous Meetings Dataset

Get Started?