See What's NEW


press images

ISCSLP2022 Conversational Short-phrase Speaker Diarization Challenge Registration Open

Date : 2022-07-06     View : 409

On July 4, 2022, ISCSLP 2022 Conversational Short-phrase Speaker Diarization Challenge (CSSD) which is jointly sponsored by the Institute of Acoustics CAS, Northwestern Polytechnical University, Singapore A*STAR Institute of Information and Communication, Shanghai Jiaotong University and Magic Data (Beijing Aishu Smart Technology Co., Ltd.), is officially opened for registration. Groups and individuals from academia and industry are welcome to register for the competition.

CSSD Challenge Registration

Challenge Background

Dialogue scenarios are one of the most essential and challenging scenarios for speech processing technology. In daily conversations, people casually respond to each other and continue the conversation with coherent questions and comments rather than bluntly answering each other's questions. Accurately detecting the speech activity of each person in a conversation is critical for many downstream tasks such as natural language processing and machine translation. The evaluation metric for speaker classification systems, the classification error rate (DER), has long been used as a standard evaluation metric for speaker classification. However, it fails to pay enough attention to short dialogue phrases. These short dialogue phrases are short but play an essential role at the semantic level. The speech community also lacks evaluation metrics to effectively assess the accuracy of short speech classification in conversations.

To solve this problem, we open-sourced the MagicData-RAMC Chinese conversational speech dataset, which contains 180 hours of manually annotated conversational speech data.  For the CSSD evaluation, we also prepare 20 hours of dialogue data for testing purpose, and manually annotate the speaker's timestamps. For the CSSD challenge, we also design a new accuracy evaluation metric to calculate the accuracy of sentence-level speaker diarization. By advancing research on segmentation and clustering techniques for dialogue data, we aim to further promote reproducible research in this field.

Challenge Committee and Support Team

Questions related to the challenge could email or with the subject of the email titled "Question about the Conversational Short-phrase Speaker Diarization Challenge".


Scoring Method

Participants submit inference results, and competition committee will calculate the score. The file format and evaluation metric will be announced in the open stage of the competition.

Prize Settings

Three sets of competitors will be awarded first prize, second prize, and third prize. The winners will have the opportunity to participate in ISCSLP 2022 for presentation.


Registration website:

Number of participants: Less than 5 participants per team (including 5 people)

More details:

All challengers are welcome to sign up for the competition!

Get Started?

Contact Us

Talk to Magic Data