Language-locale
en-*(HK/CN/ID/MY/PH/SG/TH/TR)
Style
Conversational & Scripted
File Format
PCM
Sampling Rate
48kHz
Bit Rate
16 bits
Channel
1
Number of Speaker
17,392
Duration
10,758 hours
File Number
5,979,570
As a new trend of AIGC, zero-shot speech synthesis has wide-ranging applications, including voice assistants, audiobooks, video game character voices, creating podcast, and real-time voice changer, among others. Magic Data has proactively designed and developed the "High-Quality Dataset for Voice Replication," which boasts a 48kHz high sampling rate, tens of thousands of contributors with diverse content, making it an excellent resource for achieving zero-shot speech synthesis. In particular, the diverse content of this data can be beneficial for the model's coarticulation learning.
ISO/IEC 27001 & ISO/IEC 27701:2019 compliant
Audio, text, image, and video multi-modal data
Conversational, scripted, and spontaneous data covering extensive domains
Expertise secured quality result