Company
Blog
blog
21
Apr
15
Browse: 207
Open-source Data Community MagicHub Officially Launched!

In 1969, Unix released source code on Unix Community, initiating the first “open-source act” in human history.

In 1991, the Linux kernel was released.

In 1998, Netscape Communications released the source code for its Communicator suite, which defined the word “open source.”

In 2005, a source-code management system called Git came out, which gave rise to the managed Git code warehouse.

/Shetuwang

Open source had deeply rooted in the internet ecosystem, changed the patterns of the internet industry.And the evolution of open source itself is a magnificent part of internet history.

Since the conception of Artificial intelligence (AI) was put forward in the Dartmouth Summer Research Project, it suffered countless ups and downs. Internet, big data, cloud computing, 5G, numerous new technologies came out and played increasingly important roles.

AI opened a new era, and open source comes up. Platforms for machine learning have been emerging. Developers, generation by generation, contribute their intelligence to the evolution of AI in the spirit of openness, freedom, and cooperation.

An increasing number of governments, NGOs, companies, academic institutions, and individuals release their image, textual, and audio data to the public and formed platforms like Kaggle, UCI, OpenML, ImageNet, OpenSLR. Data has yet to become the core driver for AI development.

Launch of MagicHub open-source community.

MagicHub, according, was launched on April 15. As the founder, Magic Data lays the leading position in the amount of conversational speech data and becomes the first company to release open-source datasets on an independent website, which might change the way users get data.

The Father of Speech Recognition Toolkit Kaldi, Daniel Povey, together with more than ten AI developers, cheered for the launch of the MagicHub Community.

Massive, diversiform datasets are released on MagicHub.io. The datasets are subdivided into multiple dimensions, offering AI engineers a more efficient way to find datasets for their various AI models, thereby reserves more energy on algorithm optimization.

Magic Data welcomes all data producers of discoverers to join and release datasets on MagicHub. We, together, could build a better ecology for open source. Please contact us if interested.

Home page of MagicHub.io

MagicHub has released more than 30 open-source datasets, including Mandarin Chinese, English, and Shanghai Dialect (Wu Chinese) conversational speech, NLP textual corpus, TTS corpus, and lexicons. All datasets are divided by languages, scenes, and industries as possible.

We will be releasing high-quality datasets and more content on MagicHub, and we always appreciate your comments, sharing, or any form of support. Let’s together make MagicHub a better place for inspiration and the spirit of sharing.

Click here to visit MagicHub.

Share
Previous
Page
Good News! Magic Data Tech Wins “Best Supplier of Alibaba Cloud 2021”
Next
Page
Magic Data Tech Won Intel AI 100 Acceleration Program: Supporting AI Industry from the Basics
Latest Blogs
Good News! Magic Data Tech Wins “Best Supplier of Alibaba Cloud 2021”

We are proud to announce Magic Data Tech has been named the “Best Supplier of Alibaba Cloud 2021”.

21
Apr
15
Magic Data Tech Won Intel AI 100 Acceleration Program: Supporting AI Industry from the Basics

On May 20, 2021, Intel published the 5th issue of its AI 100 Acceleration Program list at the 2021 Shenzhen (International) Artificial Intelligence Exhibition, and Magic Data Tech was selected for the program by relying on its strong innovation strength.

21
Apr
15
New Arrival in MagicHub! Get Hundreds of Hours of Datasets for Free!

Nowadays, a new round of technological revolution and industrial transformation is rising, which promote the rapid evolution of digital technology represented by artificial intelligence, and human beings are marching into an intelligent society. According to the White Paper on the Core Technology Industry of Artificial Intelligence, released by the China Academy of Information and Communications in April this year, AI has fully covered the basic elements of social operation and improved overall operating efficiency. In the future, AI will be as ubiquitous as water and electricity, disrupting and transforming every industry.

21
Apr
15
New Update on MagicHub—Get Free Datasets for In-Vehicle Scene!

Recently, nearly one thousand car companies attended the Shanghai Auto Show, where electrification and intelligentization were the standard equipment for many car companies. The intelligent functions of cars, such as intelligent cabins, self-driving and cloud service, depict the future of intelligent cars.

21
Apr
15
GO ALL IN ON DIGITAL TRANSFORMATION - SEE HOW DATA EMPOWERS BANKS IN INDUSTRY 4.0

Recently, the third CIO Summit of the Bank of China (BOC) was held in Shanghai in 2021, bringing together executives and CIOs from the financial, technological and Internet sectors to discuss and share with them on the theme of "Banking Era 4.0: Go all in on Digital Transformation", so as to share ideas on the way of the digital transformation of banking.

21
Apr
15
Sales Department
Please fill in this form to purchase datasets or quote for
data collection/ annotation services.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Country
City
Submit
Resources Department
If you want to be our data collection and annotation team
member, please fill in this form.
DATA COLLECTION PROJECTS
Language*
Location*
DATA ANNOTATION PROJECTS
Language*
CONTACT INFORMATION
Name*
Company Name*
E-mail*
Phone Number*
Experience*
Address*
Submit
Marketing Department
If you want to forward our article or tell us marketing
events, please fill in this form.
Name
*
Company Name
*
Email
*
Phone Number
*
Detail
Submit
Human Resources Department
Please fill in this form to be a member of Magic Data Tech.
Name
*
Email
*
Phone Number
*
Job
*
Upload Resume
Submit
Sample Download
Name*
E-mail*
Phone Number*
Company Name*
Job
Department
Company Product
I am also interested in the following data:
Languages
Style
Scenario

We will contact you via telephone to confirm your information and provide the method to download.
Submit
Submission Successful!
We will contact you as soon as possible.
This page would be
closed in 3 seconds automatically.
>
TOP