AGI Timing AI Bias AI Chip Makers AI Ethics AI Improvement AI Training AI Usecases AI in Fashion Datasets for ML Dynamic Pricing Algorithm Few Shot Learning Healthcare AI Use Cases Logistics AI Manufacturing AI Model Retraining NLP Use Cases Recommendation System Supply Chain AI

Cloud Deep Learning Cloud GPU Cloud GPU Providers

Customer Success Software

Cloud Contact Center Solutions Contact Center AI Contact Center Software Features Social Customer Service Social Customer Service Software

DAST DAST Tools Insider Threat Management Insider Threat Management Software Microsegmentation Microsegmentation Tools Penetration Testing Use Cases Software Testing Best Practices Vulnerability Management Automation

Amazon Mechanical Turk Alternatives Appen Alternatives Data Annotation Data Augmentation Data Augmentation Techniques Sentiment Analysis Dataset

Process Intelligence

Digital Twin Applications Machine Learning Process Mining Open Source Process Mining Process Analysis Tools Process Improvement Case Studies Process Improvement Techniques Process KPIs Process Mining Process Mining Algorithms Process Mining Tool Process Mining Use Cases Process Risk Process Technology Process Visualization

B2B Satisfaction Survey Customer Survey Software Google Survey Alternatives Healthcare Market Research Pollfish Alternatives Product Market Research Retail Market Research Survey Analysis Tools Zoho Survey

Carbon Footprint Calculation Digital Transformation And Sustainability GPT Sustainability Supply Chain Sustainability Supply Chain Sustainability Technology Sustainability AI Sustainability Case Studies

Workload Automation

Alternative To Task Scheduler Hybrid Cloud Job Scheduler Meter To Cash Solutions Open Source Job Scheduler SAP BTP Automation SAP Scheduler Alternative

Wu Dao 2.0 in 2024: China's Improved Version of GPT-3

Updated on Jan 11

3 min read

Share on Linkedin Share on Twitter

Wu Dao 2.0 in 2024: China's Improved Version of GPT-3

Wu Dao 2.0 in 2024: China's Improved Version of GPT-3

Table of contents

Training Capabilities Intelligence benchmarks Wu Dao vs. GPT-3 What is the future of human-level thinking AI?

In June 2021, the Beijing Academy of Artificial Intelligence (BAAI) launched Wu Dao 2.0, a successor to Wu Dao 1.0, as China’s first super-scale intelligent model system. Wu Dao is a language model that aims to surpass OpenAI’s GPT-3 and Google’s LaMDA in its human-level thinking. Trained on 4.9 terabytes of images and texts, and surpassing the state of the art (SOTA) levels on 9 benchmarks, Wu Dao is closer than any of its peers to reaching artificial general intelligence (AGI) and achieving human-level thinking.

Training

Wu Dao was trained on 4.9 terabytes of high quality images and texts in both English and Chinese:

1.2TB Chinese text data in Wu Dao Corpora.
2.5TB Chinese graphic data.
1.2TB English text data in the Pile dataset.

It was trained with FastMoE, an open-source Mixture of Experts (MoE) system. MoE is a machine learning technique that works by:

dividing predictive modeling tasks into sub-tasks
training an expert (learner) model on each sub-task
developing a gating model that learns which expert to consult based on the input to be predicted, and combines the predictions.

FastMoE enables Wu Dao to consult different expert models in parallel and switch to the one with the best-predicted outcome. For example, if the input is an English text, Wu Dao will use a predictive model that can generate a response in English text.

Capabilities

Wu Dao 2.0 is an intelligent multi-modal AI model, which means that it is not only a language model that generates text and speech, but can also generate images, and has self-improving learning capabilities.

In Beijing Zhiyuan Conference where Wu Dao first debuted, creators displayed Chinese poems and drawings generated by Wu Dao.

Poems generated by Wu Dao model

Following that event, a virtual student based on Wu Dao’s AI model, Zhibing Hua, announced on her Weibo that she intends to start her education at the Department of Computer Science and Technology at Tsinghua University. The virtual student is powered by Wu Dao, therefore, she can use her knowledge base and learning capabilities to write poems, draw, and compose music. Creators claim that Zhibing Hua will have faster and more efficient learning capabilities than a normal student, however, she will not experience emotions for the time being.

Intelligence benchmarks

As reported by BAAI, Wu Dao 0.2 surpassed state-of-the-art (SOTA) levels on 9 benchmark tasks:

ImageNet zero-shot: SOTA surpasses OpenAI CLIP
LAMA knowledge detection: surpasses AutoPrompt
LAMABADA Cloze: Ability surpasses Microsoft Turing NLG
SuperGLUE few-shot FewGLUE: surpasses GPT-3 and obtains the current best few-shot learning results
UC Merced Land-Use zero-shot: SOTA surpasses OpenAI CLIP
MS COCO text generation diagram: surpasses OpenAI DALL·E
MS COCO English graphic retrieval: surpasses OpenAI CLIP and Google ALIGN
MS COCO multilingual graphic retrieval: surpasses the current best multilingual and multimodal pre-training model UC2, M3P
Multi 30K multilingual graphic retrieval: surpasses the current best multilingual and multimodal pre-training model UC2, M3P

Wu Dao vs. GPT-3

Here’s a comprehensive comparison between Wu Dao and GPT-3:

	Wu Dao	GPT-3
Parameters	1.75 Trillion	175 Billion
Training data size	4.9 TB	570 GB
Code source	Open source system based on Pytorch	Exclusive source to Microsoft
Multimodality	Can perform tasks that include either text or image	Can perform tasks which include either text or image
Languages	English and Chinese	Only English

Comparison between Wu Dao and GPT-3 models

What is the future of human-level thinking AI?

Artificial general intelligence (AGI), or singularity, is AI’s capability of human-level thinking. ~90% of AI experts expect AI to reach singularity by 2075, however, some find it out of reach as modeling the human brain is impossible. Learn more about artificial general intelligence from our in-depth article Will AI reach singularity by 2060? 995 experts’ opinions on AGI.

If you want to explore AI trends and applications, feel free to read:

Contact AIMultiple to find the right AI solution and vendor for your organization.

Find the Right Vendors

Access Cem's 2 decades of B2B tech experience as a tech consultant, enterprise leader, startup entrepreneur & industry analyst. Leverage insights informing top Fortune 500 every month.

Principal Analyst

Principal Analyst

Cem has been the principal analyst at AIMultiple since 2017. AIMultiple informs hundreds of thousands of businesses (as per similarWeb) including 60% of Fortune 500 every month.

Cem's work has been cited by leading global publications including Business Insider, Forbes, Washington Post, global firms like Deloitte, HPE, NGOs like World Economic Forum and supranational organizations like European Commission. You can see more reputable companies and media that referenced AIMultiple.

Throughout his career, Cem served as a tech consultant, tech buyer and tech entrepreneur. He advised businesses on their enterprise software, automation, cloud, AI / ML and other technology related decisions at McKinsey & Company and Altman Solon for more than a decade. He also published a McKinsey report on digitalization.

He led technology strategy and procurement of a telco while reporting to the CEO. He has also led commercial growth of deep tech company Hypatos that reached a 7 digit annual recurring revenue and a 9 digit valuation from 0 within 2 years. Cem's work in Hypatos was covered by leading technology publications like TechCrunch and Business Insider.

Cem regularly speaks at international technology conferences. He graduated from Bogazici University as a computer engineer and holds an MBA from Columbia Business School.

To stay up-to-date on B2B tech & accelerate your enterprise:

Next to Read

LLM Fine Tuning Guide for Enterprises in 2024

Jan 127 min read

How to Build a Chatbot: Components & Architecture in 2024

Jan 124 min read

OpenAI GPT-n models: Shortcomings & Advantages in 2024

Jan 37 min read

Comments

Your email address will not be published. All fields are required.

0 Comments

Related research

Top 5 Expectations Regarding the Future of NLP in 2024

Top 5 Expectations Regarding the Future of NLP in 2024

Mar 96 min read

Natural Language Generation (NLG) in 2024

Natural Language Generation (NLG) in 2024

Jan 115 min read