Ai2 says its new AI model beats one of DeepSeek’s best – TechCrunch

Latest

Amazon

Apps

Biotech & Health

Climate

Cloud Computing

Commerce

Crypto

Enterprise

EVs

Fintech

Fundraising

Gadgets

Gaming

Google

Government & Policy

Hardware

Instagram

Layoffs

Media & Entertainment

Meta

Microsoft

Privacy

Robotics

Security

Social

Space

Startups

TikTok

Transportation

Venture

Events

Startup Battlefield

StrictlyVC

Newsletters

Podcasts

Videos

Partner Content

TechCrunch Brand Studio

Crunchboard

Contact Us
Move over, DeepSeek. There’s a new AI champion in town — and they’re American. On Thursday, Ai2, a nonprofit AI research institute based in Seattle, released a model that it claims outperforms DeepSeek V3, one of Chinese AI company DeepSeek’s leading systems.Ai2’s model, called Tulu 3 405B, also beats OpenAI’s GPT-4o on certain AI benchmarks, according to Ai2’s internal testing. Moreover, unlike GPT-4o (and even DeepSeek V3), Tulu 3 405B is open source, which means all of the components necessary to replicate it from scratch are freely available and permissively licensed.A spokesperson for Ai2 told TechCrunch that the lab believes Tulu 3 405B “underscores the U.S.’ potential to lead the global development of best-in-class generative AI models.”“This milestone is a key moment for the future of open AI, reinforcing the U.S.’ position as a leader in competitive, open source models,” the spokesperson said. “With this launch, Ai2 is introducing a powerful, U.S.-developed alternative to DeepSeek’s models — marking a pivotal moment not just in AI development, but in showcasing that the U.S. can lead with competitive, open source AI independent of the tech giants.”Tulu 3 405B is a rather large model. Containing 405 billion parameters, it required 256 GPUs running in parallel to train, according to Ai2. Parameters roughly correspond to a model’s problem-solving skills, and models with more parameters generally perform better than those with fewer parameters.According to Ai2, one of the keys to attaining competitive performance with Tulu 3 405B was a technique called reinforcement learning with verifiable rewards. Reinforcement learning with verifiable rewards, or RLVR, trains models on tasks with “verifiable” outcomes, like math problem solving and following instructions.Ai2 claims that on the benchmark PopQA, a set of 14,000 specialized knowledge questions sourced from Wikipedia, Tulu 3 405B beat not only DeepSeek V3 and GPT-4o, but also Meta’s Llama 3.1 405B model. Tulu 3 405B also had the highest performance of any model in its class on GSM8K, a test containing grade school-level math word problems.Tulu 3 405B is available to test via Ai2’s chatbot web app, and the code to train the model is on GitHub and the AI dev platform Hugging Face. Get it while it’s hot — and before the next benchmark-beating flagship AI model comes along.TechCrunch has an AI-focused newsletter! Sign up here to get it in your inbox every Wednesday.Topics
Senior Reporter, Enterprise
Apple CEO says DeepSeek shows ‘innovation that drives efficiency’
Google quietly announces its next flagship AI model
Elon Musk reveals Elon Musk was wrong about Full Self-Driving
Google issues ‘voluntary exit’ program for Android, Chrome, and Pixel employees
Mark Zuckerberg teases a 2025 return to ‘OG Facebook’
Ai2 says its new AI model beats one of DeepSeek’s best
Report: Majority of US teens have lost trust in Big Tech
Subscribe for the industry’s biggest tech newsEvery weekday and Sunday, you can get the best of TechCrunch’s coverage.TechCrunch’s AI experts cover the latest news in the fast-moving field.Every Monday, gets you up to speed on the latest advances in aerospace.Startups are the core of TechCrunch, so get our best coverage delivered weekly.By submitting your email, you agree to our Terms and Privacy Notice.© 2024 Yahoo.

Source: https://techcrunch.com/2025/01/30/ai2-says-its-new-ai-model-beats-one-of-deepseeks-best/

Ai2 says its new AI model beats one of DeepSeek’s best – TechCrunch

More Stories

Gears of War Remastered Trilogy Reportedly Coming to PS5 – ComicBook.com

GeForce RTX 40 series owners are getting a feature previously exclusive to RTX 50 cards – XDA Developers

Scientists make game-changing breakthrough on quest for next-gen battery: ‘Magic will happen when costs come down’ – The Cool Down

Leave a Reply Cancel reply

Gears of War Remastered Trilogy Reportedly Coming to PS5 – ComicBook.com

GeForce RTX 40 series owners are getting a feature previously exclusive to RTX 50 cards – XDA Developers

Scientists make game-changing breakthrough on quest for next-gen battery: ‘Magic will happen when costs come down’ – The Cool Down

DISCUSSION: Should Ferland Mendy Continue To Start Big Games? – Managing Madrid

More Stories

Gears of War Remastered Trilogy Reportedly Coming to PS5 – ComicBook.com

GeForce RTX 40 series owners are getting a feature previously exclusive to RTX 50 cards – XDA Developers

Scientists make game-changing breakthrough on quest for next-gen battery: ‘Magic will happen when costs come down’ – The Cool Down

Leave a Reply Cancel reply

You may have missed

Gears of War Remastered Trilogy Reportedly Coming to PS5 – ComicBook.com

GeForce RTX 40 series owners are getting a feature previously exclusive to RTX 50 cards – XDA Developers

Scientists make game-changing breakthrough on quest for next-gen battery: ‘Magic will happen when costs come down’ – The Cool Down

DISCUSSION: Should Ferland Mendy Continue To Start Big Games? – Managing Madrid