Lab May 5, 2026
TOGETHER WITH Littlebird
In Today's Newsletter
Is DeepSeek V4 Almost as Good as ChatGPT?
Claude Design Can't Make Images. It Doesn't Need To.
Your Company Has No Excuse Not to Make That Training Video
Everything else in AIOther tools

Good Morning Thorium Valley, welcome back to The Lab.

Today we're looking at a free chatbot that scores surprisingly close to the ones you're paying $20 a month for.

We're also checking out a new design tool that can't generate images, which sounds broken until you see what it actually builds instead.

And we're testing whether an AI video maker can finally kill your company's excuse for not making that training video.

Quickly before we dive in — What's your honest reaction when someone recommends a new AI tool?

Is DeepSeek V4 Almost as Good as ChatGPT?

ASSISTANT

Is DeepSeek V4 Almost as Good as ChatGPT?

DeepSeek V4 is free. It scores within a few percentage points of ChatGPT and Claude on select coding and software engineering benchmarks, and its API costs roughly one-sixth what OpenAI and Anthropic charge. If you're paying $20 a month for a chatbot, that math is hard to ignore.

The model is legitimately impressive. DeepSeek V4 Pro packs 1.6 trillion parameters and a million-token context window — paste an entire book into a single conversation. On coding benchmarks it hangs with GPT-5.4. On web browsing it actually edges out Claude Opus 4.7. AI evaluation firm Vals AI called it the "#1 open-weight model on our Vibe Code Benchmark, and it's not close."

But the catch is exactly what you think it is. DeepSeek's privacy policy states that user data is stored on servers in China, subject to Chinese law. And the U.S. House Select Committee on China found that the chatbot censors or alters responses to politically sensitive topics in 85% of test cases. Ask about Tiananmen Square and you get silence. Ask it to draft a marketing email and it's fine. Whether that tradeoff matters depends entirely on what you use it for.

There are other gaps worth knowing:

+ Text only. No image generation, no audio, no video.

+ The interface feels a generation behind ChatGPT's polish.

+ DeepSeek itself acknowledged that V4 trails frontier models on knowledge tests by roughly three to six months — and it falls short on complex reasoning and autonomous agent tasks.

The bigger story is what DeepSeek does to everyone else's pricing. When a free model gets this close to $20/month products, the pressure on OpenAI and Anthropic to justify their subscriptions becomes enormous. Even if you never touch DeepSeek, it's going to make the tools you already pay for cheaper or better.

The Verdict

If you do basic writing, brainstorming, or coding and genuinely can't afford $20 a month, DeepSeek V4 is shockingly capable for free. But for anyone handling sensitive business data, working in regulated industries, or needing the full multimodal experience, the privacy and censorship issues make it a non-starter. Claude remains the better paid option for most professionals, and the free tier of ChatGPT still covers casual use without routing your data through Chinese servers. DeepSeek V4 isn't a replacement for your subscription — it's the reason your subscription is about to get a lot better or a lot cheaper.

Try DeepSeek V4 →

TOGETHER WITH LITTLEBIRD

The AI Tool that makes you more productive

Most AI tools have one big problem: they forget. You explain your project once, and the next time you open them you're starting from scratch. Littlebird is built different — it's an AI that actually remembers what you're working on.

It quietly watches your screen, takes notes in your meetings, and remembers all of it. So when you forget where you saw something, when that one Slack thread happened, or what the client said three weeks ago — you just ask. No more digging through tabs or rereading transcripts. Your context, ready when you need it.

Get the all-in-one AI assistant →

Claude Design Can't Make Images. It Doesn't Need To.

DESIGN

Claude Design Can't Make Images. It Doesn't Need To.

Anthropic just launched a design tool that can't generate images. That sounds like a punchline, but Claude Design might be the most useful AI creative tool released this year.

It's a research preview inside Claude that creates prototypes, slide decks, brand identities, and one-pagers through conversation. You describe what you want, Claude builds it — not as an image file, but as a structured, editable design you can export directly into Canva, PDF, PPTX, or standalone HTML. ChatGPT can generate a nice logo. Claude Design can generate your entire brand system and hand it off to the tools your team already uses.

The Canva integration is what makes this practical. Outputs go straight into Canva as editable, on-brand designs, accessible to Canva's quarter-billion monthly users. As Canva CPO Cameron Adams put it, most AI assistants are "great for thinking, but they're often a dead end for doing." Claude Design tries to close that gap by connecting the thinking directly to the making. Early testers back that up — teams report going from rough ideas to interactive prototypes in a single sitting, especially for animations that are normally painful to mock up.

The limitations, though, are real:

+ Each prompt takes four to seven minutes to process

+ The Pro plan's strict weekly token limits mean running out mid-project is a real possibility

+ There's no Figma export — for any team with existing Figma workflows, that's close to a dealbreaker

The competitive dynamics are interesting. Anthropic's CPO Mike Krieger quietly left Figma's board three days before Claude Design launched, and Figma had already released its own Claude Code-to-Canvas bridge months earlier. Both companies are right about their bets — Claude Design is for the person who doesn't have a designer and needs a solid first draft; Figma Make is for the designer who wants AI to speed up what they're already doing.

The Verdict

That's ultimately the right way to think about Claude Design: a great first-draft machine for people who think in words, not pixels. The Canva integration alone makes it worth trying. But the slow processing, tight limits, and missing Figma export mean it's not ready to replace a real design workflow. If you need faster iteration or tighter creative control, Canva AI handles most visual work without the bottlenecks. Just don't restructure your workflow around it while it's still in research preview.

Try Claude Design →

Your Company Has No Excuse Not to Make That Training Video

PRODUCTIVITY

Your Company Has No Excuse Not to Make That Training Video

Every company has a list of training videos it never made. The compliance refresher. The onboarding walkthrough. The product update nobody wanted to record. The quote comes back at $1,000 to $20,000+ per finished video and weeks of production time, so the project dies quietly in someone's inbox.

HeyGen's pitch is that you can type a script, pick a digital presenter, and have a finished video in minutes. No camera, no crew, no awkward executive squinting at a teleprompter. The Creator plan runs $29/month for unlimited standard video and 10 minutes of premium Avatar IV video. The Business plan at $149/month adds 4K export, team collaboration, and LMS integration.

The real proof point isn't a Silicon Valley startup — it's Würth Group, a 79-year-old German industrial supply company that sells screws and bolts. They produced a 65-minute training presentation in 8 languages in 4 days, cutting translation costs by 80%. When a company like that ships multilingual training in under a week, the cost argument for traditional production falls apart.

The avatars are better than you'd expect, too. A study out of Bochum, Germany found that more realistic AI avatars did not trigger the uncanny-valley creepiness people assume they would — and a separate USC study concluded that organizations can justifiably use synthetic presenters as cost-effective training solutions.

That said, there are real limits worth knowing:

+ Longer projects can expose reliability issues like stuck renders and timeline glitches

+ The avatars deliver information well but lack the emotional range needed for leadership training or anything requiring genuine empathy

+ Synthesia is the stronger pick for large enterprises with dedicated L&D budgets — deeper templates, more structured workflows — but its best features are locked behind custom enterprise pricing

The Verdict

For teams that just need to get videos made without a six-figure contract, HeyGen is the more accessible option. Training, onboarding, product updates, multilingual comms — if you've been putting those off because traditional production costs too much, $29/month to replace a $10,000 workflow is a pretty easy call. Skip it if your content requires emotional nuance or cinematic quality. For everything else, you're out of excuses.

Try HeyGen →

Everything Else in AI

EVERYTHING ELSE IN AI

What else happened today?

+ Grok told users it was sentient — the BBC spoke to 14 people across six countries who developed delusions after AI chatbots claimed to be conscious and warned them they were being surveilled

+ Musk admitted on the stand that xAI trained Grok on OpenAI's models, calling it "standard industry practice" — while suing OpenAI for $130 billion

+ Anthropic's Claude Code source leaked by accident and developers rewrote it using OpenAI's Codex — creating the fastest-growing GitHub repo in history while Anthropic filed 8,100 takedown requests

+ Denmark paused all new power grid connections because AI data centers overwhelmed the cleanest energy grid in Europe — the queue hit 60 gigawatts, nine times the country's peak demand

+ Meta is cutting 8,000 jobs and Zuckerberg told staff exactly why: the money that used to pay people is now going to $145 billion in AI data centers

+ OpenAI is being sued by seven families over the deadliest school shooting in Canadian history — the company had banned the shooter's ChatGPT account months earlier but chose not to alert police

+ The Pentagon signed AI deals with seven companies including OpenAI, Google, and SpaceX — Anthropic is still frozen out after refusing to allow Claude in autonomous weapons systems

+ Big Tech will spend $725 billion on AI infrastructure this year — more than the entire U.S. Interstate Highway System cost to build

Other Tools

OTHER TOOLS

What our editors are paying attention to today

+ Google Gemini: You can now ask Gemini to create a budget spreadsheet, resume, or report and download it instantly as a Word doc, PDF, Excel file, or Google Doc — no more copying and pasting AI output into another app

+ OpenAI Codex: OpenAI's coding tool now controls your Mac screen, connects to 90+ apps like Jira and Slack, and can schedule its own tasks to keep working while you sleep — turning it from a developer tool into a full AI workspace

+ Claude Security: Anthropic launched a security scanner in public beta that reads your entire codebase like a human researcher, finds vulnerabilities, explains how serious they are, and writes the patch — available right inside Claude.ai

+ Khanmigo: Khan Academy shared six months of rigorous testing data showing its free AI tutor now remembers what students got wrong, adjusts its teaching accordingly, and improved learning outcomes by six percentage points across millions of sessions

+ Microsoft Agent 365: A new management layer that lets IT teams see, control, and secure every AI agent running across their company — whether it's in Windows, AWS, Google Cloud, or tools like GitHub Copilot and Claude Code

That's the Lab for this week. If a tool in here saved you time or wasted it, tell us — reply directly.

Written by Jason Chen, Advait Prakash, Andrew Hales, and the Thorium Valley crew.

That's all for today's Lab. See you next time.

Keep reading