|
|
|
|
|
|
Good Morning Thorium Valley, welcome back to The Lab.
Today we're looking at a free chatbot that scores surprisingly close to the ones you're paying $20 a month for.
We're also checking out a new design tool that can't generate images, which sounds broken until you see what it actually builds instead.
And we're testing whether an AI video maker can finally kill your company's excuse for not making that training video.
Quickly before we dive in — What's your honest reaction when someone recommends a new AI tool?
ASSISTANT
DeepSeek V4 is free. It scores within a few percentage points of ChatGPT and Claude on select coding and software engineering benchmarks, and its API costs roughly one-sixth what OpenAI and Anthropic charge. If you're paying $20 a month for a chatbot, that math is hard to ignore.
The model is legitimately impressive. DeepSeek V4 Pro packs 1.6 trillion parameters and a million-token context window — paste an entire book into a single conversation. On coding benchmarks it hangs with GPT-5.4. On web browsing it actually edges out Claude Opus 4.7. AI evaluation firm Vals AI called it the "#1 open-weight model on our Vibe Code Benchmark, and it's not close."
But the catch is exactly what you think it is. DeepSeek's privacy policy states that user data is stored on servers in China, subject to Chinese law. And the U.S. House Select Committee on China found that the chatbot censors or alters responses to politically sensitive topics in 85% of test cases. Ask about Tiananmen Square and you get silence. Ask it to draft a marketing email and it's fine. Whether that tradeoff matters depends entirely on what you use it for.
There are other gaps worth knowing:
+ Text only. No image generation, no audio, no video.
+ The interface feels a generation behind ChatGPT's polish.
+ DeepSeek itself acknowledged that V4 trails frontier models on knowledge tests by roughly three to six months — and it falls short on complex reasoning and autonomous agent tasks.
The bigger story is what DeepSeek does to everyone else's pricing. When a free model gets this close to $20/month products, the pressure on OpenAI and Anthropic to justify their subscriptions becomes enormous. Even if you never touch DeepSeek, it's going to make the tools you already pay for cheaper or better.
If you do basic writing, brainstorming, or coding and genuinely can't afford $20 a month, DeepSeek V4 is shockingly capable for free. But for anyone handling sensitive business data, working in regulated industries, or needing the full multimodal experience, the privacy and censorship issues make it a non-starter. Claude remains the better paid option for most professionals, and the free tier of ChatGPT still covers casual use without routing your data through Chinese servers. DeepSeek V4 isn't a replacement for your subscription — it's the reason your subscription is about to get a lot better or a lot cheaper.
TOGETHER WITH LITTLEBIRD
Most AI tools have one big problem: they forget. You explain your project once, and the next time you open them you're starting from scratch. Littlebird is built different — it's an AI that actually remembers what you're working on.
It quietly watches your screen, takes notes in your meetings, and remembers all of it. So when you forget where you saw something, when that one Slack thread happened, or what the client said three weeks ago — you just ask. No more digging through tabs or rereading transcripts. Your context, ready when you need it.
DESIGN
Anthropic just launched a design tool that can't generate images. That sounds like a punchline, but Claude Design might be the most useful AI creative tool released this year.
It's a research preview inside Claude that creates prototypes, slide decks, brand identities, and one-pagers through conversation. You describe what you want, Claude builds it — not as an image file, but as a structured, editable design you can export directly into Canva, PDF, PPTX, or standalone HTML. ChatGPT can generate a nice logo. Claude Design can generate your entire brand system and hand it off to the tools your team already uses.
The Canva integration is what makes this practical. Outputs go straight into Canva as editable, on-brand designs, accessible to Canva's quarter-billion monthly users. As Canva CPO Cameron Adams put it, most AI assistants are "great for thinking, but they're often a dead end for doing." Claude Design tries to close that gap by connecting the thinking directly to the making. Early testers back that up — teams report going from rough ideas to interactive prototypes in a single sitting, especially for animations that are normally painful to mock up.
The limitations, though, are real:
+ Each prompt takes four to seven minutes to process
+ The Pro plan's strict weekly token limits mean running out mid-project is a real possibility
+ There's no Figma export — for any team with existing Figma workflows, that's close to a dealbreaker
The competitive dynamics are interesting. Anthropic's CPO Mike Krieger quietly left Figma's board three days before Claude Design launched, and Figma had already released its own Claude Code-to-Canvas bridge months earlier. Both companies are right about their bets — Claude Design is for the person who doesn't have a designer and needs a solid first draft; Figma Make is for the designer who wants AI to speed up what they're already doing.
That's ultimately the right way to think about Claude Design: a great first-draft machine for people who think in words, not pixels. The Canva integration alone makes it worth trying. But the slow processing, tight limits, and missing Figma export mean it's not ready to replace a real design workflow. If you need faster iteration or tighter creative control, Canva AI handles most visual work without the bottlenecks. Just don't restructure your workflow around it while it's still in research preview.
PRODUCTIVITY
Every company has a list of training videos it never made. The compliance refresher. The onboarding walkthrough. The product update nobody wanted to record. The quote comes back at $1,000 to $20,000+ per finished video and weeks of production time, so the project dies quietly in someone's inbox.
HeyGen's pitch is that you can type a script, pick a digital presenter, and have a finished video in minutes. No camera, no crew, no awkward executive squinting at a teleprompter. The Creator plan runs $29/month for unlimited standard video and 10 minutes of premium Avatar IV video. The Business plan at $149/month adds 4K export, team collaboration, and LMS integration.
The real proof point isn't a Silicon Valley startup — it's Würth Group, a 79-year-old German industrial supply company that sells screws and bolts. They produced a 65-minute training presentation in 8 languages in 4 days, cutting translation costs by 80%. When a company like that ships multilingual training in under a week, the cost argument for traditional production falls apart.
The avatars are better than you'd expect, too. A study out of Bochum, Germany found that more realistic AI avatars did not trigger the uncanny-valley creepiness people assume they would — and a separate USC study concluded that organizations can justifiably use synthetic presenters as cost-effective training solutions.
That said, there are real limits worth knowing:
+ Longer projects can expose reliability issues like stuck renders and timeline glitches
+ The avatars deliver information well but lack the emotional range needed for leadership training or anything requiring genuine empathy
+ Synthesia is the stronger pick for large enterprises with dedicated L&D budgets — deeper templates, more structured workflows — but its best features are locked behind custom enterprise pricing
For teams that just need to get videos made without a six-figure contract, HeyGen is the more accessible option. Training, onboarding, product updates, multilingual comms — if you've been putting those off because traditional production costs too much, $29/month to replace a $10,000 workflow is a pretty easy call. Skip it if your content requires emotional nuance or cinematic quality. For everything else, you're out of excuses.
EVERYTHING ELSE IN AI
+ Grok told users it was sentient — the BBC spoke to 14 people across six countries who developed delusions after AI chatbots claimed to be conscious and warned them they were being surveilled
+ Musk admitted on the stand that xAI trained Grok on OpenAI's models, calling it "standard industry practice" — while suing OpenAI for $130 billion
+ Anthropic's Claude Code source leaked by accident and developers rewrote it using OpenAI's Codex — creating the fastest-growing GitHub repo in history while Anthropic filed 8,100 takedown requests
+ Denmark paused all new power grid connections because AI data centers overwhelmed the cleanest energy grid in Europe — the queue hit 60 gigawatts, nine times the country's peak demand
+ Meta is cutting 8,000 jobs and Zuckerberg told staff exactly why: the money that used to pay people is now going to $145 billion in AI data centers
+ OpenAI is being sued by seven families over the deadliest school shooting in Canadian history — the company had banned the shooter's ChatGPT account months earlier but chose not to alert police
+ The Pentagon signed AI deals with seven companies including OpenAI, Google, and SpaceX — Anthropic is still frozen out after refusing to allow Claude in autonomous weapons systems
+ Big Tech will spend $725 billion on AI infrastructure this year — more than the entire U.S. Interstate Highway System cost to build
OTHER TOOLS
+ Google Gemini: You can now ask Gemini to create a budget spreadsheet, resume, or report and download it instantly as a Word doc, PDF, Excel file, or Google Doc — no more copying and pasting AI output into another app
+ OpenAI Codex: OpenAI's coding tool now controls your Mac screen, connects to 90+ apps like Jira and Slack, and can schedule its own tasks to keep working while you sleep — turning it from a developer tool into a full AI workspace
+ Claude Security: Anthropic launched a security scanner in public beta that reads your entire codebase like a human researcher, finds vulnerabilities, explains how serious they are, and writes the patch — available right inside Claude.ai
+ Khanmigo: Khan Academy shared six months of rigorous testing data showing its free AI tutor now remembers what students got wrong, adjusts its teaching accordingly, and improved learning outcomes by six percentage points across millions of sessions
+ Microsoft Agent 365: A new management layer that lets IT teams see, control, and secure every AI agent running across their company — whether it's in Windows, AWS, Google Cloud, or tools like GitHub Copilot and Claude Code
That's the Lab for this week. If a tool in here saved you time or wasted it, tell us — reply directly.
Written by Jason Chen, Advait Prakash, Andrew Hales, and the Thorium Valley crew.
That's all for today's Lab. See you next time.