DeepSeek V4: Open-Weight Frontier at 1/7 the Cost

About this episode

Alex and Jamie unpack DeepSeek V4: Open-Weight Frontier at 1/7… — what shipped, why it matters, and how engineers can put it to work today. New episodes weekly.

Transcript

[Alex]: Welcome to another episode of "Nerd Level Tech AI Cast," where we dive deep into the heart of what's buzzing in the tech world. I'm Alex, here to unpack the techy stuff.

[Jamie]: And I’m Jamie, here to keep Alex from getting too lost in the tech sauce and ask the questions you're all thinking. Today, we're talking about something pretty exciting—the new DeepSeek V4. It's making waves for being a heavyweight in AI at a featherweight cost, right Alex?

[Alex]: Absolutely, Jamie. DeepSeek V4 is not just a new model; it's a game-changer in terms of efficiency and accessibility. They've launched two variants under this model: the V4-Pro and the V4-Flash. The Pro version is a beast with 1.6 trillion parameters, and even the Flash, the lighter version, packs a solid 284 billion.

[Jamie]: Trillion and billion parameters? That sounds...expensive. [CHUCKLES] How are they managing the costs?

[Alex]: Good question! That’s the headline here. They’ve implemented what they call a Hybrid Attention mechanism. It’s a fancy way of saying they’ve gotten really smart about which data the AI pays attention to at any given time, massively cutting down on necessary computing power.

[Jamie]: So it’s like having a super-focused brain that doesn’t get distracted by every little thing? I could use one of those during our team meetings! [LAUGHS]

[Alex]: [LAUGHS] Exactly, Jamie. This focus not only makes it cheaper to run but also incredibly efficient. They've managed to bring down the cost to about one-seventh of its closest competitor, Claude Opus 4.7, when we talk about output tokens.

[Jamie]: Wow, that’s like buying a coffee for the price of a sip! Now, they’re also scoring pretty well on benchmarks, right?

[Alex]: Right on the money. On the SWE-bench Verified, V4-Pro scores 80.6, which is pretty close to Opus 4.6 and not too far behind the latest GPT models. And it’s leading on the LiveCodeBench with a score of 93.5!

[Jamie]: So it’s cheaper and pretty darn smart. But Alex, what’s this I hear about a partnership with Huawei?

[Alex]: Ah, the plot thickens! Yes, Huawei has announced full support for V4 on their Ascend AI processors. This is big because it suggests you don't need the priciest, latest hardware to run top-tier AI models efficiently. Although, it’s worth noting that the model was likely still trained on NVIDIA hardware.

[Jamie]: So it’s like saying, "Hey, you can totally run this Ferrari engine on regular unleaded fuel!" Not that you’d want to... [PAUSE] Speaking of running things, isn’t there a promo price happening?

[Alex]: Sharp memory, Jamie! Until the end of May, there’s a hefty discount on API usage—nearly 75% off. After that, prices will normalize, so developers and companies using V4-Pro should plan their budgets accordingly.

[Jamie]: Budget planning, not the most exciting part of tech, but necessary. [CHUCKLES] Any final thoughts, Alex, before we wrap up today’s tech feast?

[Alex]: Just that it’s an exciting time in AI. Models like DeepSeek V4 are pushing boundaries not just in capability but in making high-level AI more accessible and affordable. It's a leap towards more innovation and maybe even some new startups getting a chance to play in the big leagues.

[Jamie]: Love that—democratizing AI, one parameter at a time. Thanks, Alex, for breaking all that down, and thank you all for tuning in.

[Alex]: Don't forget to subscribe for more deep dives and tech tidbits. Until next time, keep your tech levels nerdy and your questions ready! [OUTRO MUSIC PLAYS] [END OF SCRIPT]

Listen to this episode

About this episode

Transcript

Stay on the Nerd Track