🎙️ Episode 29406:57 • June 3, 2026
Microsoft MAI Models: In-House AI in Copilot (2026)
Listen to this episode
AI-generated discussion by Alex and Jamie
About this episode
Join hosts Alex and Jamie on this episode of Nerd Level Tech AI Cast as they dive into Microsoft's groundbreaking in-house AI models, the MAI family, and their implications for developers. Discover the exciting features of MAI-Code-1-Flash and MAI-Thinking-1, from coding to reasoning capabilities, and learn how these innovations are set to revolutionize tools like GitHub Copilot. Tune in for a fun, informative discussion that unpacks the latest in AI technology!
Transcript
[Alex]: Hey everyone, welcome back to Nerd Level Tech AI Cast—the show where we break down the latest in AI, coding, and, occasionally, why my smart fridge still refuses to buy oat milk. I’m Alex. [Jamie]: And I’m Jamie! And Alex, I still think your fridge is just trying to save you from a life of bland lattes. [PAUSE] Today, we’re getting nerdy with Microsoft’s brand-new MAI models—especially the ones powering GitHub Copilot and some wild new reasoning tricks. Seven models, zero distillation, and apparently a whole lot of “self-sufficiency.” What does that even mean? We’re about to find out. [Alex]: That’s right. If you’ve seen the headlines from Build 2026 and thought, “Wait, didn’t Microsoft already have Copilot and, you know, OpenAI?”—this episode is for you. Let’s dive in. [Jamie]: Okay, Alex, set the scene for us. Microsoft just dropped seven in-house AI models—what’s the big deal here? [Alex]: Big deal is right. For the first time, Microsoft built a whole family of AI models, called MAI, completely in-house. No borrowing, no “student-teacher” distillation from OpenAI or anyone else. They’re flexing those AI muscles and saying, “Hey, we can do this ourselves!” The two stars of the show? MAI-Code-1-Flash, their new coding model, and MAI-Thinking-1, a reasoning powerhouse. [Jamie]: Seven models—so it’s not just coding bots running wild in VS Code? [Alex]: Nope! The family includes models for coding, reasoning, image generation, transcription, and voice. There’s even a “Flash” variant for some of them. But most developers will first meet MAI-Code-1-Flash—now rolling out in GitHub Copilot for VS Code users. And then there’s MAI-Thinking-1, which is in private preview on Microsoft Foundry. [Jamie]: Flash... like, it’s fast, or are we talking superhero capes? [Alex]: [Chuckles] No capes—just speed. MAI-Code-1-Flash is lightweight and designed to be quick and efficient, a bit like Claude Haiku but—according to Microsoft—cheaper and smarter. It’s got 5 billion parameters, which is enough to impress your dev friends but not so big that your laptop starts making airplane noises. [Jamie]: All right, but does it actually code better? Or is it just another shiny model in the picker? [Alex]: Good question. Microsoft says MAI-Code-1-Flash outperforms Claude Haiku 4.5 in all four of their key coding benchmarks: SWE-Bench Pro, SWE-Bench Verified, Multilingual, and Terminal Bench 2. Especially on SWE-Bench Pro, it’s got a 16-point lead. [PAUSE] And, get this—it uses “adaptive solution length control.” So if you ask it to write fizzbuzz, it doesn’t over-explain; if you ask it to build a web server from scratch, it spends more “reasoning budget” on the hard stuff. [Jamie]: So, basically, it doesn’t write you a novel when you want a haiku, and vice versa. I wish my emails worked like that. [Alex]: Same! And for the security folks, Microsoft built a 186-question adversarial benchmark to make sure it’s reasoning, not just memorizing Stack Overflow. [Jamie]: Oof, 186 trick questions? That’s more than I got on my driving test. How did it do? [Alex]: It scored about 85.8 adjusted accuracy, but Microsoft admits there are still some categories—like “Einstellung traps”—where it’s not perfect. So, room to grow. But for devs, it’s a big leap in efficiency and reliability. [Jamie]: All right, what about MAI-Thinking-1? The name alone sounds like it should be solving world peace. [Alex]: If only! MAI-Thinking-1 is their flagship reasoning model. Think of it as the brains of the operation—built from scratch, no borrowed DNA from other labs. It’s a sparse Mixture-of-Experts model, which is a fancy way of saying it’s really good at picking the right “mini-brain” for each task. It runs 35 billion active parameters out of a trillion total, so it punches above its weight. [Jamie]: So how does it compare to the likes of Claude Opus or GPT? [Alex]: On Microsoft’s own benchmarks—so, grain of salt—it goes toe-to-toe with Claude Opus 4.6 on SWE-Bench Pro, and in blind human evaluations it was preferred over Claude Sonnet 4.6. Plus, it’s got a 256,000-token context window—enough for a 600-page document in one go. Try pasting your entire high school diary in there, Jamie. [Jamie]: [Laughs] Nobody needs that much context, trust me. But that’s genuinely impressive. And all of this runs on Microsoft’s own hardware too? [Alex]: Yes! Here’s where it gets sci-fi. The models were co-designed with Microsoft’s Maia 200 inference chips—the ones built on TSMC’s 3nm process. This vertical integration means Microsoft controls everything: the model, the data pipeline, and the hardware. It’s like building your own gaming PC from scratch, but for AI. [Jamie]: So, in short, Microsoft isn’t just a customer anymore—they’re baking their own AI cake and eating it too. [Alex]: Exactly! And the frosting is what they call “Frontier Tuning”—meaning businesses and big customers can customize these models on their own workflow data. Microsoft claims, for example, a custom Excel-tuned MAI matches GPT 5.4 but at a tenth of the cost. And the Mayo Clinic is even building a clinical model with them. [PAUSE] But, as always, these are Microsoft’s numbers. I’m waiting for the independent cook-off before betting my lunch money. [Jamie]: So MAI models aren’t about replacing OpenAI, but giving Microsoft more options—and a bit of leverage? [Alex]: Right. Microsoft’s official line is “optionality, not divorce.” On Foundry—their model platform—you can still use OpenAI, Anthropic, Meta, Mistral, and the rest. But now Microsoft isn’t limited by what OpenAI delivers—they can ship their own stuff, on their own terms. [Jamie]: All right, so for devs listening at home: If you open up VS Code today, you might already be using MAI-Code-1-Flash in Copilot. And if you’re in enterprise land, MAI-Thinking-1 is waiting for you on Foundry—if you can get an invite. [Alex]: That’s the gist. And if you’re wondering, “Was MAI trained on OpenAI data?”—nope. Microsoft says it’s all clean, licensed data, with no AI-generated content in the pre-training mix. [Jamie]: Okay, last question, Alex—how do I get my fridge to stop judging my oat milk choices? [Alex]: Sorry, Jamie, that’s going to require MAI-Overlord-1.0. Maybe next year. [Jamie]: [Laughs] All right, that’s all for this episode of Nerd Level Tech AI Cast. [Alex]: Thanks for listening, folks! If you enjoyed the show, leave us a review and share us with your favorite AI-powered appliance. [Jamie]: See you next time—same nerd time, same nerd channel. [OUTRO MUSIC FADES IN]