There’s tech that talks, and then there’s tech that actually listens. Sesame just blurred that line with style. The San Francisco-based startup, led by CEO Brendan Iribe and CTO Ankit Kumar, locked in a massive $250M Series B led by Sequoia Capital & Spark Capital, with Andreessen Horowitz (a16z), Matrix Partners, and a few quiet giants in the mix. Total raise: $307.6M. Valuation? Not public, but insiders peg it north of $1B. That’s heavyweight status for a company that’s barely been out of stealth since February ’25.
Sesame isn’t chasing AR gimmicks or chasing the next headset fad. They’re engineering something deeply human, a voice-first AI that doesn’t just hear commands, it feels the conversation. Iribe’s track record speaks volumes: co-founder & former CEO of Oculus VR, sold to Meta for $2B. Kumar co-founded AR startup Ubiquity6, led Discord’s Clyde AI team post-acquisition. Founding engineer Ryan Brown spent years running Reality Labs Research Engineering at Meta. Together, they’re building tech that sounds less robotic and more like the person you wish your phone assistant could be.
Their secret weapon is the Conversational Speech Model (CSM), a proprietary transformer that processes text & audio tokens at once, with lightning-fast 200–300 ms response time. That’s why Sesame’s AI voices, Maya & Miles, can laugh mid-sentence, interrupt naturally, or catch emotional cues in real-time. The Verge called it “genuinely fun” and “unlike anything we’d used before.” Within weeks of launch, over 1M users logged 5M+ minutes of dialogue. That’s not engagement, it’s validation.
And they’re not hoarding the sauce. In March ’25, Sesame open-sourced its CSM-1B model (1B parameters, Apache 2.0 license) for free commercial use. It’s already racked up 13K+ GitHub stars & 95K+ downloads on Hugging Face. They’re building not just a product, but a developer ecosystem, fueling innovation the same way open-source jumpstarted the AI boom. Sequoia even said Sesame’s speech felt “different,” capturing the rhythm & emotion of real dialogue. That’s not PR fluff, it’s investor conviction.
Now comes the next chapter: bringing it off-screen. The $250M infusion will accelerate hardware R&D for their sleek, audio-first smart glasses, lightweight, fashion-forward, built for all-day wear. Sesame’s betting that the next major computing interface isn’t in your hand, it’s on your face. With offices in SF, Bellevue & NYC, they’re scaling fast, hiring across AI, hardware, design, & product.
When your tech starts talking back with empathy, it’s not just progress, it’s presence. The future’s looking you right in the eyes.

