Founder Mode Episode 27 - From AI Prototype to Production with Ankur Goyal


Founder Mode Episode 27 - From AI Prototype to Production with Ankur Goyal

Welcome back to Founder Mode!

In this episode, we talked to Ankur Goyal, the founder of BrainTrust. He’s built AI systems across multiple generations—from structured data and search to modern AI agents.

We all love a slick AI demo. But many teams find that turning that demo into a real product is much harder than it seems.

Ankur showed us how to solve that. He talked about the missing systems, such as evals, observability, and feedback loops. These systems help AI function in the real world.

“The real trick is building something that matters—something the business actually needs. That’s how you stay in the 5% who succeed.”
— Kevin Henrikson

Why Most AI Projects Fail

An MIT report recently found that 95% of enterprise AI projects return zero ROI. That means most teams are building AI—but not getting results.

Ankur explained why: Teams don't have feedback loops. They ship an AI model once but don’t keep testing or improving it. Or they tune for one user and break it for everyone else.

That’s where evals (short for evaluations) come in. They help you track quality, compare models, and make sure your product stays good as it grows.

5 Key Takeaways

1. Evals Should Start as Soon as You Ship

Early AI projects often break in surprising ways. Evals help you catch regressions before users do.

2. Observability = Quality, Not Just Uptime

In traditional apps, observability is about keeping the site live. In AI apps, it’s about keeping the results useful and relevant.

3. Connect Feedback to Testing in One Click

Top teams make it easy to turn a user complaint into an eval. That helps the team learn and fix things fast.

4. Rethink Model Selection Monthly

Old infrastructure changed slowly. AI moves fast. The best teams re-test models every 1–2 months to stay ahead.

5. Models Can Now Improve Each Other

New models like Claude 3 and GPT-5 can review and improve the output of other models. This changes how we run evals and build agents.

Final Thoughts

Evals aren’t extra—they’re essential. They help you scale AI without breaking trust. As Ankur put it, evals should be a time-saver, not just a scoreboard.

Shipping AI that works at scale isn’t about just adding more tools. It’s about building the right feedback loops, staying close to real users, and checking your models regularly.

If you’re building with LLMs, think like a systems engineer. Logs, feedback, and evals are your new stack.

🎧 Listen to Episode 27 here:

show
From AI Prototype to Product...
Oct 2 · Founder Mode
30:29
Spotify Logo
 
video preview

This podcast builds on the Founder Mode newsletter.

Let’s build.

-kevin

2810 N Church St #87205, Wilmington, DE 19802
Unsubscribe · Preferences

Founder Mode

Founder Mode is a weekly newsletter for builders—whether it’s startups, systems, or personal growth. It’s about finding your flow, balancing health, wealth, and productivity, and tackling challenges with focus and curiosity. Each week, you’ll gain actionable insights and fresh perspectives to help you think like a founder and build what matters most.

Read more from Founder Mode
Learn how to turn research and deep tech into scalable products. Simplify complexity, reduce friction, focus your niche, and design for real-world use.

The Hidden Lessons from Building a Neurotech Startup Welcome back to Founder Mode! I recently talked about the gap between great science and great business. It made me think. It’s tempting to think that advanced research or deep tech will lead to business success. I've found that innovation only scales when it's easy to use, affordable, and relatable. It was clear that neurotechnology companies are turning hardware once seen as “impossible” into products that people can buy and use. What I...

Private equity meets AI: how Jason Friedrichs uses technology to drive smarter growth and real value creation in 2025 and beyond.

Private Equity + AI: Turning Capital into Value with Jason Friedrichs Welcome back to Founder Mode! In this episode, we chatted with Jason Friedrichs. We explored how private equity connects with AI and why this is so important right now. Jason has worked in consulting, operations, and investing. Today, he helps companies grow smarter with tech, not just faster with capital. We talked about how private equity is changing in a post-ZIRP world. We also discussed what AI means for value creation...

Learn how to master go-to-market in 2025—target better buyers, simplify outbound, sell credibility, and focus on big wins that scale.

How to Build a Smarter GTM Strategy for 2025 Welcome back to Founder Mode! This week's podcast is already our most popular ever. More than 10K views on YouTube. Give it a listen, and I would love your feedback. Should we do more like this? No Guests!Go-to-market strategies are shifting fast. What worked in 2020 doesn’t work today, and by 2025, the playbook will look even more different. The noise is louder, the inboxes are tighter, and attention is harder to win. I've been rethinking our GTM...