AI 行业热点

🎙️ 播客精选

How Anthropic Uses Claude Fable 5 With Mike Krieger

AI & I by Every · 2026-06-11

Speaker 1 | 00:04 - 00:07
Mike, welcome to the show. Great to be here, Dan. Good to see you.

Speaker 2 | 00:07 - 00:17
So for people who don’t know you, you’re the head of Anthropic Labs, and you’re the cofounder of Instagram. And today, I wanna talk to you about is Fable five. So Fable five is dropping tomorrow. We’re recording this the day before. It’s gonna come out after it drops.

Speaker 2 | 00:17 - 00:39
But what I really wanted to do is bring you on the show to tell me about what it’s like to use this model beyond the first day. I think when a model this powerful drops, it’s so useful…

🎧 收听完整节目


🐦 X/Twitter 热点

Swyx (@swyx)

  • wooh [8 ❤️ 1 🔄]

Josh Woodward (@joshwoodward)

  • Update: Everything is back up and running, sorry again! [429 ❤️ 27 🔄]
  • Heads up: Gemini is currently experiencing an outage. We’re on it and will get everything back up ASAP. Some of the fixes are in, the rest coming very soon. Stay tuned for updates, and thanks for bearing with us! [1553 ❤️ 137 🔄]

Boris Cherny (@bcherny)

  • Hello from Code with Claude Tokyo!! [1820 ❤️ 77 🔄]

Thibault Sottiaux (@thsottiaux)

  • Can confirm we saw a strong spike in growth of token consumption for Codex over last 48 hours. Unusual when we don’t launch something. [2460 ❤️ 105 🔄]
  • Simplify until there is nothing to simplify [580 ❤️ 12 🔄]
  • Welcome Clint and Michael! Incredibly excited to see what we do together to contribute to the cybersecurity field and accelerate defenders across the globe.

It’s time to build. [772 ❤️ 18 🔄]

Peter Yang (@petergyang)

  • Give yourself permission to build.

The traditional career ladder pushes everyone to become a leader, but I just want to be a builder.

As you climb the ladder at most companies, you’re expected to step away from building and fill your time with product reviews, cross-functional alignment, managing up, and performance calibrations.

I know a lot of builders who spent their best years climbing the wrong ladder.

The good news is that this is finally changing. Companies are rewarding builders and ICs more than ever, and even managers are increasingly expected to do IC work too.

But becoming a good builder takes reps, and it’s hard to put in those reps when you’re in back-to-back meetings all day.

So if you’re a builder at heart, embrace it. You don’t have to give up what you’re good at to be a “leader.”

📌 Watch now: [52 ❤️ 1 🔄]

  • This shit is actually working unbelievable [8 ❤️]
  • The more I use Codex the more ambitious my requests get. Or maybe this is not ambitious enough? [48 ❤️]

Nan Yu (@thenanyu)

  • gangprompting [10 ❤️]
  • There’s a thing they say about being a boat owner: the two best days of being a boat owner are the day you buy the boat and the day you sell the boat. [15 ❤️]

Madhu Guru (@realmadhuguru)

  • Right from the early days of Gemini, enterprises would get the quality/cost tradeoff wrong. They’d often start with the smallest, cheapest model.

The rule of thumb we gave customers:

Replacing a traditional ML model with an LLM - start small, cos you already know what good looks like.

Building something new - start with the most capable model. Think magically. Figure out what’s actually possible first.

Once they had a high-quality working application, we’d help them move to a smaller model while maintaining quality. [20 ❤️ 1 🔄]

Thariq (@trq212)

  • and the video for reference:

(I didnt get to use the updated designs in time) [92 ❤️ 1 🔄]

  • here’s the deck from this video if you want to go over it yourself:

lmk if you have any questions! [203 ❤️ 9 🔄]

  • Lots of people asked how I used Fable to edit its own launch video so I made a video about that!

TLDR it wrote a lot of code & tool calls to use transcription services, ffmpeg, do colorgrading, use the figma mcp, make remotion UI and render it.

I didn’t touch a video editor. [6036 ❤️ 367 🔄]

Google Labs (@GoogleLabs)

  • 🌍 Project Genie access is expanding even more! Starting today, Google AI Ultra 5X subscribers (our latest tier!) globally can access Project Genie.

Try it out here! [365 ❤️ 60 🔄]

Amjad Masad (@amasad)

  • Automate your job search with Replit! [164 ❤️ 5 🔄]
  • 🇺🇸🇺🇸🇺🇸 [802 ❤️ 40 🔄]
  • Super interesting approach to enterprise agents. Congrats on the launch @markiewagner [185 ❤️ 11 🔄]

Guillermo Rauch (@rauchg)

  • 🇬🇧 London calling
    Excited for Vercel Ship next week
    Some special announcements… [182 ❤️ 8 🔄]
  • What I love about Silicon Valley is that the future is up for grabs, ready for anyone to build.

I get intros for angel investing to all kinds of people. I take everyone equally seriously. 2 lads & a dog, or a 5-time award-winning entrepreneur.

No place more meritocratic. [815 ❤️ 20 🔄]

Aaron Levie (@levie)

  • Lots of evidence of huge jumps in capability for Fable across coding (and related) tasks. It’s also a major jump in accuracy and success in complex knowledge work tasks.

In our Box AI Complex Work Eval, we tested the model against Opus 4.8 and saw huge boosts across almost every industry. For our eval we give the Box AI Agent, using Fable, a set of hard real world knowledge work problems that deal with enterprise documents. Then score how the agent performs the tasks.

The main differentiators for Fable vs Opus 4.8 is that it doesn’t take shortcuts on complex reasoning, it gets multi-step calculations right, and it’s significantly more consistent across runs. We saw the biggest leaps in Media & Entertainment (78% vs 61%), Technology (81% vs 73%), Financial Services (89% vs 83%), and Healthcare (66% vs 60%).

Here are some specific examples:

  • Legal M&A due diligence: On a task reviewing NDA terms against a semiconductor company’s contracting policy, Fable correctly identified that a joint-ownership clause violates exclusivity requirements while a liability cap is permitted under a Super Cap exception. Fable scored 100% vs Opus’s 78%.

  • Healthcare: On a clinical radiology error audit across 12 reports, Fable precisely categorized each error by severity grade and correctly concluded no Grade 3 errors existed. Opus prematurely escalated a case to “major error requiring immediate departmental review” when the evidence didn’t support it — Fable 63% vs Opus 41%.

  • Media & Entertainment: On a genre profitability projection task, Fable correctly recognized that a 20% Argentine tax deduction was already embedded in the source spreadsheet figures and didn’t double-apply it. Opus applied it again on top — a compounding error across 4 genre calculations that took its score negative on the task vs Fable’s 74%.

  • Retail analytics: On a task analyzing high-growth product articles against an investment benchmark, Fable correctly computed each article’s growth rate individually and identified that only 2 of 5 exceeded the threshold. Opus confused “high growth relative to average” with “above the benchmark” — scoring 61% vs Fable’s 94%.

  • Financial Services: On a 5-year debt facility projection, Fable correctly applied interest to opening balances and used the right capex figure. Opus applied interest to the total facility amount and computed tax from the wrong base — two compounding errors. Fable scored 83% vs Opus’s 62%.

  • Technology: On a SaaS feature valuation requiring computation of a Feature Value Index across multiple regions, Fable applied the formula correctly and got exact values for the markets. Opus got the arithmetic wrong on multiple criteria — Fable scored 100% vs Opus’s 74%.

Overall, huge step change in complex analysis, work that requires analytical reasoning, and deep domain understanding. Fable will be available shortly in the Box AI Studio for customers to build agents with. [177 ❤️ 19 🔄]

Garry Tan (@garrytan)

  • May common sense reign in San Francisco for 100 years

Aaron Peskin, enjoy being a private citizen. [54 ❤️ 4 🔄]

  • Performative nonprofit industrial complex must be rooted out and defunded.

Their political cronies and grifter friends must not be allowed to squander the gifts of San Francisco any longer. [114 ❤️ 9 🔄]

  • Nessie just became the best way to get all your existing context, memory and history from ChatGPT, Perplexity, and Gemini into all the other places you have memory, and also get it into OpenClaw/Hermes Agent. Their OpenClaw and MCP servers are ace. [269 ❤️ 23 🔄]

Matt Turck (@mattturck)

  • 2026 is a BRUTAL grind in VC. You start in Davos, freeze in Aspen, hit Upfront, survive Milken, then it’s straight to Paris for the French Open. Briefly back in NYC for the Knicks. Then, total blur: SuperReturn in Berlin, Founders Forum in London, then back stateside for the World Cup, back to Paris for Raise AI, Idaho for Sun Valley, quick respite in Mykonos, then the Goldman tech gauntlet, Slush in Finland, NeurIPS in freakin’ Sydney… and boom, a productive year of thought leadership and adding value is over, and you’re a wreck. [488 ❤️ 22 🔄]

Zara Zhang (@zarazhangrui)

  • This is so good

Increasingly the output of an agency looks like a folder of files for agents, instead of one-off assets

“Get paid for your mind, not your hands” [104 ❤️ 7 🔄]

  • People should build agents/skills for their cross-functional teams.

For example, if a design team builds a design agent/skill for the marketing team (trained on all of the brand’s guidelines and design patterns), then the marketing team can produce more on-brand assets without having to bug designers every time

But the marketing team couldn’t have built this on their own; it takes the designer with their expertise, context, and knowledge

Same thing applies to every pair of teams that work closely with each other, often rely on each other, and complain about each other’s limited bandwidth

Building agents for your cross-functional teams ensures each team can be more self-sufficient, and moves us to a direction where teams can be organized by “loops” rather than “functions” [50 ❤️ 3 🔄]

  • It seems like most startups in San Francisco are selling products to each other

When I ask founders who their target audience is, 90% of the time it’s “engineering and product teams, AI-native startups”

Feels like the same small group of target audience is being bombarded with a million products, whereas very few people are building for the 99% of the world [203 ❤️ 17 🔄]

Nikunj Kothari (@nikunj)

  • TIL: You can just roast your way into getting some legit coffee at the Cognition office ☕️ [26 ❤️]

Dan Shipper (@danshipper)

  • absolutely insane game [91 ❤️]
  • I predicted this might happen on on @lennysan’s pod last year

Higher productivity from each individual employee with AI, makes it appealing to reshore certain jobs back to the US to be close to customers [164 ❤️ 15 🔄]

  • fable maxxing on the plane to SF [50 ❤️]

Claude (@claudeai)

  • From The Problem Solvers, our series featuring founders taking on hard problems with Claude: [137 ❤️ 11 🔄]
  • Michael Truell (@mntruell) fell in love with coding at 12. The company he co-founded, @cursor_ai, went from 15 people to 700 in two years.

Today, over 60% of the Fortune 500 build with its AI coding platform. [5258 ❤️ 246 🔄]

  • Scheduled deployments and environment variables in vaults are available today on the Claude Platform.

Read more: [119 ❤️ 10 🔄]


Follow Builders 自动生成 · 2026-06-11