When execution is free, judgment is expensive
What building at speed revealed
Cal shipped in 5 days. Claude Code produced functioning UI, wired-up components, and working API integrations faster than any developer handoff I've experienced. But the timeline isn't a boast. It's a data point about where the design effort went.
What took time was the evaluation loop. Running each generated plan through the same criteria a human trainer would: Does this progression make sense for someone at this fitness level? Are the rest periods appropriate for the intensity? Would a real athlete trust this enough to follow it for six weeks?
Zero visible AI scaffolding. The experience reads as a polished, intentional product, not a prototype. That's not because the AI was good enough on its own. It's because the evaluation criteria were specific enough to catch what "good enough" actually means.
The role stops being about making things and starts being about deciding what's worth making and whether what was made is good enough.
Browse the plan overview and tap into a day. The warmup sets, progressive overload, and rest periods are all AI-generated from one profile. Notice how injury accommodations (lower back) shape exercise selection across every session.