Discussion about this post

User's avatar
Vishal Kataria's avatar

Great insight, Elena.

AI Evals are especially important since they bridge the gap between what engineering prioritizes (MMLU scores) and what users expect (quality, reliability, safety, and performance). And AI PMs can decide the next step based on them: whether to train the model, pivot, or put it on hold.

Expand full comment

No posts