Discussion about this post

User's avatar
Neural Foundry's avatar

The Air Canada example really drives the point home. It's one thing to have a chatbot give a suboptimal answer, but when it starts making promises that contradict company policy, you're essentialy giving customers ammunition for legal actin. The three layer eval system makes a lot of sens for catching these kind of issues before they escallate.

Expand full comment
Benedikt Kantus's avatar

This article is right on spot between user focus and technical accuracy. Very helpful!

Expand full comment
7 more comments...

No posts

Ready for more?