Content
Tag
1 entry tagged Pre-Deployment Evaluation · 1 term.
A pre-release method of running a candidate AI model against stripped real-user conversation logs to predict the behaviors it will exhibit in production before any user sees it.