Skip to content
Adi's Digital Garden
Go back

Stages of Evaluation

Ideas are fragile things and evaluation is a crushingly heavy burden

do hardcore evaluation for - every model switch, every prompt change, every agent architecture change

Imbibe algo-judge into your app if you have very high confidence on the precision/recall of the algo-judge


Share this post on:

Previous Post
AI Evaluation in Practice
Next Post
Ideal Safety Spec