Testing the Untestable – four phases of automated evals for LLM-powered featuresallenpike.com1 pointthunderbong2 years ago