BT5153 Applied Machine Learning for Business Analytics

Course webiste for BT5153

View My GitHub Profile

Lecture 11 LLM & Agent Eval

In-class Material

  1. Slides

Extra Reading

  1. Demystifying evals for AI agents
  2. Using LLM-as-a-Judge For Evaluation: A Complete Guide
  3. Your AI Product Needs Evals
  4. Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)