Tag: evals
All the articles with the tag "evals".
-
Learnings from building an AI coding agent: How I build 'AI Evals' system for my product (2/3)
Published:• 5 min readDefining evaluation methodology, metrics & evals, and using LLM-as-a-judge for my product, a coding agent.
-
Learnings from building an AI coding agent: Key Learnings (1/3)
Published:• 3 min readLearnings from building AI/LLM coding agents highlighting value of AI evals, trustworthy workflows and the impact created.