Blog
Featured
Updates
Hello World
Shahul Es
Nov 19, 2025
How solving evaluation taught us we needed to solve training
Read more
AI
Evaluating the Evaluators
Shahul Es
Aug 18, 2025
Benchmarking Alignment Strategies for LLM-as-Judges
Read more
AI
Evals
OSS
Hard-Earned Lessons from 2 Years of Improving AI Applications
Shahul
May 7, 2025
A step-by-step guide to setup evaluations and improve AI systems
Read more
LLM
Evals
Aligning LLM as judge with human evaluators
Shahul Es
Dec 11, 2024
Aligning and Improving LLM based metrics using human feedback
Read more
LLM
Data
All about synthetic data generation
Shahul Es
Nov 19, 2024
An in-depth survey blog on synthetic data generation with LLMs
Read more
LLM
Evaluation
All about evaluating Large language models
Shahul Es
Jul 9, 2024
Deep survey blog on evaluating LLM applications
Read more





