Real world evals are exactly what you need
Great conversation between Logan Filpatrick of Google and Alex Atallah of OpenRouterAI covering a wide range of topics, but most interestingly for me is their discussion on why real world use model evals are what matters.