Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

Why your AI evals keep breaking

atla-ai.com

5 points by capybarahi 6 hours ago

sanskarix 6 hours ago

[dead]

  • thelemonbot 6 hours ago

    Totally agree - that was our follow up post, on drift/clustering :) https://www.atla-ai.com/post/automating-error-analysis