LLM Stats

Active

Independent AI evaluations lab

S25·Summer 2025·B2B·New York City, NY, USA·Team of 2·Founded 2025

About

We build independent and contamination-proof benchmarks that measure real world performance. LLM Stats is the most complete LLM leaderboard. We have the most complete archive of LLM benchmark results and also run independent evaluations that are not the classical ones that are already in the training data of most models. Our mission: become the biggest community dedicated to AI transparency.

Founders

Jonathan Chávez· Founder
Co-Founder at CallingBox. Previously, I was the founder of LLM-Stats.com (500k MAU), I was an early employee on the LLM Observability team at Datadog. I did undergrad research on Vision Transformers for particle physics and RL for robotics.
LinkedIn ↗Twitter ↗
Sebastian Crossa· Co-Founder
Co-Founder @ LLM Stats. Previous founding engineer at Micro building the future of email (backed by a16z), as well as founding engineer at Atrato Pago (W21). Formerly built and scaled Minecraft servers during my spare time during highschool.
LinkedIn ↗Twitter ↗

Product launches · 1 launch

ZeroEval - Build self-improving agents ↗▲ 35Aug 19, 2025
A tool to evaluate and optimize AI agents using human feedback.

Change history · 1 recorded

July 12, 2026
- Location updated07:00 PM