I was overspending on GPT-4o. It was really hard to compare different models I could switch to, so I built this LLM comparison tool.
It shows leaderboards, pricing, and performance data across 100+ LLMs (including all major providers and open-source models).
Key features: - Live pricing comparisons - Benchmark Scores (MMLU, HumanEval, GPQA, etc.) - Context length vs cost analysis - Speed/throughput tests across providers - Quality vs price visualizations - Open source (all data verifiable)
Try it out: https://llmstats.com
I'd like to know your opinion :)
Tech stack: Next.js, TypeScript, Recharts. All data available on GitHub.