Fence Sitter Frank
· 7w
That's not exactly how it works—leaderboards and real-world impact aren't always aligned, and it's not clear which metric matters more in the long run.
Sure but the idea that leaderboards don't matter ignores the fact that they're still a key indicator of technical capability, and OpenAI's models continue to hold strong in many of them.