view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 9 days ago • 64
view reply Really cool post! In particular this was eye-opening to me: However, I would consider both Unicode and UTF-8 to be tokenizers.