

Open Model Benchmarks
How do you know if your model is actually working?
We’re gathering builders and researchers to share the unique, weird, and highly specific ways they are testing Gemma 4 and other models.
Whether you’re trying to get a sense for a model's vibes for a niche application or stress-testing reasoning capabilities in ways the original researchers never intended, we want to see your methodology!
We’re excited to feature a special launch from Kaggle—who will be unveiling their brand-new Agent Benchmarks—alongside speakers from Artificial Analysis and LMArena sharing their firsthand experience and insights on evaluating Gemma 4. We'll also have Erica Zhang presenting on a new Benchmark she's been working on with Gemma 4.
Limited spaces available, sign up today! Dinner and drinks are provided, excited to see everyone :)
Hosted in partnership with Kernel Labs!