I just listened to a breakdown of Gemini jump from 2.0 to 2.5, and here's the gist. The team did not rely on lab tests alone. They scraped real user feedback from X, turned those "this broke" moments into living evals, and keep appending new edge cases with every release. It's https://t.co/qRzgOVvHRO
— Matija Grcic (@matijagrcic) Aug 27, 2025
from Twitter https://twitter.com/matijagrcic
August 27, 2025 at 01:56PM
via IFTTT
No comments:
Post a Comment