Thoughts: Favorite tweets

Saturday, April 5, 2025

Favorite tweets

Incredible paper from Stanford. They trained a reasoning model that matched and outperformed OpenAI’s o1 using just 1,000 examples. It uses a clever trick: if the model stopped thinking they added "Wait" to make it continue reasoning. https://t.co/QwHQbTXSmk
— Lior⚡ (@LiorOnAI) Apr 5, 2025

from Twitter https://twitter.com/LiorOnAI

April 05, 2025 at 01:00PM
via IFTTT

No comments:

Post a Comment

Subscribe to: Post Comments (Atom)