Incredible paper from Stanford. They trained a reasoning model that matched and outperformed OpenAI’s o1 using just 1,000 examples. It uses a clever trick: if the model stopped thinking they added "Wait" to make it continue reasoning. https://t.co/QwHQbTXSmk
— Lior⚡ (@LiorOnAI) Apr 5, 2025
from Twitter https://twitter.com/LiorOnAI
April 05, 2025 at 01:00PM
via IFTTT
No comments:
Post a Comment