Major reasoning models so far with technical reports (focused on those w RL): 2025-01-22 — DeepSeek R1 — https://t.co/FanOPm9oTF 2025-01-22 — Kimi 1.5 — https://t.co/NN8Nr1EAmQ 2025-03-31 — Open-Reasoner-Zero — https://t.co/H5ycSmAkwS 2025-04-10 — Seed 1.5-Thinking —
— Nathan Lambert (@natolambert) Jun 11, 2025
from Twitter https://twitter.com/natolambert
June 11, 2025 at 03:13AM
via IFTTT
No comments:
Post a Comment