Here are 7 reasoning datasets distilled from Reasoning Models like @deepseek_ai R1, @Alibaba_Qwen QwQ or @GoogleDeepMind Flash thinking: 1️⃣ ServiceNow-AI/R1-Distill-SFT: 1.7M samples distilled from DeepSeek-R1-Distill-Qwen-32B from 9 different source datasets (unfiltered yet).… https://t.co/UFHfcCMsb8 https://t.co/xQ7ZPxuVUh
— Philipp Schmid (@_philschmid) Feb 2, 2025
from Twitter https://twitter.com/_philschmid
February 02, 2025 at 02:14PM
via IFTTT
No comments:
Post a Comment