Qwen-14B (Alibaba) The most powerful open-source model for it's size. And the longest trained: 3T tokens. Comes in 5 different versions: Base, Chat, Code, Math and Vision. (And is even trained for tool usage!) Opinion: You should consider it as your new "go-to". --- Paper:… https://t.co/MNinyS70S7 https://t.co/ZkJVxBQdlW
— Yam Peleg (@Yampeleg) Sep 27, 2023
from Twitter https://twitter.com/Yampeleg
September 27, 2023 at 02:40AM
via IFTTT
No comments:
Post a Comment