You can fine-tune a base language model to the BitNet 1.58b architecture. You dont necessarily have to train a model from scratch. i.e. you can fine-tune an existing model to 1.58 bits! --- Now BitNet is a special transformers architecture that represents each parameter with… https://t.co/oSQLBufSdh https://t.co/YQhEAWE7YM https://t.co/d0PNpHEuQL
— Rohan Paul (@rohanpaul_ai) Oct 21, 2024
from Twitter https://twitter.com/rohanpaul_ai
October 21, 2024 at 01:20AM
via IFTTT
No comments:
Post a Comment