Thoughts: Favorite tweets

Sunday, October 20, 2024

Favorite tweets

You can fine-tune a base language model to the BitNet 1.58b architecture. You dont necessarily have to train a model from scratch. i.e. you can fine-tune an existing model to 1.58 bits! --- Now BitNet is a special transformers architecture that represents each parameter with… https://t.co/oSQLBufSdh https://t.co/YQhEAWE7YM https://t.co/d0PNpHEuQL
— Rohan Paul (@rohanpaul_ai) Oct 21, 2024

from Twitter https://twitter.com/rohanpaul_ai

October 21, 2024 at 01:20AM
via IFTTT

No comments:

Post a Comment

Subscribe to: Post Comments (Atom)