Private Chat / QA over docs at ~25 tokens / s with 13b Llama-v2 (on Mac M2 max gpu). Using @trychroma vectorDB, @nomic_ai GPT4all embeddings, LLama-v2 Full recipe added to @LangChainAI docs: https://t.co/amzJ9ZcfeE https://t.co/KNgNdxRDqB
— Lance Martin (@RLanceMartin) Jul 19, 2023
from Twitter https://twitter.com/RLanceMartin
July 19, 2023 at 11:33AM
via IFTTT
No comments:
Post a Comment