I run alpaca.cpp on a laptop with 8 GB ram and the 7B model. Works pretty well. I would love to find a project that would enable me to go to the 13B model, but have not yet found one that enables me to run that on only 8 GB ram. On Wed, 5 Apr 2023, Undescribed Horrific Abuse, One Victim & Survivor of Many wrote:
https://python.langchain.com/en/latest/modules/models/llms/integrations/gpt4...
note: - llamacpp is cpu only. the huggingface backend can run the same model on gpu. it is not always faster - llamacpp is experiencing political targeting and a little upheaval in optimization code - there are newer and more powerful models that can be loaded just like gpt4all, such as vicuna
langchain is a powerful and noncomplex language model frontend library supporting openai that provides for coding autonomous agents that use tools and access datastores. see also llama-index .