v0.1.0

1b5d released this 23 Jul 21:21

· 5 commits to main since this release

928ec4a

Introducing Huggingface generic model, which can be used to run many popular models on HF
Upgrade llama.cpp in order to run newer Llama models like Llama 2
Streamline docker images down to 2 images: a default lightweight one, and a gpu enabled image with nvidia / cuda support
General fixes and stability improvements

Assets 2

Provide feedback