v0.1.0
- Introducing Huggingface generic model, which can be used to run many popular models on HF
- Upgrade llama.cpp in order to run newer Llama models like Llama 2
- Streamline docker images down to 2 images: a default lightweight one, and a gpu enabled image with nvidia / cuda support
- General fixes and stability improvements