v0.1.9
- Updates the bindings to work with the new llama.cpp API from ggerganov/llama.cpp#370
- Adds two separate interfaces -
LlamaInference
which is similar to the bindings in v0.1.8 and the lower levelLlamaContext
(currently untested) - The old bindings are still present in
PyLlama.cpp
but is currently not compiled and will be removed at a later date