diff --git a/models/llama-2-7b-chat.ggmlv3.q4_0.bin.md b/models/llama-2-7b-chat.ggmlv3.q4_0.bin.md index c382386..3d55f4c 100644 --- a/models/llama-2-7b-chat.ggmlv3.q4_0.bin.md +++ b/models/llama-2-7b-chat.ggmlv3.q4_0.bin.md @@ -9,3 +9,5 @@ Download the Llama 2 model file named `llama-2-7b-chat.ggmlv3.q4_0.bin` from the | Name | Quant method | Bits | Size | Max RAM required | |--------------------------------|--------------|------|---------|------------------| | llama-2-7b-chat.ggmlv3.q4_0.bin | q4_0 | 4 | 3.79 GB | 6.29 GB | + +**Note:** After downloading the model, add the model file to the `models` directory. The file should be located at `models\llama-2-7b-chat.ggmlv3.q4_0.bin`, in order to run the code.