Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and `check_double_bos_eos` error? #55

thaddavis · 2024-12-03T01:57:08Z

When I make this call I get an error?

client.inference.chat_completion(
        messages=[
            UserMessage(
                content="hello world, write me a 3 word poem about the moon",
                role="user",
            ),
        ],
        model_id="meta-llama/Llama-3.2-1B-Instruct",
        stream=False,
    )

yanxi0830 · 2025-01-03T19:00:59Z

Wondering what is the inference provider you are using? It looks like ollama from the logs, if using ollama, which model are you using from ollama run?

yanxi0830 added the question Further information is requested label Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and `check_double_bos_eos` error? #55

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and `check_double_bos_eos` error? #55

thaddavis commented Dec 3, 2024

yanxi0830 commented Jan 3, 2025

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and check_double_bos_eos error? #55

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and check_double_bos_eos error? #55

Comments

thaddavis commented Dec 3, 2024

yanxi0830 commented Jan 3, 2025

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and `check_double_bos_eos` error? #55

Why does a double BOS token get added to my InferenceResource.chat_completion calls and cause and `check_double_bos_eos` error? #55