Bonfire

Run open-source generative AI models in a lightweight, reliable, and customizable rust API.
This is a Rust project powered by Huggingface's Candle and Tokio's Axum. It focuses on text generation and image generation models.

Usage

Install Rust
Clone this repo
Set the 'API_ADDRESS' and optionally the 'API_PORT' environment variables to the desired address and port
Run cargo run --release

API

Text Generation

disclaimer: This program is designed to run on limited hardware, prompts often take upwards of a minute to finish generating. This API chooses the prompt/polling stragety for standalone prompts to avoid http requests timing out due to a long async process on the server.

`POST /prompt_polled`

Generates text from a prompt, stored on the server for a limited amount of time, to be polled. Returns the id of your content.
Parameters:

model: string
prompt: string
sample_len: integer

`GET /poll_text/{id}`

Polls the server for the generated text. Returns the generated text or an error if the id is invalid or the content has expired.
Parameters:

id: string

`POST /new_streaming`

Initializes a model and sets up message history for a new streaming session. Returns the id of your content.
Parameters:

model: string

`POST /prompt_streaming/{id} (Unstable Work in Progress)`

Generates text from a prompt and streams it token by token to the consumer, stores message history on the server for a limited amount of time
Parameters:

id: string
prompt: string
sample_len: integer

Supported Models

Text Generation

Mistral7b,
Mistral7b Instruct,
Mistral7b Instruct V02,
Mixtral (needs beefy gpu),
Mixtral Instruct (needs beefy gpu),
Mistral7b Quantized,
Mistral7b Instruct Quantized,
Mistral7b Instruct V02 Quantized,
Mixtral Quantized,
Mixtral Instruct Quantized,
Zephyr Alpha Quantized (fine tuned mixtral),
Zephyr Beta Quantized (fine tuned mixtral),
Dolphin Mixtral Quantized (fine tuned mixtral),
other llms coming soon...

Image Generation

coming soon...

Extending

Text Generation

Create a script that implements the following trait, where tokens are streamed into the mpsc Sender:

pub trait TextGeneratorInner: Send + Sync {
    fn run(&mut self, prompt: &str, sample_len: u32, sender: Sender<String>) -> anyhow::Result<()>;
}

Implement a factory function that loads the model into memory, this is part of the trait because by design this function should take custom arguments of your choice, or none at all:

impl YourModel {
    pub fn new(arguments_of_your_choice: Args, or_none_at_all: Option<Args>) -> anyhow::Result<Self> {
        ...
    }
}

Add the model to the TextGenerationModel enum in src/text_generation/utils.rs, this is what http requests will identify your model as:

pub enum TextGenerationModel {
    YourModelName,
    ...
}

Add the model name and factory function to the match statement in src/text_generation/utils.rs:

impl TextGenerator {
    pub fn new(model: TextGenerationModel, args: &TextGenerationArgs) -> anyhow::Result<Self> {
        match model {
            TextGenerationModel::YourModelName => wrap(YourModel::new(args)?),
            ...
        }
    }
}

Done! You can now use your model in the API.

Image Generation

coming soon ...

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bonfire

Usage

API

Text Generation

`POST /prompt_polled`

`GET /poll_text/{id}`

`POST /new_streaming`

`POST /prompt_streaming/{id} (Unstable Work in Progress)`

Supported Models

Text Generation

Image Generation

Extending

Text Generation

Image Generation

About

Languages

License

jason136/bonfire

Folders and files

Latest commit

History

Repository files navigation

Bonfire

Usage

API

Text Generation

POST /prompt_polled

GET /poll_text/{id}

POST /new_streaming

POST /prompt_streaming/{id} (Unstable Work in Progress)

Supported Models

Text Generation

Image Generation

Extending

Text Generation

Image Generation

About

Resources

License

Stars

Watchers

Forks

Languages

`POST /prompt_polled`

`GET /poll_text/{id}`

`POST /new_streaming`

`POST /prompt_streaming/{id} (Unstable Work in Progress)`