Latte Model Integration with Gradio

This project integrates the Latte model from the Diffusers library to generate images from textual prompts using an easy-to-use Gradio interface. The Latte model, Latent Diffusion Transformer, is designed for advanced video and image generation, providing detailed and high-quality outputs.

Prerequisites

Make sure you have the following libraries installed before running the project:

pip install diffusers gradio torch

Diffusers: Provides access to pre-trained diffusion models, including the Latte model.
Gradio: Used to create an interactive interface for easy input and output.
Torch: Required for the Latte model to perform computations efficiently.

How to Use the Gradio Interface

Model Loading: The Latte model is loaded using the Diffusers library from a pre-trained checkpoint:

from diffusers import DiffusionPipeline

model_id = "maxin-cn/Latte-1"
pipe = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16).to("cuda" if torch.cuda.is_available() else "cpu")

Interactive Image Generation: The Gradio interface is designed to allow users to input a caption and generate an image based on that caption:

import gradio as gr

def generate_image(prompt):
    # Generate an image based on the prompt using the Latte pipeline
    with torch.autocast("cuda"):
        image = pipe(prompt).images[0]
    return image

# Create a Gradio interface
with gr.Blocks() as demo:
    gr.Markdown("# Generate Images with Latte Model (Latent Diffusion Transformer)")
    
    prompt_input = gr.Textbox(label="Enter Prompt", placeholder="Type a description for your image...")
    generate_button = gr.Button("Generate Image")
    image_output = gr.Image(label="Generated Image")

    generate_button.click(fn=generate_image, inputs=prompt_input, outputs=image_output)

# Launch the Gradio interface
demo.launch()

Running the Interface

Setup the Environment: Ensure you have GPU access for efficient generation.
Run the Script: Execute the above script in Colab, or your local Python setup.
Interactive Use:
- A link will be provided to access the Gradio interface.
- Input a prompt like "Astronaut in a jungle, cold color palette, muted colors" and click on "Generate Image" to get the result.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Latte generated-20241127T145044Z-001.zip		Latte generated-20241127T145044Z-001.zip
README.md		README.md
TEAM2 - DOCUMENTATION REPORT.pdf		TEAM2 - DOCUMENTATION REPORT.pdf
TEAM2(1a).py		TEAM2(1a).py
run.ipynb		run.ipynb
video_captions.csv		video_captions.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latte Model Integration with Gradio

Prerequisites

How to Use the Gradio Interface

Running the Interface

About

Releases

Packages

Contributors 2

Languages

CoNfIg7952/Project-1a---Rare-Human-Action-AI-video-Generator

Folders and files

Latest commit

History

Repository files navigation

Latte Model Integration with Gradio

Prerequisites

How to Use the Gradio Interface

Running the Interface

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages