This repository represents the official implementation of the paper titled "MangaNinja: Line Art Colorization with Precise Reference Following".
Zhiheng Liu*
·
Ka Leong Cheng*
·
Xi Chen
·
Jie Xiao
·
Hao Ouyang
·
Kai Zhu
·
Yu Liu
·
Yujun Shen
·
Qifeng Chen
·
Ping Luo
We propose MangaNinja, a reference-based line art colorization method. MangaNinja automatically aligns the reference with the line art for colorization, demonstrating remarkable consistency. Additionally, users can achieve more complex tasks using point control. We hope that MangaNinja can accelerate the colorization process in the anime industry.
- 2025-01-20: 🔥 MangaNinja is available on huggingface space, Thanks Sylvain Filoni ! You can find it here.
- 2025-01-16: 🔥 MangaNinja is available on windows, 6G VRAM need Auto install and Download Model. Thanks sdbds ! You can find it here.
- 2025-01-15: Inference code and paper are released.
- 🏃: We will open an issue area to investigate user needs and adjust the model accordingly. This includes more memory-efficient structures, data formats for line art (such as binary line art), and considering retraining MangaNinjia on a better foundation model (sd3,flux).
Clone the repository (requires git):
git clone https://github.com/ali-vilab/MangaNinjia.git
cd MangaNinjia
Install with conda
:
conda env create -f environment.yaml
conda activate MangaNinjia
- You could download them from HuggingFace: StableDiffusion, clip-vit-large-patch14, control_v11p_sd15_lineart and Annotators
- You could download our MangaNinjia model from HuggingFace
- The downloaded checkpoint directory should have the following structure:
-- checkpoints
|-- StableDiffusion
|-- models
|-- clip-vit-large-patch14
|-- control_v11p_sd15_lineart
|-- Annotators
|--sk_model.pth
|-- MangaNinjia
|-- denoising_unet.pth
|-- reference_unet.pth
|-- point_net.pth
|-- controlnet.pth
cd scripts
bash infer.sh
You can find all results in output/
. Enjoy!
The default settings are optimized for the best result. However, the behavior of the code can be customized:
--denoise_steps
: Number of denoising steps of each inference pass. For the original (DDIM) version, it's recommended to use 20-50 steps.--is_lineart
: If the user provides an image and the task is to color the line art within that image, this parameter is not needed. However, if the input is already a line art and no additional extraction is necessary, then this parameter should be included.--guidance_scale_ref
: Increasing makes the model more inclined to accept the guidance of the reference image.--guidance_scale_point
: Increasing makes the model more inclined to input point guidance to achieve more customized colorization.--point_ref_paths
and--point_lineart_paths
(optional): Two 512x512 matrices are used to represent the matching points between the corresponding reference and line art with continuously increasing integers. That is, the coordinates of the matching points in both matrices will have the same values: 1, 2, 3, etc., while the values in other positions will be 0 (you can refer to the provided samples). Of course, we recommend using Gradio for point guidance.
First, modify ./configs/inference.yaml
to set the path of model weights. Afterwards, run the script:
python run_gradio.py
The gradio demo would look like the UI shown below.
-
Upload the reference image and target image.
Note that for the target image, there are two modes: you can upload an RGB image, and the model will automatically extract the line art; or you can directly upload the line art by checking the 'input is lineart' option.
The line art images are single-channel grayscale images, where the input consists of floating-point values with the background set to 0 and the line art close to 1. Additionally, we would like to further communicate with our users: if the line art you commonly use is binarized, please let us know. We will fine-tune the model and release an updated version to better suit your needs. 😆
-
Click 'Process Images' to resize the images to 512*512 resolution.
-
(Optional) Starting from the reference image, alternately click on the reference and target images in sequence to define matching points. Use 'Undo' to revert the last action.
-
Click 'Generate' to produce the result.
This project is developped on the codebase of MagicAnimate. We appreciate this great work!
Please cite our paper:
@article{liu2025manganinja,
title={MangaNinja: Line Art Colorization with Precise Reference Following},
author={Liu, Zhiheng and Cheng, Ka Leong and Chen, Xi and Xiao, Jie and Ouyang, Hao and Zhu, Kai and Liu, Yu and Shen, Yujun and Chen, Qifeng and Luo, Ping},
journal={arXiv preprint arXiv:2501.08332},
year={2025}
}