ImageCaptioning-Attention-PyQt5

ImageCaptioning improved with attention. Also a PyQt5 applications

Welcome !

Hello guys, hope you are doing awesome these days !😄
In my previous ImageCaption repository, I implemented a ImageCaption algorithm and I promised to upload an attention based version latter. And here it is ！😄
Using the ResNet50 pretrained on ImageNet as the backbone(no finetune) and also some attention, the model can describe image like human(most of the time).
Moreover,Beam Search are also used during the inferrence part and this give another great improvment on the model's performence
Now, let's enjoy some funny stuff😎

1.Examples👍

①.doggy doggy, juicy doggy

③.You don't want to mess up with No. 1 shooter in the west

④.🌶④💉💧🐮🍺

⑤.Portland Timbers, Assemble!

⑥.mountchicken must has something to do with mountain

2.Requirements

skimage
spacy
PyQt5
Pip install them

3.Train😣

download flickr30k

Download the flickr30k dataset, unpack all the images into the folder flickr30k/flickr30k-images. I have already preprocessed the captions.txt, and you don't need to download that
flickr(提取码:hrf3)

download my checkpoint(if you don't want to train it with 14h on GeForce2080ti)

Put the downloaded checkpoint into the folder checkpoint
checkpoint(提取码:qny4)

change some parameters

train.py line20 - line26, set the dataset path
train.py line31 - line34, load_model:load my checkpoint or not.
Ok, you can train now

4.Inferrence😀

inferrence.py line245, choose your predict image path

5.APP

Run main_gui.py

if you run the .py file succesfully, it should look like this

Then, you need to push the initialize button to load the model, after that, just wait the `Finished` sign appers in the right.

Finally, load the image with `Load Image` button and press `Detect`

For more issue, contact me

Email Address mountchicken@outlook.com

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
flickr30k		flickr30k
github		github
vocabulary		vocabulary
README.md		README.md
Ui_main.py		Ui_main.py
get_loader.py		get_loader.py
inferrence.py		inferrence.py
itos_utils.py		itos_utils.py
main.ui		main.ui
main_gui.py		main_gui.py
model.py		model.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ImageCaptioning-Attention-PyQt5

Welcome !

1.Examples👍

①.doggy doggy, juicy doggy

③.You don't want to mess up with No. 1 shooter in the west

④.🌶④💉💧🐮🍺

⑤.Portland Timbers, Assemble!

⑥.mountchicken must has something to do with mountain

2.Requirements

3.Train😣

download flickr30k

download my checkpoint(if you don't want to train it with 14h on GeForce2080ti)

change some parameters

4.Inferrence😀

5.APP

Run main_gui.py

if you run the .py file succesfully, it should look like this

Then, you need to push the initialize button to load the model, after that, just wait the `Finished` sign appers in the right.

Finally, load the image with `Load Image` button and press `Detect`

For more issue, contact me

About

Releases

Packages

Languages

Mountchicken/ImageCaptioning-Attention-PyQt5

Folders and files

Latest commit

History

Repository files navigation

ImageCaptioning-Attention-PyQt5

Welcome !

1.Examples👍

①.doggy doggy, juicy doggy

③.You don't want to mess up with No. 1 shooter in the west

④.🌶④💉💧🐮🍺

⑤.Portland Timbers, Assemble!

⑥.mountchicken must has something to do with mountain

2.Requirements

3.Train😣

download flickr30k

download my checkpoint(if you don't want to train it with 14h on GeForce2080ti)

change some parameters

4.Inferrence😀

5.APP

Run main_gui.py

if you run the .py file succesfully, it should look like this

Then, you need to push the initialize button to load the model, after that, just wait the Finished sign appers in the right.

Finally, load the image with Load Image button and press Detect

For more issue, contact me

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Then, you need to push the initialize button to load the model, after that, just wait the `Finished` sign appers in the right.

Finally, load the image with `Load Image` button and press `Detect`

Packages