Skip to content

The analysis of several vision-based transformers is the main emphasis of this project, which also analyzes their distinctive properties and evaluates how well they work using a common dataset. The study intends to obtain insights into the strengths and shortcomings of various transformer designs created for computer vision tasks.

Notifications You must be signed in to change notification settings

asumandemireriden/vision_based_transfomer_architectures

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

vision_based_transfomer_architectures

The project plans to analyze various transformer models using vision-based approaches with two distinct methodologies. The first methodology is training from scratch, the second method is fine-tuning the models with the chosen dataset. Chosen dataset is Butterfly & Months Image Classification 100 species from Kaggle.[1]

The dataset contains 12594 trains, 500 tests, and 500 validation images. Each image’s size is 224 x 224. There are 100 different classes in the dataset. Determined models to analyze:

  1. ViT(Vision-based Transformer)
  2. Swin Transformers
  3. Data-efficient Image Transformers(DeIT)

3 different methods are applied to implement models:

  • From the Scratch
  • Fine-tuning
  • Feature Extraction

Results:

vit

Figure 1: Result of 3 different methods with ViT

swin

Figure 2: Result of 3 different methods with Swin Transformer

deit

Figure 3: Result of 2 different methods with Transformer

References

[1] Butterfly Image Classification 100 species, [Online].

Available: https://www.kaggle.com/datasets/gpiosenka/butterfly-images40-species

About

The analysis of several vision-based transformers is the main emphasis of this project, which also analyzes their distinctive properties and evaluates how well they work using a common dataset. The study intends to obtain insights into the strengths and shortcomings of various transformer designs created for computer vision tasks.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published