Skip to content

Latest commit

 

History

History
66 lines (51 loc) · 3.05 KB

README.md

File metadata and controls

66 lines (51 loc) · 3.05 KB

Object Detection with YOLOS


Banner

Object detection is a computer vision technique that identifies and classifies a particular object in a particular setting. The main goal of object detection is to scan digital images or real-life scenarios to locate instances of every object, separate them, and analyze their necessary features for real-time predictions. Object detection algorithms typically leverage machine learning or deep learning to produce meaningful results. Object detection is used in various domains, such as image annotation, vehicle counting, activity recognition, face detection, face recognition, video object co-segmentation, etc.

  • In this project, I aim to demonstrate how to use YOLOS models for object detection on images using Gradio, a Python library that allows us to build and share web applications for our machine learning models or data science workflows.

  • I also used some predefined YOLOS models from the Hugging Face hub as our data sources. We use gr.components.Dropdown() to create a dropdown component that can let users choose from different YOLOS models available on the hub.

  • The models used are:

    • yolos-tiny
    • yolos-small
    • yolos-base
    • yolos-small-300
    • yolos-small-dwr
  • These models have different sizes and performance trade-offs. You can find more details about these models on their respective pages on the hub.


Installation

  • To run this project, you need to install some dependencies:

    • Python 3
    • PyTorch
    • Transformers
    • Gradio
    • OpenCV
    • Numpy
  • You can install these dependencies using pip:

    pip install -r requirements.txt


Usage

  • To run this project, you need to execute the app.py script: python main.py

Examples

  • Here are some examples of object detection using different YOLOS models and inputs:

Girl in a jacket


Girl in a jacket

The application can be accessed here: MyApp


Report

  • The project report is available here. It provides a detailed explanation of the methodologies used, results obtained, and conclusions drawn from this project.

Future Work

  • Adding more features and functionalities to our web application, such as downloading, sharing, or editing the results.
  • Evaluating the performance and quality of our web application using metrics and feedback.
  • Adding more YOLOS models from the Hugging Face hub or other sources.
  • Adding more input and output components from Gradio or other libraries.

License

  • This project is licensed under the MIT License. 📝