This is active learning demo using Label Studio and LabelStudio ML backend. This demo trains a model for vegetable classification and also model is actively trained from label studio.
According to Human Signal:
"To create annotated training data for supervised machine learning models can be expensive and time-consuming. Active Learning is a branch of machine learning that seeks to minimize the total amount of data required for labeling by strategically sampling observations that provide new insight into the problem. In particular, Active Learning algorithms aim to select diverse and informative data for annotation, rather than random observations, from a pool of unlabeled data using prediction scores. For more about the practice of active learning, read this article written by Heartex CTO on Towards Data Science." ~Label Studio Docs
- Use Label Studio Enterprise Edition to build an automated active learning loop with a machine learning model backend.
- If you use the open source Community Edition of Label Studio, you can manually sort tasks and retrieve predictions to mimic an active learning process.
Tested with:
- Python 3.8
- Fedora 38
https://www.kaggle.com/datasets/misrakahmed/vegetable-image-dataset/download?datasetVersionNumber=1
https://www.kaggle.com/code/theeyeschico/vegetable-classification-using-transfer-learning
Download Training Data from Kaggle references above and extract the data under model-training/Vegetable Images
python3 -m venv venv
source venv/bin/activate
pip install -r requirements-devel.txt
python3 model_training.py
Use RHODS Project with MinIO Server and establish the data connection and launch Notebook.
Open Notebooks : notebooks
source env/bin/activate
label-studio-ml start serving