🎯 Ad Click Prediction MLOps Project

A robust end-to-end MLOps project that predicts whether a user will click on an advertisement based on various user behavioral and demographic features.

Training Pipeline Structure

Prediction Pipeline Structure

🌟 Project Overview

This project implements a complete MLOps pipeline for ad click prediction, incorporating best practices in machine learning operations including automated data pipelines, data ingestion, data validation, data transforamtion, model training, evaluation, and deployment. The system uses MongoDB for data storage, AWS for model registry and deployment, Data Version Control and Performance tracking, and includes comprehensive CI/CD pipelines.

🎲 Features Used for Prediction

Age
Gender
Device Type
Ad Position
Browsing History
Time of Day

🏗️ Project Architecture

The project follows a modular and scalable architecture with the following components:

Data Ingestion 📥
- MongoDB integration for data storage and retrieval
- Automated data extraction and transformation pipeline
- Data validation and quality checks
Data Validation ✅
- Schema validation using YAML configuration
- Data drift detection
- Automated validation reports
Data Transformation 🔄
- Feature engineering pipeline
- Data preprocessing and standardization
- Automated transformation artifacts
Model Training 🧠
- Automated model training pipeline
- Hyperparameter optimization
- Model performance logging
Model Evaluation 📊
- Automated performance metrics calculation
- Model comparison with existing production model
- AWS S3 integration for model registry
Model Deployment 🚀
- Containerized deployment using Docker
- AWS ECR for container registry
- CI/CD pipeline using GitHub Actions

🛠️ Tech Stack

Python 3.10
MongoDB Atlas - Data Storage
AWS Services:
- S3 (Model Registry)
- ECR (Container Registry)
- EC2 (Deployment)
Docker - Containerization
GitHub Actions - CI/CD Pipeline
FastAPI - Web Application

🚀 Getting Started

Clone the repository:

git clone https://github.com/bobinsingh/Ad-Click-Prediction-MLOps.git

Create and activate a conda environment:

conda create -n Ad python=3.10 -y
conda activate Ad

Install requirements:

pip install -r requirements.txt

Set up MongoDB connection:

export MONGODB_URL="your_mongodb_connection_string"

Set up AWS credentials:

export AWS_ACCESS_KEY_ID="your_access_key"
export AWS_SECRET_ACCESS_KEY="your_secret_key"

💻 Project Structure

├── artifacts/               # Training artifacts and model files
├── dataset/               # Contains a local copy of dataset used in this project
├── configs/                 # Contain Schema and Model config files
├── src/
|   ├── cloud/              # Contains files for AWS connection & storage
│   ├── components/         # Core pipeline components
│   ├── config/            # Files relate to database
│   ├── constants/         # Contains Central file for all Constants used
│   ├── data/                # Contains project data handler
│   ├── docs/               # Documents related to project
│   ├── entities/            # Contain Artifact & Config, and model related entities
│   ├── exceptions/         # Custom exception handling
│   ├── logging/           # Logging configuration
│   ├── pipelines/         # Training & Prediction pipeline
│   ├── tests/             # Test pipeline
│   └── utils/            # Utility functions
├── static/                # Static files for web application
├── templates/             # HTML templates
├── app.py                # FastAPI application
├── Dockerfile            # Docker configuration
├── requirements.txt      # Project dependencies
└── setup.py             # Project setup configuration

🔄 MLOps Pipeline

Data Pipeline:
- Automated data ingestion from MongoDB
- Data validation and quality checks
- Feature engineering and transformation
Training Pipeline:
- Model training with latest data
- Performance evaluation
- Model versioning and registry
Deployment Pipeline:
- Automated Docker image creation
- Push to AWS ECR
- Deployment to EC2 instance

🌐 Web Application

The project includes a web interface for:

Real-time ad click predictions
Model training triggering
Performance monitoring

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎯 Ad Click Prediction MLOps Project

Training Pipeline Structure

Prediction Pipeline Structure

🌟 Project Overview

🎲 Features Used for Prediction

🏗️ Project Architecture

🛠️ Tech Stack

🚀 Getting Started

💻 Project Structure

🔄 MLOps Pipeline

🌐 Web Application

🤝 Contributing

📝 License

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
.github/workflows		.github/workflows
configs		configs
dataset		dataset
src		src
static		static
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

License

bobinsingh/Ad-Click-Prediction-MLOps

Folders and files

Latest commit

History

Repository files navigation

🎯 Ad Click Prediction MLOps Project

Training Pipeline Structure

Prediction Pipeline Structure

🌟 Project Overview

🎲 Features Used for Prediction

🏗️ Project Architecture

🛠️ Tech Stack

🚀 Getting Started

💻 Project Structure

🔄 MLOps Pipeline

🌐 Web Application

🤝 Contributing

📝 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages