The Oral Cancer Detection project aims to develop a robust machine learning model that leverages advanced image processing techniques to accurately identify signs of oral cancer in images. This project integrates various technologies and frameworks, providing a seamless user experience for both medical professionals and patients.
- 📈 Accurate Detection: Utilizes state-of-the-art deep learning algorithms for precise identification of oral cancer.
- 🖼️ Image Processing: Implements OpenCV for enhanced image preprocessing, ensuring high-quality input for the model.
- 📊 User-Friendly Interface: Built with Streamlit to offer an intuitive interface for users to upload images and view results.
- 🚀 Fast Performance: Optimized for quick processing and real-time feedback.
- 📊 Data Visualization: Includes interactive visualizations using Matplotlib and Graphviz for better understanding of model predictions.
- 🔄 Model Training and Evaluation: Supports training with various datasets and evaluating model performance with scikit-learn.
- ☁️ Cloud Deployment: Enables deployment using Kubernetes for scalability and reliability.
- Introduction
- Problem Statement
- Solution
- Dataset
- Methodologies
- Models Used
- Model Comparisons
- Building Interface
- Deployment
- Results
- Conclusion
- Technologies Used
- Installation
- Future Works
- References
The Oral Cancer Detection project aims to harness the power of machine learning and image processing to accurately detect oral cancer at an early stage. 🦷 Oral cancer is a significant health concern, often leading to severe consequences if not identified promptly. This project seeks to provide a reliable tool that aids medical professionals in diagnosing oral cancer through advanced techniques. 🩺
Utilizing a comprehensive dataset of oral images, this project implements various algorithms to train models capable of distinguishing between healthy and cancerous tissues. 📊 The application features a user-friendly interface, enabling healthcare practitioners to upload images and receive instant feedback on potential cancer detection. With the integration of visualization tools, users can gain insights into the model's predictions and the underlying data. 🔍
By leveraging state-of-the-art technologies, this project not only aims to improve diagnostic accuracy but also to facilitate early intervention, ultimately contributing to better patient outcomes. 🌈
Oral cancer is a major global health issue, accounting for hundreds of thousands of cases annually. Despite medical advancements, early detection of oral cancer remains a challenge, often leading to late-stage diagnoses and poor patient outcomes. 🦷 The lack of accessible and reliable diagnostic tools, particularly in remote and underserved areas, exacerbates this problem.
The need for a solution that allows medical professionals to identify oral cancer at an early stage is critical. 📉 Early detection can significantly improve survival rates and reduce treatment costs. Therefore, this project focuses on building a machine learning-based tool that assists in the early detection of oral cancer through image analysis, addressing both accessibility and diagnostic accuracy. 📲
The Oral Cancer Detection project provides a machine learning-based solution to assist healthcare professionals in detecting oral cancer early. 🦷 By utilizing advanced deep learning techniques, this project processes and analyzes medical images to identify potential cancerous regions in the oral cavity.
This solution includes the following key components:
- 📸 Image Processing: Uses OpenCV to preprocess images, enhancing the clarity and quality of input data for more accurate detection.
- 🤖 Machine Learning Models: Employs cutting-edge deep learning models built using TensorFlow to classify images into cancerous and non-cancerous categories.
- 🖥️ User-Friendly Interface: Features an easy-to-use Streamlit-based interface, enabling users to upload images and receive instant diagnostic results with high accuracy.
- 📊 Data Visualization: Visualizes the prediction results and the areas of interest within the image, making it easier for healthcare professionals to interpret the results.
- ☁️ Cloud-Ready Deployment: The model is scalable and can be deployed using Kubernetes for real-time and widespread use in clinical settings.
This approach not only makes cancer detection faster and more accessible but also enhances diagnostic precision, leading to better patient outcomes and earlier interventions. 🌟
Introducing the Oral Cancer Image Dataset! This dataset comprises 500 oral cancer images and 450 non-cancer oral images, all meticulously labeled for seamless classification. 🦷 The dataset is designed to support research and development in the field of oral cancer detection using advanced machine learning algorithms.
With a balanced representation of cancer and non-cancer samples, it allows researchers to explore innovative approaches to enhance diagnostic accuracy. 🔬 This dataset serves as a valuable resource for the healthcare community, fostering advancements in early detection and intervention for oral cancer. 💡
You can access the dataset here 📂.
-
📥 Data Collection: Medical images are sourced from reliable and reputable datasets, ensuring a comprehensive mix of oral cancer and non-cancer samples. This provides a strong foundation for training the model, ensuring that it learns from high-quality, representative data.
-
🛠️ Preprocessing: To ensure consistency, all images are resized to a standard dimension of 260x260 pixels with 3 color channels (RGB). The images are normalized to a range between [0,1] for smoother training. Image augmentation techniques, such as rotation and flipping, are applied to make the model robust against variations and prevent overfitting.
-
🧠 Model Selection: A range of cutting-edge deep learning architectures are chosen for comparison:
- CNN (Convolutional Neural Networks): A standard deep learning model for image classification.
- ResNet50: A deeper network that addresses the vanishing gradient problem using skip connections.
- DenseNet121: A model that efficiently passes gradients between layers using dense connections.
- EfficientNetB2: A state-of-the-art model that balances accuracy and efficiency through compound scaling.
- VGG19: A popular deep learning model with a simple, uniform architecture known for its performance in image tasks.
-
🎓 Training: Each model is trained using the preprocessed images. During training, the models adjust their weights using a process called backpropagation to minimize the loss function. The training continues for several epochs until the models converge, or achieve optimal performance on the training data.
-
📊 Evaluation: After training, the models are evaluated based on:
- Accuracy: The proportion of correctly predicted labels.
- Speed: How quickly the model processes new data.
- Memory usage: The amount of system resources required by the model. Performance metrics such as precision, recall, and F1 score are calculated to assess how well the models balance true positives and false negatives.
-
📈 Comparison: Once all models are trained and evaluated, their performances are compared. The model that strikes the best balance between accuracy, speed, and resource efficiency is selected for deployment. This ensures that the deployed model is optimal for real-world use.
The following deep learning models were utilized in this project to compare their performance and select the best model for oral cancer detection:
A Convolutional Neural Network (CNN) is the foundation for most image classification tasks. CNNs are composed of convolutional layers that automatically learn spatial hierarchies of features (such as edges, textures, and objects). In this project:
- Advantages: CNNs are relatively easy to train and excel in capturing local features in images.
- Use Case: It serves as a baseline model, offering a simpler but powerful approach for detecting cancerous tissues in oral images.
- Limitations: While CNNs work well on simpler problems, they may struggle with more complex patterns found in medical data.
ResNet50 (Residual Networks) is a deeper network with 50 layers that incorporates residual connections (or skip connections) to solve the vanishing gradient problem common in deep networks. This makes it highly effective for complex tasks like medical image classification.
- Advantages: The residual connections enable the network to learn much deeper representations without degrading performance.
- Use Case: ResNet50 excels in detecting intricate patterns in medical images, making it a great candidate for identifying cancerous tissues.
- Limitations: As the network becomes deeper, it requires more computation power, increasing the time required for training.
DenseNet121 (Densely Connected Convolutional Networks) employs dense blocks, where each layer is directly connected to every other layer, allowing feature reuse and improving efficiency.
- Advantages: DenseNet121 captures detailed information by reusing features, which can help the model efficiently learn the critical features needed for cancer detection.
- Use Case: Its ability to learn complex features makes it well-suited for cancer detection, as it captures small yet significant features in oral images.
- Limitations: DenseNet can be computationally demanding, especially when dealing with large datasets.
EfficientNetB2 is part of the EfficientNet family, which scales model dimensions—width, depth, and resolution—in a balanced manner, optimizing performance while using fewer parameters.
- Advantages: EfficientNetB2 provides high accuracy with fewer parameters, which is beneficial for resource-constrained environments (e.g., mobile applications or cloud-based deployments).
- Use Case: Its efficiency and accuracy make it an ideal choice for real-time cancer detection tasks where computational resources may be limited.
- Limitations: While it uses fewer parameters, EfficientNetB2 may still require considerable tuning and experimentation to optimize performance on highly complex tasks.
VGG19 is a very deep network with 19 layers, known for its simplicity and high performance in transfer learning tasks. It’s frequently used in medical imaging tasks due to its ability to generalize well from pretrained weights.
- Advantages: VGG19 is straightforward in architecture and powerful when fine-tuned on specific tasks, such as detecting cancer in oral images.
- Use Case: It’s often used for transfer learning, leveraging pretrained weights to adapt quickly to the specific task of oral cancer detection.
- Limitations: VGG19 is resource-intensive and slower compared to other models, which can make training and inference more time-consuming.
Below is a visual representation of the differences in performance across the models used:
For this project, we utilized Streamlit to create an intuitive and user-friendly web interface for our oral cancer detection application. Streamlit is an open-source app framework specifically designed for machine learning and data science projects, allowing developers to quickly build and deploy interactive applications.
- Simplicity: Streamlit’s straightforward API enables rapid development without the need for complex web frameworks.
- Interactivity: It allows dynamic user input, such as uploading images, which the model can then analyze in real-time.
- Integration: Streamlit seamlessly integrates with popular Python libraries, making it ideal for deploying machine learning models and visualizing results.
- Deployment: With built-in support for deploying applications, Streamlit simplifies sharing our project with stakeholders and users, enhancing accessibility.
By leveraging Streamlit, we were able to focus on the model development and analysis while providing a polished, interactive interface for users to engage with the oral cancer detection system.
The project has been deployed on Streamlit Cloud to make the oral cancer detection application accessible to users. The deployment process involved the following steps:
- Pushed the Project to GitHub: The complete codebase was uploaded to a GitHub repository, allowing version control and collaboration.
- Deployed on Streamlit Cloud: Using the GitHub repository, the project was deployed directly to Streamlit Cloud. This allows users to run the application in their web browser without any local setup.
Streamlit Cloud provides a seamless way to host applications, ensuring that users can easily interact with the model and visualize results.
check the live apphere
In addition to Streamlit Cloud, we also utilized Docker for deployment. The Docker deployment process involved:
- Creating a Dockerfile: We created a Dockerfile that contains the instructions for building the Docker image, including the application's dependencies and configurations. Here is a sample Dockerfile:
# Use the official Python image from the Docker Hub
FROM python:3.8-slim
# Set the working directory
WORKDIR /app
# Copy the requirements.txt file into the container
COPY requirements.txt .
# Install the required Python packages
RUN pip install --no-cache-dir -r requirements.txt
# Copy the rest of the application code into the container
COPY . .
# Specify the command to run the Streamlit app
CMD ["streamlit", "run", "your_app.py", "--server.port=8501", "--server.address=0.0.0.0"]
To build the Docker image and push it to Docker Hub, follow these steps:
- Build the Docker Image: Run the following command in your terminal, ensuring you are in the directory containing your
Dockerfile
:
docker build -t jagadesh086/my_streamlit_app:latest .
After creating the Docker image for the application, we pushed it to Docker Hub to enable easy access and deployment from anywhere. The steps to push the image are as follows:
- Login to Docker Hub:
docker login
- Push the Image:
docker push jagadesh086/my_streamlit_app:latest
For deploying the application in a more scalable environment, we utilized Kubernetes. We created two files, deployment.yaml
and service.yaml
, which defined how our Docker images should be deployed and accessed.
Here’s the configuration for deployment.yaml
:
apiVersion: apps/v1
kind: Deployment
metadata:
name: streamlit-app
labels:
app: streamlit
spec:
replicas: 1
selector:
matchLabels:
app: streamlit
template:
metadata:
labels:
app: streamlit
spec:
containers:
- name: streamlit-app
image: jagadesh086/my_streamlit_app:latest
ports:
- containerPort: 8501
In order to expose our Streamlit application to the outside world, we defined a Kubernetes Service. This allows users to access the application via a specific port on the cluster.
Here’s the configuration for service.yaml
:
apiVersion: v1
kind: Service
metadata:
name: streamlit-service
spec:
type: NodePort
selector:
app: streamlit
ports:
- protocol: TCP
port: 8501
targetPort: 8501
nodePort: 30000 # You can choose any available port from 30000 to 32767
To run your application locally using Minikube, follow these steps:
-
Start Minikube: Use the following command to start your Minikube cluster:
minikube start
-
Deploy the Application:Apply your Kubernetes deployment and service configuration files:
kubectl apply -f deployment.yaml kubectl apply -f service.yaml
-
Get Service Url: Retrieve the URL to access your Streamlit application:
kubectl get streamlit-service --url
The following table summarizes the performance of various deep learning models used for oral cancer detection based on accuracy and probability metrics:
- As we know, during model training, a probability less than 0.5 indicates cancer, while a probability greater than 0.5 indicates non-cancer.
- ResNet50 showed the highest accuracy at 98.2%, with a balanced probability between cancer and non-cancer predictions.
- VGG19 performed well, achieving 96.8% accuracy, while maintaining a similar average probability for both cancer and non-cancer predictions.
- DenseNet121 also showed strong results, with 94% accuracy and relatively even probabilities.
- CNN displayed a solid accuracy of 93.2%, though its average cancer probability was lower compared to other models.
- EfficientNet achieved the lowest accuracy at 85.4%, but its non-cancer probability was the highest, making it more conservative in detecting non-cancer cases.
In this project, we explored the use of deep learning models for detecting oral cancer from medical images. By evaluating models such as CNN, ResNet50, DenseNet121, EfficientNetB2, and VGG19, we found that:
- ResNet50 achieved the highest accuracy of 98.2%, making it the most reliable model for oral cancer detection in our experiments.
- VGG19 also performed exceptionally well with an accuracy of 96.8%, making it another strong candidate for deployment in real-world applications.
- Models like DenseNet121 and CNN demonstrated solid performance, balancing accuracy with computational efficiency.
- EfficientNetB2 was the most resource-efficient but had the lowest accuracy, indicating that it may be more suitable for cases where computational resources are limited and non-cancer detection is prioritized.
Overall, the models show great potential in assisting healthcare professionals with early detection of oral cancer, which is critical for improving patient outcomes. Moving forward, further fine-tuning and testing on more diverse datasets will help enhance model robustness and reliability.
In this project, we utilized the following technologies:
To run the project locally, follow these steps:
Ensure you have Python and pip
installed, then install the required dependencies from the requirements.txt
file.
pip install -r requirements.txt
Clone the repository to your local machine.
git clone https://github.com/jagadeshchilla/oral_cancer_detection
cd oral_cancer_detection
Launch the Streamlit app locally using the following command:
streamlit run app.py
This will start the application, which you can access in your browser at http://localhost:8501.
You can also run the application using Docker for a more containerized solution.
-
Pull the Docker Image
First, pull the pre-built image from Docker Hub:docker pull jagadesh086/my_streamlit_app:latest
-
Run the Docker Container
docker run -p 8501:8501 jagadesh086/my_streamlit_app:latest
Access the app in your browser at http://localhost:8501.
If you prefer to deploy the application using Kubernetes, follow these steps:
-
Start Minikube: Use the following command to start your Minikube cluster:
minikube start
-
Deploy the Application:Apply your Kubernetes deployment and service configuration files:
kubectl apply -f deployment.yaml kubectl apply -f service.yaml
-
Get Service Url: Retrieve the URL to access your Streamlit application:
kubectl get streamlit-service --url
As we continue to advance our oral cancer detection project, there are several exciting areas to explore for future improvements and development:
-
Integration with Real-time Data Sources
We aim to connect the system to real-time medical databases to automatically update the dataset with the latest oral cancer images, allowing continuous model learning and enhancement. -
Enhancing Model Accuracy
Future work will focus on implementing advanced machine learning techniques such as attention mechanisms and transfer learning from larger medical datasets to further increase model accuracy and reduce false positives/negatives. -
Support for Other Medical Conditions
Expanding the model to detect other forms of cancer or diseases by incorporating diverse medical datasets. This will broaden the system’s impact and utility in the medical community. -
Mobile Application
Building a mobile-friendly interface to allow easy access for healthcare professionals and patients to utilize the system on-the-go for quick diagnoses. -
Deploying to Multi-cloud Environment
Moving towards a more scalable and distributed deployment strategy using a multi-cloud environment such as AWS, Google Cloud, or Azure for enhanced scalability and fault tolerance. -
Incorporating Explainable AI (XAI)
Adding interpretability to the model through explainable AI techniques will allow clinicians to understand how predictions are made and increase trust in AI-powered diagnostics. -
Optimizing for Edge Devices
The model could be optimized for edge computing, enabling deployment on devices with limited computing resources, such as smartphones or portable medical devices, to bring AI-based diagnosis closer to patients in rural areas. -
Improved Augmentation and Preprocessing
Experimenting with advanced image preprocessing techniques and augmentation methods to improve the model's ability to generalize across varied real-world scenarios.
By addressing these areas, we hope to make our solution more robust, accessible, and valuable to the healthcare community, contributing to the fight against oral cancer.
-
Oral Cancer Image Dataset
Source of the dataset used for training and testing our model. Available at: Oral Cancer Dataset -
Convolutional Neural Networks for Visual Recognition
A foundational paper explaining the use of CNNs in image classification tasks:
Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). "ImageNet Classification with Deep Convolutional Neural Networks." Advances in Neural Information Processing Systems, 25. -
ResNet: Deep Residual Learning for Image Recognition
This paper covers the architecture of ResNet, which we used in our comparison.
He, K., Zhang, X., Ren, S., & Sun, J. (2016). "Deep Residual Learning for Image Recognition." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
DenseNet: Densely Connected Convolutional Networks
Explanation of DenseNet architecture, which is used for feature reuse:
Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). "Densely Connected Convolutional Networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Description of the EfficientNet architecture used in our project.
Tan, M., & Le, Q. (2019). "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks." Proceedings of the International Conference on Machine Learning (ICML). -
VGGNet: Very Deep Convolutional Networks for Large-Scale Image Recognition
This research focuses on VGGNet, another model we compared.
Simonyan, K., & Zisserman, A. (2015). "Very Deep Convolutional Networks for Large-Scale Image Recognition." International Conference on Learning Representations (ICLR). -
Streamlit Documentation
Official documentation for Streamlit, which was used to build the interactive UI: Streamlit Docs -
Docker Documentation
Information on how to build and run containers for our project using Docker: Docker Docs -
Kubernetes Documentation
Official Kubernetes documentation for container orchestration: Kubernetes Docs -
TensorFlow Documentation
Reference for TensorFlow, used in model training: TensorFlow -
Scikit-Learn Documentation
Documentation for Scikit-Learn, used for various ML-related tasks in this project: Scikit-Learn Docs
You can explore the live version of our Oral Cancer Detection project by visiting the following link:
- Upload an image of an oral scan.
- The model will classify whether the image indicates cancer or non-cancer.
- Probability below 0.5 indicates cancer, while above 0.5 suggests non-cancer.
Feel free to try the app and provide feedback!