How close a computer is from being attacked by malware (Kaggle)
This repo can also be used as a very good example how to divide huge data and process then using Incremental Learning.
Making_train_sets and Making_test_sets have made many small datasets where RAM is not enough to process the entire data.
Please download the datasets from Kaggle
https://www.kaggle.com/c/microsoft-malware-prediction
All the other files have shown many ways to process data using different machine learning techniques.
Best performing is the neural network one.