Skip to content
This repository has been archived by the owner on Dec 9, 2018. It is now read-only.
/ CPT Public archive

This is the native Python implementation of CPT(compact Prediction Tree)

License

Notifications You must be signed in to change notification settings

NeerajSarwan/CPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CPT (Compact Prediction Tree)

This is the Python Implementation of CPT algorithm for Sequence Prediction. The library has been written from scratch in Python and as far as I believe is the first Python implementation of the algorithm.

The repository is also an exercise on my part to code a research paper. The library is not perfect. I have intentionally left out some optimisations such as CFS(compression of frequenct sequences) etc. These features will be later added to the library as an ongoing effort.

The library is created using the below two research papers.

  1. Compact Prediction Tree: A Losless Model for Accurate Sequence Prediction

  2. CPT+: Decreasing the time/space complexity of the Compact Prediction Tree

  • How to use the library?

There is no requirement of compiling anything but make sure you have Pandas and tqdm installed in your environment specific versions of which are mentioned in the file requirements.txt.

  • Sample code for training and getting predictions.
# When inside the CPT folder

from CPT import CPT

model = CPT()

train, test = model.load_files("./data/train.csv","./data/test.csv", merge = True)

model.train(data)

predictions = model.predict(train,test, k, n)

About

This is the native Python implementation of CPT(compact Prediction Tree)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages