This is a modified version of the Webpage-Similarity project. With the addition of 190 more wikipedia pages, a more efficient method of data store is required. The main focus of this project is to integrate persistent data stores and switch the similarity metric to TF-IDF.