NLP

NLP Projects

This is a collection of corpora for a phishing corpus project. A .zip of a merged Styler Enronsent corpus can be found here: https://mega.nz/#!dWAhWKpB!sRKfTG0_GL8JXLzoZwzLFn0SJhnCRX3yqtl9-3uBtQc

The corpora are attributed to:

Ocampo, D. (2019). This project will determine which of the five supervised classification machine learning algorithms performs best in detecting phishy emails: diegoocampoh/MachineLearningPhishing. Retrieved from https://github.com/diegoocampoh/MachineLearningPhishing (Original work published 2017)

Radev, D. (2008), CLAIR collection of fraud email, ACL Data and Code Repository, ADCR2008T001, http://aclweb.org/aclwiki

Styler, Will (2011). The EnronSent Corpus. Technical Report 01-2011, University of Colorado at Boulder Institute of Cognitive Science, Boulder, CO., http://wstyler.ucsd.edu/enronsent.html

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
CLAIR.txt		CLAIR.txt
LIWC2015 Results (enronsent-merged).csv		LIWC2015 Results (enronsent-merged).csv
LIWC2015 Results (phishing and enronsent).csv		LIWC2015 Results (phishing and enronsent).csv
Nigerian_emails_sample-CohMetrixOutput-tsv.txt		Nigerian_emails_sample-CohMetrixOutput-tsv.txt
Nigerian_emails_sample.txt		Nigerian_emails_sample.txt
README.md		README.md
Schriner-FinalPaper-LING7800-Detecting Deception using Natural Language Processing Techniques.pdf		Schriner-FinalPaper-LING7800-Detecting Deception using Natural Language Processing Techniques.pdf
emails-enron.mbox		emails-enron.mbox
emails-phishing-clean.txt		emails-phishing-clean.txt
emails-phishing.mbox		emails-phishing.mbox
python-readability.ipynb		python-readability.ipynb
schriner-homework03.md		schriner-homework03.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP

About

Releases

Packages

Languages

johnschriner/NLP

Folders and files

Latest commit

History

Repository files navigation

NLP

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages