Releases: AndyTheFactory/FakeNewsDataset
Releases · AndyTheFactory/FakeNewsDataset
Initial release
a consolidated and cleaned up version of the opensources Fake News dataset, classified into 12 classes: reliable, unreliable, political, bias, fake, conspiracy, rumor clickbait, junk science, satire, hate and unknown. The articles were scraped between the end of 2017 and the beginning of 2018 from various news websites, totaling 647 distinct sources
The extracted file is 20 GB large
Label | Nr Records |
---|---|
reliable | 1,807,323 |
political | 96,8205 |
bias | 769,874 |
fake | 762,178 |
conspiracy | 494,184 |
rumor | 375,963 |
unknown | 230,532 |
clickbait | 174,176 |
unreliable | 104,537 |
satire | 84,735 |
junksci | 79,099 |
hate | 64,763 |
--- | ---- |
total | 5,915,569 |