You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running the scraper from scratch, there appears a directory articles/https/. There are some articles under this directory, and I don't think they match up with articles not under this directory in the browse view. E.g. articles/https//www.nytimes.com/ don't appear along with articles/www.nytimes.com.
The text was updated successfully, but these errors were encountered:
This is due to a legacy artifact in models.Article#filename:
elif ans.startswith('https://'):
# Terrible hack for backwards compatibility from when https was stored incorrectly,
# perpetuating the problem
return 'https:/' + ans[len('https://'):]
When running the scraper from scratch, there appears a directory
articles/https/
. There are some articles under this directory, and I don't think they match up with articles not under this directory in the browse view. E.g.articles/https//www.nytimes.com/
don't appear along witharticles/www.nytimes.com
.The text was updated successfully, but these errors were encountered: