Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Week 03 2025 routine #352

Closed
21 tasks done
kiwixbot opened this issue Jan 20, 2025 · 5 comments
Closed
21 tasks done

Week 03 2025 routine #352

kiwixbot opened this issue Jan 20, 2025 · 5 comments
Assignees
Labels
maint Maintenance tasks

Comments

@kiwixbot
Copy link

kiwixbot commented Jan 20, 2025

Check nodes free space

df -h / && df -h /data
  • create a report in issue comment

Nodes system upgrades

apt update && apt upgrade
  • run systematically the upgrade on bastion, stats, services, storage, demo, mirrors-qa nodes
  • check for and apply important security upgrade on worker nodes asap (imager-worker, ondemand, sisyphus)

(regular workers updates are done separately on a monthly basis for worker nodes to not impact production)

Backups

k8s cluster

  • Check Pod errors or in CrashLoopBackoff
k get pods -A -o wide|grep -E 'Error|Crash'
  • Check Pod restarts
k get pods -A -o wide | pyp -i 'print("\n".join([line for line in l if re.split(r"\s+", line)[4] != "0"]))'
  • Check if k8s should/could be upgraded
curl -s -H "X-Auth-Token: $SCW_SECRET_KEY" https://api.scaleway.com/k8s/v1/regions/fr-par/clusters/$KIWIX_PROD_CLUSTER | jq ".version,.upgrade_available"
curl -s -H "X-Auth-Token: $SCW_SECRET_KEY" https://api.scaleway.com/k8s/v1/regions/fr-par/versions | jq ".versions[].name"

Stats

matomo - stats.kiwix.org

  • Ensure download.kiwix.org stats are being recorded
  • Check whether matomo should be upgraded

Grafana

Projects

Security

Note: this is an automatic reminder intended for the assignee(s).

@kiwixbot kiwixbot added the maint Maintenance tasks label Jan 20, 2025
@rgaudin
Copy link
Member

rgaudin commented Jan 20, 2025

Storage

Machine Filesystem Size Used Avail Use% Use change
bastion / 37G 17G 19G 49% -
stats / 233G 144G 77G 66% +13G
services / 456G 226G 207G 53% +5G
storage / 33T 9.7T 22T 32% +0.2T
imager-worker / 1.9T 281G 1.5T 16% don't care
sisyphus / 233G 20G 201G - don't care
ondemand / 25G 14G 11G 56% -
ondemand /data 216G 202M 205G 1% don't care
demo / 40G 9.5G 28G 26% -2G
demo /data 1.8T 920G 743G 56% don't care

storage node has a lot of packages pending upgrade ; since does include k8s-related ones, I'm leaving it for next monthly routine.

I did upgrade rsync in regard to the recent security issue but there's no running rsyncd on the host.

misc

zimit

We're getting more of those HTTP Protocol error. Is it what it looks like? ie: the crawler not able to work with HTTP/2 ? If so, we should maybe raise the priority of this to browsertrix.

@rgaudin rgaudin closed this as completed Jan 20, 2025
@benoit74
Copy link
Collaborator

Regarding HTTP2, we already confirmed that HTTP2 is supported. To me it is just that more websites are upgrading to HTTP2 (often behind a proxy like Cloudflare).

@rgaudin
Copy link
Member

rgaudin commented Jan 20, 2025

OK ben if its supported, what is this issue reporting?

@benoit74
Copy link
Collaborator

From what I find on the web, this error reports that there has been an error inside the HTTP2 protocol (connection broken, dropped by server, ...). Probably because browser has been blocked by some kind of protection.

@rgaudin
Copy link
Member

rgaudin commented Jan 20, 2025

OK

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
maint Maintenance tasks
Projects
None yet
Development

No branches or pull requests

3 participants