Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: add retries for .taskcluster.yml fetches #30

Merged
merged 2 commits into from
Jul 17, 2024

Conversation

bhearsum
Copy link
Contributor

@bhearsum bhearsum commented Jul 16, 2024

We get this intermittently for whatever reason, and it's constantly busting deployments.

I tried to debug the 404s a bit but didn't get anywhere; I don't think it's worth digging into more unless we see it with retries.

@bhearsum bhearsum force-pushed the pr-policy-failure branch 4 times, most recently from 5e68a86 to db1cd30 Compare July 16, 2024 19:16
@bhearsum bhearsum changed the title wip: deal with pull request policy failures fix: add retries for .taskcluster.yml fetches Jul 16, 2024
@bhearsum bhearsum force-pushed the pr-policy-failure branch from db1cd30 to abe6f8e Compare July 16, 2024 19:21
@bhearsum bhearsum marked this pull request as ready for review July 16, 2024 19:24
@bhearsum bhearsum requested a review from a team as a code owner July 16, 2024 19:24
Copy link
Contributor

@hneiva hneiva left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

raise e
client = RetryClient(
client_session=aiohttp_session(),
retry_options=ExponentialRetry(attempts=5, statuses={404}),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems odd to me that a 404 would be intermittent; are we sure that's what's happening?

We should probably retry on 5xx though, if we're adding retries...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems odd to me that a 404 would be intermittent; are we sure that's what's happening?

As far as I can tell, the only way we can ever end up returning None from tcyml.get is by hitting this line, which can only happen on a 404.

We should probably retry on 5xx though, if we're adding retries...

Somewhat confusingly, 5xx responses are retried even when statuses are passed:

Important: by default all 5xx responses are retried + statuses you specified as statuses param If you will pass retry_all_server_errors=False than you can manually set what 5xx errors to retry.

I'll add a comment to this effect.

@bhearsum bhearsum force-pushed the pr-policy-failure branch 2 times, most recently from d9e3fd2 to abfab52 Compare July 17, 2024 14:06
@bhearsum bhearsum requested a review from jcristau July 17, 2024 14:12
Copy link
Contributor

@jcristau jcristau left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Intermittent 404s still seem rather wonky, but I don't have a better idea so let's go with this.

bhearsum added 2 commits July 17, 2024 10:51
We get this intermittently for whatever reason, and it's constantly busting deployments.
@bhearsum bhearsum force-pushed the pr-policy-failure branch from abfab52 to 6b078bd Compare July 17, 2024 14:54
@bhearsum bhearsum merged commit 5f3f923 into mozilla-releng:main Jul 17, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants