Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding prow job config for gcsfuse pytorch dino model for test #1950

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

sethiay
Copy link
Contributor

@sethiay sethiay commented May 17, 2023

This commit adds:

  • Periodic job for Pytorch Dino model.

@google-oss-prow
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: sethiay
Once this PR has been reviewed and has the lgtm label, please assign mpherman2 for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@google-oss-prow
Copy link
Contributor

Hi @sethiay. Thanks for your PR.

I'm waiting for a GoogleCloudPlatform member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@mpherman2
Copy link
Contributor

/ok-to-test

@sethiay
Copy link
Contributor Author

sethiay commented May 18, 2023

/retest

@sethiay
Copy link
Contributor Author

sethiay commented May 18, 2023

/retest

@google-oss-prow
Copy link
Contributor

@sethiay: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
pull-oss-test-infra-check-create-testgrid-config e4498c6 link true /test pull-oss-test-infra-check-create-testgrid-config
pull-prow-config-validate e4498c6 link true /test pull-prow-config-validate

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@chases2
Copy link
Collaborator

chases2 commented May 18, 2023

/assign

@chases2
Copy link
Collaborator

chases2 commented May 19, 2023

2023/05/18 19:00:48 1 error occurred:
* configuration error for (TestGroup) gcsfuse-pytorch-dino-periodic: Each Test Group must be referenced by at least 1 Dashboard Tab.

Prow jobs in this repository must contain TestGrid annotations or be explicitly opted-out. Try removing the commit that deletes the TestGrid annotations.

@@ -0,0 +1,33 @@
periodics:
- name: gcsfuse-pytorch-dino-periodic
cluster: gcsfuse-prow-test
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

{"component":"checkconfig","file":"k8s.io/test-infra/prow/cmd/checkconfig/main.go:91","func":"main.reportWarning","level":"warning","msg":"invalid periodic job: job configuration for "gcsfuse-pytorch-dino-periodic" specifies unknown 'cluster' value "gcsfuse-prow-test"","severity":"warning","time":"2023-05-18T19:00:49Z"}

Is this a new cluster? Prow doesn't seem to recognize it. Can you point to where it was configured?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, this is a new cluster. I just ran https://github.com/GoogleCloudPlatform/oss-test-infra/blob/master/prow/oss/create-build-cluster.sh. Is there any other configurations that I need to do ? If yes, please point to it.

Prow jobs in this repository must contain TestGrid annotations or be explicitly opted-out. Try removing the commit that deletes the TestGrid annotations.

I removed the TestGrid annotations because that was giving me error - "No dashboard found". Do I have to first create the TestGrid dashboard ? If yes, is it true that I just need to add an entry here; https://github.com/GoogleCloudPlatform/oss-test-infra/blob/master/testgrid/config.yaml ?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As far as TestGrid dashboards go, that's correct; add your new dashboard there and then you can use it in an annotation.

As for what steps you need to take after running that script for Prow to recognize a new build cluster; I'll look into that

@mpherman2
Copy link
Contributor

Ignore my last comment (deleted it). I missread the PR. Still looking into this

@@ -0,0 +1,33 @@
periodics:
- name: gcsfuse-pytorch-dino-periodic
cluster: gcsfuse-prow-test
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this is the correct name for the cluster you created. Typically the script creates a cluster with the name build-{TEAM} so I think yours is build-gcsfuse

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I changed the name in the script to gcsfuse-prow-test and can confirm cluster with this name is present in my project as well.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think maybe you did not edit the script correctly.
It looks like the cluster is configured as build-gcsfuse https://github.com/GoogleCloudPlatform/oss-test-infra/blob/master/prow/oss/gencred-config/gencred-config.yaml

and build-gcsfuse is in a lot of the config https://grep.app/search?q=build-gcsfuse&filter[repo][0]=GoogleCloudPlatform/oss-test-infra

I am not seeing any config for gcsfuse-prow-test

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

okay, I will creating another cluster again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants