Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BUILD: fix GPU arch flags in amdclang build #914

Merged

Conversation

nileshnegi
Copy link
Contributor

What

  • Modified GPU architecture flags for compatibility with AMD GPUs.
  • Add -O3 for improving performance.

Why ?

Incompatibility between explicitly specified GPU arch flags (like --target-offload=gfx90a:xnack+) and --target-offload=native led to build errors on some AMD GPUs.

Signed-off-by: nileshnegi <Nilesh.Negi@amd.com>
@swx-jenkins3
Copy link

Can one of the admins verify this patch?

@Sergei-Lebedev
Copy link
Contributor

ok to test

@Sergei-Lebedev Sergei-Lebedev merged commit c1734db into openucx:master Feb 5, 2024
9 of 11 checks passed
nileshnegi added a commit to nileshnegi/ucc that referenced this pull request Feb 5, 2024
Signed-off-by: nileshnegi <Nilesh.Negi@amd.com>
(cherry picked from commit c1734db)
manjugv pushed a commit that referenced this pull request Feb 16, 2024
Signed-off-by: nileshnegi <Nilesh.Negi@amd.com>
(cherry picked from commit c1734db)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants