-
Notifications
You must be signed in to change notification settings - Fork 318
Issues: NVIDIA/gpu-operator
NOTICE: Containers losing access to GPUs with error: "Failed ...
#485
opened Feb 7, 2023 by
cdesiniotis
Open
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Anyway to find if gpu operator has completed node discovery
#1216
opened Jan 21, 2025 by
SSushmitha8
Is the current GPU Operator also affected with Security Bulletin January 2025, Bulletin ID 5614 and 5599?
#1214
opened Jan 20, 2025 by
tb914
Failed to pull image "nvcr.io/nvidia/driver:550.90.07-debian12" in nvidia-driver-daemonset Pod
#1212
opened Jan 20, 2025 by
utsumi-fj
Operator fails to render a valid daemonset on OCP when using 64K kernel page size.
#1207
opened Jan 16, 2025 by
mvazquezc
Master node has no GPU but worker nodes have gpu, helm install fails
#1204
opened Jan 16, 2025 by
rogersaloo
Precompiled Driver Container for Linux Kernel other than 5.15 does not exist
#1203
opened Jan 16, 2025 by
utsumi-fj
what is the correct way to enable MIG for the GPU card via Gpu operator
#1197
opened Jan 10, 2025 by
okyspace
Why is the operator is designed to run on master/control-plane, and the devicePlugin with toleration?
#1194
opened Jan 5, 2025 by
amir-bialek
Unable to run nsys or CUPTI profiling on K8 cluster with gpu-operator
#1158
opened Dec 9, 2024 by
manepallirajesh
The resource requests and limits are not being applied to the pod as expected.
#1145
opened Nov 28, 2024 by
IndhumithaR
Vulnerability on libssl.so.1.1.1k on gpu-operator-certified.v24.9.0
#1138
opened Nov 25, 2024 by
dario-lab
bug: operator anti-pattern, validator pod deployments cause
CrashBackLoop
behaviour
#1114
opened Nov 13, 2024 by
justinthelaw
container-toolkit fails to start after upgrading to v24.9.0 on k3s cluster
bug
Issue/PR to expose/discuss/fix a bug
#1109
opened Nov 7, 2024 by
logan2211
Previous Next
ProTip!
Adding no:label will show everything without a label.