Skip to content

Commit

Permalink
run make manifests
Browse files Browse the repository at this point in the history
  • Loading branch information
samos123 committed Nov 2, 2024
1 parent da2f09c commit e11ebcf
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions manifests/models/llama-3.1-70b-instruct-awq-int4-gh200.yaml
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
# Source: models/templates/models.yaml
apiVersion: kubeai.org/v1
kind: Model
metadata:
Expand All @@ -13,5 +14,4 @@ spec:
- --enable-prefix-caching
- --disable-log-requests
targetRequests: 50
minReplicas: 1
resourceProfile: nvidia-gpu-gh200:1
resourceProfile: nvidia-gpu-gh200:1

0 comments on commit e11ebcf

Please sign in to comment.