Skip to content

examples: add gpu-operator kind cluster walkthrough#71

Open
shivakunv wants to merge 1 commit into
mainfrom
kind_demo
Open

examples: add gpu-operator kind cluster walkthrough#71
shivakunv wants to merge 1 commit into
mainfrom
kind_demo

Conversation

@shivakunv

@shivakunv shivakunv commented May 4, 2026

Copy link
Copy Markdown

@shivakunv shivakunv force-pushed the kind_demo branch 3 times, most recently from 6cc8709 to 56f4c3f Compare May 6, 2026 06:02
Signed-off-by: Shiva Kumar (SW-CLOUD) <shivaku@nvidia.com>
@shivakunv shivakunv self-assigned this May 6, 2026
@shivakunv shivakunv marked this pull request as ready for review May 6, 2026 06:18
@shivakunv shivakunv removed their assignment Jun 9, 2026
--wait \
--timeout "${HELM_TIMEOUT}" \
--set cdi.enabled=true \
--set driver.enabled=false \

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this also need --toolkit.enabled=false? My understanding is nvkind cluster create already configures toolkit.


log "Installing Go ${GO_VERSION}"
ensure_base_packages
curl -fsSL -o /tmp/go.tgz "https://go.dev/dl/go${GO_VERSION}.linux-amd64.tar.gz"

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We're hardcoding to amd64. What about arm64 hosts?

restartPolicy: Never
containers:
- name: cuda-container
image: nvcr.io/nvidia/k8s/cuda-sample:vectoradd-cuda10.2

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use a newer vectoradd image?

Comment thread README.md

And install the `gpu-operator` as follows:
```bash
helm upgrade -i --kube-context=kind-${KIND_CLUSTER_NAME} --namespace gpu-operator --create-namespace --wait nvidia-gpu-operator nvidia/gpu-operator --set cdi.enabled=true --set driver.enabled=false --set operator.runtimeClass=nvidia

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: let's use multi-line here.


The script will install or configure these tools if they are missing:

- Go `1.24.3` by default, or `NVKIND_GO_VERSION`

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why 1.24.3 by default? Why not newer Go?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Running the GPU Operator on a kind cluster

3 participants