Skip to content

Commit 2456c45

Browse files
helm/deploy: add GKE instructions
Signed-off-by: Jordi Massaguer Pla <[email protected]>
1 parent 69604ce commit 2456c45

File tree

1 file changed

+50
-0
lines changed

1 file changed

+50
-0
lines changed

deploy/helm/README.md

Lines changed: 50 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -440,6 +440,56 @@ More info at:
440440
https://github.com/Project-MONAI/monai-deploy-informatics-gateway/blob/develop/docs/api/rest/config.md
441441
442442
443+
## Deploying in Google Kubernetes Engine
444+
445+
Check you have enough GPU quota in the zone you want to deploy. I have
446+
tested this in europe-west4-a and I had to request to increase the quota
447+
to 1 nvidia-tesla-a100 GPU.
448+
449+
Create a standard cluster and name it monai-deploy-1
450+
451+
> Note you could create an autopilot cluster, so that GPU nodes are
452+
deployed automatically. Be aware that autopilot needs as much quota
453+
as nodes you have multplied by the GPUs you request. With the standard
454+
autopilot cluster configuration, I was getting 11 nodes, and so it was
455+
requesting 11 GPUs, which was way higher than my quota.
456+
457+
Configure your cluster:
458+
459+
```gcloud container clusters get-credentials monai-deploy-1```
460+
461+
Check your cluster:
462+
463+
```kubectl cluster-info```
464+
465+
Edit the argo template and add the node selector:
466+
467+
```
468+
metadata: {}
469+
+ nodeSelector:
470+
+ cloud.google.com/gke-accelerator: nvidia-tesla-a100
471+
container:
472+
```
473+
474+
Add a node pool and inside that pool a node with 1 GPU.
475+
476+
Install drivers with:
477+
478+
```kubectl apply -f https://raw.githubusercontent.com/GoogleCloudPlatform/container-engine-accelerators/master/nvidia-driver-installer/cos/daemonset-preloaded-latest.yaml```
479+
480+
More info at https://cloud.google.com/kubernetes-engine/docs/how-to/gpus#console
481+
482+
Then, you will need a bigger machine for the argo workflows. You need
483+
to create another pool with one node and this time you can select type
484+
e2-standard-2.
485+
486+
Now, you can run all the helm and kubectl commands above.
487+
488+
```Warning: Using GKE with GPUs will cost you money, be careful with
489+
that. ```
490+
491+
492+
443493
[0] https://drive.google.com/file/d/1d8Scm3q-kHTqr_-KfnXH0rPnCgKld2Iy/view?usp=sharing
444494
a DICOM dataset that was converted to DICOM from Medical Decathlon
445495
training and validation images (see https://github.com/Project-MONAI/monai-deploy/tree/main/deploy/monai-deploy-express#running-a-monai-deploy-workflow)

0 commit comments

Comments
 (0)