Christian Osendorfer is considering making 20 single Desktop machines with one 4090 each usable for Skypilot by creating a Kubernetes cluster with 20 nodes. He wants to know if this is a reasonable approach and if users can schedule distributed training tasks utilizing 4 GPUs in this scenario.
Christian Osendorfer
Asked on Apr 09, 2024