skypilot-users

How to specify the amount of CPU and memory to use for each job/exec in a cluster?

I have launched a cluster with 3 nodes on AWS, and when I issue sky exec, I see that the resource section shows 2x [CPU:0.5]. Does this mean it's only using 0.5 CPUs? How can I specify the amount of CPU and memory to use for each job/exec? I tried using --cpus parameter directly in the command but it was ignored. Zongheng mentioned setting OMP_NUM_THREADS, but I'm not sure how to do that. Can you provide guidance on how to specify CPU and memory usage for each job/exec in a cluster?

Ke

Kenady Inampudi

Asked on Feb 26, 2024

  • The task should be able to use all CPUs on the node in a cluster.
  • --cpus parameter is currently being ignored as a scheduling constraint.
  • Setting OMP_NUM_THREADS might help in utilizing all CPUs on the node.
  • For specifying CPU and memory usage for each job/exec, you may need to explore other configuration options or wait for the issue tracking --cpus to be resolved.
Feb 28, 2024Edited by