skypilot-users

How to fix the issue of streaming log not displaying anything in skypilot?

I'm facing an issue where my streaming log in skypilot doesn't display anything. Even a simple echo command takes a few minutes to run and fails. I have tried reinstalling skypilot but the issue persists. Any suggestions on how to fix this?

Ja

Jason Krone

Asked on Feb 03, 2024

The issue might be caused by a mismatch in the ray version. Skypilot relies on ray==2.4.1 to schedule jobs on the remote cluster. If you have ray 2.6.3 installed, it can cause issues with the skypilot runtime. To fix this, you can create a new conda environment in the remote machine and install ray==2.4.1 in that environment. Here's an example of how to do it:

setup: |
  conda activate my-env
  if [ $? -eq 0 ]; then
    echo 'conda env exists'
  else
    # Setup the environment
    conda create -n my-env python=3.10 -y
    conda activate my-env
  fi
  pip install -r requirements/requirements.txt

run: |
  conda activate my-env
  echo "hello"

Additionally, it is recommended to install torch<2.2 instead of the latest torch==2.2 as the latest version can cause a segmentation fault on the VM image.

Feb 05, 2024Edited by