Skip to content

Unable to contact slurm controller

Hi @gmontane

I cannot cancel these two jobs.
What could be the issue?

[cns44815@dtransfer1 ~]$ squeue 
             JOBID PARTITION     NAME     USER ST       TIME  NODES NODELIST(REASON)
           6233376   archive    dt_cp cns44815  R    1:03:10      1 hsmmover3
           6233064   archive    dt_cp cns44815  R    3:20:41      1 hsmmover2
[cns44815@dtransfer1 ~]$ scancel
scancel: error: No job identification provided
[cns44815@dtransfer1 ~]$ scancel -u cns44815
slurm_load_jobs error: Unable to contact slurm controller (connect failure)
[cns44815@dtransfer1 ~]$ scancel 6233376
scancel: error: Kill job error on job id 6233376: Unable to contact slurm controller (connect failure)
[cns44815@dtransfer1 ~]$ scancel 6233064
scancel: error: Kill job error on job id 6233064: Unable to contact slurm controller (connect failure)
[cns44815@dtransfer1 ~]$