condor_vacate_job
Stop running job(s) and relinquish resources back to the host machine(s) executing the job(s).
Synopsis
condor_vacate_job [-help | -version]
condor_vacate_job [OPTIONS] [cluster… | cluster.proc… | user…]
condor_vacate_job [-debug] [-long] [-totals] [-all] [-constraint expression] [-pool hostname[:portnumber] | -name scheddname | -addr “<a.b.c.d:port>”] [-fast]
Description
Vacate job(s) from the host machine(s) where they are currently running; relinquishing resources back to the host machine(s) and returning the job(s) back to idle to restart resource matchmaking and execution elsewhere. Vacating the job causes the condor accountant to stop charging the user in terms of priority. For any given job, only the owner of the job or one of the queue super users (defined by the QUEUE_SUPER_USERS macro) can vacate the job.
Options
- -help
Display usage information.
- -version
Display version information.
- -long
Display result ClassAd.
- -totals
Display success/failure totals.
- -fast
Perform a fast vacate and hard kill the jobs.
- -pool hostname[:portnumber]
Specify a pool by giving the central manager’s host name and an optional port number.
- -name scheddname
Send the command to a machine identified by scheddname.
- -addr “<a.b.c.d:port>”
Send the command to a machine located at “<a.b.c.d:port>”.
- -constraint expression
Vacate all jobs which match the job ClassAd expression constraint.
- -all
Vacate all the jobs in the queue.
- cluster
Vacate all jobs in the specified cluster.
- cluster.process
Vacate the specific job in the cluster.
- user
Vacate jobs belonging to specified user.
General Remarks
If the -name option is specified, the named condor_schedd is targeted for processing. Otherwise, the local condor_schedd is targeted.
Vacated jobs will be sent a soft kill signal (SIGTERM by default or
the value of KillSig in the job’s ClassAd) unless -fast is used.
Using condor_vacate_job on jobs which are not currently running has no effect.
Jobs running in the local or scheduler universe can be vacated. Given these
types of jobs run directly on the Access Point, they will always restart on the same
resources and may begin executing again immediately if there is a low load or number
of jobs submitted to the Access Point.
DAGMan can also be vacated and will enter recovery mode upon restarting. However, all jobs submitted and managed by DAGMan will continue running uninterrupted.
Warning
Do not confuse this tool with condor_vacate. condor_vacate is intended for use by the owners/administrators of an Execution Point to evict jobs from their host machine.
Exit Status
0 - Success
1 - Failure has occurred
Examples
To vacate a specific job:
$ condor_vacate_job 23.0
To vacate a job fast:
$ condor_vacate_job -fast 23.0
To vacate all jobs owned by user Mary:
# condor_vacate_job mary
To vacate all vanilla universe jobs owned by Mary:
# condor_vacate_job -constraint 'JobUniverse == 5 && Owner == "mary"'
See Also
condor_vacate, condor_rm, condor_continue, condor_suspend, condor_hold, condor_release
Availability
Linux, MacOS, Windows