Users’ Manual¶
- HTCondor Quick Start Guide
- Welcome and Introduction to HTCondor
- Running a Job: the Steps To Take
- Submitting a Job
- Sample submit description files
- Submitting many similar jobs with one queue command
- Variables in the Submit Description File
- Including Submit Commands Defined Elsewhere
- Using Conditionals in the Submit Description File
- Function Macros in the Submit Description File
- About Requirements and Rank
- Submitting Jobs Using a Shared File System
- Jobs That Require Credentials
- Jobs That Require GPUs
- Interactive Jobs
- Submitting Lots of Jobs
- Submitting Jobs Without a Shared File System: HTCondor’s File Transfer Mechanism
- Managing a Job
- Checking on the progress of jobs
- Peeking in on a running job’s output files
- Starting an interactive shell next to a running job on a remote machine
- Removing a job from the queue
- Placing a job on hold
- Changing the priority of jobs
- Why is the job not running?
- Job in the Hold State
- In the Job Event Log File
- Job Termination
- Job Completion
- Summary of all HTCondor users and their jobs
- Automatically managing a job
- Services for Running Jobs
- Priorities and Preemption
- DAGMan Workflows
- Describing Workflows with DAGMan
- Node Job Submit File Contents
- DAG Submission
- File Paths in DAGs
- DAG Monitoring
- Editing a Running DAG
- Removing a DAG
- Suspending a Running DAG
- Advanced Features of DAGMan
- The Rescue DAG
- DAG Recovery
- Visualizing DAGs with dot
- Capturing the Status of Nodes in a File
- A Machine-Readable Event History, the jobstate.log File
- Status Information for the DAG in a ClassAd
- Managing Large Numbers of Jobs with DAGMan
- Workflow Metrics
- DAGMan and Accounting Groups
- Matchmaking with ClassAds
- Choosing an HTCondor Universe
- Java Applications
- Parallel Applications (Including MPI Applications)
- Virtual Machine Applications
- Docker Universe Applications
- Self-Checkpointing Applications
- Time Scheduling for Job Execution
- Special Environment Considerations
- Potential Problems