condor_annex

Add cloud resources to the pool.

Synopsis

condor_annex -help

condor_annex [-aws-region <region>] -setup [FROM INSTANCE|[/full/path/to/access/key/file [/full/path/to/secret/key/file]]]

condor_annex [-aws-on-demand ] -annex-name <name of the annex> -count <integer number of instances> [-aws-on-demand-* ] [common options ]

condor_annex [-aws-spot-fleet ] -annex-name <name of the annex> -slots <integer weight> [-aws-spot-fleet-* ] [common options ]

condor_annex -annex-name <name of the annex> -duration hours

condor_annex [-annex-name <name of the annex>] -status [-classad ]

condor_annex -check-setup

condor_annex <condor_annex options> status <condor_status options>

Description

condor_annex adds clouds resources to the pool. (“The pool” is determined in the usual manner for HTCondor daemons and tools.) This version supports only Amazon Web Services (‘AWS’). To add “on-demand” instances, use the third form listed above; to add “spot” instances, use the fourth. For an explanation of terms, consult either the HTCondor manual in the Cloud Computing chapter or the AWS documentation.

Using condor_annex with AWS requires a one-time setup procedure performed by invoking condor_annex with the -setup flag (the second form listed above). You may check if this procedure has been performed with the -check-setup flag (the seventh form listed above). If you use the setup flag on an instance whose role gives it sufficient privileges, you may, instead of specifying your API keys, pass FROM INSTANCE to -setup to ask condor_annex to use the instance’s role credentials.

To reset the lease on an existing annex, invoke condor_annex with only the -annex-name option and -duration flag (the fifth form listed above).

To determine which of the instances previously requested for a particular annex are not currently in the pool, invoke condor_annex with the -status flag and the -annex-name option (the sixth form listed above). The output of this command is intended to be human-readable; specifying the -classad flag will produce the same information in ClassAd format. If you omit -annex-name, information for all annexes will be returned.

Starting in 8.7.3, you may instead invoke condor_annex with status as a command argument (the eighth form listed above). This will cause condor_annex to use condor_status to present annex instance data. Arguments and options on the command line after status will be passed unmodified to condor_status, but not all arguments and options will behave as expected. (See below.) condor_annex will construct an ad for each annex instance and pass that information to condor_status; condor_status will (unless you specify otherwise using its command line) query the collector for more information about the instances. Information from the collector will be presented as usual; instances which did not have ads in the collector will be presented last, in their own table. These instances can not be presented in the usual way because the annex instance ads generated by condor_annex do not (and can not) have the same information in them as ads generated by a condor_startd running in the instance. See the condor_status manual page for details about the “merge” mode of condor_status used by this command argument. Note that both condor_annex and condor_status have -annex-name options; if you’re interested in a particular annex, put this flag on the command line before the status command argument to avoid confusing results.

Common options are listed first, followed by options specific to AWS, followed by options specific to AWS’ on-demand instances, followed by options specific to AWS’ spot instances, followed by options intended for use by experts.

Options

-help
Print a usage reminder.
-setup [/full/path/to/access/key/file/full/path/to/secret/key/file]
Do the first-time setup.
-duration hours
Set the maximum lease duration in decimal hours. After this amount of time, all instances will terminated, regardless of their idleness. Defaults to 50 minutes.
-idle hours
Set the maximum idle duration in decimal hours. An instance idle for longer than this duration will terminate itself. Defaults to 15 minutes.
-config-dir /full/path/to/directory
Copy the contents of /full/path/to/directory to each instance’s configuration directory.
-owner owner[, owner]*
Configure the annex so that only owner may start jobs there. By default, configure the annex so that only the user running condor_annex may start jobs there.
-no-owner
Configure the annex so that anyone in the pool may use the annex.
-aws-region region
Specify the region in which to create the annex.
-aws-user-data user-data
Set the instance user data to user-data.
-aws-user-data-file /full/path/to/file
Set the instance user data to the contents of the file /full/path/to/file.
-aws-default-user-data user-data
Set the instance user data to user-data, if it’s not already set. Only applies to spot fleet requests.
-aws-default-user-data-file /full/path/to/file
Set the instance user data to the contents of the file /full/path/to/file, if it’s not already set. Only applies to spot fleet requests.
-aws-on-demand-instance-type instance-type
This annex will requests instances of type instance-type. The default for v8.7.1 is ‘m4.large’.
-aws-on-demand-ami-id ami-id
This annex will start instances of the AMI ami-id. The default for v8.7.1 is ‘ami-35b13223’, a GPU-compatible Amazon Linux image with HTCondor pre-installed.
-aws-on-demand-security-group-ids group-id[,group-id]
This annex will start instances with the listed security group IDs. The default is the security group created by -setup.
-aws-on-demand-key-name key-name
This annex will start instances with the key pair named key-name. The default is the key pair created by -setup.
-aws-spot-fleet-config-file /full/path/to/file
Use the JSON blob in /full/path/to/file for the spot fleet request.
-aws-access-key-file /full/path/to/access-key-file
Experts only.
-aws-secret-key-file /full/path/to/secret-key-file
Experts only.
-aws-ec2-url https://ec2.<region>.amazonaws.com
Experts only.
-aws-events-url https://events.<region>.amazonaws.com
Experts only.
-aws-lambda-url https://lambda.<region>.amazonaws.com
Experts only.
-aws-s3-url https://s3.<region>.amazonaws.com
Experts only.
-aws-spot-fleet-lease-function-arn sfr-lease-function-arn
Developers only.
-aws-on-demand-lease-function-arn odi-lease-function-arn
Developers only.
-aws-on-demand-instance-profile-arn instance-profile-arn
Developers only.

General Remarks

Currently, only AWS is supported. The AMI configured by setup runs HTCondor v8.6.10 on Amazon Linux 2016.09, and the default instance type is “m4.large”. The default AMI has the appropriate drivers for AWS’ GPU instance types.

Examples

To start an on-demand annex named ‘MyFirstAnnex’ with one core, using the default AMI and instance type, run

condor_annex -count 1 -annex-name MyFirstAnnex

You will be asked to confirm that the defaults are what you want.

As of 2017-04-17, the following example will cost a minimum of $90.

To start an on-demand annex with 100 GPUs that job owners ‘big’ and ‘little’ may use (be sure to include yourself!), run

condor_annex -count 100 -annex-name MySecondAnnex \
  -aws-on-demand-instance-type p2.xlarge -owner "big, little"

Exit Status

condor_annex will exit with a status value of 0 (zero) on success.