condor_top
Display status and runtime statistics of a HTCondor daemon
Synopsis
condor_top [-h ]
condor_top [-l ] [-H ] [-p centralmanagerhostname[:portname]] [-n name] [-d delay] [-c columnset] [-s sortcolumn] [–attrs=<attr1,attr2,…>] [daemon options ]
condor_top [-c columnset] [-s sortcolumn] [–attrs=<attr1,attr2,…>] [classad-filename classad-filename ]
Description
condor_top displays the status (e.g. memory usage and duty cycle) of a HTCondor daemon and calculates and displays runtime statistics for the daemon’s subprocesses.
When no arguments are specified, condor_top displays the status for the primary daemon based on the role of the current machine by scanning the DAEMON_LIST configuration setting. If multiple daemons are listed, condor_top will monitor one of (in decreasing priority): condor_schedd, condor_startd, condor_collector, condor_negotiator, condor_master.
If the condor_collector returns multiple ClassAds for the chosen
daemon type, condor_top will display stats from the first ClassAd
returned. Results can be constrained by passing the NAME of a
specific daemon with -n.
The default delay is STATISTICS_WINDOW_QUANTUM, which is 4 minutes (240 seconds) in a default HTCondor configuration. Setting the delay smaller can be helpful for finding spikes of activity, but setting the delay too small will lead to poor measurements of the duty cycle and of the runtime statistics.
condor_top can run in a top-like “live” mode by passing -l. The live mode is similar to the *nix top command, with stats updating every delay seconds. Redirecting stdout will disable live mode even if -l is set. To exit condor_top while in live mode, issue Ctrl-C.
condor_top can be passed two files containing ClassAds from the same HTCondor daemon, in which case the condor_collector will not be queried but rather the statistics will be computed and displayed immediately from the two ClassAds. Only -c, -s, and -attrs options are considered when passing ClassAds via files.
When monitoring a condor_schedd, passing -H (or -history)
reads the two most recent daemon ClassAds from the
$(SPOOL)/schedd_daemon_history file instead of querying the
schedd twice. This mode disables live output, cannot be combined with
file input, and ignores the -p option because the data are sourced
locally.
The following subprocess stat columns may be displayed (*default).
Each column is printed with a three-row header followed by a horizontal
rule separating the header from the data. The first header row names the
time scope (Recent = between the two ClassAds, Lifetime/Total
= since daemon start), the second names the metric, and the third gives
the units (s = seconds, # = count, /s = count per second,
% = percentage). The displayed header is shown in quotes below; the
identifier following it is the name to pass to the -s option to sort
by that column.
- “Name” (
Item)*Name of the subprocess
- “Recent Runtime” (
InstRt)*Total runtime between the two ClassAds
- “Recent Avg” (
InstAvg)*Mean runtime per execution between the two ClassAds
- “Total Runtime” (
TotalRt)Total runtime since daemon start
- “Lifetime Avg” (
TotAvg)*Mean runtime per execution since daemon start
- “Lifetime Max” (
TotMax)*Max runtime per execution since daemon start
- “Lifetime Min” (
TotMin)Min runtime per execution since daemon start
- “Pct of Avg” (
RtPctAvg)*Percent of mean runtime per execution. The ratio of InstAvg to TotAvg, expressed as a percentage
- “Pct of Max” (
RtPctMax)Percent of max runtime per execution. The ratio of (InstAvg - TotMin) to (TotMax - TotMin), expressed as a percentage
- “Std Devs” (
RtSigmas)Standard deviations from mean runtime. The ratio of (InstAvg - TotAvg) to the standard deviation in runtime per execution since daemon start
- “Recent Count” (
InstCt)Executions between the two ClassAds
- “Recent Rate” (
InstRate)*Executions per second between the two ClassAds
- “Total Count” (
TotalCt)Total executions (counts) since daemon start
- “Lifetime Rate” (
AvgRate)*Mean count rate. Executions per second since daemon start
- “Pct of Rate” (
CtPctAvg)Percent of mean count rate. The ratio of InstRate to AvgRate, expressed as a percentage.
Options
- -h
Displays the list of options.
- -l
Puts condor_top in to a live, continually updating mode.
- -H, -history, –history
When monitoring a condor_schedd, read the two most recent daemon ClassAds from
$(SPOOL)/schedd_daemon_historyinstead of querying the daemon twice. This mode disables live updates, ignores -p, and cannot be combined with file input.- -p centralmanagerhostname[:portname]
Query the daemon via the specified central manager. If omitted, the value of the configuration variable COLLECTOR_HOST is used.
- -n name
Query the daemon named name. If omitted, the value used will depend on the type of daemon queried (see Daemon Options).
- -d delay
Specifies the delay between ClassAd updates, in integer seconds. If omitted, the value of the configuration variable STATISTICS_WINDOW_QUANTUM is used.
- -c columnset
Display columnset set of columns. Valid columnset s are: default, runtime, count, all.
- -s sortcolumn
Sort table by sortcolumn. Defaults to InstRt.
- -attrs=<attr1,attr2,…>
Comma-delimited list of additional ClassAd attributes to monitor.Daemon Options
- -collector
Monitor condor_collector ClassAds. If -n is not set, the constraint “Machine == COLLECTOR_HOST“ will be used.
- -negotiator
Monitor condor_negotiator ClassAds. If -n is not set, the constraint “Machine == COLLECTOR_HOST“ will be used.
- -master
Monitor condor_master ClassAds. If -n is not set, the constraint “Machine == COLLECTOR_HOST“ will be used.
- -schedd
Monitor condor_schedd ClassAds. If -n is not set, the constraint “Machine ==
FULL_HOSTNAME“ will be tried, otherwise the first condor_schedd ClassAd returned from the condor_collector will be used.- -startd
Monitor condor_startd ClassAds. If -n is not set, the constraint “Machine ==
FULL_HOSTNAME“ will be tried, otherwise the first condor_startd ClassAd returned from the condor_collector will be used.