Job selection

Selecting the Resource Manager (Job Launcher)

Padb supports many resource managers and should select the appropriate one for your machine, if you have more than one resource manager installed or padb can't detect the correct one use the rmgr configuration option to set machine-wide defaults.

If your resource manager or scheduler is not supported you can also use local and process identifiers (pids) will be used instead of job ids.

Resource manager Name Notes
Any mpirun Works with any resource manager or software stack that is compliant with the MPI debugger interface. It is preferable to use support for your specific resource manager if it exists.
Slurm slurm Fully supported
RMS rms Fully supported
OpenMPI orte orte Fully supported
MPICH2 mpd mpd Fully supported in 3.0 and above
MPICH2 hydra mpirun Use the mpirun resource mangager in version 3.2 and above.
PBS and PBS pro pbs Fully supported in 3.2 and above
None local allows process identifiers (pids) to be specified as jobids and treats that process as a single process job.
None local-fd as local but only selects processes which have a certain file open, use the "local-fd-name" configuration option to set the filename.
None local-qsnet as local-fd with local-fd-name set to /proc/qsnet/elan/user to automatically select network jobs on the local node.

The --list-rmgrs option can be used to show a list of detected resource managers and their active jobs.

Selecting the job(s) to target

Padb provides several ways for the user to select which jobs to target depending on requirements. All jobs detected by padb are identified by a unique numeric "jobid", typically this maps directly only a identifier provided by the underlying resource manager. The default is to target jobs of the current user, this can be over-ridden with the --user flag.

To target a specific job

To target a specific job specify the numeric jobid for the job on the command line, after all other options.

Showing list of current jobs

To show a list of currently running jobs for a given user use the --show-jobs option. Alternatively the --list-rmgrs option shows a list of detected resource managers along with a list of currently active jobs for each of them.

To target all jobs

To target all jobs currently running for a given user use the --all (-a) flag.

To target any jobs

To target any job currently running for a given user use the --any (-A) flag. This differs from targeting all jobs as it will exit with an error if more than one job is running.

Selecting ranks (Processes)

In modes where data for each process is reported separately it is possible to restrict which ranks are queried, this is done via the --rank option. Multiple ranks can be selected by specifying --rank multiple times or by specifying a rank list using the [<low>-<high>,<value>] notation. Eg, to specify ranks 0,2 and 3 use --rank [0,2-3]

Selecting which mode to run in.

Padb can present an array of different information about your select jobs and it can present it in a number of different ways. With the exception of full Report only one mode can be selected, if you need more information about the program padb has to be run more than once. A list of available modes and their descriptions can be found on the modes page.