11 getting information about jobs, 1 getting job allocation information – HP XC System 3.x Software User Manual

Page 107

Advertising
background image

10.11 Getting Information About Jobs

There are several ways you can get information about a specific job after it has been submitted
to LSF-HPC integrated with SLURM. This section briefly describes some of the commands that
are available under LSF-HPC integrated with SLURM to gather information about a job. This
section is only intended to give you an idea of the commonly used commands and to describe
any differences there may be in their operation in the HP XC environment, not as a complete
reference on this topic. See the LSF manpages for full information about the commands described
in this section.

The following LSF commands are described in this section:

bjobs

“Examining the Status of a Job”

bhist

“Viewing the Historical Information for a Job”

10.11.1 Getting Job Allocation Information

Before a job runs, LSF-HPC integrated with SLURM allocates SLURM compute nodes based on
job resource requirements.

After LSF-HPC integrated with SLURM allocates nodes for a job, it attaches allocation information
to the job.

The bjobs -l command provides job allocation information on running jobs. The bhist -l
command provides job allocation information for a finished job. For details about using these
commands, see the LSF manpages .

A job allocation information string resembles the following:

slurm_id=slurm_jobid;ncpus=slurm_nprocs;slurm_alloc=node_list

This allocation string has the following values:

slurm_id

SLURM_JOBID

environment variable. This is SLURM allocation ID (Associates

LSF-HPC job with SLURM allocated resources.)

ncpus

SLURM_NPROCS

environment variable. This the actual number of allocated

cores. Under node-level allocation scheduling, this number may be bigger
than what the job requests.)

slurm_alloc

A comma separated list of allocated nodes.

LSF-HPC integrated with SLURM sets the SLURM_JOBID and SLURM_NPROCS environment
variables, when it starts a job.

Example 10-3

illustrates how to use the the bjobs -l command to obtain job allocation

information about a running job:

10.11 Getting Information About Jobs

107

Advertising