Submitting jobs, Summary of the lsf bsub command format – HP XC System 3.x Software User Manual

Page 77

Advertising
background image

$ sinfo -p lsf

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

lsf up infinite 128 idle n[1-128]

Use the following command to obtain more information on the nodes in the lsf partition:

$ sinfo -p lsf -lNe

NODELIST NODES PARTITION STATE CPUS MEMORY TMP_DISK WEIGHT FEATURES REASON

n[1-128] 128 lsf idle 2 3456 1 1 (null) none

Refer to

"Getting System Information with the sinfo Command"

and the sinfo(1) manpage and for further

information about using the sinfo command.

Submitting Jobs

The bsub command submits jobs to the LSF-HPC system. This section focuses on enhancements to the bsub
command from the LSF-HPC integration with SLURM on the HP XC system; this section does not discuss
standard bsub functionality or flexibility. See the Platform LSF documentation and the bsub(1) manpage for
more information on this important command.

This topic of submitting jobs is explored in detail in Chapter

Chapter 5. Submitting Jobs

and this section

refers to some of these examples.

The bsub command and its options, including the external SLURM scheduler, is used to request a set of
resources on which to launch a job. See

"HP XCCompute Node Resource Support"

for an introduction

and

"Summary of the LSF bsub Command Format"

for additional information. The arguments to the bsub

command consist of the user job and its arguments. The bsub options allow you to provide information on
the amount and type of resources needed by the job.

The basic synopsis of the bsub command is:

bsub [

bsub-options] jobname jobname [job-options]

The HP XC system has several features that make it optimal for running parallel applications, particularly
(but not exclusively) MPI applications. You can use the bsub command's -n to request more than one core
for a job. This option, coupled with the external SLURM scheduler, discussed in

"LSF-SLURM External

Scheduler"

, gives you much flexibility in selecting resources and shaping how the job is executed on those

resources.

LSF-HPC, like standard LSF, reserves the requested number of nodes and executes one instance of the job
on the first reserved node, when you request multiple nodes. Use the srun command or the mpirun command
with the -srun option in your jobs to launch parallel applications. The -srun can be set implicitly for the
mpirun

command; see

"Submitting a Parallel Job That Uses the HP-MPI Message Passing Interface"

for

more information on using the mpirun -srun command.

Most parallel applications rely on rsh or ssh to "launch" remote tasks. The ssh utility is installed on the
HP XC system by default. If you configured the ssh keys to allow unprompted access to other nodes in the
HP XC system, the parallel applications can use ssh. See

"Enabling Remote Execution with OpenSSH"

for more information on ssh.

Summary of the LSF bsub Command Format

This section provides a summary of the format LSF bsub command on the HP XC system. The bsub command
can have the following formats:

bsub

When you invoke the bsub command without any arguments, you are prompted for a command from
standard input.

bsub [

bsub-options] jobname [job-arguments]

This is the bsub command format to submit a serial job. The srun command is required to run parallel jobs
on the allocated compute node. Refer to

"Submitting a Serial Job Using LSF-HPC"

.

bsub -n

num-procs [bsub-options] jobname [job-arguments]

This is the standard bsub command format to submit a parallel job to the LSF execution host. The jobname
parameter can be name of an executable or a batch script. If jobname is executable, job is launched on
LSF execution host node. If jobname is batch script (containing srun commands), job is launched on

Using LSF-HPC

77

Advertising