sge_intro (1) - Linux Manuals

sge_intro: xxQS_NAMExx, a facility for executing jobs on remote machines

NAME

sge_intro - xxQS_NAMExx, a facility for executing jobs on remote machines

DESCRIPTION

xxQS_NAMExx is a facility for executing Unix-like batch jobs (shell scripts or binaries) on a pool of cooperating CPUs. Jobs are queued and executed remotely according to defined policies. The CPUs may be in desktop systems, with jobs run on them at times when they would otherwise be idle or only lightly loaded. The work load may be distributed according to the (generalized) load situation of each machine and the resource requirements of the jobs.

User level checkpointing programs are supported and a transparent checkpointing mechanism is provided (see Checkpointing jobs migrate from system to system without user intervention on load demand. In addition to simple batch jobs, interactive jobs and parallel jobs can also be submitted to xxQS_NAMExx.

USER INTERFACE

The xxQS_NAMExx user interface consists of several programs which are described separately.
qacct extracts arbitrary accounting information from the cluster logfile.
qalter changes the characteristics of already submitted jobs.
qconf provides the user interface for configuring, modifying, deleting and querying queues and the cluster configuration.
qdel provides the means for a user/operator/manager to cancel jobs.
qevent provides a means of watching xxQS_NAMExx events and acting on jobs finishing.
qhold holds back submitted jobs from execution.
qhost displays status information about xxQS_NAMExx execution hosts.
qlogin initiates a telnet or similar login session with automatic selection of a suitable host.
qmake is a replacement for the standard Unix make facility. It extends make with an ability to distribute independent make steps across a cluster of suitable machines.
qmod allows the owner(s) of a queue to suspend and enable queues, e.g. all queues associated with his machine (all currently active processes in this queue are also signaled) or to suspend and enable jobs executing in the queues.
qmon provides a Motif command interface to all xxQS_NAMExx functions. The status of all, or a private selection of, the configured queues is displayed on-line by changing colors at corresponding queue icons.
qping can be used to check the status of xxQS_NAMExx daemons.
qquota provides a status listing of all currently used resource quotas (see
qresub creates new jobs by copying currently running or pending jobs.
qrls releases holds from jobs previously assigned to them e.g. via (see above).
qrdel provides the means to cancel advance reservations.
qrsh can be used for various purposes such as providing remote execution of interactive applications via xxQS_NAMExx comparable to the standard Unix facility rsh, to allow for the submission of batch jobs which, upon execution, support terminal I/O (standard/error output and standard input) and terminal control, to provide a batch job submission client which remains active until the job has finished or to allow for the xxQS_NAMExx-controlled remote execution of the tasks of parallel jobs.
qrstat provides a status listing of all advance reservations in the cluster.
qrsub is the user interface for submitting an advance reservation to xxQS_NAMExx.
qselect prints a list of queue names corresponding to specified selection criteria. The output of qselect is usually fed into other xxQS_NAMExx commands to apply actions on a selected set of queues.
qsh opens an interactive shell (in an on a low loaded host. Any kind of interactive job can be run in this shell.
qstat provides a status listing of all jobs and queues associated with the cluster.
qtcsh is a fully compatible replacement for the widely known and used Unix C-Shell (csh) derivative tcsh. It provides a command-shell with the extension of transparently distributing execution of designated applications to suitable and lightly loaded hosts via xxQS_NAMExx.
qsub is the user interface for submitting a job to xxQS_NAMExx.

COPYRIGHT

Copyright: 2008 by Sun Microsystems, Inc. Parts of the manual page texts are Copyright 2011 Univa Corporation, and various dates by other contributors. The manual pages are licensed under the Sun Industry Standards Source License, like the source, except where noted in some files by other contributors, which are under other free licences noted in their source.

SEE ALSO