- Posted
- Filed under Computer/HPC
Download license daemon & PBS file
license daemon : altair_licensing_11.0.linux_x64.bin
PBS : PBSPro_11.0.0-RHEL5_x86_64.tar.gz
Install
1. Create license :
type : LM-X
2. Install license daemon to license server:
# sh altair_licensing_11.0.linux_x64.bin
start daemon
# chkconfig --level 35 altairlmxd on
# /etc/init.d/altairlmxd start
check license
# ps -ef |grep lm
root 3863 1 0 09:57 ? 00:00:00 /opt/pbs/licensing11.0/bin/lmx-serv-altair -b -c /opt/pbs/licensing11.0/altair-serv.cfg
debug
# tail -f /opt/pbs/licensing11.0/logs/<hostname>.log
3. Install PBS server in front-end server
requirement daemon : pbs_sched, pbs_server.bin, postgres
# useradd altair
# tar zxvf PBSPro_11.0.0-RHEL5_x86_64.tar.gz
# cd PBSPro_XXXX
# ./INSTALL
***
Execution directory? [/opt/pbs/11.0.0.103450] <enter>
***
Home directory? [/var/spool/PBS] <enter>
***
PBS Installation:
1. Server, execution and commands <= front end server
2. Execution only <= compute node
3. Commands only <= Just run command node (not submit)
(1|2|3)?1 <enter>
PBS Professional version 9.0 and later is licensed
via the Altair License Manager.
The Altair License Manager can be downloaded from:
http://www.pbspro.com/UserArea/Software/
For more information, please refer to the PBS
Professional Administrator's Guide, or contact pbssupport@altair.com.
Continue with the installation ([y]|n)? <enter>
Please enter the list of Altair License file location(s)
in a colon-separated list of entries of the form
<port>@<host>
@<host>
<license file path>
Examples:
7788@fest
7788@tokyo:7788@madrid:7788@rio
@perikles:27000@aspasia
@127.3.4.5
/usr/local/altair/security/altair_lic.dat
Enter License File Location(s):@pbs_license_server <enter>
***
Switch to the new version of PBS (y/n)?y <enter>
***
Would you like to start PBS now (y|[n])?n <enter>
***
# vi /etc/pbs.conf
-------------------------------------
PBS_EXEC=/opt/pbs/default
PBS_HOME=/var/spool/PBS
PBS_START_SERVER=1
PBS_START_MOM=0 <== change from 1 to 0
PBS_START_SCHED=1
PBS_SERVER=home
PBS_DATA_SERVICE_USER=altair
-------------------------------------
start daemon
# chkconfig --level 35 pbs on
# /etc/init.d/pbs start
check log
# tail -f /var/spool/PBS/server_logs/<date>
4. install PBS in compute node
requirement daemon : pbs_mom
requirement remote shell : default (rsh, rcp), avail (ssh, scp)
* using ssh for remote shell
# vi quick
---------------------------------------------
<enter>
<enter>
2 <enter>
y
<server host name> <enter>
y <enter>
y <enter>
n <enter>
----------------------------------------------
# ./INSTALL < quick
# vi /var/spool/PBS/pbs_environment
------------------------------------------------
TZ=America/Chicago
PATH=/bin:/usr/bin
PBS_RSHCOMMAND=ssh <== add this line but not must.
------------------------------------------------
or
------------------------------------------------
TZ=America/Chicago
PATH=/bin:/usr/bin
------------------------------------------------
# vi /opt/pbs.conf
------------------------------------------------
PBS_EXEC=/opt/pbs/default
PBS_HOME=/var/spool/PBS
PBS_START_SERVER=0
PBS_START_MOM=1
PBS_START_SCHED=0
PBS_SERVER=<pbs server hostname>
PBS_SCP=/usr/bin/scp <== add this line (must)
------------------------------------------------
# chkconfig --level 35 pbs on
# /etc/init.d/pbs start
debug
# tail -f /var/spool/PBS/mom_logs/<date>
6. test
$ echo "sleep 60; hostname; pwd; date" | qsub
$ qstat -an
$ cat STDIN.o<job id>
7. useful commands
PBS node list
$ pbsnodes -a
Trace Job
$ tracejob <job id>
Queue state
$ qstat -an
Queue del
$ qdel <job id>
$ qdel -W force <job id>
Show Queue configuration, license infomation, ...
$ qstat -fB
$ qmgr -c "list server"
Add compute node to server
$ qmgr -c "create node <host name>"
$ qmgr -c "create node <host name> resoures_available.ncpu=2"
Delete compute node
$ qmgr -c "delete node <host name>"
change License information
License server
$ qmgr -c "set server pbs_license_info=<port1>@<host1>"
$ qmgr -c "set server pbs_license_info=<port1>@<host1>:...:<port#>@<host#>"
File
$ qmgr -c "set server pbs_license_info=<path license file>"
$ qmgr -c "set server pbs_license_info=<path license file1>:..:<path license file2>"
unset
$ qmgr -c "unset server pbs_license_info"
Server configuration
$ qmgr -c "print nodes @default"
move all jobs within a queue
$<new pbs path>/qmove <queue name>@<new server host name>:15001 <old pbs path>/qselect -q <queue name>@<old server host name>:13001
8. remove PBS
# rpm -qa |grep pbs
# rpm -e pbs-xxxx
# rm -fr /var/spool/PBS
# rm -f /etc/pbs.conf
# rm -fr /opt/pbs/11.0XXXXX
If you find ulimit problem in PBS queue then modify /etc/init.d/pbs in compute node.
add "ulimit -l unlimited" before run pbs_mom daemon.