slpolice will determine the top cpu users across a cluster of hosts.
It will send mail if a process has over a specified amount of cpu time.
It will also mail if a user has a reservation for a long period of time.
Usually slpolice is run with a crontab entry similar to:
5 8-21 * * * /usr/local/bin/slpolice --cpu_min 120 --reserved_min 120 long=999 >/dev/null 2>&1
This sends warnings each hour after 2 hours of CPU time. It does not check
at night so that long overnight jobs will not receive warnings.
Additional non-parameter arguments specify specific command regular
expressions. When a process command matches that regexp, the specified
number of minutes will be used to determine when to send mail instead of
This program is most valuable when used with the nicercizerd program, or
a operating system where nice 19 processes get only leftover cpu resources.
It requires a program called nice19 which is a version of nice that is
setgid root and renices a job to 19. This comes with nicercizerd.