Node maintenance window enhancement
It is a common use case that nodes need some maintenance and the admin knows the maintenance window in advance. It is difficult to plan such a maintenance window(s) in the PBS now. This new feature enhances reservations in order to provide proper maintenance windows.
Interface: New '--hosts' option to PBS command 'pbs_rsub'
Visibility: public
Change Control: Stable
Synopsis: The new option allows submitting a special reservation and this reservation is allowed to 'run' on unavailable nodes.
Details: pbs_rsub with '--hosts' option is allowed to be run only by managers and operators. The resources 'place' and 'select' are generated automatically and they are forbidden to combine with '--hosts'. Combining these resources with '--hosts' results in printing 'usage' help.
The syntax of pbs_rsub with '--hosts' option requires list of hosts: ' <host1> <host2> <host3> ...'
The placement of this reservation is always: '-l place=exclhost'
The select is generated by the hosts like this: '-l select=host=<host1>:ncpus=<ncpus_host1>+host=<host2>:ncpus=<ncpus_host2>+host=<host3>:ncpus=<ncpus_host3>+...'
The resv_nodes of this reservation is created in order to request all the ncpus of all vnodes on requested hosts.
This reservation is confirmed immediately after submission by the pbs_rsub command and overlapping reservations are degraded and will be reconfirmed in the next scheduler iteration.
The resv_nodes of overlapping reservations is modified and the requested vnodes are removed from the resv_nodes. This means that for running reservations no new job will start on overlapping nodes.
Overlapping running jobs are ignored and it is up to the administrators to deal with these jobs.
Reservation submitted with '--hosts' ignores resv_enable attribute on nodes.
The reservation prefix is 'M', which stands for maintenance.
Submitting this reservation will not invoke the scheduler iteration.
Interface: New extend parameter 'm' to IFL function 'char *pbs_submit_resv(int connect, struct attropl *attrib, char *extend)'
Visibility: public
Change Control: Stable
Synopsis: The new extend parameter modifies reservation id prefix.
Details: When the extend parameter includes 'm', the returned reservation id is prefixed with 'M'. This is available only for managers and operators and for others the PBSE_PERM (15007) is returned.
Interface: New reservation substate 'RESV_IN_CONFLICT'
Visibility: public
Change Control: Stable
Synopsis: The substate means that reservation is overlapping the 'M'-reservation.
Details: The RESV_IN_CONFLICT (shortcut is 'IC') has a similar impact as substate RESV_DEGRADED. The difference is that the reservation with the new substate is reconfirmed even if all the nodes of the reservation are up.