PBSPro Community Discussion:
- Interface 1: vnode attribute: resources_available.vntype
- Visibility: Public
- Change Control: Stable
- Details:
- The vntype attribute, for vnodes corresponding to Cray KNL nodes, will have the same value as non knl compute node i.e "cray_compute".
- Node selection for bootable processor node will happen on the basis of aoe.
- Interface 2: vnode attribute: resources_available.PBScrayseg
- Visibility: Public
- Change Control: Stable
- Details:
- KNL vnodes, corresponding to KNL nodes returned as part of the System (BASIL 1.7) Query XML response, will have PBScrayseg set to 0, when vnode_per_numa node is true. There will only be one KNL vnode per KNL node, regardless of the number of segments/NUMA nodes, per KNL node, specified in the XML response.
- This is a change from existing behavior.
- Interface 3: vnode attribute: current_aoe
- Visibility: Public
- Change Control: Stable
- Details:
- KNL vnodes, corresponding to KNL nodes returned as part of the System (BASIL 1.7) Query XML response, will have current_aoe set to the numa_cfg value concatenated with the hbm_cache_pct value e.g. a2a_0.
- Each KNL vnode’s current_aoe attribute shows that vnode’s current AOE.
- The valid values are: a2a, snc2, snc4, hemi, quad for numa_cfg and 0, 25, 50, 100 for hbm_cache_pct.
- Interface 4: vnode attribute: resources_available.hbmem
- Visibility: Public
- Change Control: Stable
- Details:
- KNL vnodes, corresponding to KNL nodes returned as part of the System (BASIL 1.7) Query XML response, will have hbmem set to the hbm_size_mb value.
- KNL vnodes, corresponding to KNL nodes returned as part of the System (BASIL 1.7) Query XML response, will have hbmem set to the hbm_size_mb value.
- Interface 5: System Query (BASIL 1.7)
- Visibility: PBS Private
- Change Control: Stable
- Details:
- The System Query (BASIL 1.7) reports inventory information in a much more compact form than the Inventory Query. Attribute value pairs in the XML response
apply to a group of Nodes.
- The System Query (BASIL 1.7) reports inventory information in a much more compact form than the Inventory Query. Attribute value pairs in the XML response
- Interface 6: sched config
- Visibility: Public
- Change Control: Stable
- Details:
- New default resource 'hbmem' will be added in scheduler config for cray platform.
- This will allow scheduler to schedule jobs based on hbmem resource request.
- Interface 7: Log/Error messages.
- Visibility: Public
- Change Control: Stable
Details: The following table lists the log/error messages introduced in the KNL (BASIL 1.7) project.
No.
Level
Log/Error message
Visibility
Classification
1 PBSEVENT_SYSTEM In MoM logs: Memory allocation for XML request buffer failed. Public Stable 2 PBSEVENT_DEBUG2 In MoM logs: This Cray system supports the BASIL 1.7 protocol. Public Stable 3 PBSEVENT_DEBUG2 In MoM logs: This Cray system does not support the BASIL 1.7 protocol. Public Unstable 4 PBSEVENT_SYSTEM In MoM logs: ALPS System Query request failed. Public Stable 5 PBSEVENT_SYSTEM In MoM logs: Failed to save System Query response. Public Stable 6 PBSEVENT_DEBUG In MoM logs: Creation of Cray KNL vnodes failed with name <vnode name> Public Stable 7 PBSEVENT_DEBUG3 In MoM logs: No KNL nodes. Public Stable 8 PBSEVENT_ERROR In MoM logs: Bad KNL Rangelist: <rangelist> Public Unstable 9 PBSEVENT_ERROR In MoM logs: malloc failure Public Unstable 10 PBSEVENT_ERROR In MoM logs: realloc failure Public Unstable .
- Interface 8: PBS hook PBS_knl_provision
- Visibility: Public
- Change Control: Stable
- Synopsis: KNL provisioning
- Details:
- This provisioning hook script runs on the server to provision the node with requested aoe.
- This will be invoked whenever aoe resource is requested in the job.
- The hook is disabled by default, can be enabled by Administrator.
- Time out for the hook is 1800 seconds.