Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Interfaces job exit codes:

    • Values:

      • JOB_EXEC_KILL_NCPUS_BURST -24

      • JOB_EXEC_KILL_NCPUS_SUM -25

      • JOB_EXEC_KILL_VMEM -26

      • JOB_EXEC_KILL_MEM -27

      • JOB_EXEC_KILL_CPUT -28

      • JOB_EXEC_KILL_WALLTIME -29

      • JOB_EXEC_KILL_MPPE -30

      • JOB_EXEC_KILL_MPPSSP -40

    • Visibility: public

    • Synopsis: job exit code

    • Details: Exit code is a value sent by a mom to the server as information on how the job ended. The new exit codes convey the information of exceeding resource kills to the server. E.g.: Job is killed due to exceeding walltime, the exit code JOB_EXEC_KILL_WALLTIME is returned to the server.

  • Interfaces job substates:

    • Values:

      • JOB_SUBSTATE_KILL_NCPUS_BURST 95

      • JOB_SUBSTATE_KILL_NCPUS_SUM 95

      • JOB_SUBSTATE_KILL_VMEM 96

      • JOB_SUBSTATE_KILL_MEM 97

      • JOB_SUBSTATE_KILL_CPUT 98

      • JOB_SUBSTATE_KILL_WALLTIME 99

      • JOB_SUBSTATE_KILL_MPPE 100

      • JOB_SUBSTATE_KILL_MPPSSP 101

    • Visibility: public

    • Synopsis: job substate

    • Details: On the server, a corresponding job substate is set based on the job exit code.

    Interfaces job comments:

    • Values:

      • Job run at … on … and exceeded resource ncpus (burst)

      • Job run at … on … and exceeded resource ncpus (sum)

      • Job run at … on … and exceeded resource vmem

      • Job run at … on … and exceeded resource mem

      • Job run at … on … and exceeded resource cput

      • Job run at … on … and exceeded resource walltime

      • Job run at … on … and exceeded resource mppe

      • Job run at … on … and exceeded resource mppssp

    • Visibility: public

    • Synopsis: job comment

    • Details: If the job is killed due to exceeding resources, the job comment is set to a corresponding value. E.g.: Job exceeds the walltime and the job comment is set to: “Job run at … on … and exceeded resource walltime“.

  • Interfaces email abort messages:

    • Values:

      • "Job exceeded resource ncpus (burst)\nSee job standard error file"

      • "Job exceeded resource ncpus (sum)\nSee job standard error file"

      • "Job exceeded resource vmem\nSee job standard error file"

      • "Job exceeded resource mem\nSee job standard error file"

      • "Job exceeded resource cput\nSee job standard error file"

      • "Job exceeded resource walltime\nSee job standard error file"

      • "Job exceeded resource mppe\nSee job standard error file"

      • "Job exceeded resource mppssp\nSee job standard error file"

    • Visibility: public

    • Synopsis: email message on job abort

    • Details: An appropriate abort email is sent once the job is killed due to exceeding resources. E.g.: Job exceeds the walltime and the abort email with a message "Job exceeded resource walltime\nSee job standard error file" is sent.

...