Simple Linux Utility for Resource Management

Edit Package slurm

SLURM is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for Linux clusters
containing up to 65,536 nodes. Components include machine status, partition management, job management, scheduling and accounting modules.

Refresh
Refresh
Source Files
Filename Size Changed
Fix-test-21.41.patch 0000002334 2.28 KB
Fix-test-3.13.patch 0000001026 1 KB
Fix-test-32.8.patch 0000000740 740 Bytes
Fix-test-38.11.patch 0000001164 1.14 KB
Fix-test7.2-to-find-libpmix-under-lib64-as-well.patch 0000000948 948 Bytes
Keep-logs-of-skipped-test-when-running-test-cases-sequentially.patch 0000001113 1.09 KB
README_Testsuite.md 0000004813 4.7 KB
Remove-rpath-from-build.patch 0000002941 2.87 KB
_service 0000000133 133 Bytes
pam_slurm-Initialize-arrays-and-pass-sizes.patch 0000002549 2.49 KB
slurm-22.05.5.tar.bz2 0007098772 6.77 MB
slurm-rpmlintrc 0000002684 2.62 KB
slurm.changes 0000153244 150 KB
slurm.spec 0000046445 45.4 KB
slurmctld.xml 0000000256 256 Bytes
slurmd.xml 0000000262 262 Bytes
slurmdbd.xml 0000000255 255 Bytes
test_setup.tar.gz 0000003104 3.03 KB
Revision 79 (latest revision is 108)
Dominique Leuenberger's avatar Dominique Leuenberger (dimstar_suse) accepted request 1030432 from Egbert Eich's avatar Egbert Eich (eeich) (revision 79)
- updated to 22.05.5
- NOTE: Slurm validates that libraries are of the same version. Unfortunately,
  due to an oversight, we failed to notice that the slurmstepd loads the
  hash_k12 library only after a job has completed. This means that if the
  hash_k12 library is upgraded before a job finishes, the slurmstepd will load
  the new library when the job finishes, and will fail due to a mismatch of
  versions.  This results in nodes with slurmstepd processes stuck
  indefinitely. These processes require manual intervention to clean up. There
  is no clean way to resolve these hung slurmstepd processes.
  The only recommended way to upgrade between minor versions of 22.05 with
  RPM’s or upgrades that replace current binaries and libraries is to drain the
  nodes of running jobs first.
- Fixes a number of moderate severity issues, noteable are:
  * Load hash plugin at slurmstepd launch time to prevent issues loading the
    plugin at step completion if the Slurm installation is upgraded.
  * Update nvml plugin to match the unique id format for MIG devices in new
    Nvidia drivers.
  * Fix multi-node step launch failure when nodes in the controller aren't in
    natural order. This can happen with inconsistent node naming (such as
    node15 and node052) or with dynamic nodes which can register in any order.
  * job_container/tmpfs - cleanup containers even when the .ns file isn't
    mounted anymore.
  * Wait up to PrologEpilogTimeout before shutting down slurmd to allow prolog
    and epilog scripts to complete or timeout. Previously, slurmd waited 120
    seconds before timing out and killing prolog and epilog scripts. (forwarded request 1010642 from mslacken)
Comments 1

openSUSE Build Service is sponsored by