Simple Linux Utility for Resource Management
SLURM is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for Linux clusters
containing up to 65,536 nodes. Components include machine status, partition management, job management, scheduling and accounting modules.
- Developed at network:cluster
- Sources inherited from project openSUSE:Factory
-
4
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Factory:Rebuild/slurm && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
Fix-test-21.41.patch | 0000002334 2.28 KB | |
Fix-test-3.13.patch | 0000001026 1 KB | |
Fix-test-32.8.patch | 0000000740 740 Bytes | |
Fix-test-38.11.patch | 0000001164 1.14 KB | |
Fix-test7.2-to-find-libpmix-under-lib64-as-well.pa |
0000000948 948 Bytes | |
Keep-logs-of-skipped-test-when-running-test-cases- |
0000001113 1.09 KB | |
README_Testsuite.md | 0000004813 4.7 KB | |
Remove-rpath-from-build.patch | 0000002941 2.87 KB | |
_service | 0000000133 133 Bytes | |
pam_slurm-Initialize-arrays-and-pass-sizes.patch | 0000002549 2.49 KB | |
slurm-22.05.5.tar.bz2 | 0007098772 6.77 MB | |
slurm-rpmlintrc | 0000002684 2.62 KB | |
slurm.changes | 0000153244 150 KB | |
slurm.spec | 0000046445 45.4 KB | |
slurmctld.xml | 0000000256 256 Bytes | |
slurmd.xml | 0000000262 262 Bytes | |
slurmdbd.xml | 0000000255 255 Bytes | |
test_setup.tar.gz | 0000003104 3.03 KB |
Revision 79 (latest revision is 108)
Dominique Leuenberger (dimstar_suse)
accepted
request 1030432
from
Egbert Eich (eeich)
(revision 79)
- updated to 22.05.5 - NOTE: Slurm validates that libraries are of the same version. Unfortunately, due to an oversight, we failed to notice that the slurmstepd loads the hash_k12 library only after a job has completed. This means that if the hash_k12 library is upgraded before a job finishes, the slurmstepd will load the new library when the job finishes, and will fail due to a mismatch of versions. This results in nodes with slurmstepd processes stuck indefinitely. These processes require manual intervention to clean up. There is no clean way to resolve these hung slurmstepd processes. The only recommended way to upgrade between minor versions of 22.05 with RPM’s or upgrades that replace current binaries and libraries is to drain the nodes of running jobs first. - Fixes a number of moderate severity issues, noteable are: * Load hash plugin at slurmstepd launch time to prevent issues loading the plugin at step completion if the Slurm installation is upgraded. * Update nvml plugin to match the unique id format for MIG devices in new Nvidia drivers. * Fix multi-node step launch failure when nodes in the controller aren't in natural order. This can happen with inconsistent node naming (such as node15 and node052) or with dynamic nodes which can register in any order. * job_container/tmpfs - cleanup containers even when the .ns file isn't mounted anymore. * Wait up to PrologEpilogTimeout before shutting down slurmd to allow prolog and epilog scripts to complete or timeout. Previously, slurmd waited 120 seconds before timing out and killing prolog and epilog scripts. (forwarded request 1010642 from mslacken)
Comments 1
Slurm 18.08 has been released: https://www.schedmd.com/downloads.php