Simple Linux Utility for Resource Management
SLURM is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for Linux clusters
containing up to 65,536 nodes. Components include machine status, partition management, job management, scheduling and accounting modules.
- Developed at network:cluster
-
4
derived packages
- Download package
-
Checkout Package
osc -A https://api.opensuse.org checkout openSUSE:Factory/slurm && cd $_
- Create Badge
Refresh
Refresh
Source Files
Filename | Size | Changed |
---|---|---|
Fix-test-21.41.patch | 0000002334 2.28 KB | |
Fix-test-32.8.patch | 0000000740 740 Bytes | |
Fix-test7.2-to-find-libpmix-under-lib64-as-well.pa |
0000000948 948 Bytes | |
Keep-logs-of-skipped-test-when-running-test-cases- |
0000001113 1.09 KB | |
README_Testsuite.md | 0000004747 4.64 KB | |
Remove-rpath-from-build.patch | 0000002941 2.87 KB | |
_service | 0000000133 133 Bytes | |
pam_slurm-Initialize-arrays-and-pass-sizes.patch | 0000002851 2.78 KB | |
slurm-23.02.4.tar.bz2 | 0007259848 6.92 MB | |
slurm-rpmlintrc | 0000002979 2.91 KB | |
slurm.changes | 0000184367 180 KB | |
slurm.spec | 0000046719 45.6 KB | |
slurmctld.xml | 0000000256 256 Bytes | |
slurmd.xml | 0000000262 262 Bytes | |
slurmdbd.xml | 0000000255 255 Bytes | |
test_setup.tar.gz | 0000003165 3.09 KB |
Revision 92 (latest revision is 108)
Ana Guerrero (anag+factory)
accepted
request 1109308
from
Egbert Eich (eeich)
(revision 92)
- Fixes since 23.02.03: Highlights: * Fix main scheduler loop not starting after a failover to backup controller. * Avoid slurmctld segfault when specifying `AccountingStorageExternalHost` (bsc#1214983). Other: * Fix sbatch return code when `--wait` is requested on a job array. * Fix collected `GPUUtilization` values for `acct_gather_profile` plugins. * Fix `slurmrestd` handling of job hold/release operations. * Make spank `S_JOB_ARGV` item value hold the requested command `argv` instead of the `srun --bcast` value when `--bcast` requested (only in local context). * Fix step running indefinitely when slurmctld takes more than `MessageTimeout` to respond. Now, slurmctld will cancel the step when detected, preventing following steps from getting stuck waiting for resources to be released. * Fix regression to make `job_desc.min_cpus` accurate again in job_submit when requesting a job with `--ntasks-per-node`. * Fix handling of `ArrayTaskThrottle` in backfill. * Fix regression in 23.02.2 when checking gres state on `slurmctld` startup or reconfigure. Gres changes in the configuration were not updated on slurmctld startup. On startup or reconfigure, these messages were present in the log: `"error: Attempt to change gres/gpu Count`". * Fix potential double count of gres when dealing with limits. * Fix slurmstepd segfault when ContainerPath is not set in `oci.conf` * Fixed an issue where jobs requesting licenses were incorrectly rejected. * `scrontab` - Fix cutting off the final character of quoted variables. * `smail` - Fix issues where e-mails at job completion were not being sent. * `scontrol/slurmctld` - fix comma parsing when updating a reservation's nodes.
Comments 1
Slurm 18.08 has been released: https://www.schedmd.com/downloads.php