-
Notifications
You must be signed in to change notification settings - Fork 7
/
Copy pathCHANGELOG
194 lines (145 loc) · 5.67 KB
/
CHANGELOG
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
=========
Changelog
=========
This file keeps track of all notable changes to the Slurm Charms.
Unreleased
----------
1.1.4 - 2024-06-26
------------------
- added actions to set node weight and gres
- updated slurm-ops-manager to version 0.8.18
1.1.3 - 2023-12-11
------------------
- added a custom python installation to run the charm code
- removed CentOS 8 support
1.1.2 - 2023-09-07
------------------
- added support for Rocky Linux
- removed the nhc resource and packed the nhc tar file along with slurmd charm
1.1.1 - 2023-06-27
------------------
- added support for Ubuntu 22.04
1.1.0 - 2023-05-22
------------------
- set setuptools to be lower than 55
- set Slurm 23.02.1 as default on slurm-ops-manager 0.8.13
- set urllib3 as a requirement for slurmd and slurmctld and fix the version to 1.26.9
- changed license to Apache-2.0
- removed the action in slurmd to install InfiniBand drivers
- removed the action in slurmd to install Nvidia drivers
- removed the action in slurmd to install Singularity
- removed the action in slurmd to install MPI (mpich)
1.0.0 - 2022-10-17
------------------
0.10.1 - 2022-10-04
-------------------
- added action in slurmd to install MPI (mpich)
0.10.0 - 2022-09-15
-------------------
- updated NHC to version 1.4.3
- changed how NHC is installed to be via Juju resources
- added action in slurmd to install singularity
0.9.2 - 2022-07-04
------------------
- added slurmctld configuration options to use TLS certificates for etcd
0.9.1 - 2022-06-02
------------------
- added munge key to etcd database
- added action in slurmctld to create etcd user to query mungekey
- added action in slurmctld to get etcd root password
- added action in slurmd to get etcd slurmd password
- added set-node-inventory slurmd action
- updated default Mellanox Infiniband drivers to version 5.4
- fix get-node-inventory action
- fix influxdb-info slurmctld action
- fix etcd not being configured after an upgrade
0.9.0 - 2022-04-12
------------------
- fix: do not install Nvidia repository by default
- add support for custom NHC parameters
- improve checks when nodes need to be rebooted
- add Nvidia GPU support: install drivers via an action on compute nodes
- update operator framework to 1.3.0
0.8.5 - 2022-01-13
------------------
- add support for ElasticSearch Slurm addon
- pin Operator Framework to 1.2.0
0.8.4 - 2021-12-10
------------------
- use the cluster name as the database name for InfluxDB
- improve logrotate profiles for Slurm and NHC logs
- add epilog-prolog relation
0.8.3 - 2021-11-05
------------------
- add labels to Fluentbit logs: cluster-name, partition-name, hostname, service
- add user-group relation to create user and group on the system
- fix typos in unit status
0.8.2 - 2021-10-13
------------------
- fix changing infiniband repos on Ubuntu
- fix fluentbit parser for slurm logs
- fix creation of user and group when those already exist
0.8.1 - 2021-10-07
------------------
- use Omnivector Solutions' RPM repository to install Slurm
- add Fluentbit relation to forward logs from all the Slurm Charms
0.8.0 - 2021-09-13
------------------
- fix slurmd crashing when etcd in slurmctld is down/not-started/unreachable
- allow installing Slurm from different repositories on Ubuntu
- use Omnivector's PPA for installing Slurm on Ubuntu
0.7.0 - 2021-08-09
------------------
- fix: reduce number of events when a slurmd unit is added/removed
- fix #111: install bullseye gpg keys from files
- add support for Ubuntu partitions on CentOS deploys
- add influxdb-info action to slurmctld
- add grafana relation for slurmctld
- add influxdb relation for accounting and profiling of jobs with the SLURM
InfluxDB plugin
- add acct gather configuration options
- handle update-status hook to constantly give feedback about charms' status
- add .jujuignore to reduce size of final charm files
- fix cache of partition_name in slurmd charm
- improve description of charm's state in juju status
- fix race condition on relation data exchange between slurmctld and slurmdbd
- change default partition name from juju-compute-random to osd-appname
- fix slurmctld crashing when removing a slurmd application
0.6.6 - 2021-07-01
------------------
- fix checks for munge when the code can't run subprocess as another UID
0.6.5 - 2021-06-28
------------------
- fix race condition on inventory synchronization between slurmd and slurmctld
before starting slurmd
- fix breakage in charms when the slurmd leader is removed
- remove unused code in slurmd peer relation
- fix slurmd action to query infiniband version
- add slurmrestd to CentOS7
- fix slurmrestd Systemd environment variables to enable jwt auth
0.6.4 - 2021-06-11
------------------
- improve checks on Systemd commands to start/restart daemons, as to guarantee
correct Charm initialization
- check for munge keys before starting Slurm daemons, to avoid a race condition
- fix possible breakage in Slurm configuration due to spaces in partition names
- improve description of configurations and actions.
- fix slurmrestd version in Juju status breaking the line.
0.6.3 - 2021-06-02
------------------
- fix systemd command to restart munge
- improve slurmdbd logs when restarting munge
0.6.2 - 2021-06-02
------------------
- handle charm upgrade
- improve juju status' message to display failure when munge/slurm does not
start correctly
- fix slurmd initialization when slurm.conf is present
- enable munge system service, so it always start when the machine boots
0.6.1 - 2021-05-31
------------------
- changed charm's status message to have consistent capitalization.
- fixed initialization order of the charms, to ensure database and controller
start before compute nodes and REST server.
0.6.0 - 2021-05-28
------------------