-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathVERSIONING
223 lines (169 loc) · 9.1 KB
/
VERSIONING
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
-------------------
Written by Ted T'so
-------------------
> https://www.gnu.org/software/libtool/manual/html_node/Updating-version-info.html
>
> I understood that, if there is no interface change but some implementation
> changes, I need to bump revision. If new interface is added, for example, I
> need to bump current while revision=0 and age++.
So part of the problem here is that libtool is doing something really
strange because they are trying to use some abstract concept that is
OS-independent. I don't use libtool because I find it horribly
complex and doesn't add enough value to be worth the complexity.
So I'll tell you how things work with respect to Linux's ELF version
numbering system. Translating this to libtool's wierd "current,
revision, age" terminology is left as an exercise to the reader. I've
looked at the libtool documentation, and it confuses me horribly.
Reading it, I suspect it's wrong, but I don't have the time to
experiment to confirm that the documentation is wrong and how it
diverges from the libtool implementation.
So let me explain things using the ELF shared library terminology,
which is "major version, minor version, patchlevel". This shows up in
the library name:
libudev.so.1.6.11
So in this example, the major version number is 1, the minor version
is 6, and the patchlevel is 11. The patchlevel is entirely optional,
and many packages don't use it at all. The minor number is also
mostly useless on Linux, but it's still there for historical reasons.
The patchlevel and minor version numbers were useful back for SunOS
(and Linux a.out shared library), back when there weren't rpm and dpkg
as package managers.
So many modern Linux shared libraries will only use the major and
minor version numbers, e.g:
libext2fs.so.2.4
The only thing you really need to worry about is the major version
number, really. The minor version is *supposed* to change when new
interfaces has changed (but I and most other people don't do that any
more). But the big deal is that the major number *must* get bumped if
an existing interface has *changed*.
So let's talk about the major version number, and then we'll talk
about why the minor version number isn't really a big deal for Linux.
So if you change any of the library's function signatures --- and this
includes changing a type from a 32-bit integer to a 64-bit integer,
that's an ABI breakage, and so you must bump the major version number
so that a program that was linked against libfoo.so.4 doesn't try to
use libfoo.so.5. That's really the key --- will a program linked
against the previous version library break if it links against the
newer version. If it does, then you need to bump the version number.
So for structures, if you change any of the existing fields, or if the
application program allocates the structure --- either by declaring it
on the stack, or via malloc() --- and you expand the structure,
obviously that will cause problem, and so that's an ABI break.
If however, you arrange to have structures allocated by the library,
and struct members are always added at the end, then an older program
won't have any problems. You can guarantee this by simply only using
a pointer to the struct in your public header files, and defining the
struct in a private header file that is not available to userspace
programs.
Similarly, adding new functions never breaks the ABI. That's because
older program won't try to use the newer interfaces. So if I need to
change an interface to a function, what I'll generally do is to define
a new function, and then implement the older function in terms of the
newer one. For example:
extern errcode_t ext2fs_open(const char *name, int flags, int superblock,
unsigned int block_size, io_manager manager,
ext2_filsys *ret_fs);
extern errcode_t ext2fs_open2(const char *name, const char *io_options,
int flags, int superblock,
unsigned int block_size, io_manager manager,
ext2_filsys *hret_fs);
As far as the minor version numbers are concerned, the dynamic linker
doesn't use it. In SunOS 4, if you have a DT_NEEDED for libfoo.so.4,
and the dynamic linker finds in its search path:
libfoo.so.4.8
libfoo.so.4.9
It will preferentially use libfoo.so.4.9.
That's not how it works in Linux, though. In Linux there will be a
symlink that points libfoo.so.4 to libfoo.so.4.9, and the linker just
looks for libfoo.so.4. One could imagine a package manager which
adjusts the symlink to point at the library with the highest version,
but given that libfoo.so.4.9 is supposed to contain a superset of
libfoo.so.4.8, there's no point. So we just in practice handle all of
this in the package manager, or via an ELF symbol map. Or, we just
assume that since vast majority of software comes from the
distribution, the distro package manager will just update libraries to
the newer version as a matter of course, and nothing special needs to
be done.
So in practice I don't bump the minor version number for e2fsprogs
each time I add new interfaces, because in practice it really doesn't
matter for Linux. We have a much better system that gets used for
Debian.
For example in Debian there is a file that contains when each symbol
was first introduced into a library, by its package version number.
See:
https://git.kernel.org/pub/scm/fs/ext2/e2fsprogs.git/tree/debian/libext2fs2.symbols
This file contains a version number for each symbol in libext2fs2, and
it tells us what version of libext2fs you need to guarantee that a
particular symbol is present in the library. Then when *other*
packages are built that depend on libext2fs2, the minimum version of
libext2fs can be calculated based on which symbols they use.
So for example the libf2fs-format4 package has a Debian dependency of:
Depends: libblkid1 (>= 2.17.2), libc6 (>= 2.14), libf2fs5, libuuid1 (>= 2.16)
The minimum version numbers needed for libblkid1 and libuuid1 are
determined by figuring out all of the symbols used by the
libf2fs-format4 package, and determining the minimum version number of
libblkid1 that supports all of those blkid functions.
This gets done automatically, so I didn't have to figure this out.
All I have in the debian/control file is:
Depends: ${misc:Depends}, ${shlibs:Depends}
Sorry this got so long, but hopefully you'll find this useful. How
you bend libtool to your will is something you'll have to figure out,
because I don't use libtool in my packages.[1]
Cheers,
- Ted
[1] If you are interested in how I do things in e2fsprogs, take a look
at the Makefile.elf-lib, Makefile.solaris-lib, Makefile.darwin-lib,
etc. here:
https://git.kernel.org/pub/scm/fs/ext2/e2fsprogs.git/tree/lib
This these Makefile fragments are then pulled into the generated
makefile using autoconf's substitution rules, here:
https://git.kernel.org/pub/scm/fs/ext2/e2fsprogs.git/tree/lib/ext2fs/Makefile.in
(Search for "@MAKEFILE_ELF@" in the above Makefile.in).
So when someone runs "configure --enable-elf-shlibs", they get the ELF
shared libraries built. On BSD and MacOS systems they just have to
run "configure --enable-bsd-shlibs", and so on.
Personally, since most people don't bother to write truly portable
programs, as their C code is full of Linux'isms, using libtool is just
overkill, because they probably can't build on any other OS *anyway*
so libtool's slow and complex abstraction layer is totally wasted.
Might as well not use autoconf, automake, and libtool at all.
On the other hand, if you really *do* worry about portability on other
OS's (e2fsprogs builds on MacOS, NetBSD, Hurd, Solaris, etc.) then
using autoconf makes sense --- but I *still* don't think the
complexity of libtool is worth it.
= Add-on =
If you are going to be making one less major update, this is the
perfect time to make sure that data structures are allocated by the
library, and are (ideally) opaque to the calling application (so they
only manipulate structure poitners). That is, the structure
definition is not exposed in the public header file, and you use
accessor functions to set and get fields in the structure.
If you can't do that for all data structures, if you can do that with
your primary data structure that's going to make your life much easier
in the long term. For ext2fs, that's the file systme handle. It's
created by ext2fs_open(), and it's passed to all other library
functions as the first argument.
The other thing you might want to consider doing is adding a magic
number to the beginning of each structure. That way you can tell if
the wrong structure gets passed to a library. It's also helpful for
doing the equivalent of subclassing in C.
This is how we do it in libext2fs --- we use com_err to define the
magic numbers:
error_table ext2
ec EXT2_ET_BASE,
"EXT2FS Library version @E2FSPROGS_VERSION@"
ec EXT2_ET_MAGIC_EXT2FS_FILSYS,
"Wrong magic number for ext2_filsys structure"
ec EXT2_ET_MAGIC_BADBLOCKS_LIST,
"Wrong magic number for badblocks_list structure"
...
And then every single structure starts like so:
struct struct_ext2_filsys {
errcode_t magic;
...
struct ext2_struct_inode_scan {
errcode_t magic;
...
And then before we use any pointer we do this:
if (file->magic != EXT2_ET_MAGIC_EXT2_FILE)
return EXT2_ET_MAGIC_EXT2_FILE;