Description
This is an issue extracted from #47066 (comment) which is caused by an issue that any OSX compilation with debuginfo ends up being nondeterministic. Specifically (currently known at least) the source of nondeterminism is that an mtime for an object file winds up in the final binary.
It turns out this isn't really our fault (unfortunately that makes it harder to fix!). This can be reproduced with just C and a linker:
# Compile an object file with a symbol in it
$ echo 'void foo() {}' > foo.c
$ cc -g foo.c -o foo.o -c
# Link that object to a shared library, take a look at the output
$ cc foo.o -m64 -dynamiclib -o libfoo.dylib -Wl,-dylib
$ md5 libfoo.dylib
MD5 (libfoo.dylib) = e60e735b7c919c19259daddd04a625c8
# update the timestamp on the object file
$ sleep 1
$ touch foo.o
# now link the same way we did above
$ cc foo.o -m64 -dynamiclib -o libfoo.dylib -Wl,-dylib
$ md5 libfoo.dylib
MD5 (libfoo.dylib) = 9754a78562696bbe5912efd9fc892a83
Here we're using the exact same object file (with two timestamps) and we're seeing different linked artifacts.
This is a source of bugs in programs that expect rustc to be deterministic (aka #47066 as was originally stated) and is something that we as rustc should probably fix.
Unfortunately I don't really know of a fix for this myself. I'd be tempted to take a big hammer to the problem and deterministically set all mtime fields for objects going into the linker to a known fixed value, but that unfortunately doesn't fix the determinism for C code (whose objects we don't control) and also is probably too big of a hammer (if a build system uses the mtime of the object to control rebuilds it'd get mixed up).
We could also use something like goblin
and reach in to the specific field and remove the actual data. I found it in a symbol section with the N_OSO
type (described in various documents online too apparently). We may be able to postprocess all output artifacts on OSX to maybe just zero out these fields unconditionally (or set them to something like May 15, 2015), although I'm not actually sure if this would be easy to do.
Activity
alexcrichton commentedon Dec 31, 2017
cc @michaelwoerister, @johnklai1
cc @luser (you're probably interested in this for the sccache ramifications like @johnklai1 is)
ranma42 commentedon Dec 31, 2017
If we used LLD for linking (#39915), it would be possible to fix this in the linker (by providing a flag to ignore mtime).
This would also fix (this part of) deterministic compilation for other languages as well.
est31 commentedon Dec 31, 2017
Source code for the darwin linker seems to be available, but I have no idea whether they take patches. Maybe LLVM develpers know more. Most likely, Apple will switch to LLD eventually.
est31 commentedon Dec 31, 2017
I wonder what experts on deterministic builds (@infinity0 ) can say about this.
luser commentedon Jan 3, 2018
Hm. I wonder why we haven't noticed this for Firefox builds? Maybe the version of the linker we're using has a patch to work around this? We're using https://github.com/tpoechtrager/cctools-port for our builds.
alexcrichton commentedon Jan 3, 2018
@luser that is indeed surprising! The source code there also slurps in the mtime, but that may be getting postprocessed somewhere else perhaps.
luser commentedon Jan 3, 2018
We discussed this on IRC, and I suspect the reason is that nobody has actually tried to do unstripped reproducible Firefox builds for macOS (although I'm not 100% sure). The info in question are STABS entries used by dsymutil to link the debug info from the object files into the dSYM.
This isn't critical for sccache currently, since it doesn't cache linker outputs.
luser commentedon Jan 3, 2018
Related: @metajack noticed that static archives are not reproducible on macOS because Apple's
ar
tool puts timestamps in the archive (mozilla/sccache#169).johnklai1 commentedon Jan 4, 2018
Right, I think the impact on sccache is similar to mozilla/sccache#169.
What I am seeing is that the .dylib non-determinism is causing unexpected cache misses in sccache since the dylibs end up being passed to rustc.
Here is an example for
cargo_metadata
:luser commentedon Jan 4, 2018
Ah, right, proc macro crates!
20 remaining items