Try to make random sampling for unbinding reproducible #16

jchelly · 2024-02-15T11:53:26Z

This makes the seed for the random shuffle used in unbinding depend on the particle IDs so that the random number sequence no longer depends on the (unpredictable) order in which threads process halos. It also adds a parameter that can be used to change the seed so that we can run multiple realizations. Particle IDs are xor'd with the supplied parameter and then used to initialize the random number generator. I'm not sure if some other operation would make more sense.

I think this change also fixes two problems with the original code:

The random number generator was never seeded so we always got the same, default seed on all MPI ranks
Calling std::random_shuffle from multiple threads without specifying the random number generator may have used a non thread-safe default generator

There might still be some non-deterministic behaviour due to rounding error in reduction operations depending on the order of execution.

jchelly · 2024-02-15T15:37:13Z

This still doesn't seem to be enough to make runs deterministic with >1 threads (tested by running on L1000N0900/DMO_FIDUCIAL twice.)

jchelly · 2024-04-29T10:49:32Z

It looks like TrackIds somehow get assigned in a non-deterministic way when we use multiple threads. In the small Colibre test I get the same halo propertes in snapshot 000 but the TrackIds differ.

jchelly · 2024-04-29T11:14:53Z

FeedCentrals() assigns subhalo indexes in a non-deterministic way using a critical section. It could be fixed using an ordered section or just removing the openmp directives.

jchelly · 2024-04-29T12:51:38Z

With the ordered section fix two colibre test runs now generate identical output for snasphots 0-10 (of 15) but diverge after that.

jchelly added 3 commits February 14, 2024 14:39

Seed RNG for each halo using first particle ID

2ddda76

Use multiple particle IDs to seed the random number generator

6e7fd48

Add parameter to modify random number seed (via xor with particle IDs)

83ad894

Merge branch 'master' into random_numbers

45af48e

Force deterministic ordering of new halo creation

52efebc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to make random sampling for unbinding reproducible #16

Try to make random sampling for unbinding reproducible #16

jchelly commented Feb 15, 2024

jchelly commented Feb 15, 2024

jchelly commented Apr 29, 2024

jchelly commented Apr 29, 2024

jchelly commented Apr 29, 2024

Try to make random sampling for unbinding reproducible #16

Are you sure you want to change the base?

Try to make random sampling for unbinding reproducible #16

Conversation

jchelly commented Feb 15, 2024

jchelly commented Feb 15, 2024

jchelly commented Apr 29, 2024

jchelly commented Apr 29, 2024

jchelly commented Apr 29, 2024