Possible shuffle improvement? #6

siiky · 2023-03-26T01:20:05Z

Kind of related to #5: I believe it may be possible to improve performance of the shuffle operation in gochan-select*.

Here's my proposed alternative:

(define (shuffle l)
  (if (null? l)
      '()
      (let ((x (car l))
            (l (shuffle (cdr l))))
        (if (zero? (pseudo-random-integer 2))
            (cons x l)
            (append l `(,x))))))

Thinking about it (though not too much, admitedly), I believe using this rather than (map ... (sort ... (map ... l))) would improve that operation from 2 traversals+sorting of the original list (which should be roughly 3 traversals total?¹) to roughly 2 traversals (assuming 50% chance of (pseudo-random-integer 2) returning either 0 or 1²).

I have no idea how I could benchmark this, however. You're certainly more familiar with the codebase, so if you think this is worth exploring I would appreciate some pointers/ideas. :)

One question I have is whether it would be a problem for shuffle not to be tail-recursive.

And another question is whether the result would be random enough to work well as a load-balancer.

Depends on sorting algorithm! And assuming that map is implemented as a tail-recursive procedure with a reverse at the end, as usual, the total number of traversals is actually 5+. ↩
Worst case scenario, putting the first half of the elements to the left and then the second half all to the right, total number of traversals would be roughly 3. ↩

The text was updated successfully, but these errors were encountered:

kristianlm · 2023-07-27T20:46:00Z

HI @siiky and sorry for this embarrasingly long response time. I really appreciate when people use software I've written and I really need to get better at showing that ...

Anyhow, I had the same idea when I was implementing this. The problem with this way of sorting is that it isn't fair: the first element in a 10-element list has a 50% chance of becoming the first element in the resulting list, but it should be 10%. The last element's chance of becoming the first is, and I'm no statistician, but is that 0.5^10?

(list-tabulate 10 (lambda (x) (car (shuffle '(0 1 2 3 4 5 6 7 8 9)))))
#;=> (0 0 1 1 0 4 1 1 3 0)

I don't know if this would actually be a problem in practice. But I'd rather take the performance hit than take chances. Also, I think the list of channels involved are usually bound to around 3-5 items. Or does your experience contradict that?

It would be interesting to benchmark gochan, though. I suspect it'd perform painfully slow.

siiky · 2023-07-28T14:03:17Z

Makes sense, I understand fairness is important.

Also, I think the list of channels involved are usually bound to around 3-5 items. Or does your experience contradict that?

Yeah, I agree, I think it should be the most common case to select on only a few channels at a time. Unless someone is using it in some crazy way with macros that generate tons of channels.

Unfortunately I ended dropping the gochan experiment because of tight deadlines and all. I was hitting deadlocks no clue why, and was forced to move on.

BTW (shameless plug) some weeks ago I started working on something you might be interested in, since you seem to be interested in message-passing. It's an egg wrapper of the Erl_Interface library, the C library used to read/write Erlang's External Terms (binary) Format: https://git.sr.ht/~siiky/experiments/tree/main/item/erlang-interface.scm

It's not done yet, though, some reader functions segfault, usual C stuff :)

kristianlm · 2023-07-29T13:52:17Z

Do you reckon the deadlocks were caused by "user error", or some bug in gochan? If it's the latter, I'd like to investigate.

siiky mentioned this issue Mar 26, 2023

How to do prioritize select? #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible shuffle improvement? #6

Possible shuffle improvement? #6

siiky commented Mar 26, 2023 •

edited

Loading

kristianlm commented Jul 27, 2023

siiky commented Jul 28, 2023

kristianlm commented Jul 29, 2023

Possible shuffle improvement? #6

Possible shuffle improvement? #6

Comments

siiky commented Mar 26, 2023 • edited Loading

Footnotes

kristianlm commented Jul 27, 2023

siiky commented Jul 28, 2023

kristianlm commented Jul 29, 2023

siiky commented Mar 26, 2023 •

edited

Loading