Key #95 in 10 1/2 hours on a single machine? That's impressive.
Why so many - 256 - kangaroos? Everyone says it is not optimal, do you have some new insights on this?
Who says it's not optimal?
Almost 38 million kangaroos spread across 4 Tesla V100.
Only 86% of the expected run time was needed.
Each kangaroo has to produce at least few distinguished points.
All these points have to be stored in a hash table, for efficient collision detection.
Experiments show, that collision happens with 3-12 DP/kangaroo.
And since none bothered to post the key from the image:
#95 - 0000000000000000000000000000000000000000527a792b183c7f64a0e8b1f4