But since it’s AMP and not SMP, sharing work across cores doesn’t necessarily work how you expect it to.
128 bytes is perfect 2 x 64! So even if the risk of cache invalidation goes up even if two cores are not writing to the exact same structure the alignment still works!
Good job Apple!