You can get decent savings (~90-95% of the optimized size) with some brute force at the encoder level, more time spent testing different filters and rearranging the palette, but it takes a lot of CPU for little gain. At some point, you might spend minutes to save bytes.
This service is much faster than that, so they probably don't use brute force at all.
This service is much faster than that, so they probably don't use brute force at all.