In this article, Joel says he "decided to do everything internally in UCS-2 (two...

jwarkentin · on Feb 28, 2014

I never got that. He wrote a great article about the issue and then ended with using UCS-2. I always wondered if there was something I missed that made him choose UCS-2 over UTF-8, since UTF-8 can represent every Unicode code point.

rb12345 · on Feb 28, 2014

I imagine it's linked to Win32 and .Net using UCS-2 internally, since then there's no need to convert strings before and after API calls.

tszming · on Feb 28, 2014

Actually, Joel's article is also outdated, e.g. Only code points 128 and above are stored using 2, 3, in fact, up to 6 bytes.

See:

What is the maximum number of bytes for a UTF-8 encoded character?

http://stackoverflow.com/questions/9533258/what-is-the-maxim...