Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> Unicode is a variable width abstract encoding;

To be a bit more explicit: Unicode is a character encoding, to 20-and-a-half-bit 'bytes', that is variable-width in those 'bytes', even before considering how the 'bytes' are encoded to actual bytes. Eg "ψ̊" (greek small psi with ring above) is U+3C8 U+30A (two 'bytes').



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: