We should get rid of UTF-8 and UTF-32, use CESU-8 and CESU-32 instead. Why? Because then we get to treat U+10000 to U+10FFFF as a COMPLETELY DIFFERENT ENCODING from U+0000 to U+FFFF!
It also creates precedent for expanding unicode through upper planes, say 100000 to 10FFFF encoded surrogate pairs for 2^30 new codepoints!
#unicode #programming #utf8 #utf16 #utf32 #nesting #recursion #encoding