A few highlights:
U+200B ZERO WIDTH SPACE
This isn't a "space". It is an invisible character that can be used to provide line break opportunities.
U+2118 SCRIPT CAPITAL P
Should have been called calligraphic small p or Weierstrass elliptic function symbol, which is what it is used for. It is not a capital "P" at all.
These two are my favourite:
U+FE18 PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET
A spelling error: "brakcet" should be "bracket". A formal alias correcting this error has been defined.
U+FEFF ZERO WIDTH NO-BREAK SPACE
Byte Order Mark (Naming it ZWNBSP was a mistake from the start.)
This one is interesting:
U+262B FARSI SYMBOL
This symbol is so named because as symbol of Iran it cannot be encoded in ISO standards.
I'm not sure what prevents it from being named that way, maybe the ISO standards don't let you have countries in codepoint names unless it's like a flag or country code?
@ionchy i assume because they're so unique the weirdness with Iran comes from UN sanctions?? no clue how that would change standards though
@pounce apparently it's because that would make it a logo, not a character http://archives.miloush.net/michkap/archive/2005/01/29/363208.html
A Mastodon instance for users who like the study of programming languages, formal semantics, types, or linguistics.