Skip to main content
3 of 4
Improved capitalization.

The first 128 Unicode code points are the same as ASCII. Then they have a 100,000 or so more:

There are two common formats for Unicode, UTF-8 which uses 1-4 bytes for each value (so for the first 128 characters, UTF-8 is exactly the same as ASCII) and UTF-16, which uses 2 or 4 bytes.

wholerabbit
  • 11.6k
  • 2
  • 40
  • 73