Explore the Unicode character space
Unicode assigns every character a unique number called a code point, written in
hexadecimal with a U+ prefix. Code points are organised into named blocks like
Basic Latin, Greek, Currency Symbols and Emoticons. This browser lets you scan a
whole block as a grid of glyphs, or jump straight to a single code point by its
hex value to read its decimal number, HTML entity and owning block.
How it works
When you pick a block, the tool iterates from its start code point to its end and
renders each one with String.fromCodePoint(cp). The U+ label is produced by
converting the code point to base 16 and zero-padding to four digits. The hex
search parses your input after stripping an optional U+ or 0x prefix, then
finds which block contains it by checking each block’s inclusive start and end
boundaries. The HTML entity shown is the numeric form &#decimal;, which is
universally supported.
Notes and examples
- The euro sign is
U+20AC, decimal 8364, entity€, in the Currency Symbols block. - Emoji such as the grinning face
U+1F600live in supplementary planes aboveU+FFFFand are stored internally as surrogate pairs. - Surrogate code points
U+D800-U+DFFFare reserved for that pairing mechanism and are never valid characters on their own, so they are skipped. - Glyph rendering depends entirely on the fonts installed on your device; the numbers and entities are always accurate even when a box appears.