Unicode Code Points to UTF-8

Compose Unicode code point sequences back into readable text

Ad placeholder (leaderboard)

From code points back to text

This tool is the inverse of a code-point exploder: give it the numeric Unicode code points and it reassembles the original characters. It is handy when a log, API response or escape sequence gives you raw code-point numbers and you want to see what they actually spell.

How it works

The input is split on spaces and commas into tokens, and each token is classified:

U+, 0x, \u, \U prefix      -> hexadecimal
contains a-f letters       -> hexadecimal
all decimal digits         -> decimal

The token is parsed to an integer in the right base, validated against the Unicode range (U+0000 to U+10FFFF) and against the surrogate gap (U+D800 to U+DFFF, which are not real characters), then passed to String.fromCodePoint. That builtin correctly emits a single character even for astral code points that JavaScript must store internally as a UTF-16 surrogate pair.

Tips and notes

You can mix notations in a single list — U+0047 0x65 114 61 is perfectly valid and resolves to “Gera” minus the last letter, for instance. If you have an ambiguous all-digit value that is meant to be hex, add a 0x or U+ prefix so it is not misread as decimal. Surrogate or over-range values are rejected with a clear message rather than silently producing the replacement character.

Ad placeholder (rectangle)