Unicode Block Reference

Browse Unicode blocks and their code-point ranges

Search every named Unicode block by name or code point and see its start, end, and total code-point span. A fast reference for developers working with multilingual text and emoji.

What is a Unicode block?

A Unicode block is a contiguous, named range of code points reserved for a related group of characters, such as Basic Latin or Cyrillic. Blocks help organise the code space and are defined in the Unicode Character Database.

Find any Unicode block fast

Unicode organises its more than a million possible code points into named blocks — contiguous ranges set aside for a related family of characters. Knowing which block a character lives in helps when you build fonts, validate input, filter by script, or debug mojibake. This tool lets you search the common blocks by name or by a code point and instantly see the range it occupies.

How it works

Each block has a fixed start and end code point, defined in the Unicode Character Database file Blocks.txt. When you type a name, the tool matches it against block names. When you type a code point such as U+1F600, it parses the hexadecimal value and finds the block whose start and end span contains it — a simple range check. The point count column is the size of that span, computed as end - start + 1. Note that some code points inside a block may be unassigned, so the span is an upper bound on assigned characters.

Tips and notes

Code points are written in hexadecimal with a U+ prefix, so U+0041 is the capital letter A and U+1F680 is the rocket emoji. The largest single block shown here is CJK Unified Ideographs (U+4E00 to U+9FFF), which holds the core set of Chinese, Japanese, and Korean characters. Emoji are spread across several blocks, including Emoticons, Transport and Map Symbols, and Miscellaneous Symbols and Pictographs. All searching happens locally in your browser, so nothing you type is uploaded.