Traditional Chinese Unique Character Counter

List all unique Traditional Chinese characters with occurrence counts

Extracts every unique Traditional Chinese (Hanzi) character from a passage and ranks them by frequency, giving total and unique counts for vocabulary-coverage and study planning. Runs entirely in your browser.

What counts as a character here?

The tool counts CJK Unified Ideographs — the Hanzi block used by both Traditional and Simplified Chinese. Latin letters, digits, spaces, and punctuation are excluded, so the totals reflect only Chinese characters.

The Traditional Chinese Unique Character Counter pulls every distinct Hanzi from a passage and ranks them by how often they occur. It reports total and unique character counts, which are the core figures behind vocabulary coverage and study planning.

How it works

The tool scans the text codepoint by codepoint and keeps only CJK Unified Ideographs, tallying each into a frequency map:

total chars   = every Hanzi occurrence
unique chars  = number of distinct Hanzi
coverage rank = characters sorted by descending count

Because Chinese frequency is steeply skewed, the top of the ranked list accounts for a large share of the running text. The unique-to-total ratio then summarises how varied the vocabulary is: a low ratio means heavy repetition, a high ratio means dense, diverse vocabulary.

Tips and notes

To maximise reading coverage per character studied, work down the frequency list from the top. The unique list doubles as a ready-made flashcard set for exactly the characters in your text. Punctuation, Latin letters, and digits are excluded so the counts are not inflated by non-Chinese content. The counter does not convert between Traditional and Simplified forms — it reports the characters precisely as they appear, so paste the script you intend to study.