The Traditional Chinese Unique Character Counter pulls every distinct Hanzi from a passage and ranks them by how often they occur. It reports total and unique character counts, which are the core figures behind vocabulary coverage and study planning.
How it works
The tool scans the text codepoint by codepoint and keeps only CJK Unified Ideographs, tallying each into a frequency map:
total chars = every Hanzi occurrence
unique chars = number of distinct Hanzi
coverage rank = characters sorted by descending count
Because Chinese frequency is steeply skewed, the top of the ranked list accounts for a large share of the running text. The unique-to-total ratio then summarises how varied the vocabulary is: a low ratio means heavy repetition, a high ratio means dense, diverse vocabulary.
Tips and notes
To maximise reading coverage per character studied, work down the frequency list from the top. The unique list doubles as a ready-made flashcard set for exactly the characters in your text. Punctuation, Latin letters, and digits are excluded so the counts are not inflated by non-Chinese content. The counter does not convert between Traditional and Simplified forms — it reports the characters precisely as they appear, so paste the script you intend to study.