Counting Simplified Chinese text needs more than a naive length check, because a single Han character takes one position on screen but three bytes in storage, and there are no spaces to mark word boundaries. This tool reports characters, UTF-8 bytes, and an estimated word count so you can match whatever limit you are working against.
How it works
Characters are counted as Unicode code points, which means each Han ideograph counts once. The tool classifies a character as Han if it falls in the CJK Unified Ideographs blocks or the compatibility range, and reports the Han total separately from other characters.
The byte count is produced by encoding the text as UTF-8, where each Chinese character is three bytes and each ASCII character is one. The word estimate divides the Han character count by about 1.5, the commonly used average number of characters per Chinese word.
Example and notes
A sentence mixing Chinese, English, and digits will show more bytes than characters because the Chinese portion triples in storage. Use the byte figure when you face a strict database column width or an SMS segment limit, and the character or estimated word figure for editorial length targets. Everything runs locally, so private documents stay on your device.