Count Persian text accurately
Counting Persian characters naively misses the details that matter: the script includes invisible joiners, optional diacritics, and four letters that Arabic does not have. This counter handles the Perso-Arabic script properly and breaks the total down so you know exactly what is in your text.
How it works
The tool iterates over your text codepoint by codepoint and classifies each character:
- Persian-script letters fall in the Perso-Arabic Unicode ranges. Among them, the Persian-specific letters پ چ ژ گ ک ی are tracked separately.
- Digits include Persian (۰–۹), Arabic-Indic, and ASCII numerals.
- Diacritics (harakat) are combining marks in U+064B–U+065F and related ranges; you can include or exclude them from the total.
- ZWNJ (U+200C) is counted on its own because it is invisible.
Example and notes
The phrase پژواک (“echo”) contains the Persian-specific letters پ and ژ, which
the counter flags. If you paste fully vowelised text with harakat, toggle
“Exclude diacritics” to get a count that matches the visible letters.
All processing is local — your text never leaves the browser.