Hindi is written in Devanagari, an abugida where the natural unit is the
akshara — a consonant (or consonant cluster) carrying one vowel. Counting
raw Unicode code points overcounts badly, because a single akshara such as
क्ष is made of three code points. This tool counts the aksharas the way a
reader actually perceives them.
How it works
The counter walks through your text one code point at a time and groups them into aksharas using these rules:
- A consonant (
U+0915–U+0939, plus the nukta formsU+0958–U+095F) starts a new akshara. - A virama / halant (
्,U+094D) immediately after a consonant means the next consonant joins the same akshara as a conjunct, so it is not counted again. - An independent vowel (
अ,आ,इ…) is an akshara on its own. - Matras (
ा,ि,े…), anusvara/visarga/candrabindu, and nukta attach to the current akshara and never add to the count.
Everything else — spaces, punctuation, Latin letters, digits — simply ends the current akshara without adding a syllable.
Example
The word क्षत्रिय (kṣatriya) looks long in code points but is read as the
aksharas क्ष · त्रि · य — three syllabic units. A code-point counter would
report seven or more; this tool reports the three aksharas a Hindi reader sees.
Notes
- This is an orthographic count, very close to spoken syllables, and ideal for comparing text length, validating SMS-style limits, or teaching scripts.
- It is not a prosody tool: Hindi poetry uses a separate laghu/guru matra-weight system based on vowel length, which is outside this tool’s scope.