Vietnamese Word Counter

Count words and syllable-words in space-delimited Vietnamese text

Count Vietnamese text two ways: raw space-delimited syllable tokens and an estimate of multi-syllable compound words using a common-word lookup. Also shows characters and sentences. Runs entirely in your browser.

Why are there two different word counts?

Written Vietnamese separates every syllable with a space, so a simple split counts syllables (tiếng), not words. Many real words are compounds of two or more syllables, like máy bay (airplane). The tool reports both so you can pick the count that fits your need.

This counter is built for the way Vietnamese is actually written. Because every syllable is separated by a space, an ordinary word counter reports syllables rather than words. This tool gives you both numbers: the exact count of space-delimited syllable tokens and an estimate of true lexical words.

How it works

Vietnamese orthography separates each syllable, called a tiếng, with a space. A lexical word, or từ, may be one syllable or a compound of several. The tool first counts every space-delimited token. It then walks through adjacent pairs and, when a pair appears in a built-in list of common compounds such as xe đạp or gia đình, merges them into a single lexical word. Characters and sentences are counted directly from the text.

Tips and notes

Use the syllable count when you care about layout, typesetting, or character budgets. Use the lexical-word estimate when comparing against an English word count or pricing a translation, since one English word often maps to a Vietnamese compound. The estimate is deliberately conservative: only well-known compounds are merged, so unusual or technical phrases will still count as separate syllables.