Thai is unusual: it has no sentence-ending punctuation. Within a clause the words run together with no spaces at all, and then a single space is used to mark the boundary between phrases or sentences. A line break is the strongest boundary. This makes counting Thai sentences a segmentation problem rather than a punctuation-splitting one.
How it works
The counter uses the standard Thai segmentation heuristic:
- A line break is a sentence boundary.
- A run of whitespace between Thai chunks is a phrase or sentence boundary, so each whitespace-separated chunk is counted as one unit.
- The repetition mark
ๆ(mai yamok) and the abbreviation markฯ(paiyannoi) stay inside a sentence and never end one.
If you paste mixed-script text that includes ., !, ?, or the full-width
。!?, those are honoured as explicit terminators, and the tool reports the
larger of the two heuristics so it never under-counts.
Example
The text:
ภาษาไทยสวยงามมาก ฉันชอบเรียนภาษาไทย
has one space in the middle, separating two phrases — so it counts as two sentence units. Each chunk of joined Thai characters is one unit.
Notes
- Thai has no period; spaces and line breaks carry sentence boundaries.
- The repetition mark
ๆand abbreviation markฯare never treated as terminators. - Counts are approximate by nature because Thai sentence boundaries are context-dependent.
- Everything runs locally; your text never leaves the browser.