Urdu does not end sentences with a dot. Its full stop is the khari pai ۔
(U+06D4), a short vertical bar, and its question mark is the mirrored Arabic
؟ (U+061F). A counter that only looks for the ASCII period would treat an
entire Urdu paragraph as a single sentence. This tool recognises the genuine
Urdu terminators.
How it works
The text is split on runs of sentence-ending marks: the Urdu full stop ۔, the
Arabic question mark ؟, the exclamation mark !, the ASCII period . (for
digitised text), and the ellipsis …. Consecutive terminators collapse into one
break.
Each segment that still contains Urdu or alphanumeric content counts as one
sentence. The Urdu comma ، (U+060C) and the Urdu semicolon ؛ (U+061B) are
not split on — they mark clause boundaries inside a sentence.
Example
The passage:
اردو ایک خوبصورت زبان ہے۔ کیا آپ اردو بولتے ہیں؟
is two sentences: a statement closed by the khari pai ۔ and a question closed
by the Arabic question mark ؟. A comma inside either clause would not raise the
count.
Notes
- The khari pai
۔is the default Urdu sentence ending — not the dot. - Clause separators
،and؛are ignored on purpose. - The average-words-per-sentence figure is a quick readability gauge.
- Everything runs locally; your text never leaves the browser.