How are words counted in Hebrew?

By default a word is a whitespace-separated token, the same convention used by most word processors. Leading and trailing punctuation is stripped before counting, so quotation marks, the maqaf, and sentence punctuation do not create phantom words.

What are attached prefixes and why do they matter?

Hebrew attaches several one-letter particles directly to the next word without a space: the conjunction ו, the definite article ה, and the prepositions ב, כ, ל, מ, plus the relativizer ש. Grammatically these are separate words, so the optional mode adds them to the count.

Why separate Hebrew words from Latin words?

Mixed documents often interleave Hebrew with English brand names, citations, or numbers. Splitting the totals lets you see how much of the text is genuinely Hebrew versus Latin-script content, which is useful for translation quotes and localisation.

Does the maqaf affect the count?

The maqaf (Hebrew hyphen, ־) joins words like a hyphen in English. It is stripped as punctuation, so a maqaf-joined pair is treated as the surrounding tokens dictate rather than silently merging into one unexpected token.

Is my text sent to a server?

No. Counting runs entirely in your browser. Nothing is uploaded, so private manuscripts, legal text, or unpublished work stay on your device.

Hebrew Word Counter — Gera Tools

Email me this result

Get this tool's output sent to your inbox, plus one useful tool a week. No spam, unsubscribe any time.

This counter gives a reliable word count for Hebrew, handling right-to-left input, stripping punctuation correctly, and offering a grammar-aware mode that counts Hebrew’s attached one-letter prefixes as separate words.

How it works

Text is split on runs of whitespace into tokens, and each token has its leading and trailing punctuation removed (quotation marks, parentheses, the maqaf ־, dashes, and sentence marks). A token is classified as Hebrew if it contains any character in the Hebrew Unicode block (U+0590 to U+05FF); otherwise it counts as a Latin or other-script word. When the prefix mode is on, any Hebrew word that begins with one of the inseparable particles — ו ה ב כ ל מ ש — and is long enough to have a stem is counted as carrying an extra grammatical word:

words = tokens.length
if countPrefixes: words += (tokens beginning with ו/ה/ב/כ/ל/מ/ש)

Example and notes

The phrase הספר בבית is two whitespace tokens. Grammatically it is four words — “the” + “book” + “in” + “house” — because ה and ב are attached particles. With the prefix mode on, the count rises to reflect those attached words; with it off, you get the literal two-token count that matches a standard word processor. Use the plain count for length limits and the prefix-aware count when a teacher or editor counts grammatical words.