Why does E-Mail-Adresse count as one word?

In German a hyphen inside a compound joins parts of a single word, so E-Mail-Adresse is one lexical unit. The counter keeps hyphen-joined tokens together rather than splitting on every hyphen.

How are the em-dash and en-dash treated?

The em-dash (—) and en-dash (–) are punctuation used to separate clauses or words, not to join them. The tool replaces them with spaces before counting, so Berlin—München counts as two words.

Are ä, ö, ü and ß counted as letters?

Yes. The umlauts ä ö ü, the eszett ß, and the capital ẞ are all treated as ordinary word characters, so they never break a word into pieces or get dropped from the character count.

How are sentences detected?

Sentences are counted from terminal punctuation: full stops, question marks, exclamation marks, and the ellipsis. If text has words but no terminal punctuation, it counts as one sentence.

Does any text leave my browser?

No. All counting happens locally in JavaScript in your browser. Nothing is uploaded, logged, or sent to a server, so it is safe for confidential documents.

German Word Counter — Gera Tools

Email me this result

Get this tool's output sent to your inbox, plus one useful tool a week. No spam, unsubscribe any time.

German word counting is trickier than English because the language builds long compound nouns and uses both the hyphen (as a joiner) and the dash (as a separator). This counter applies the correct rules so your totals match how a German editor would count.

How it works

A token is recognised as a sequence of German word characters — the Latin letters plus ä ö ü ß ẞ and digits — optionally joined by hyphens or apostrophes. Because the hyphen joins, E-Mail-Adresse and Donau-Dampfschiff each count as a single word.

Before tokenising, every em-dash — and en-dash – is replaced by a space, so these punctuation dashes act as word boundaries. That means Berlin—München splits into two words, while E-Mail-Adresse stays as one. Sentences are counted from terminal punctuation (. ! ? …), and the longest token is tracked so you can see your biggest compound.

Example

For the text Die Donaudampfschifffahrtsgesellschaft schickt eine E-Mail-Adresse. Berlin—München ist weit. the counter reports five words — Donaudampfschifffahrtsgesellschaft is one long word, E-Mail-Adresse is one hyphenated compound, and Berlin—München is split into two. Two sentences are detected, and the longest word is the 34-letter Danube-steamship compound.

Notes

Use this when you localise English copy into German and need accurate counts for layout, subtitles, or character limits — German runs roughly 10-30% longer than English, and compound handling materially changes the totals.