How is Hungarian alphabetical order different?

Hungarian has 8 digraphs (cs, dz, gy, ly, ny, sz, ty, zs) and one trigraph (dzs) that each count as a single letter. So 'csak' sorts after 'cukor' because 'cs' comes after 'c' in the alphabet, even though plain string comparison would put it before.

Where do cs and sz sort?

Each digraph sorts immediately after its base letter: cs after c, dz after d, gy after g, ly after l, ny after n, sz after s, ty after t, zs after z. The trigraph dzs sorts after dz.

How are accented vowels ordered?

Accented vowels sort right after their base: a before á, e before é, o before ó before ö before ő, and u before ú before ü before ű. They are not merged with the plain vowel.

Does it handle the ambiguous double-digraph cases?

Hungarian normally writes a doubled digraph in short form (ssz = sz+sz, not s+sz). This tool uses standard greedy digraph matching, which matches the common dictionary convention. Truly ambiguous compounds are rare.

Is the sort case-insensitive?

Yes. Comparison is done case-insensitively so 'Alma' and 'alma' sort together. Within equal keys the original order is preserved (stable sort).

Hungarian Alphabetical Sort

Email me this result

Get this tool's output sent to your inbox, plus one useful tool a week. No spam, unsubscribe any time.

Sorting Hungarian correctly is harder than a plain string compare because the Hungarian alphabet treats digraphs and one trigraph as single letters. This tool tokenises each word into Hungarian letters and orders them by the official alphabet position.

How it works

The Hungarian alphabet contains these multi-character letters: cs, dz, dzs, gy, ly, ny, sz, ty, zs. Each one occupies its own slot in the alphabet, sorting right after its base letter (so cs comes after c, sz after s, and the trigraph dzs after dz).

The algorithm builds a rank table for every Hungarian letter, including accented vowels (a < á, o < ó < ö < ő, u < ú < ü < ű). It then tokenises each word greedily, preferring the longest digraph match first (so dzs beats dz beats d). Each token is mapped to its rank, producing a key array. Comparing two words means comparing their rank arrays element by element — exactly how a Hungarian dictionary orders entries.

Why plain sorting fails

A naive Unicode sort would put csak before cukor because the character c is followed by s (115) which is below u (117). But in Hungarian, cs is a single letter that comes after c, so the correct order is cukor, then csak. This is the classic trap that this tool fixes.

Example

Input:

cukor
csak
alma
ánizs
szar
sajt

Correct Hungarian order:

alma
ánizs
cukor
csak
sajt
szar

Note that csak follows cukor (cs after c) and szar follows sajt (sz after s). All sorting runs locally in your browser.