Unicode NFKD Decomposer

Compatibility + decomposition — exposes every combining element

Ad placeholder (leaderboard)

NFKD (Normalization Form KD) is the most thorough Unicode normalisation: it folds compatibility variants like ligatures and full-width letters and then fully decomposes precomposed characters into base letters plus combining marks. The result exposes every individual element, which is ideal for analysis, matching, and accent stripping. This tool applies NFKD and lists the code points before and after.

How it works

NFKD is a compatibility decomposition with no recomposition step:

fi  ->  f i
A  ->  A
é   ->  e + ◌́   (U+0065 U+0301)
²   ->  2

Combining marks are placed in canonical order so equivalent inputs always produce the same output. Because compatibility variants are folded and precomposed characters are split, the code point count usually grows. The tool uses the engine’s native String.prototype.normalize("NFKD").

Notes and example

NFKD pairs well with accent stripping: after decomposition, drop everything in the combining diacritical marks block (U+0300 to U+036F) to turn café into cafe and fiancé into fiance. Like NFKC the folding is lossy, so use NFKD for comparison and search rather than for storing display text. For the same folding with marks recombined, use the NFKC normaliser.

Ad placeholder (rectangle)