Turkish Diacritic Remover

Removes ç, ğ, ı, ö, ş, ü for ASCII with dotless-i awareness

Strip Turkish-specific characters to plain ASCII with correct mappings: ç to c, ğ to g, ı to i, İ to I, ö to o, ş to s, ü to u. Avoids the default that leaves dotless ı unmapped.

Why does ı map to i and not stay unchanged?

The dotless ı is a distinct Turkish letter. Its natural ASCII fold is i. Many generic tools leave it untouched or get it wrong because Unicode normalization does not treat ı as a decorated i, so we map it explicitly.

Fold Turkish text to clean ASCII

Removing diacritics from Turkish is harder than it looks. Standard Unicode normalization (NFD) decomposes letters like é into e plus an accent, but Turkish letters such as ı, ğ, and ş are not decorated base letters — they are distinct code points. As a result, naive ASCII-folding code silently leaves them in place. This tool uses an explicit Turkish-aware map so every special character is converted correctly.

How it works

Each character is looked up in a fixed mapping table and replaced if it matches:

ç -> c    Ç -> C
ğ -> g    Ğ -> G
ı -> i    İ -> I
ö -> o    Ö -> O
ş -> s    Ş -> S
ü -> u    Ü -> U

Everything else — ordinary Latin letters, digits, punctuation, spaces — passes through unchanged. The two trickiest cases are the dotless ı, which folds to i, and the dotted capital İ, which folds to I.

Example

input:  Çağrı İstanbul'da güneşli bir öğle yaşıyor
output: Cagri Istanbul'da gunesli bir ogle yasiyor

Tips and notes

  • Use this before generating URL slugs, filenames, or legacy-system identifiers that must be ASCII-only.
  • Because the mapping is explicit, it does not depend on locale or normalization quirks across browsers.
  • All conversion happens locally in your browser, so the text stays private.