Romanian Diacritic Remover

Removes ă, â, î, ș, ț — handles ș/ț comma vs cedilla traps

Strip Romanian diacritics to plain ASCII while correctly normalising both the proper comma-below letters ș and ț and the legacy cedilla variants ş and ţ, preserving letter case.

Which letters does it strip?

It maps ă to a, â to a, î to i, ș to s and ț to t, in both upper and lower case, leaving every other character untouched.

This tool converts Romanian text to plain ASCII by stripping the five diacritic letters — ă, â, î, ș, ț — to their base Latin forms. It is built to handle the most common Romanian Unicode pitfall: the difference between the correct comma-below letters and the legacy cedilla look-alikes.

How it works

Each character is looked up in a fixed table and replaced if it is a Romanian diacritic, otherwise left as-is. Case is preserved.

ă â → a      î → i      ș → s      ț → t
Ă Â → A      Î → I      Ș → S      Ț → T

The table also includes the legacy cedilla characters so messy input is cleaned regardless of source:

ş (U+015F, s-cedilla) → s      ţ (U+0163, t-cedilla) → t

Proper Romanian uses ș (U+0219) and ț (U+021B) with a comma below, not a cedilla. When the tool sees a cedilla form it still removes it correctly, and it tells you how many it found so you can fix the source encoding.

Tips

Use the ASCII output for URL slugs, file names, email-local parts, or any legacy system that cannot store the full Romanian character set. If you need to keep the diacritics but fix wrong cedilla forms, look for the legacy-character count in the result — a non-zero value means the original text mixed comma and cedilla glyphs.