This tool converts Romanian text to plain ASCII by stripping the five diacritic letters — ă, â, î, ș, ț — to their base Latin forms. It is built to handle the most common Romanian Unicode pitfall: the difference between the correct comma-below letters and the legacy cedilla look-alikes.
How it works
Each character is looked up in a fixed table and replaced if it is a Romanian diacritic, otherwise left as-is. Case is preserved.
ă â → a î → i ș → s ț → t
Ă Â → A Î → I Ș → S Ț → T
The table also includes the legacy cedilla characters so messy input is cleaned regardless of source:
ş (U+015F, s-cedilla) → s ţ (U+0163, t-cedilla) → t
Proper Romanian uses ș (U+0219) and ț (U+021B) with a comma below, not a cedilla. When the tool sees a cedilla form it still removes it correctly, and it tells you how many it found so you can fix the source encoding.
Tips
Use the ASCII output for URL slugs, file names, email-local parts, or any legacy system that cannot store the full Romanian character set. If you need to keep the diacritics but fix wrong cedilla forms, look for the legacy-character count in the result — a non-zero value means the original text mixed comma and cedilla glyphs.