Which letters does it strip?

It maps ă to a, â to a, î to i, ș to s and ț to t, in both upper and lower case, leaving every other character untouched.

What is the ș/ş and ț/ţ difference?

Correct Romanian uses comma-below letters: ș (U+0219) and ț (U+021B). Older fonts and encodings used the visually similar cedilla letters ş (U+015F) and ţ (U+0163). The tool normalises both to s and t.

Does it report legacy cedilla characters?

Yes. If your input contains cedilla forms ş or ţ, the tool counts them and notes that proper Romanian text should use the comma forms ș and ț instead.

Is letter case preserved?

Yes. Uppercase Ă, Â, Î, Ș, Ț become uppercase A, A, I, S, T, and lowercase forms stay lowercase.

Does my text get uploaded?

No. The replacement runs entirely in your browser using a fixed character table, so nothing is sent to a server.

Romanian Diacritic Remover

Email me this result

Get this tool's output sent to your inbox, plus one useful tool a week. No spam, unsubscribe any time.

This tool converts Romanian text to plain ASCII by stripping the five diacritic letters — ă, â, î, ș, ț — to their base Latin forms. It is built to handle the most common Romanian Unicode pitfall: the difference between the correct comma-below letters and the legacy cedilla look-alikes.

How it works

Each character is looked up in a fixed table and replaced if it is a Romanian diacritic, otherwise left as-is. Case is preserved.

ă â → a      î → i      ș → s      ț → t
Ă Â → A      Î → I      Ș → S      Ț → T

The table also includes the legacy cedilla characters so messy input is cleaned regardless of source:

ş (U+015F, s-cedilla) → s      ţ (U+0163, t-cedilla) → t

Proper Romanian uses ș (U+0219) and ț (U+021B) with a comma below, not a cedilla. When the tool sees a cedilla form it still removes it correctly, and it tells you how many it found so you can fix the source encoding.

Tips

Use the ASCII output for URL slugs, file names, email-local parts, or any legacy system that cannot store the full Romanian character set. If you need to keep the diacritics but fix wrong cedilla forms, look for the legacy-character count in the result — a non-zero value means the original text mixed comma and cedilla glyphs.