Clean, unvowelled Arabic in one step
Vocalised Arabic packs short vowels and other marks above and below the letters. That is essential for teaching, recitation, and disambiguation, but it gets in the way when you need plain text for search, matching, or a tidy layout. This tool deletes every diacritic and hands back the bare consonantal spelling.
How it works
Arabic diacritics live in well-defined Unicode ranges, separate from the base letters. The tool runs your text through a character filter that matches those combining marks and removes them:
U+064B..U+065F fatha, damma, kasra, tanwin, shadda, sukun, …
U+0610..U+061A Quranic annotation signs
U+0670 superscript (dagger) alef
U+06D6..U+06ED Quranic small high/low marks
Because only these combining code points are targeted, the base letters and their joining behaviour are preserved exactly. A separate toggle removes the tatweel character ـ, which is a justification glyph rather than a vowel mark.
Tips and notes
If you are normalising text for fuzzy matching or de-duplication, stripping harakat is usually the right first step, and you may also want to remove tatweel so that stretched and unstretched spellings collapse to the same form. The removed-marks counter tells you at a glance how heavily vocalised the source was. For the reverse workflow — checking whether a fully vowelled document has any gaps — use the Harakat Completeness Checker.