The Malay Character Counter counts characters in Bahasa Melayu and is aware that Malay can be written in two scripts: Rumi (the everyday Latin alphabet) and Jawi (an Arabic-based script). It detects which script your text uses and adjusts the counts it shows accordingly.
How it works
Every character is examined by its Unicode code point. Jawi uses the Arabic script blocks — the main Arabic block plus the Arabic Supplement and Arabic Extended-A ranges that hold Malay-specific letters such as ڠ (nga), ڤ (pa), چ (ca), and ݢ (ga). When most characters in your text fall into these ranges, the tool labels the text Jawi and reports the Jawi character total. Otherwise it labels the text Rumi and reports the Latin letter count.
Total characters and characters-without-spaces are counted by spreading the string into code points, which keeps surrogate pairs (and any emoji) counted as one unit. Words are counted by whitespace, since modern Jawi still separates words with spaces.
Example
Selamat datang is recognised as Rumi with 14 characters. The Jawi equivalent سلامت داتڠ is recognised as Jawi, and the tool reports its Arabic-script character total instead of a Latin letter count.
Notes
- Mixed-script text is classified by whichever script dominates.
- The text area uses automatic direction, so Jawi displays right-to-left as expected.