When Urdu has to appear in a system that cannot render Nastaliq — a citation, an SMS gateway, or a Latin-only database — it is romanized into Latin script. Doing that consistently requires a standard scheme rather than guesswork. This tool applies the National Language Authority romanization.
How it works
Each Urdu character is looked up in a mapping table built from the NLA scheme. Consonants, long vowels, and the vowel signs (harakat) each have a Latin equivalent, with diacritics preserving the distinctions Latin would otherwise lose:
ٹ → ṭ ڈ → ḍ ڑ → ṛ ں → ñ
غ → gh خ → kh ش → sh چ → ch
Aspirated consonants are handled by a look-ahead: whenever a consonant is
followed by the do-chashmi he (ھ), the pair is emitted as a digraph such as bh
or kh. Zero-width non-joiners are dropped, and unknown characters pass through
unchanged.
Example and notes
پاکستان زندہ باد romanizes to a readable Latin string using p for پ, k for ک,
and so on. Bear in mind that Urdu orthography does not always write short vowels,
so the output reflects what is spelled rather than every spoken vowel — this is
inherent to romanizing an Arabic-derived script and is true of every scheme.
Treat the result as a faithful reading aid, not a phonetic transcription.