Japanese Mixed-Script Ratio Analyzer

Show the hiragana / katakana / kanji / romaji ratio in Japanese text

Calculates the percentage breakdown of hiragana, katakana, kanji, and romaji in a Japanese passage so you can gauge reading difficulty and audience. Runs entirely in your browser.

Which scripts does it count?

It counts hiragana (U+3040–U+309F), katakana (U+30A0–U+30FF and half-width forms), kanji (the CJK Unified Ideographs block), and romaji (basic Latin letters). Punctuation, digits, and whitespace are excluded from the percentages so they reflect only script characters.

The Japanese Mixed-Script Ratio Analyzer breaks a passage into its four writing systems and reports the percentage of each. Because Japanese blends scripts, the mix is a fast proxy for reading difficulty and intended audience.

How it works

Every character is classified by its Unicode block, and non-script characters are excluded from the totals:

hiragana  U+3040–U+309F        grammar, particles, easy words
katakana  U+30A0–U+30FF + ハンカク loanwords, onomatopoeia, emphasis
kanji     U+4E00–U+9FFF (CJK)  content words, highest learning load
romaji    A–Z a–z              acronyms, brand names, foreign terms

Percentages are computed over the script-character total only, so spaces, punctuation, and digits do not dilute the figures. The kanji share, in particular, is the headline indicator of difficulty.

Tips and notes

A passage that is mostly hiragana with little kanji usually targets children or beginning learners, often paired with furigana. A high kanji ratio points to newspapers, official documents, or literary adult prose. A notable katakana share suggests technical, scientific, or marketing material full of loanwords. Use the breakdown together: two texts with the same kanji percentage can still differ in tone if one leans on katakana loanwords and the other on native hiragana grammar.