The world’s languages, grouped by ancestry
This reference organises the world’s languages into their major language families — groups descended from a shared ancestral language. For each family it shows representative member languages, an approximate first-language speaker total, and the region where the family is concentrated. It is a quick way to see how the roughly 7,000 living languages cluster into a few dozen related groups.
How it works
Linguists assign languages to families using the comparative method: they look for regular, systematic sound correspondences and shared inherited grammar across languages, which reveal descent from a common parent rather than mere contact or borrowing. The result is a family tree — Proto-Indo-European, for example, split into branches that became the Germanic, Romance, Slavic, Indo-Iranian and other languages. A handful of languages, like Basque, resist any grouping and are called isolates. Speaker totals here are rounded L1 estimates, so they show relative scale rather than precise headcounts.
Tips and notes
- Search a specific language such as
TamilorHausato find which family it belongs to. - The same family can span continents — Austronesian reaches from Madagascar to the Pacific, and Indo-European covers Europe, Iran, and South Asia.
- Numbers are first-language speakers only; widely learned second languages like English have far larger total speaker counts than the L1 figure shown.