OBDILCI
WWW STATE OF MULTILINGUALISM

WWW STATE OF MULTILINGUALISM
If we join the 3 main sources of data OBDILCI has been working on:
We can now offer figures for the state of the art of WEB Multilingualism. Figures updated at the light of last studies on April 2026.
| WHOLE WEB (3) | ONE MILLION MOST VISITED SITES (TRANCO) (2) | COMMENT | |
| Rate of multilingualism | 1.6 | 3 | Number of web linguistic versions divided by number of web sites |
| Percentage of multilingual web sites | 15% | 33.7% | Huge variance depending on country, language or type of web. |
| Average number of languages per multilingual web site | 5 | 7 |
Note 1 : Figures in bold are measured, figures in italic are estimated or computed from an estimated figure.
Note 2: The figure of web multilingualism shall be compared with the equivalent figure for the humanity computed as number of speakers first and second languages divided by number of speakers first language which value is 1.44 following Ethnologue’s data for 2024. The world wide web is therefore much more multilingual than humanity…
Note 3: It is perfectly coherent that the one million most visited web sites are 2.3 times more multilingual than the average world wide.
| PERCENTAGE OF WEBPAGES PER LANGUAGE | WHOLE WEB (1) | TRANCO (3) |
| English | 20,08% | 21,77% |
| Chinese | 19,04% | 2,77% |
| Spanish | 7,70% | 6,36% |
| Hindi | 3,77% | 0,75% |
| Russian | 3,75% | 3,86% |
| Arabic | 3,65% | 1,60% |
| French | 3,42% | 6,38% |
| Portuguese | 3,16% | 3,86% |
| Japanese | 2,23% | 2,93% |
| German | 2,15% | 6,93% |
| Indonesian | 2,02% | 1,79% |
| Bengali | 1,48% | 0,33% |
| Turkish | 1,14% | 1,76% |
| Italian | 0,99% | 4,13% |
| Vietnamese | 0,95% | 1,09% |
| Korean | 0,87% | 1,62% |
| Persian | 0,86% | 0,28% |
| Urdu | 0,83% | 0,20% |
| Tagalog | 0,76% | 0,22% |
| Thai | 0,67% | 1,03% |
| Marathi | 0,58% | 0,14% |
| Telugu | 0,57% | 0,12% |
| Polish | 0,56% | 2,57% |
The large differences between the two figures account for the fact that the most visited web sites are by design (natural bias from trafic) and by construction (geographic bias leading to European languages bias from the ranking sources – Majestic, QuantCast, Cisco Umbrella) strongly biased in pro of main dominant European languages.
If you are interested in knowing all other existing sources for corresponding figures, check this document published for UNESCO LT4ALL2025.


Projects by OBDILCI
- Indicators for the Presence of Languages and multilingualism in the Internet
- The Languages of France in the Internet
- French in the Internet
- Portuguese in the Internet
- Spanish in the Internet
- Web Multilingualism reports
- Courses
- AI and Multilingualism
- Linguistic gTLDs
- DILINET
- Pre-historic Projects…
- Digital Language Death
