לתוכן העניינים

Characters in URLs

של carbsrule, 2 באוקטובר 2016

הודעות: 4

שפה: English

carbsrule (הצגת פרופיל) 2 באוקטובר 2016, 12:41:31

When using Esperanto words in (SEO-friendly) URLs, what's recommended for the characters with circumflexes (i.e. ĉ, ĝ, ĥ, ĵ, ŝ, ŭ)? I can see 4 possibilities:

1) Use the characters as they are (may not work for all browsers etc. although I haven't looked too deeply into this yet)
2) Convert using the x-system
3) Convert using the h-system
4) Just remove the circumflex; e.g. convert ĉ to c

Roch (הצגת פרופיל) 2 באוקטובר 2016, 19:24:05

nornen (הצגת פרופיל) 3 באוקטובר 2016, 02:35:22

Anything newer than Mozilla 1.4, Netscape 7.1, Opera 7.11 should be OK with unicode URLs. source

The "percent-encoding" actually doesn't change a thing, because it is just an encoding, a way of representing the characters. This encoding generally happens on-the-fly, invisible to the user. For instance, when I hover my mouse over your second link, the tooltip says "https://eo.wikipedia.org/wiki/Ĉeĥio".

See RFC 3986 for more details.

To answer OP's question: I would suggest to use the actual characters.

carbsrule (הצגת פרופיל) 3 באוקטובר 2016, 10:20:12

Thanks!

Yeah I played around and came to the same conclusion - if I use percent encoding (via the urlencode method in PHP), it should be fine. I'll still test all the browsers anyway. I'll bet IE will find some way of giving me trouble.

לראש הדף