K vsebini

Characters in URLs

od carbsrule, 02. oktober 2016

Sporočila: 4

Jezik: English

carbsrule (Prikaži profil) 02. oktober 2016 12:41:31

When using Esperanto words in (SEO-friendly) URLs, what's recommended for the characters with circumflexes (i.e. ĉ, ĝ, ĥ, ĵ, ŝ, ŭ)? I can see 4 possibilities:

1) Use the characters as they are (may not work for all browsers etc. although I haven't looked too deeply into this yet)
2) Convert using the x-system
3) Convert using the h-system
4) Just remove the circumflex; e.g. convert ĉ to c

Roch (Prikaži profil) 02. oktober 2016 19:24:05

nornen (Prikaži profil) 03. oktober 2016 02:35:22

Anything newer than Mozilla 1.4, Netscape 7.1, Opera 7.11 should be OK with unicode URLs. source

The "percent-encoding" actually doesn't change a thing, because it is just an encoding, a way of representing the characters. This encoding generally happens on-the-fly, invisible to the user. For instance, when I hover my mouse over your second link, the tooltip says "https://eo.wikipedia.org/wiki/Ĉeĥio".

See RFC 3986 for more details.

To answer OP's question: I would suggest to use the actual characters.

carbsrule (Prikaži profil) 03. oktober 2016 10:20:12

Thanks!

Yeah I played around and came to the same conclusion - if I use percent encoding (via the urlencode method in PHP), it should be fine. I'll still test all the browsers anyway. I'll bet IE will find some way of giving me trouble.

Nazaj na začetek