Till sidans innehåll

Characters in URLs

av carbsrule, 2 oktober 2016

Meddelanden: 4

Språk: English

carbsrule (Visa profilen) 2 oktober 2016 12:41:31

When using Esperanto words in (SEO-friendly) URLs, what's recommended for the characters with circumflexes (i.e. ĉ, ĝ, ĥ, ĵ, ŝ, ŭ)? I can see 4 possibilities:

1) Use the characters as they are (may not work for all browsers etc. although I haven't looked too deeply into this yet)
2) Convert using the x-system
3) Convert using the h-system
4) Just remove the circumflex; e.g. convert ĉ to c

Roch (Visa profilen) 2 oktober 2016 19:24:05

nornen (Visa profilen) 3 oktober 2016 02:35:22

Anything newer than Mozilla 1.4, Netscape 7.1, Opera 7.11 should be OK with unicode URLs. source

The "percent-encoding" actually doesn't change a thing, because it is just an encoding, a way of representing the characters. This encoding generally happens on-the-fly, invisible to the user. For instance, when I hover my mouse over your second link, the tooltip says "https://eo.wikipedia.org/wiki/Ĉeĥio".

See RFC 3986 for more details.

To answer OP's question: I would suggest to use the actual characters.

carbsrule (Visa profilen) 3 oktober 2016 10:20:12

Thanks!

Yeah I played around and came to the same conclusion - if I use percent encoding (via the urlencode method in PHP), it should be fine. I'll still test all the browsers anyway. I'll bet IE will find some way of giving me trouble.

Tillbaka till toppen