前往目錄

Characters in URLs

貼文者: carbsrule, 2016年10月2日

訊息: 4

語言: English

carbsrule (顯示個人資料) 2016年10月2日下午12:41:31

When using Esperanto words in (SEO-friendly) URLs, what's recommended for the characters with circumflexes (i.e. ĉ, ĝ, ĥ, ĵ, ŝ, ŭ)? I can see 4 possibilities:

1) Use the characters as they are (may not work for all browsers etc. although I haven't looked too deeply into this yet)
2) Convert using the x-system
3) Convert using the h-system
4) Just remove the circumflex; e.g. convert ĉ to c

Roch (顯示個人資料) 2016年10月2日下午7:24:05

nornen (顯示個人資料) 2016年10月3日上午2:35:22

Anything newer than Mozilla 1.4, Netscape 7.1, Opera 7.11 should be OK with unicode URLs. source

The "percent-encoding" actually doesn't change a thing, because it is just an encoding, a way of representing the characters. This encoding generally happens on-the-fly, invisible to the user. For instance, when I hover my mouse over your second link, the tooltip says "https://eo.wikipedia.org/wiki/Ĉeĥio".

See RFC 3986 for more details.

To answer OP's question: I would suggest to use the actual characters.

carbsrule (顯示個人資料) 2016年10月3日上午10:20:12

Thanks!

Yeah I played around and came to the same conclusion - if I use percent encoding (via the urlencode method in PHP), it should be fine. I'll still test all the browsers anyway. I'll bet IE will find some way of giving me trouble.

回到上端