url_alias not supporting UTF8 .. or what ?!

Author Message

Selmah Maxim

Wednesday 18 June 2003 5:16:27 am

Hi ..

my site have 4 language (English,Hungarian,Hebrew,Arabic) , the url_alias work fine with iso type fonts, but I get parent path for the link if I switch to hebrew, or arabic, or hungarian !

Is this my fault or url_alias not supporting not supporting it yet ?

Bård Farstad

Wednesday 18 June 2003 5:29:06 am

The url alias does not support other characters than 0-9 a-z and _. We need extend this to support unicode type url's.

BTW: do you have a link to the site. It would be interesting to see that kind of internationalized site. Also: are you using MySQL 4.1 or PostgreSQL?

--bård

Documentation: http://ez.no/doc

Selmah Maxim

Wednesday 18 June 2003 5:35:57 am

I see ..

Then just the english site will use url_alias, else node_id !

Not yet, still developing it in localhost.

thx.

Bård Farstad

Wednesday 18 June 2003 5:50:14 am

What we need to do is make a solution for this that works. We could of course just encode the characters like the norwegian characters æøå can be used in an URL by encoding it as %E6%F8%E5, but I think that this looks very dirty and is not very user friendly - which was the Idèa with the url_alias.

The second solution which we can implement is a translation table for every character set which will convert o->oe, æ->ae and å->aa. But this will only work for character based languages.

For chinese and japanese I do not have a good suggestion.

A fallback solution would be to use the node id if the name is empty. E.g. /home/news/120/420/.

Any input?

--bård

Documentation: http://ez.no/doc

Selmah Maxim

Wednesday 18 June 2003 6:13:00 am

I agree with you .. will be to long and ugly link :)

But .. I have some sites which use friendly url, and it`s can grap the UTF8 fonts normal, but the i don`t use utf-8 charset, am using windows-1256, and it`s work fine, and geting the artical from DB by the artical name which is in utf8 , but will be little bit difficult to write the link manually for hebrew and arabic language, beciz is starting from right to left :) !

btw .. the site will use Mysql.

Tamas Takacs

Tuesday 25 November 2003 12:08:07 pm

Hi,

We are about to set up a Hungarian language site, where I'd like to use a translation table like solution for converting accented vowels:

á -> a
ó,ö,ő -> o
ü,ű -> u
í -> i

Since the existing url_translator simply leaves these above mentioned letters out from the string, the practical result is something "very difficult to understand" urls.

I understand, that implementing of this charactertable conversion module is not top priority for eZ, so I am happy to do it (or at least try :), also I am not very familiar with the source yet, so I'd be glad for some assistance of locating the script responsible for URL translations.

Tamas

Mikhail Chekanov

Monday 29 December 2003 9:28:01 am

Does anybody know how to solve this problem?

Is there any suggestion how editors can assign url_alias manually for each object?
For exapmple:
Directory "Economy and business" has url_alias "business" only and an article with title "McDonald's, Wendy's shares beef up" has url_alias "sharesbeefup".

Lazy editors can simply leave this field blank to generate it automatically.

And, finally, I don't think that translation table ф,Ф->F should be good enough for non-latin languages (russian, chinese, bulgarian, etc) because such translation kills the sense constituent of "url_alias"

BTW: Nice urls are important not only for users, but can bring some additional pageranking for search engines.

--
mike
#6595551

liu spider

Tuesday 30 December 2003 2:17:34 pm

I agree with eoNormalMutation, as for non-latin based Language, manually alias_url is the best choice (in fact by fat the only solution I can come up with)

I did patch eZ 3.1 to support this, although with some limitations( only main location's url can be assigned).

Unfortunately, I lost all the source code :(

Hope this would be a new feature in eZ 3.4 series.

http://liucougar.scim-im.org
SCIM Input Method Platform
http://scim.sf.net
SJSD Online Editor
http://sf.net/projects/sjsd

Powered by eZ Publish™ CMS Open Source Web Content Management. Copyright © 1999-2014 eZ Systems AS (except where otherwise noted). All rights reserved.

eZ debug

Timing: Jan 18 2025 15:09:52
Script start
Timing: Jan 18 2025 15:09:52
Module start 'layout'
Timing: Jan 18 2025 15:09:52
Module start 'content'
Timing: Jan 18 2025 15:09:52
Module end 'content'
Timing: Jan 18 2025 15:09:52
Script end

Main resources:

Total runtime0.0171 sec
Peak memory usage2,048.0000 KB
Database Queries3

Timing points:

CheckpointStart (sec)Duration (sec)Memory at start (KB)Memory used (KB)
Script start 0.00000.0080 588.0156152.6250
Module start 'layout' 0.00800.0023 740.640639.4453
Module start 'content' 0.01030.0047 780.085997.3672
Module end 'content' 0.01500.0020 877.453142.3047
Script end 0.0170  919.7578 

Time accumulators:

 Accumulator Duration (sec) Duration (%) Count Average (sec)
Ini load
Load cache0.002313.2383140.0002
Check MTime0.00105.7988140.0001
Mysql Total
Database connection0.00073.940410.0007
Mysqli_queries0.002112.503830.0007
Looping result0.00000.064210.0000
Template Total0.001710.010.0017
Template load0.00095.258510.0009
Template processing0.00084.702710.0008
Override
Cache load0.00063.677810.0006
General
dbfile0.00021.327980.0000
String conversion0.00000.037740.0000
Note: percentages do not add up to 100% because some accumulators overlap

Templates used to render the page:

UsageRequested templateTemplateTemplate loadedEditOverride
1print_pagelayout.tpl<No override>extension/community/design/community/templates/print_pagelayout.tplEdit templateOverride template
 Number of times templates used: 1
 Number of unique templates used: 1

Time used to render debug report: 0.0001 secs