List Info

Thread: Re: About Language.




Re: About Language.
country flaguser name
Greece
2007-06-01 01:39:41
On Thu, May 31, at 01:25 Matthias Feichtinger wrote:
> 
> Thinking about UTF-16 could be fine too, couldn't it?

You can do it even today if you like it, glibc supports it
with
converted functions, although Ulrich Drepper stated that
native support 
in glibc ( and for pango/gtk+ application(s) I would add)
will never 
happen for UTF-16.

But, is there any reason why this switch will ever happen?
because I
am under the impression that UTF-16 carries many of the
drawbacks of UTF-8
and UTF-32 and few of their advantages.

But also is true that for text intensive applications like
database
software with high memory load and single code unit access
to characters [1],
UTF-16 is much more sufficient than UTF-8 and still uses
less than 50% of 
space of UTF-32.

Right now windows uses it (since NT? if am not mistaken) and
Java uses it
and some like Python and Mozilla ECMASscript uses it
internally, although
Python can be compiled also for UTF-32* (there is a switch
while you are
configuring it).

* UTF-32 seems like a natural choice for the future and
eventually will
  become the default mode (compatibility is no issue in that
case),
  but today it's seems just wasteful.

For the moment and for quite sometime UTF-8 will be the
standard in
Linux, and UTF-16 will stay only for compatibility with
those which already use
it.

For a lot more detailed and more accurate information than
my amateur approach,
there are archives also available for download in:
http://mail.nl.l
inux.org/linux-utf8/

where it's quite fascinated to watch all the evolution to
UTF-8 (how various
'well known' application get ported to UTF-8 from plain
ascii); definitely 
worths a reading, but you have to clean the archive from the
spam. 
I had to filter through formail to catch most of it (about
500 spam
messages in a total of 7200 emails).

1. http://w
ww.unicode.org/unicode/reports/tr17/
-- 
http://linuxfromscratch.org/mailman/listinfo/alfs-discu
ss
FAQ: http://www.linux
fromscratch.org/faq/
Unsubscribe: See the above information page

Re: About Language.
country flaguser name
Greece
2007-06-01 02:43:29
Jesus, how I will ever learn to double check my message. 

On Fri, Jun 01, at 09:39 Ag. D. Hatzimanikas wrote:
> UTF-16 is much more sufficient than UTF-8 and still
uses less than 50% of 
                  /su/e/      		
-- 
http://linuxfromscratch.org/mailman/listinfo/alfs-discu
ss
FAQ: http://www.linux
fromscratch.org/faq/
Unsubscribe: See the above information page

[1-2]

about | contact  Other archives ( Real Estate discussion Medical topics )