List Info

Thread: Re: the utf8 flag




Re: the utf8 flag
user name
2007-03-31 05:10:20
Marc Lehmann schreef:

> Juerd:

> Perl only supported bytes 0..255 in earlier versiosn,
and
> now the perl byte can be up to 64 bits (or maybe a bit
less, I
forgot).

Aren't you mixing up bytes and words here?

A byte is a unit of memory addressing. One or more bytes
make a word. A
word is a unit of transporting memory data.


> In C, a single byte is a character, even if it happens
to have a value
> higher than 255 (although very few compilers allow
that, usually, a
> byte is an octet, although it is common on DSPs to have
32 bit bytes).

C chars don't have to be machine bytes. The DSPs you mention
don't have
to have 32 bit machine bytes, to be used best with 32 bit C
chars.
C has effectively 3 types of char: signed, unsigned and
default.

-- 
Affijn, Ruud

"Gewoon is een tijger."


[1]

about | contact  Other archives ( Real Estate discussion Medical topics )