Originally Posted by MaxOff
|
lol someone thing he knows something.
okay ascii has 7bit encoding, so it has 128different characters
unicode has 64bit encoding, and has over 100,000characters(from wikipedia) , and still has a lot of room for more.
|
ASCII characters are 1 byte.
UNICODE characters are 2 bytes.
1 Bytes = 8 bits.
Maximum values us 2^K-1
255 possible characters.
Signed Value range is -2^(K-1) to 2^(K-1)-1
It can still represent 255 characters, however if you only take the leading positive values than its 127.
Unicode is 2^16-1 = 65535
Having taken courses in computer-related studies in University I would be a little more credible than a wikipedia article thats telling you a unicode character is 64-bits (8 bytes).
Originally Posted by MaxOff
|
in fact the game uses UTF8(if u dont know it, look it up)
and the Messages ingame are build using 2 terminators. a length byte at the beginning and an zero byte at the end. Actually the length byte is ignored, but the message is stored in an much bigger buffer, so it find its zero a little bit later guaranteed.
|
UTF-N is a method used to retain the size of the string even though the string may contain a null terminator. UTF-N also allows the grouping of the bytes representing a character to be changed to support different regional codes depending on the inital character range value.
Although the UTF-N still does not prevent misprinting, or improperly implemented algorithms to handle string manipulation.
Rather than opening wikipedia, I suggest you open something like Microsofts Development network website and read the information off of there. The difference is, one is published by real-world developers, and the other is posted by some individual who may or may not have accredited knowledge on the subject.
Here I'll even point you to it:
http://msdn.microsoft.com