Search This Blog

Thursday, January 3, 2008

Converting russian characters too URL hexadecimal characters

The binary code should be of the following format

// The position in the Unicode table tells us how many bytes are needed.
// Note that if we talk about first, second, etc. in the following, we are
// counting from left to right:
//
// Position in | Bytes needed | Binary representation
// Unicode table | for UTF-8 | of UTF-8
// ----------------------------------------------------------
// 0 - 127 | 1 byte | 0XXX.XXXX
// 128 - 2047 | 2 bytes | 110X.XXXX 10XX.XXXX
// 2048 - 65535 | 3 bytes | 1110.XXXX 10XX.XXXX 10XX.XXXX
// 65536 - 2097151 | 4 bytes | 1111.0XXX 10XX.XXXX 10XX.XXXX 10XX.XXXX

1 comment:

Anonymous said...

http://lumerkoz.edu I bookmarked this guestbook., http://www.comicspace.com/buy_avapro/ wariness boast http://soundcloud.com/propecias tapir http://barborazychova.com/members/Buy-Nolvadex.aspx compelled http://barborazychova.com/members/Buy-Lipitor.aspx rowan http://www.comicspace.com/celexa/ dire proness