Wrox Programmer Forums
|
BOOK: Beginning C# 3.0 : An Introduction to Object Oriented Programming ISBN: 978-0-470-26129-3
This is the forum to discuss the Wrox book Beginning C# 3.0 : An Introduction to Object Oriented Programming by Jack Purdum; ISBN: 9780470261293
Welcome to the p2p.wrox.com Forums.

You are currently viewing the BOOK: Beginning C# 3.0 : An Introduction to Object Oriented Programming ISBN: 978-0-470-26129-3 section of the Wrox Programmer to Programmer discussions. This is a community of software programmers and website developers including Wrox book authors and readers. New member registration was closed in 2019. New posts were shut off and the site was archived into this static format as of October 1, 2020. If you require technical support for a Wrox book please contact http://hub.wiley.com
 
Old February 18th, 2010, 11:21 PM
Authorized User
 
Join Date: Feb 2010
Posts: 13
Thanks: 0
Thanked 0 Times in 0 Posts
Default chapter 5 : unicode versus ascii

hi ...

ive got a question

page 103
unicode uses two bytes per character and permits over 65,000 individual characters.


what does it mean ??? if i were to display an entire newspaper which contains multiple pages say in japanese will that mean each character takes up 2 bytes? .. wont the application just crash if say there were 4-5million characters?

and also , what is UTF-8 in relation to unicode? is one better than the other
 
Old February 19th, 2010, 01:08 AM
Friend of Wrox
 
Join Date: Sep 2008
Posts: 234
Thanks: 0
Thanked 32 Times in 30 Posts
Default Unicode

Unlike English, many languages are written using a non-ASCII character format (e.g., Kangi) and cannot be represented with the ASCII character set. By expanding the number of bytes from one to two bytes, these special language characters can be represented. Yes, if 8 there are 4 million characters in a newpaper, it would take 8 megabytes to represent it in Unicode. However, with today's gigabyte computers, it's not a problem.

UTF-8 is an ASCII-preserving encoding method that is defined in the Unicode specs. Simply stated, it allows an 8-bit version of Unicode that is consistent with ASCII. If you want more details, just Google it.

If you're going to write web apps or write apps that might be read in countries that don't use ASCII as their native character set, I'd use Unicode. If you're writing code for embedded systems where you're lucky to have 32K of memory, you've got to economize on each byte. The circumstances more or less dictate which you will use.
__________________
Jack Purdum, Ph.D.
Author: Beginning C# 3.0: Introduction to Object Oriented Programming (and 14 other programming texts)





Similar Threads
Thread Thread Starter Forum Replies Last Post
Comma Versus Dot Wenggo Javascript 6 March 1st, 2011 03:37 AM
Converting ASCII Text file to Unicode i_shahid .NET Framework 2.0 2 July 16th, 2009 07:45 AM
unicode and ascii characters sudhanshu631 BOOK: Ivor Horton's Beginning Visual C++ 2008 ISBN: 978-0-470-22590-5 0 May 24th, 2009 12:15 PM
Get UNICODE or ASCII Value of a character Eyob_the_pro C# 0 January 10th, 2007 03:42 AM





Powered by vBulletin®
Copyright ©2000 - 2020, Jelsoft Enterprises Ltd.
Copyright (c) 2020 John Wiley & Sons, Inc.