ASCII码和Unicode码有什么区别?

Unicode和ASCII的确切区别是什么?

ASCII共有128个字符(扩展集为256个)。

Unicode字符有大小规范吗?

当前回答

ASCII定义了128个字符，对应于数字0-127。Unicode定义了(小于)221个字符，类似地，这些字符映射到数字0-221(尽管目前不是所有的数字都被分配了，有些是保留的)。

Unicode是ASCII的超集，数字0-127在ASCII中的含义与在Unicode中的含义相同。例如，数字65的意思是“拉丁大写字母‘A’”。

因为Unicode字符通常不适合一个8位字节，所以有许多方法将Unicode字符存储在字节序列中，例如UTF-32和UTF-8。

2013-10-06 18:29:09

其他回答

java提供了对Unicode的支持，即它支持所有世界范围的字母。因此，java中的char的大小是2字节。和范围是0到65535。

2017-11-04 06:32:12

Unicode是ASCII的超集，数字0-127在ASCII中的含义与在Unicode中的含义相同。例如，数字65的意思是“拉丁大写字母‘A’”。

因为Unicode字符通常不适合一个8位字节，所以有许多方法将Unicode字符存储在字节序列中，例如UTF-32和UTF-8。

2013-10-06 18:29:09

ASCII有128个码位，从0到127。它可以容纳一个8位字节，值128到255往往用于其他字符。不兼容的选择，导致代码页灾难。在一个代码页中编码的文本不能被假设或猜测另一个代码页的程序正确读取。

Unicode的出现解决了这个灾难。版本1开始有65536个代码点，通常用16位编码。后来在版本2中扩展到110万个代码点。当前版本是6.3，使用了110万个可用代码点中的110,187个。这已经不适合16位了。

当v2出现时，16位编码非常普遍，例如微软和苹果的操作系统就使用了16位编码。以及像Java这样的语言运行时。v2规范提出了一种将这110万个码位映射到16位的方法。一种称为UTF-16的编码，一种可变长度编码，其中一个编码点可以占用2或4个字节。原始v1代码点占用2个字节，新增的占用4个字节。

另一种非常常见的变长编码，在*nix操作系统和工具中使用的是UTF-8，一个码位可以占用1到4个字节，原始ASCII码占用1个字节，其余的占用更多。唯一的非变长编码是UTF-32，每个编码点占用4个字节。不经常使用，因为它相当浪费。还有其他的，如UTF-1和UTF-7，被广泛忽视。

UTF-16/32编码的一个问题是，字节的顺序取决于创建文本流的机器的端序。所以加入UTF-16BE, UTF-16LE, UTF-32BE和UTF-32LE。

Having these different encoding choices brings back the code page disaster to some degree, along with heated debates among programmers which UTF choice is "best". Their association with operating system defaults pretty much draws the lines. One counter-measure is the definition of a BOM, the Byte Order Mark, a special codepoint (U+FEFF, zero width space) at the beginning of a text stream that indicates how the rest of the stream is encoded. It indicates both the UTF encoding and the endianess and is neutral to a text rendering engine. Unfortunately it is optional and many programmers claim their right to omit it so accidents are still pretty common.

2013-10-06 19:12:30

ASCII and Unicode are two character encodings. Basically, they are standards on how to represent difference characters in binary so that they can be written, stored, transmitted, and read in digital media. The main difference between the two is in the way they encode the character and the number of bits that they use for each. ASCII originally used seven bits to encode each character. This was later increased to eight with Extended ASCII to address the apparent inadequacy of the original. In contrast, Unicode uses a variable bit encoding program where you can choose between 32, 16, and 8-bit encodings. Using more bits lets you use more characters at the expense of larger files while fewer bits give you a limited choice but you save a lot of space. Using fewer bits (i.e. UTF-8 or ASCII) would probably be best if you are encoding a large document in English.

Unicode问题的主要原因之一来自于许多非标准扩展ASCII程序。除非您使用的是微软和大多数其他软件公司使用的流行页面，否则您很可能会遇到字符显示为方框的问题。Unicode实际上消除了这个问题，因为所有的字符代码点都是标准化的。

Unicode的另一个主要优点是，在最大限度时它可以容纳大量字符。正因为如此，Unicode目前包含了大多数书面语言，而且还有空间容纳更多的语言。这包括典型的从左到右的脚本，如英语，甚至从右到左的脚本，如阿拉伯语。中文、日文和许多其他变体也在Unicode中表示。所以Unicode不会很快被取代。

为了保持与当时已经广泛使用的旧ASCII码的兼容性，Unicode被设计成前8位与最流行的ASCII码页相匹配。因此，如果使用Unicode打开ASCII编码的文件，仍然可以在文件中得到正确的字符编码。这促进了Unicode的采用，因为它减轻了对那些已经在使用ASCII的人采用新编码标准的影响。

简介:

1.ASCII uses an 8-bit encoding while Unicode uses a variable bit encoding.
2.Unicode is standardized while ASCII isn’t.
3.Unicode represents most written languages in the world while ASCII does not.
4.ASCII has its equivalent within Unicode.

摘自:http://www.differencebetween.net/technology/software-technology/difference-between-unicode-and-ascii/#ixzz4zEjnxPhs

2017-11-23 07:14:57

ASCII定义128个字符，而Unicode包含超过120,000个字符。

2015-08-16 03:33:54

ASCII码和Unicode码有什么区别?

推荐文章

最新文章

标签