最新的Unicode文字编码标准v 6.1.0正式发布。主要变化包括:新增732个新字符,7种全新文字,支持通过变量选择符区分绘文字(emoji)风格和文本风格的符号和表情符号;更新断行算法以更准确反映日本和希伯来文文本,等等。
上述文字来自:http://it.solidot.org/article.pl?sid=12/02/02/1124247&from=rss
来自官方:http://www.unicode.org/versions/Unicode6.1.0/
Unicode 6.1.0 is a minor version of the Unicode Standard and supersedes all previous versions. This page summarizes the important changes for the Unicode Standard, Version 6.1.0. In the discussion below, Version 6.1.0 may be abbreviated as "Unicode 6.1" or "Version 6.1."
Version 6.1 of the Unicode Standard continues the Unicode Consortium's long-term commitment to support the full diversity of languages around the world. This latest version adds characters to support additional languages of China, other Asian countries, and Africa. It also addresses educational needs in the Arabic-speaking world. A total of 732 new characters have been added.
This version of the Standard also brings technical improvements to support implementers. Improved changes to property values and their aliases mean that properties now have easy-to-specify labels. The new labels combined with a new script extensions property means that regular expressions can be more straightforward and are easier to validate. Hangul algorithms were consolidated and restructured. Before, one had to examine four separate documents. Now, the information is consolidated in the core specification in Chapter 3, Conformance.
Over 200 new Standardized Variants have been added for emoji characters, allowing implementations to distinguish preferred display styles between text and emoji styles.
Among the notable property changes and additions in Unicode 6.1 are two new line break property values, which improve the line-breaking behavior of Hebrew and Japanese text. Segmentation behavior was also improved for Thai, Lao, and similar languages. The processing of Chinese data has been augmented by more fully specified information on mapping between Simplified and Traditional Chinese characters, in addition to other improved Unihan data that supports the processing of Chinese data.
For detailed property changes see Section F. Unicode Character Database Changes.
Version 6.1 has minor conformance updates, including the determination of grapheme cluster boundaries and the processing of combining canonical class and decomposition mapping. There are documentation improvements throughout.
Two other important Unicode specifications are maintained in synchrony with the Unicode Standard, and have updates for Version 6.1:
This latest version of the Unicode Standard is synchronized in repertoire with the forthcoming third edition of 10646: ISO/IEC 10646:2012.