The Tru64 UNIX operating system supports different Chinese locales for different countries and areas. These include Taiwan, People's Republic of China (PRC), and Hong Kong. Table 3-1 shows the valid Chinese locales with different countries, codesets, and collating sequences.
Note
zh_TW is an alias of zh_TW.eucTW, and zh_CN is an alias of zh_CN.dechanzi.
Codeset | Locale | Collation Sequence |
---|---|---|
DEC Hanyu | zh_TW zh_TW.dechanyu zh_TW.dechanyu@radical zh_TW.dechanyu@stroke zh_TW.dechanyu@chuyin zh_HK.dechanyu | Internal code Internal code Radical Stroke Chuyin (Phonetic) Internal code |
Taiwanese EUC | zh_TW.eucTW zh_TW.eucTW@radical zh_TW.eucTW@stroke zh_TW.eucTW@chuyin zh_HK.eucTW | Internal code Radical Stroke Chuyin (Phonetic) Internal code |
Big-5 | zh_TW.big5 | Internal
code |
DEC Hanzi | zh_CN zh_CN.dechanzi zh_CN.dechanzi@radical zh_CN.dechanzi@stroke zh_CN.dechanzi@pinyin zh_HK.dechanzi | Internal code Internal code Radical Stroke Pinyin (Phonetic) Internal code |
GBK | zh_CN.GBK | Internal code (however, all characters in GB 2312 character set are ordered first, followed by GBK extension characters) |
GB18030 | zh_CN.GB18030 | Internal code (however, all characters in GB 2312 character set are ordered first, followed by GBK characters not already ordered, followed by GB18030 characters not already ordered) |
UTF-8 | zh_CN.UTF-8 | zh_CN.GBK |
The collation sequence for the GBK locale is further refined so that characters in the GB 2312 character set are ordered first followed by GBK extension characters.
The collation sequence for the GB18030 locale is further refined so that characters in the GB 2312 character set are ordered first, followed by GBK characters not already ordered, followed by GB18030 characters not already ordered.
Locales that support the same country and codeset are basically the same. The radical, stroke, pinyin, and chuyin modifiers after the at (@) sign specify different criterion for collation and sorting. Moreover, the characters defined in character set standards have collating precedence over user-defined characters, which, in turn, have precedence over undefined or reserved characters.
The dechanyu, eucTW, and dechanzi locales support locale names with @ucs4 modifiers. The @ucs4 modifier indicates that UCS-4 is used as the internal processing code. However, these locales are the same as the locales without the @ucs4 modifier. The @ucs4 subset is provided for backward compatibility and may be removed in the future. See l10n_intro(5) for more information.
If you are using DECwindows Motif, you can select the locale through the Language Menu of Session Manager. If you are using CDE, you can select the locale using the language menu on the CDE login screen. If you are superuser, you can use the SYSMAN I18NCONFIG utility to select a locale and define it as the system default.
The applicable locales are shown in Table 3-2.
Locale | Language Name |
---|---|
zh_TW | Traditional Chinese Taiwan |
zh_TW.dechanyu | Traditional Chinese Taiwan (DEC Hanyu) |
zh_TW.eucTW | Traditional Chinese Taiwan (EUC) |
zh_TW.big5 | Traditional Chinese Taiwan (Big5) |
zh_TW.UTF-8 | Traditional Chinese Taiwan (Unicode) |
zh_HK.dechanyu | Traditional Chinese Hong Kong (DEC Hanyu) |
zh_HK.dechanzi | Simplified Chinese Hong Kong (DEC Hanzi) |
zh_HK.eucTW | Traditional Chinese Hong Kong (EUC) |
zh_HK.big5 | Traditional Chinese Hong Kong (Big5) |
zh_HK.UTF-8 | Traditional Chinese Hong Kong (Unicode) |
zh_CN | Simplified Chinese PRC |
zh_CN.dechanzi | Simplified Chinese PRC (DEC Hanzi) |
zh_CN.GBK | Simplified Chinese PRC (GBK) |
zh_CN.GB18030 | Simplified Chinese PRC (GB18030) |
zh_CN.UTF-8 | Simplified Chinese PRC (Unicode) |
Note
For DECwindows Motif and CDE, the locale modifier is ignored.