Asian System Environment Unicode Release Notes This document describes the features of Asian System Environment (ASE) Unicode support. Japanese System Environment (JSE) ================================= Locale and Code set ------------------- o New locale JSE supports ja_JP.utf8 locale as Japanese UTF-8 locale. o Code range * ja_JP.utf8 JSE supports standard characters of the following code sets: JIS X 0201 (1976) JIS X 0208 (1990) JIS X 0212 (1990) Note: - UDC (User Defined Characters, GAIJI) and VDC (Vender Defined Characters) are not supported. - This code range is applied to: Code Conversion Display Input Method Printing Code Conversion --------------- o Supported code sets iconv supports code conversions between the following code sets related with Unicode: ucs2 ucs4 utf8 Example: iconv -f utf8 -t sjis UTF8.txt > SJIS.txt o Conversion rule iconv supports the code conversion rule of "JIS X 0221 Type Conversion (when used in conjunction with ASCII" defined by TOG/JVC (The Open Group/Japanese Vendors Council). See the following document as your reference. http://www.opengroup.or.jp/jvc/cde/cde.html Display ------- o ISO 8859 fonts The following characters which overlap between JIS X 0208/0212 code sets and ISO 8859 code sets are displaied as the characters in ISO 8859 fonts. - Some symbol characters: ACUTE ACCENT DIAERESIS REVERSE SOLIDUS PLUS-MINUS SIGN MULTIPLICATION SIGN DIVISION SIGN DEGREE SIGN CENT SIGN POUND SIGN SECTION SIGN NOT SIGN PILCROW SIGN BREVE CARON (Mandarin Chinese third tone) CEDILLA DOT ABOVE (Mandarin Chinese light tone) DOUBLE ACUTE ACCENT MACRON OGONEK RING ABOVE GREEK TONOS GREEK DIALYTIKA TONOS INVERTED EXCLAMATION MARK BROKEN BAR INVERTED QUESTION MARK MASCULINE ORDINAL INDICATOR FEMININE ORDINAL INDICATOR COPYRIGHT SIGN REGISTERED SIGN CURRENCY SIGN - Greek Letters - Cyrillic Letters - Latin Letters (not include Basic Latin) - Latin Ligatures o Back Slash Back Slash character is displayed as Yen Sign (0x5c:JIS X 0201) character. Input Method ------------ o Japanese Input Methods (common) Code input of Unicode is not supported. o XJIM XJIM supports input for ja_JP.utf8 applications with characters in the following code sets: JIS X 0201 JIS X 0208 You can also input JIS X 0212 characters if you set resource optionCharsets to true and then invoke xjim with LANG=ja_JP.utf8 or LANG=ja_JP.eucJP in ja_JP.utf8 environment. In other words, you cannot use JIS X 0212 characters if you invoke either application program or xjim with LANG=ja_JP.SJIS. See "HP XJIM Japanese Input Guide" as a refelence. o ATOK8 ATOK8 supports input for ja_JP.utf8 applications with characters in the following code sets: JIS X 0201 JIS X 0208 o EGBridge EGBridge supports input for ja_JP.utf8 applications with characters in the following code sets: JIS X 0201 JIS X 0208 o VJE-gamma VJE-gamma supports input for ja_JP.utf8 applications with characters in the following code sets: JIS X 0201 JIS X 0208 o wdutil wdutil does not work if current locale is ja_JP.utf8. To use wdutil, make source file with SJIS or EUC encoding and invoke wdutil with LANG=ja_JP.SJIS or LANG=ja_JP.eucJP. Printing -------- o LP -oja_JP.utf8 option is supported on the following model files. This option designates that the input file is UTF-8 encoded. Model Files: ESCP LIPS3 LIPS4 PCL4.nloo PCL5.nloo PCL5.asian PS.nlio hpc1200aj hpc1208a o HP DPS -x printer-pass-through=ja_JP.utf8 document attribute is supported on the printer models which use the following interface programs. This document attribute designates that the input file is UTF-8 encoded. Interface Programs: ESCP LIPS3 LIPS4 PCL4.nloo PCL5.nloo PCL5.asian PS.nlio hpc1200aj hpc1208a DP300 Problem Description and Fixes ----------------------------- o VJE-gamma The two tools "KANKYOU SETTEI" and "JISYO utility" of VJE-gamma do not work if you invoke VJE-gamma in ja_JP.utf8 locale. To invoke these functions, execute the followings from command line. To invoke "KANKYOU SETTEI": $ LANG=ja_JP.SJIS /opt/vje/lbin/vjeenv & To invoke "JISYO utility": $ LANG=ja_JP.SJIS /opt/vje/lbin/vuty & Korean System Environment (KSE) =============================== Locale and Code set ------------------- o New locale KSE supports ko_KR.utf8 locale as Korean UTF-8 locale. o Code range * ko_KR.utf8 KSE supports standard characters of the following code sets: KSC 5636 (1989) KSC 5601 (1987) Note: - UDC (User Defined Characters) are not supported. - This code range is applied to: Code Conversion Display Input Method Printing Code Conversion --------------- o Supported code sets iconv supports code conversions between the following code sets related with Unicode: ucs2 ucs4 utf8 Example: iconv -f utf8 -t eucKR UTF8.txt > EUC.txt Display ------- o ISO 8859 fonts The following characters which overlap between KSC 5601 code sets and ISO 8859 code sets are displaied as the characters in ISO 8859 fonts. - Some symbol characters: MIDDLE DOT DIAERESIS SOFT HYPHEN PLUS-MINUS SIGN MULTIPLICATION SIGN DIVISION SIGN DEGREE SIGN SECTION SIGN ACUTE ACCENT CARON BREVE DOUBLE ACUTE ACCENT RING ABOVE DOT ABOVE CEDILLA OGONEK MARK MODIFIER LETTER TRIANGULAR COLON CURRENCY SIGN PILCROW SIGN FEMININE ORDINAL INDICATOR MASCULINE ORDINAL INDICATOR VULGAR FRACTION ONE HALF VULGAR FRACTION ONE QUARTER VULGAR FRACTION THREE QUARTERS SUPERSCRIPT ONE SUPERSCRIPT TWO SUPERSCRIPT THREE - Greek Letters - Latin Letters (not include Basic Latin) - Latin Ligatures - Cyrillic Letters Input Method ------------ o XKIM XKIM supports input for ko_KR.utf8 applications. Code input of Unicode is not supported. o wdutil wdutil does not work if current locale is ko_KR.utf8 To use wdutil, make source file with EUC encoding and invoke wdutil with LANG=ko_KR.eucKR. Printing -------- o LP -oko_KR.utf8 option is supported on the following model files. This option designates that the input file is UTF-8 encoded. Model Files: ESCP PCL4.nloo PCL5.nloo hpc1200ak hpc1208a o HP DPS -x printer-pass-through=ko_KR.utf8 document attribute is supported on the printer models which use the following interface programs. This document attribute designates that the input file is UTF-8 encoded. Interface Programs: ESCP PCL4.nloo PCL5.nloo hpc1200ak hpc1208a Simplified Chinese System Environment (SSE) =========================================== Locale and Code set ------------------- o New locale SSE supports zh_CN.utf8 locale as Simplified Chinese UTF-8 locale. o Code range * zh_CN.utf8 SSE supports standard characters of the following code sets: ISO 646 (1991) GB 2312 (1980) Note: - UDC (User Defined Characters) are not supported. - This code range is applied to: Code Conversion Display Input Method Printing Code Conversion --------------- o Supported code sets iconv supports code conversions between the following code sets related with Unicode: ucs2 ucs4 utf8 Example: iconv -f utf8 -t hp15CN UTF8.txt > HP15.txt Display ------- o ISO 8859 fonts The following characters which overlap between GB2312 code sets and ISO 8859 code sets are displaied as the characters in ISO 8859 fonts. - Some symbol characters: MODIFIER LETTER MACRON (Mandarin Chinese first tone) CARON (Mandarin Chinese third tone) DIAERESIS PLUS-MINUS SIGN MULTIPLICATION SIGN DIVISION SIGN DEGREE SIGN CURRENCY SIGN SECTION SIGN - Greek Letters - Cyrillic Letters - Latin Letters (not include Basic Latin) Input Method ------------ o XSIM XSIM supports input for zh_CN.utf8 applications. Code input of Unicode is not supported. Printing -------- o LP -ozh_CN.utf8 option is supported on the following model files. This option designates that the input file is UTF-8 encoded. Model Files: ESCP PCL4.nloo PCL5.nloo hpc1200ac hpc1208a o HP DPS -x printer-pass-through=zh_CN.utf8 document attribute is supported on the printer models which use the following interface programs. This document attribute designates that the input file is UTF-8 encoded. Interface Programs: ESCP PCL4.nloo PCL5.nloo hpc1200ac hpc1208a Traditional Chinese System Environment (TSE) ============================================ Locale and Code set ------------------- o New locale TSE supports zh_TW.utf8 locale and zh_HK.utf8 locale as Traditional Chinese UTF-8 locale. o Code range * zh_TW.utf8 and zh_HK.utf8 TSE supports standard characters of the following code sets: ISO 646 (1991) CNS 11643 plane 1 (1992) CNS 11643 plane 2 (1992) CNS 11643 plane 3 (1992) *1 CNS 11643 plane 4 (1992) *1 Note: - All range of BIG5 is supported through CNS 11643 plane 1 and 2. - *1: Except some characters which is not supported with Unicode 2.0. - UDC (User Defined Characters) are not supported. - This code range is applied to: Code Conversion Display Input Method Printing Code Conversion --------------- o Supported code sets iconv supports code conversions between the following code sets related with Unicode: ucs2 ucs4 utf8 Example: iconv -f utf8 -t eucTW UTF8.txt > EUC.txt Display ------- o ISO 8859 fonts The following characters which overlap between CNS11643 code sets and ISO 8859 code sets are displaied as the characters in ISO 8859 fonts. - Some symbol characters: MIDDLE DOT SECTION SIGN MULTIPLICATION SIGN DIVISION SIGN PLUS-MINUS SIGN DEGREE SIGN DOT ABOVE MODIFIER LETTER MACRON MODIFIER LETTER ACUTE ACCENT CARON MODIFIER LETTER GRAVE ACCENT - Greek Letters Input Method ------------ o XTIM XTIM supports input for zh_TW.utf8 and zh_HK.utf8 applications. Code input of Unicode is not supported. Printing -------- o LP -ozh_TW.utf8 option is supported on the following model files. This option designates that the input file is UTF-8 encoded. Model Files: ESCP PCL4.nloo PCL5.nloo hpc1200at hpc1205at hpc1208a o HP DPS -x printer-pass-through=zh_TW.utf8 document attribute is supported on the printer models which use the following interface programs. This document attribute designates that the input file is UTF-8 encoded. Interface Programs: ESCP PCL4.nloo PCL5.nloo hpc1200at hpc1205at hpc1208a -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-