Extended Unix Coding

Extended Unix Coding (EUC) is an 8-bit character encoding used primarily for Japanese and Korean.

In Japan, the encoding is heavily used by Unix or Unix-like operating systems, but seldom used elsewhere. It is consequently the least-used of the big 3 Japanese encodings, behind both ISO-2022-JP (JIS) and Shift-JIS.

Contents

EUC-CN

EUC-CN is equivalent to GB2312 standard for simplified Chinese characters.

EUC-JP

In the EUC-JP coding for Japanese, the 7-bit values used in ISO-2022-JP are simply incremented by 128. This allows the easy mixing of 7-bit ASCII and 8-bit Japanese without the need for the escape characters employed by ISO-2022-JP.

EUC-KR

EUC-KR is a similar 8-bit coding of ISO-2022-KR (KS X 1001), implemented with the same principle of simply adding 128 to each byte.

EUC-TW

EUC-TW is a rarely used encoding for traditional Chinese characters as used on Taiwan. Big5 is much more common.

External links

See also: Extended Unix Coding, 8-bit, Big5, Character encoding, GB2312, ISO-2022-JP, Japan, Japanese language, Korean language, Operating system