machinery in place, and support all the underlying character
sets.
- - ISO-2022-CN and ISO-2022-CN-EXT (RFC 1922), and EUC-TW. These
- encodings depend on the CNS 11643-1992 character set. Mapping
- table data for this set is available from unicode.org, but only
- in the Unihan database
- ftp://ftp.unicode.org/Public/UNIDATA/Unihan.zip
+ - ISO-2022-CN and ISO-2022-CN-EXT (RFC 1922). These are a little tricky
+ as they allow use of both GB2312 (simplified Chinese) and CNS 11643
+ (traditional Chinese), so we may need some way to specify which to
+ prefer.
- The Hong Kong (HKSCS) extension to Big5. Again, mapping tables
are available in the Unihan database.