X-Git-Url: https://git.distorted.org.uk/~mdw/sgt/charset/blobdiff_plain/707b88108c7530c06670e021df76cadfe74ba345..53163a60cc595558b83e22af71bf1ec3b1488323:/README diff --git a/README b/README index 456dfd8..8eb7c25 100644 --- a/README +++ b/README @@ -25,8 +25,10 @@ not currently support. Those that I know of are: machinery in place, and support all the underlying character sets. - - ISO-2022-CN and ISO-2022-CN-EXT (RFC 1922), and EUC-TW. These - encodings depend on the CNS 11643-1992 character set. + - ISO-2022-CN and ISO-2022-CN-EXT (RFC 1922). These are a little tricky + as they allow use of both GB2312 (simplified Chinese) and CNS 11643 + (traditional Chinese), so we may need some way to specify which to + prefer. - The Hong Kong (HKSCS) extension to Big5. Again, mapping tables are available in the Unihan database.