Fontforge support cjk extension b6/2/2023 ![]() $ vim '+e ++enc=eucJP ju-file.txt'Īn old Japanese MS-Windows text file, " jw-file.txt", stored in the so called shift-JIS encoding (more precisely: CP932) can be edited with vim by the following. $ vim '+e ++enc=latin2 pu-file.txt'Īn old Japanese unix text file, " ju-file.txt", stored in the eucJP encoding can be edited with vim by the following. This is possible since the auto detection mechanism of the file encoding in vim assumes the UTF-8 encoding first and, if it fails, assumes it to be latin1.Īn old Polish Unix text file, " pu-file.txt", stored in the latin2 (iso-8859-2) encoding can be edited with vim by the following. You should use these editors under the UTF-8 locale in the UTF-8 capable console for the best compatibility.Īn old western European Unix text file, " u-file.txt", stored in the latin1 (iso-8859-1) encoding can be edited simply with vim by the following. Intelligent modern editors such as the vim program are quite smart and copes well with any encoding systems and any file formats. ![]() JIS X 0208 Appendix 1 standard for Japanese (see CP932) Old Japanese UNIX standard 8 bit code and completely different from Shift-JIS Standard encoding for Japanese email which uses only 7 bit codes Old Russian UNIX standard for the Cyrillic alphabet Old standard for western European languages, ISO-8859-1 with euro signĬode page 850, Microsoft DOS characters with graphics for western European languages, ISO-8859-1 variantĬode page 932, Microsoft Windows style Shift-JIS variant for JapaneseĬode page 936, Microsoft Windows style GB2312, GBK or GB18030 variant for Simplified ChineseĬode page 949, Microsoft Windows style EUC-KR or Unified Hangul Code variant for KoreanĬode page 950, Microsoft Windows style Big5 variant for Traditional ChineseĬode page 1251, Microsoft Windows style encoding for the Cyrillic alphabetĬode page 1252, Microsoft Windows style ISO-8859-15 variant for western European languages Old standard for eastern European languages, ASCII + accented characters Old standard for western European languages, ASCII + accented characters Text format converter between Macintosh and Unix: frommac(1) and tomac(1)Īmerican Standard Code for Information Interchange, 7 bit code w/o accented charactersĬurrent multilingual standard for all modern OSs Text format converter between DOS and Unix: fromdos(1) and todos(1) Replace accented letters by their unaccented equivalent Text encoding converter between locales (fancy) Text encoding converter between locales (versatile, more aliases and features) Text encoding converter between locales by iconv(1) (fundamental)
0 Comments
Leave a Reply. |