nkf - Network Kanji code conversion Filter v2.0.4


SUBMITTED BY: Guest

DATE: June 6, 2014, 9:02 a.m.

FORMAT: Text only

SIZE: 4.3 kB

HITS: 3311

  1. NAME
  2. nkf - Network Kanji code conversion Filter v2.0.4
  3. SYNOPSIS
  4. nkf [ options ] [ file ]
  5. DESCRIPTION
  6. Nkf is a yet another kanji code converter among networks, hosts and terminals. It converts input kanji code to designated kanji code such as 7-bit JIS, MS-kanji (shifted-JIS), utf-8 or EUC.
  7. One of the most unique facicility of nkf is the guess of the input kanji code. It currently recognizes 7-bit JIS, MS-kanji (shifted-JIS),utf-8 and EUC. So users needn't the input kanji code specification.
  8. By default X0201 kana is converted into X0208 kana. For X0201 kana, SO/SI, SSO and ESC-(-I methods are supported. For automatic code detection, nkf assumes no X0201 kana in MS-Kanji. To accept X0201 in MS-Kanji, use -X, -x or -S.
  9. Options are bellow:
  10. -j
  11. output 7-bit JIS code. This is a default.
  12. -s
  13. output MS-kanji (shifted-JIS) code.
  14. -e
  15. output EUC (AT&T) code.
  16. -w
  17. output UTF-8 (Unicode 8bit form).
  18. -S
  19. Assume MS-Kanji and X0201 kana input. It also accpet JIS. AT&T EUC is recognized as X0201 kana. Without -x flag, X0201 kana is converted into X0208.
  20. -J
  21. Assume JIS input. It also accepts Japanese EUC. This is the default. This flag does not excludde MS-Kanji.
  22. -E
  23. Assume AT&T EUC input. It also accpet JIS. Same as -J.
  24. -B
  25. Assume broken JIS-Kanji, which lost ESC. Usefull when your site is using old B-News Nihongo patch. -B1 allows any char after ESC-( or ESC-$. -B2 forces ASCII after NL.
  26. -W
  27. Assume UTF-8 input.
  28. -m
  29. MIME ISO-2022-JP/ISO8859-1 decode. (default) To see ISO8859-1 (Latin-1) -l is necessary. -mN does loose encoding. It allows line break in the middle of the basr64 encoding.
  30. -mB
  31. Decode MIME base64 encoded stream. Remove header or other part before conversion.
  32. -mQ
  33. Decode MIME quoted stream. '_' in quoted is converted to space.
  34. -m0
  35. No MIME decode.
  36. -M
  37. MIME encode. Header style. All ASCII code and control characters are intact.
  38. -MB
  39. MIME encode. Base64 stream. Kanji conversion is performed before encoding, so this cannot be used as a picture encoder. MQ perfome quoted encoding.
  40. -l
  41. Input and output code is ISO8859-1 (Latin-1) and ISO-2022-JP. -s, -e and -x are not compatible with this option.
  42. -fn
  43. Folding on n length in a line. Default 60. -f40-0 forces 0 margin folding.
  44. -X
  45. Allow X0201 kana in MS-Kanji. X0201 is converted into X0208 by default. This is default in MSDOS.
  46. -x
  47. Try to preseve X0208 kana. Assume X0201 kana in MS-Kanji. And do not convert X0201 kana to X0208. In JIS output, ESC-(-I is used. In EUC output, SSO is used.
  48. -Z
  49. Convert X0208 alphabet to ASCII. -Z1 converts X0208 kankaku to one ASCII space. -Z2 converts X0208 kankaku to two ASCII spaces.
  50. -Z3
  51. Replacing "><&" into '>', '<', '"', '&' as in HTML.
  52. -I
  53. Replacing Non iso-2022-jp char into the strage geta character.
  54. -b
  55. bufferd output. This is a default.
  56. -u
  57. unbufferd output.
  58. -t
  59. no operations.
  60. -O
  61. Output result to file. The first name in arguments becomes output. Please be careful. If there are no file arguments, nkf.out is chosen. --overwrite does rewriting. Original listed files are replaced by filtered result.
  62. -ic
  63. output ESC-$-c as sequence to designate JIS-kanji (Default is B.)
  64. -oc
  65. output ESC-(-c as sequence to designate single-byte roman characters (Default is B.)
  66. -r
  67. {de/en}crypt ROT13/47
  68. -v
  69. display Version
  70. -T
  71. Text mode output (MS-DOS)
  72. -c
  73. add CR after NL.
  74. -d
  75. delete CR after NL.
  76. -L[wmu] new line mode
  77. -Lu unix (LF)
  78. -Lw windows (CRLF)
  79. -Lm mac (CR)
  80. default no conversion
  81. -F New line preserving line folding. hiragana/katakana translation -h1 --katakana -h2 --hirakana -h3 --hirakana-katakana
  82. --
  83. long options --fj,--unix,--mac,--msdos, --windows convert for these system --jis,--euc,--sjis,--mime,--base64 convert for named code --jis-input,--euc-input,--sjis-input,--mime-input,--base64-input assume input system -- ignore rest of -option --help --version
  84. FILES
  85. nkf - binary

comments powered by Disqus