A CLI utility to get information for Unicode characters.
chmap is a little utility to get information on Unicode characters,
such as its description, hexadecimal/octal representation, Unicode
category, UTF-8 encoding, and more.
$ chmap -r 9-10,13935,255-258
codepoint  glyph  encoded     case   description
        9         9           other  character tabulation
       10         A           other  line feed (lf)
    13935  㙯     E3 99 AF    other  modifier letter chinese tone yin ping
      255  ÿ      C3 BF       lower  latin small letter y with diaeresis
      256  Ā      C4 80       upper  latin capital letter a with macron
      257  ā      C4 81       lower  latin small letter a with macron
      258  Ă      C4 82       upper  latin capital letter a with breve
chmap was inspired by the charmap.exe tool present in Windows XP and
later.
Tarballs are available from GitHub releases for Linux-x86_64 and
Linux-armv6l. Extract, cd, and run:
$ sudo ./install.sh /usr/local
- a C99 compiler, GNU Make
- scdoc (manpage)
Download the latest source tarball from GitHub releases
(chmap-v$VERSION.tar.xz), extract, and build:
# make clean install
Run chmap with the -r flag (for range) to get info for a range of
Unicode runes:
$ chmap -r 0-5
codepoint  glyph  encoded      case   description
        0                      other  null
        1         1            other  start of heading
        2         2            other  start of text
        3         3            other  end of text
        4         4            other  end of transmission
        5         5            other  enquiry
Example ranges: 0-1, 355-369, 34, 189-192,12,45-49.
You may also list info for a list of given characters with the -c flag:
$ chmap -c Lovecraft
codepoint  glyph  encoded      case   description
       76  L      4C           upper  latin capital letter l
      111  o      6F           lower  latin small letter o
      118  v      76           lower  latin small letter v
      101  e      65           lower  latin small letter e
       99  c      63           lower  latin small letter c
      114  r      72           lower  latin small letter r
       97  a      61           lower  latin small letter a
      102  f      66           lower  latin small letter f
      116  t      74           lower  latin small letter t
You can search for characters that match a regex, too, with the -s flag:
$ chmap -s '^latin.*capital letter z'                                                       
codepoint  glyph  encoded      case   description
       90  Z      5A           upper  latin capital letter z
      377  Ź      C5 B9        upper  latin capital letter z with acute
      379  Ż      C5 BB        upper  latin capital letter z with dot above
      381  Ž      C5 BD        upper  latin capital letter z with caron
      437  Ƶ      C6 B5        upper  latin capital letter z with stroke
      548  Ȥ      C8 A4        upper  latin capital letter z with hook
     7824  Ẑ      E1 BA 90     upper  latin capital letter z with circumflex
     7826  Ẓ      E1 BA 92     upper  latin capital letter z with dot below
     7828  Ẕ      E1 BA 94     upper  latin capital letter z with line below
    11371  Ⱬ      E2 B1 AB     upper  latin capital letter z with descender
    11391  Ɀ      E2 B1 BF     upper  latin capital letter z with swash tail
    42950  Ᶎ      EA 9F 86     upper  latin capital letter z with palatal hook
Adding the -l flag causes chmap to print more information:
$ chmap -l -r 13000                                                                         
codepoint    13000 0x32C8  0o31310
encoding     UTF8(E3 8B 88)
glyph        ㋈ (2 columns)
description  ideographic telegraph symbol for september
case         other
category     Symbol (other)
See man chmap for more.
I miss some Windows utilities.
I found chars, but it lacked
several features which I'd like:
- A table-like output mode (which chmaphas by default). This makes viewing many entries more comfortable in a short terminal.
- A categoryfield to specify the Unicode category that the rune falls under.
- Window's charmap.exe
- Eric Raymond's asciiutility
- chars(https://github.com/antifuchs/chars)
- uninamefrom uniutils
This lame little utility is licensed under the MIT License. See
the LICENSE.md file for more information.