Man page - unicode-show(1)

Packages contas this manual

Manual

UNICODE-SHOW(1) helper-scripts Manual UNICODE-SHOW(1)

unicode-show - Scan and annotate suspicious Unicode characters

unicode-show - detect and annotate non-ASCII or suspicious Unicode characters in text or files

unicode-show [FILE]...

unicode-show is a utility that reads text input (from standard input or files) and highlights suspicious Unicode characters, such as those outside the safe ASCII range. This tool is useful for identifying potentially malicious or misleading Unicode characters in source code, logs, or user input.

  • The line it appears on, annotated with [U+XXXX] markers
  • A description including the character (if visible), Unicode codepoint, character name, and category

  • Characters outside the printable ASCII range (0x20-0x7E)
  • Control characters (excluding \n and \t)
  • Carriage returns (\r), even when used in CRLF pairs
  • Any character not in the standard set of ASCII letters, digits, punctuation, and trailing whitespace

  • Annotations are colorized using ANSI escape codes if stdout is a terminal and the environment is color-friendly
  • Red for inline [U+XXXX] markers
  • Cyan for character metadata descriptions

  • The environment variable $NOCOLOR is set
  • The environment variable $NO_COLOR is set to 1
  • $TERM is set to dumb
  • Output is redirected (non-interactive terminal)

This tool takes no options. Any arguments are treated as file paths. If no arguments are given, input is read from standard input.

  • 0 - No suspicious Unicode characters found
  • 1 - Suspicious characters were detected
  • 2 - An error occurred (e.g., file I/O or decoding failure)

unicode-show suspicious.txt

unicode-show file1.txt file2.md

cat file1.txt | unicode-show

NOCOLOR=1 unicode-show example.txt

  • NOCOLOR - disables color output if set
  • NO_COLOR - disables color output if set to 1
  • TERM - if set to dumb, disables color output

This man page has been written by Patrick Schleizer (adrelanos@whonix.org).

January 2020 helper-scripts