Man page - mknmz(1)

Packages contains this manual

Manual

MKNMZ

NAME
SYNOPSIS
DESCRIPTION
Target files:
Morphological Analysis:
Text Operations:
Summarization:
Index Construction:
Miscellaneous:
REPORTING BUGS
COPYRIGHT

NAME

mknmz - an indexer of Namazu

SYNOPSIS

mknmz [ options ] <target> ...

DESCRIPTION

mknmz 2.0.21, an indexer of Namazu.

Target files:

-a , --all

target all files.

-t , --media-type = MTYPE

set the media type for all target files to MTYPE.

-h , --mailnews

same as --media-type= ’message/rfc822’

--mhonarc

same as --media-type= ’text/html; x-type=mhonarc’

-F , --target-list = FILE

load FILE which contains a list of target files.

--allow = PATTERN

set PATTERN for file names which should be allowed.

--deny = PATTERN

set PATTERN for file names which should be denied.

--exclude = PATTERN

set PATTERN for pathnames which should be excluded.

-e , --robots

exclude HTML files containing <meta name="ROBOTS" content="NOINDEX">

-M , --meta

handle HTML meta tags for field-specified search.

-r , --replace = CODE

set CODE for replacing URI.

--html-split

split an HTML file with <a name="..."> anchors.

--mtime = NUM

limit by mtime just like find(1)’s -mtime option. e.g., -50 for recent 50 days, +50 for older than 50.

Morphological Analysis:

-b , --use-mecab

use MeCab for analyzing Japanese.

-c , --use-chasen

use ChaSen for analyzing Japanese.

-k , --use-kakasi

use KAKASI for analyzing Japanese.

-m , --use-chasen-noun

use ChaSen for extracting only nouns.

-L , --indexing-lang = LANG index with language specific processing.

Text Operations:

-E , --no-edge-symbol

remove symbols on edge of word.

-G , --no-okurigana

remove Okurigana in word.

-H , --no-hiragana

ignore words consist of Hiragana only.

-K , --no-symbol

remove symbols.

--decode-base64

decode base64 bodies within multipart entities.

Summarization:

-U , --no-encode-uri

do not encode URI.

-x , --no-heading-summary do not make summary with HTML’s headings.

Index Construction:

--update = INDEX

set INDEX for updating.

-z , --check-filesize

detect file size changed.

-Y , --no-delete

do not detect removed documents.

-Z , --no-update

do not detect update and deleted documents.

Miscellaneous:

-s , --checkpoint

turn on the checkpoint mechanism.

-C , --show-config

show the current configuration.

-f , --config = FILE

use FILE as a config file.

-I , --include = FILE

include your customization FILE.

-O , --output-dir = DIR

set DIR to output the index.

-T , --template-dir = DIR

set DIR having NMZ.{head,foot,body}.*.

-q , --quiet

suppress status messages during execution.

-v , --version

show the version of namazu and exit.

-V , --verbose

be verbose.

-d , --debug

be debug mode.

--help

show this help and exit.

--norc

do not read the personal initialization files.

--

Terminate option list.

REPORTING BUGS

Report bugs to <http://www.namazu.org/trac-namazu/trac.cgi> or <bug-namazu@namazu.org>.

COPYRIGHT

Copyright © 1997-1999 Satoru Takabayashi All rights reserved.
Copyright © 2000-2009 Namazu Project All rights reserved.

This is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.