Man page - mknmz(1)
Packages contains this manual
Manual
MKNMZ
NAMESYNOPSIS
DESCRIPTION
Target files:
Morphological Analysis:
Text Operations:
Summarization:
Index Construction:
Miscellaneous:
REPORTING BUGS
COPYRIGHT
NAME
mknmz - an indexer of Namazu
SYNOPSIS
mknmz [ options ] <target> ...
DESCRIPTION
mknmz 2.0.21, an indexer of Namazu.
Target files:
-a , --all
target all files.
-t , --media-type = MTYPE
set the media type for all target files to MTYPE.
-h , --mailnews
same as --media-type= ’message/rfc822’
--mhonarc
same as --media-type= ’text/html; x-type=mhonarc’
-F , --target-list = FILE
load FILE which contains a list of target files.
--allow = PATTERN
set PATTERN for file names which should be allowed.
--deny = PATTERN
set PATTERN for file names which should be denied.
--exclude = PATTERN
set PATTERN for pathnames which should be excluded.
-e , --robots
exclude HTML files containing <meta name="ROBOTS" content="NOINDEX">
-M , --meta
handle HTML meta tags for field-specified search.
-r , --replace = CODE
set CODE for replacing URI.
--html-split
split an HTML file with <a name="..."> anchors.
--mtime = NUM
limit by mtime just like find(1)’s -mtime option. e.g., -50 for recent 50 days, +50 for older than 50.
Morphological Analysis:
-b , --use-mecab
use MeCab for analyzing Japanese.
-c , --use-chasen
use ChaSen for analyzing Japanese.
-k , --use-kakasi
use KAKASI for analyzing Japanese.
-m , --use-chasen-noun
use ChaSen for extracting only nouns.
|
-L , --indexing-lang = LANG index with language specific processing. |
Text Operations:
-E , --no-edge-symbol
remove symbols on edge of word.
-G , --no-okurigana
remove Okurigana in word.
-H , --no-hiragana
ignore words consist of Hiragana only.
-K , --no-symbol
remove symbols.
--decode-base64
decode base64 bodies within multipart entities.
Summarization:
-U , --no-encode-uri
do not encode URI.
|
-x , --no-heading-summary do not make summary with HTML’s headings. |
Index Construction:
--update = INDEX
set INDEX for updating.
-z , --check-filesize
detect file size changed.
-Y , --no-delete
do not detect removed documents.
-Z , --no-update
do not detect update and deleted documents.
Miscellaneous:
-s , --checkpoint
turn on the checkpoint mechanism.
-C , --show-config
show the current configuration.
-f , --config = FILE
use FILE as a config file.
-I , --include = FILE
include your customization FILE.
-O , --output-dir = DIR
set DIR to output the index.
-T , --template-dir = DIR
set DIR having NMZ.{head,foot,body}.*.
-q , --quiet
suppress status messages during execution.
-v , --version
show the version of namazu and exit.
-V , --verbose
be verbose.
-d , --debug
be debug mode.
|
--help |
show this help and exit. |
|||
|
--norc |
do not read the personal initialization files. |
|||
|
-- |
Terminate option list. |
REPORTING BUGS
Report bugs to <http://www.namazu.org/trac-namazu/trac.cgi> or <bug-namazu@namazu.org>.
COPYRIGHT
Copyright ©
1997-1999 Satoru Takabayashi All rights reserved.
Copyright © 2000-2009 Namazu Project All rights
reserved.
This is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.