Man page - lastal(1)

Packages contains this manual

Manual

LASTAL

NAME
SYNOPSIS
DESCRIPTION
Cosmetic options:
E-value options (default settings):
Score options (default settings):
Initial-match options (default settings):
Miscellaneous options (default settings):
Split options:

NAME

lastal - genome-scale comparison of biological sequences

SYNOPSIS

lastal-plain [ options ] lastdb-name fasta-sequence-file(s)

DESCRIPTION

Find and align similar sequences.

Cosmetic options:

-h , --help

show all options and their default settings, and exit

-V , --version

show version information, and exit

-v

be verbose: write messages about what lastal is doing

-2

paired query sequences

-f

output format: TAB, MAF, BlastTab, BlastTab+ (default: MAF)

E-value options (default settings):

-D

query letters per random alignment (default: 1e+06)

-H

expected total number of random alignments for all the sequences

-E

max EG2: expected number of random alignments per square giga

Score options (default settings):

-r

match score (2 if -M , else 1)

-q

mismatch cost (3 if -M , else 1)

-p

match/mismatch score matrix (DNA-DNA: HUMSUM, protein-protein: BLOSUM62, DNA-protein: BLOSUM80)

-X

N/X is ambiguous in: 0=neither sequence, 1=reference, 2=query, 3=both (0)

-a

gap existence cost (24 if HUMSUM, else 11 if protein reference, else 7)

-b

gap extension cost ( 1 if HUMSUM, else 2 if protein reference, else 1)

-A

insertion existence cost (a)

-B

insertion extension cost (b)

-c

unaligned residue pair cost (off)

-F

frameshift cost(s) (off)

-x

maximum score drop for preliminary gapped alignments (z)

-y

maximum score drop for gapless alignments (min[t*10, x])

-z

maximum score drop for final gapped alignments (e-1)

-d

minimum score for gapless alignments (min[e, 2500/n query letters per hit])

-e

minimum score for gapped alignments

Initial-match options (default settings):

-m

maximum initial matches per query position (10)

-l

minimum length for initial matches (1)

-L

maximum length for initial matches (infinity)

-k

use initial matches starting at every k-th position in each query (1)

-W

use "minimum" positions in sliding windows of W consecutive positions

Miscellaneous options (default settings):

-P

number of parallel threads (default: 1)

-K

omit alignments whose query range lies in >= K others with > score (off)

-C

omit gapless alignments in >= C others with > score-per-length (off)

-s

strand: 0=reverse, 1=forward, 2=both (2 if DNA and not lastdb -S2 , else 1)

--reverse

reverse the query sequences

-S

use score matrix: 0=as-is, 1=on query forward strands (0)

-i

query batch size (64M if multi-volume, else off)

-M

find minimum-difference alignments (faster but cruder)

-T

type of alignment: 0=local, 1=overlap (default: 0)

-n

maximum gapless alignments per query position (infinity if m=0, else m)

-N

stop after the first N alignments per query strand

-R

lowercase & simple-sequence options (the same as was used by lastdb)

-U

maximum tandem repeat unit length (100 if --codon else same as lastdb)

-u

mask lowercase during extensions: 0=never, 1=gapless, 2=gapless+postmask, 3=always (2 if lastdb -c and Q!=pssm, else 0)

-w

suppress repeats inside exact matches, offset by <= this distance (1000)

-G

genetic code (default: 1)

-t

’temperature’ for calculating probabilities (1/lambda)

-g

’gamma’ parameter for gamma-centroid and LAMA (default: 1)

-j

output type: 0=match counts, 1=gapless, 2=redundant gapped, 3=gapped,

4=column ambiguity estimates, 5=gamma-centroid, 6=LAMA, 7=expected counts (default: 3)

-J

score type: 0=ordinary, 1=full (1 for new-style frameshifts, else 0)

-Q

input format: fastx, keep, sanger, solexa, illumina, prb, pssm

(default: fasta)

Split options:

--split

do split alignment

--splice

do spliced alignment

--split-f = FMT

output format: MAF, MAF+

--split-d = D

RNA direction: 0=reverse, 1=forward, 2=mixed (default: 1)

--split-c = PROB

cis-splice probability per base (default: 0.004)

--split-t = PROB

trans-splice probability per base (default: 1e-05)

--split-M = MEAN

mean of ln[intron length] (default: 7.0)

--split-S = SDEV

standard deviation of ln[intron length] (default: 1.7)

--split-m = PROB

maximum mismap probability (default: 1.0)

--split-s = INT

minimum alignment score (default: e OR e+t*ln[100])

--split-n

write original, not split, alignments

--split-b = B

maximum memory (default: 8T for split, 8G for spliced)