Man page - alignment-thin(1)
Packages contains this manual
- alignment-consensus(1)
- cut-range(1)
- alignment-thin(1)
- alignment-max(1)
- bali-phy-pkg(1)
- bp-analyze(1)
- alignment-draw(1)
- summarize-ancestors(1)
- extract-ancestors(1)
- stats-select(1)
- mctree-mean-lengths(1)
- trees-consensus(1)
- alignment-find(1)
- bali-phy(1)
- statreport(1)
- alignment-indices(1)
- tree-mean-lengths(1)
- pickout(1)
- draw-tree(1)
- alignment-gild(1)
- alignment-distances(1)
- alignment-info(1)
- tree-tool(1)
- alignments-diff(1)
- trees-bootstrap(1)
- mcon-tool(1)
- alignment-cat(1)
- alignment-chop-internal(1)
- alignment-translate(1)
- trees-distances(1)
- bali-subsample(1)
apt-get install bali-phy
Manual
alignment-thin
NAMESYNOPSIS
DESCRIPTION
GENERAL OPTIONS:
SEQUENCE FILTERING OPTIONS:
COLUMN FILTERING OPTIONS:
OUTPUT OPTIONS:
EXAMPLES:
REPORTING BUGS:
AUTHORS
NAME
alignment-thin - Remove sequences or columns from an alignment.
SYNOPSIS
alignment-thin alignment-file [OPTIONS]
DESCRIPTION
Remove sequences or columns from an alignment.
GENERAL OPTIONS:
-h , âhelp
Print usage information.
-V , âverbose
Output more log messages on stderr.
SEQUENCE FILTERING OPTIONS:
-p arg , âprotect arg
Sequences that cannot be removed (comma-separated).
-k arg , âkeep arg
Remove sequences not in comma-separated list arg .
-r arg , âremove arg
Remove sequences in comma-separated list arg .
-l arg , âlonger-than arg
Remove sequences not longer than arg .
-s arg , âshorter-than arg
Remove sequences not shorter than arg .
-c arg , âcutoff arg
Remove similar sequences with #mismatches < cutoff.
-d arg , âdown-to arg
Remove similar sequences down to arg sequences.
âremove-gappy arg
Remove arg outlier sequences â defined as sequences that are missing too many conserved sites.
âconserved arg (=0.75)
Fraction of sequences that must contain a letter for it to be considered conserved.
COLUMN FILTERING OPTIONS:
-K arg , âkeep-columns arg
Keep columns from this sequence
-m arg , âmin-letters arg
Remove columns with fewer than arg letters.
-u arg , âremove-unique arg
Remove insertions in a single sequence if longer than arg letters
-e , âerase-empty-columns
Remove columns with no characters (all gaps).
OUTPUT OPTIONS:
-S , âsort
Sort partially ordered columns to group similar gaps.
-L , âshow-lengths
Just print out sequence lengths.
-N , âshow-names
Just print out sequence lengths.
-F arg , âfind-dups arg
For each sequence, find the closest other sequence.
EXAMPLES:
Remove columns without a minimum number of letters:
% alignment-thin --min-letters=5 file.fasta > file-thinned.fasta
Remove sequences by name:
% alignment-thin --remove=seq1,seq2 file.fasta > file2.fasta
% alignment-thin --keep=seq1,seq2 file.fasta > file2.fasta
Remove short sequences:
% alignment-thin --longer-than=250 file.fasta > file-long.fasta
Remove similar sequences with <= 5 differences from the closest other sequence:
% alignment-thin --cutoff=5 file.fasta > more-than-5-differences.fasta
Remove similar sequences until we have the right number of sequences:
% alignment-thin --down-to=30 file.fasta > file-30taxa.fasta
Remove dissimilar sequences that are missing conserved columns:
% alignment-thin --remove-gappy=10 file.fasta > file2.fasta
Protect some sequences from being removed:
% alignment-thin --down-to=30 file.fasta --protect=seq1,seq2 > file2.fasta
% alignment-thin --down-to=30 file.fasta --protect=@filename > file2.fasta
REPORTING BUGS:
BAli-Phy online help: http://www.bali-phy.org/docs.php .
Please send bug reports to bali-phy-users@googlegroups.com .
AUTHORS
Benjamin Redelings.