Man page - gt-extractseq(1)
Packages contas this manual
- gt-scriptfilter(1)
- gt-mmapandread(1)
- gt-gff3(1)
- gt-ltrdigest(1)
- gt-encseq(1)
- gt-extractfeat(1)
- gt-seed_extend(1)
- gt-dupfeat(1)
- gt-csa(1)
- gt-clean(1)
- gt-packedindex(1)
- gt-genomediff(1)
- gt-sketch_page(1)
- gt-compreads-refcompress(1)
- gt-splicesiteinfo(1)
- gt-extractseq(1)
- gt-compreads(1)
- gt-encseq-decode(1)
- gt-cds(1)
- gt-encseq-bitextract(1)
- gt-readjoiner(1)
- gt-tallymer-occratio(1)
- gt-tallymer(1)
- gt-sequniq(1)
- gt-id_to_md5(1)
- gt-readjoiner-prefilter(1)
- gt-seqtranslate(1)
- gt-tallymer-search(1)
- gt-condenseq(1)
- gt-seqmutate(1)
- gt-seqorder(1)
- gt-seq(1)
- gt-inlineseq_add(1)
- gt-repfind(1)
- gt-seqfilter(1)
- gt-sketch(1)
- gt-hop(1)
- gt-seqids(1)
- gt-fastq_sample(1)
- gt-compreads-refdecompress(1)
- gt-readjoiner-assembly(1)
- gt-readjoiner-overlap(1)
- gt-splitfasta(1)
- gt-seqtransform(1)
- gt-tallymer-mkindex(1)
- gt-wtree(1)
- gt-ltrharvest(1)
- gt-chseqids(1)
- gt-compreads-decompress(1)
- gt-orffinder(1)
- gt-encseq-sample(1)
- gt-encseq-md5(1)
- gt-merge(1)
- gt-gff3validator(1)
- gt-matchtool(1)
- gt-congruence(1)
- gt-tagerator(1)
- gt-gff3_to_gtf(1)
- gt-featureindex(1)
- gt-md5_to_id(1)
- gt-mkfeatureindex(1)
- gt-tirvish(1)
- gt-snpper(1)
- gt-prebwt(1)
- gt-stat(1)
- gt-speck(1)
- gt-convertseq(1)
- gt-compreads-compress(1)
- gt-interfeat(1)
- gt-chain2dim(1)
- gt-encseq-bench(1)
- gt-shulengthdist(1)
- gt-encseq-encode(1)
- gt-select(1)
- gt-uniq(1)
- gt-shredder(1)
- gt-fingerprint(1)
- gt-matstat(1)
- gt-encseq-info(1)
- gt-congruence-spacedseed(1)
- gt-encseq2spm(1)
- gt-simreads(1)
- gt(1)
- gt-dot(1)
- gt-ltrclustering(1)
- gt-seqstat(1)
- gt-mergefeat(1)
- gt-bed_to_gff3(1)
- gt-uniquesub(1)
- gt-gtf_to_gff3(1)
- gt-eval(1)
- gt-encseq-check(1)
- gt-loccheck(1)
- gt-inlineseq_split(1)
apt-get install genometools
Manual
| GT-EXTRACTSEQ(1) | GenomeTools Manual | GT-EXTRACTSEQ(1) |
NAME
gt-extractseq - Extract sequences from given sequence file(s) or fastaindex.
SYNOPSIS
gt extractseq [option ...] [sequence_file(s)] | fastaindex
DESCRIPTION
-frompos [value]
-topos [value]
-match [string]
-keys [filename]
-width [value]
-o [filename]
-gzip [yes|no]
-bzip2 [yes|no]
-force [yes|no]
-help
-version
The option -keys allows one to extract substrings or sequences from the given sequence file or from a fasta index. The substrings to be extracted are specified in a key file given as argument to this option. The key file must contain lines of the form
k
or
k i j
where k is a string (the key) and the optional i and j are positive integers such that i⇐j. k is the key and the optional numbers i and j specify the first position of the substring and the last position of the substring to be extracted. The positions are counted from 1. If k is identical to the string between the first first and second occurrence of the symbol | in a fasta header, then the fasta header and the corresponding sequence is output. For example in the fasta header
>tr|A0AQI4|A0AQI4_9ARCH Putative ammonia monooxygenase (Fragment)
the fasta key is A0AQI4. If i and j are both specified, then the corresponding substring is shown in fasta format. In the latter case the header of the fasta formatted sequence in the output begins with
>k i j
followed by the original original fasta header.
If the sequence input are fasta files, then the following holds:
If the sequence input comes from a fasta index (see below), the following holds:
If the end of the argument list only contains one filename, say fastaindex, then it is checked if there is a file fastaindex.kys. This makes up part of the fasta index, which is constructed by calling the suffixerator tool as follows:
gt suffixerator -protein -ssp -tis -des -sds -kys -indexname fastaindex \
-db inputfile1 [inputfile2 ..]
This reads the protein sequence files given to the option -db and creates several files:
For the suffixerator command to work, the keys of the form |key| in the fasta header must satisfy the following constraints:
REPORTING BUGS
Report bugs to https://github.com/genometools/genometools/issues.
| 04/27/2024 | GenomeTools 1.6.5 |