Man page - cleanasn(1)
Packages contains this manual
- asn2ff(1)
- asn2gb(1)
- asntool(1)
- vecscreen(1)
- sortbyquote(1)
- trna2tbl(1)
- asn2xml(1)
- getpub(1)
- gil2bin(1)
- gene2xml(1)
- asn2all(1)
- checksub(1)
- subfuse(1)
- getmesh(1)
- taxblast(1)
- asn2asn(1)
- spidey(1)
- tbl2asn(1)
- asnval(1)
- asnmacro(1)
- idfetch(1)
- asndhuff(1)
- insdseqget(1)
- nps2gps(1)
- indexpub(1)
- findspl(1)
- asn2fsa(1)
- gbseqget(1)
- errhdr(1)
- asn2idx(1)
- trna2sap(1)
- cleanasn(1)
- asndisc(1)
- makeset(1)
- fa2htgs(1)
- debruijn(1)
apt-get install ncbi-tools-bin
Manual
CLEANASN
NAMESYNOPSIS
DESCRIPTION
OPTIONS
AUTHOR
SEE ALSO
NAME
cleanasn - clean up irregularities in NCBI ASN.1 objects
SYNOPSIS
cleanasn [ - ] [ -A filename ] [ -B str ] [ -C str ] [ -D str ] [ -F str ] [ -K str ] [ -L filename ] [ -M filename ] [ -N str ] [ -O str ] [ -P str ] [ -Q str ] [ -R ] [ -S str ] [ -T ] [ -U str ] [ -V str ] [ -X str ] [ -Z str ] [ -a str ] [ -b ] [ -c ] [ -d str ] [ -f str ] [ -i filename ] [ -j filename ] [ -k filename ] [ -m str ] [ -n path ] [ -o filename ] [ -p path ] [ -q path ] [ -r path ] [ -v path ] [ -x ext ]
DESCRIPTION
cleanasn is a utility program to clean up irregularities in NCBI ASN.1 objects.
OPTIONS
A summary of options is included below.
|
- |
Print usage message |
-A filename
Accession list file
|
-B str |
Branch, per the flags in str: |
c
|
Has coding regions |
||||
|
d |
No coding regions |
|||
|
p |
Passes validation |
|||
|
q |
Validator errors or rejects |
|||
|
r |
Only pop/phy/mut/eco/WGS sets |
|||
|
s |
Exclude pop/phy/mut/eco/WGS sets |
|||
|
t |
Only nuc-prot sets |
|||
|
u |
Exclude nuc-prot sets |
|||
|
v |
Only segmented sequences |
|||
|
w |
Exclude segmented sequences |
|||
|
x |
Only segmented proteins |
|||
|
y |
Exclude segmented proteins |
|||
|
-C str |
Sequence operations, per the flags in str:
|
c |
Compress |
|||
|
d |
Decompress |
|||
|
l |
Recalculated segmented sequence length |
|||
|
v |
Virtual gaps inside segmented sequence |
|||
|
s |
Convert segmented set to delta sequence |
|||
|
t |
Non-NucProt segmented set to delta sequence |
|||
|
u |
Improved non-NucProt segmented set to delta sequence |
|||
|
g |
Raw to delta by assembly gap |
|||
|
m |
Merge assembly gap features |
|||
|
-D str |
Clean up descriptors, per the flags in str:
|
t |
Remove Title |
|||
|
c |
Remove Comment |
|||
|
n |
Remove Nuc-Prot Set title |
|||
|
e |
Remove Pop/Phy/Mut/Eco Set title |
|||
|
m |
Remove mRNA title |
|||
|
p |
Remove Protein title |
|||
|
a |
Title to name |
|||
|
b |
AutoDef title or name |
|||
|
x |
Prefix title with organism name |
|||
|
-F str |
Clean up features, per the flags in str:
|
u |
Remove User-objects |
|||
|
d |
Remove db_xrefs |
|||
|
e |
Remove /evidence and /inference |
|||
|
g |
Fuse multi-interval genes |
|||
|
i |
Fuse adjacent-interval imported features |
|||
|
r |
Remove redundant gene xrefs |
|||
|
f |
Fuse duplicate features |
|||
|
s |
Package features on referenced Bioseq |
|||
|
k |
Package coding-region or parts features |
|||
|
z |
Delete or update EC numbers |
|||
|
b |
Set Best coding-region reading frame |
|||
|
x |
Retranslate coding regions |
|||
|
a |
Adjust for missing stop codon |
|||
|
-K str |
Perform a general cleanup, per the flags in str:
|
b |
BasicSeqEntryCleanup |
|||
|
p |
C++ BasicCleanup (via an external utility) |
|||
|
v |
AdvancedSeqEntryCleanup |
|||
|
s |
SeriousSeqEntryCleanup |
|||
|
x |
ExtendedSeqEntryCleanup |
|||
|
g |
GpipeSeqEntryCleanup |
|||
|
n |
Normalize descriptor order |
|||
|
u |
Remove NcbiCleanup User Objects |
|||
|
c |
Synchronize genetic Codes |
|||
|
f |
CDS partial from translation |
|||
|
e |
Impose CDS partials |
|||
|
d |
Resynchronize CDS partials |
|||
|
m |
Resynchronize mRNA partials |
|||
|
t |
Resynchronize Peptide partials |
|||
|
a |
Adjust consensus splice |
|||
|
i |
Promote to "worst" Seq-ID |
|||
|
r |
Reassign local IDs |
|||
|
l |
Remove locus |
-L filename
Log file
-M filename
Macro file
|
-N str |
Clean up links, per the flags in str: |
o
|
Link CDS mRNA by Overlap |
||||
|
p |
Link CDS mRNA by Product |
|||
|
l |
Link CDS mRNA by Label and Location |
|||
|
r |
Reassign feature IDs |
|||
|
m |
Merge colliding feature IDs |
|||
|
f |
Fix missing reciprocal feature IDs |
|||
|
c |
Clear feature IDs |
|||
|
-O str |
Missing prot-ref name
|
-P str |
Publication options: |
a
|
Remove All publications |
||||
|
s |
Remove Serial number |
|||
|
f |
Remove Figure, numbering, and name |
|||
|
r |
Remove Remark |
|||
|
u |
Update PMID-only publication |
|||
|
j |
Lookup ISO Journal title abbreviation |
|||
|
m |
Merge identical publication features |
|||
|
# |
Replace unpublished with PMID |
|||
|
-Q str |
Report:
|
c |
Record count |
|||
|
r |
ASN.1 BSEC report |
|||
|
s |
ASN.1 SSEC report |
|||
|
n |
NORM vs. SSEC report |
|||
|
e |
PopPhyMutEco AutoDef report |
|||
|
o |
Overlap report |
|||
|
l |
Latitude-longitude country diff |
|||
|
d |
Log SSEC differences |
|||
|
g |
GenBank SSEC diff |
|||
|
f |
asn2gb/asn2flat diff |
|||
|
h |
Seg-to-delta GenBank diff |
|||
|
v |
Validator SSEC diff |
|||
|
m |
Modernize Gene/RNA/PCR |
|||
|
u |
Unpublished Pub lookup |
|||
|
p |
Published Pub lookup |
|||
|
j |
Unindexed Journal report |
|||
|
t |
tRNA anticodon report |
|||
|
w |
Component offset report |
|||
|
x |
Custom scan |
|||
|
-R |
Remote fetching from ID (NCBI sequence databases)
|
-S str |
Selective difference filter (capital letters skip) |
s
|
SSEC |
||||
|
b |
BSEC |
|||
|
A |
Author |
|||
|
p |
Publication |
|||
|
l |
Location |
|||
|
r |
RNA |
|||
|
q |
Qualifier sort order |
|||
|
g |
Genbank block |
|||
|
k |
Package CdRegion or parts features |
|||
|
m |
Move publication |
|||
|
o |
Leave duplicate Bioseq publication |
|||
|
d |
Automatic definition line |
|||
|
e |
Pop/Phy/Mut/Eco Set definition line |
|||
|
-T |
Taxonomy Lookup
|
-U str |
Modernize, per the flags in str: |
g
|
Genes |
||||
|
r |
RNA |
|||
|
p |
PCR Primers |
|||
|
-V str |
Remove features by validator severity:
|
r |
Reject |
|||
|
e |
Error |
|||
|
w |
Warning |
|||
|
i |
Info |
|||
|
-X str |
Miscellaneous options, per str:
|
d |
Automatic definition line |
|||
|
s |
Automatic definition line with Source qualifiers |
|||
|
e |
Pop/Phy/Mut/Eco Set definition line |
|||
|
n |
Instantiate NC title |
|||
|
m |
Instantiate NM titles |
|||
|
x |
Special XM titles |
|||
|
p |
Instantiate Protein titles |
|||
|
g |
GPipe instantiate titles |
|||
|
c |
Create mRNAs for coding sequences |
|||
|
f |
Fix reciprocal protein_id/transcript_id |
|||
|
v |
Revert preRNA or ncRNA transcript_id |
|||
|
t |
Parse anticodon from Sequence |
|||
|
b |
Batch cleanup of multireader output |
|||
|
z |
Wrap SegSet with NucProt set |
|||
|
w |
GFF/WGS genome cleanup |
|||
|
-Z str |
Remove indicated User-object
|
-a str |
ASN.1 type |
a
|
Any (default) |
||||
|
e |
Seq-entry |
|||
|
b |
Bioseq |
|||
|
s |
Bioseq-set |
|||
|
m |
Seq-submit |
|||
|
t |
Batch Bioseq-set |
|||
|
u |
Batch Seq-submit |
|||
|
-b |
Input ASN.1 is Binary
|
-c |
Input ASN.1 is Compressed |
|||
|
-d str |
Source database |
a
|
Any (default) |
||||
|
g |
GenBank |
|||
|
e |
EMBL |
|||
|
d |
DDBJ |
|||
|
b |
EMBL or DDBJ |
|||
|
i |
INSD |
|||
|
r |
RefSeq |
|||
|
n |
NCBI |
|||
|
x |
Exclude EMBL/DDBJ |
|||
|
y |
Exclude gbcon, gbest, gbgss, gbhtg, gbpat, gbsts |
|||
|
-f str |
Substring filter
-i filename
Single input file (defaults to stdin)
-j filename
First filename
-k filename
Last filename
|
-m str |
Flatfile mode: |
r
|
Release |
||||
|
e |
Entrez |
|||
|
s |
Sequin |
|||
|
d |
Dump |
-n path
asn2flat executable (default is /netopt/ncbi_tools/bin/asn2flat )
-o filename
Single output file (defaults to stdout)
-p path
Process all matching files in path
-q path
ffdiff executable (default is /netopt/genbank/subtool/bin/ffdiff )
-r path
Path for results
-v path
asnval executable (default is /netopt/ncbi_tools/bin/asnval )
|
-x ext |
File selection suffix for use with -p (defaults to .ent ) |
AUTHOR
The National Center for Biotechnology Information.
SEE ALSO
asndisc (1), asnval (1), sequin (1).