Limiting search by taxonomy¶
To limit a BLAST search by taxonomy, one needs to provide the NCBI taxonomy ID(s) (taxid) for given taxonomic group(s). A taxid is simply a number that specifies a node in the taxonomic tree.
Taxids can be provided to ElasticBLAST using BLAST command-line options (see BLAST options):
-taxids
with comma-separated list of taxids, or-taxidlist
with a path to a file that contains a list of taxids, one per line. This file must be present in the local file system where ElasticBLAST is run. Cloud storage locations, likes3://
orgs://
will not work.
ElasticBLAST accepts taxids of any rank.
You can read more about limiting search by taxonomy in command line BLAST+ tools here.
Example:¶
Below is an example ElasticBLAST configuration file that limits search results by taxonomy.
1[cloud-provider]
2aws-region = us-east-1
3
4[cluster]
5num-nodes = 1
6
7[blast]
8program = blastn
9db = pdbnt
10queries = s3://elasticblast-test/queries/RFQT01.1.fsa_nt.gz
11results = ${YOUR_RESULTS_BUCKET}
12options = -outfmt 7 -taxids 1866885,1804623