U.S. flag

An official website of the United States government

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Translated BLAST: blastx
BLASTX search protein databases using a translated nucleotide query. more...
Enter Query Sequence
Enter query sequence(s) in the text area. It automatically determines the format of the input. To allow this feature, certain conventions are required with regard to the input of identifiers. more...

Enter coordinates for a subrange of the query sequence. The BLAST search will apply only to the residues in the range. Sequence coordinates are from 1 to the sequence length.The range includes the residue at the To coordinate. more...

Help

Use the browse button to upload a file from your local disk. The file may contain a single sequence or a list of sequences. The data may be either a list of database accession numbers, NCBI gi numbers, or sequences in FASTA format.


Enter a descriptive title for your BLAST search Help

This title appears on all BLAST results and saved searches.

Help

Enter one or more queries in the top text box and one or more subject sequences in the lower text box. Then use the BLAST button at the bottom of the page to align your sequences.
To get the CDS annotation in the output, use only the NCBI accession or gi number for either the query or subject. Reformat the results and check 'CDS feature' to display that annotation.

Enter Subject Sequence
Subject sequence(s) to be used for a BLAST search should be pasted in the text area. It automatically determines the format or the input. To allow this feature there are certain conventions required with regard to the input of identifiers. more...

Enter coordinates for a subrange of the subject sequence. The BLAST search will apply only to the residues in the range. Sequence coordinates are from 1 to the sequence length.The range includes the residue at the To coordinate. more...

 
Help

Use the browse button to upload a file from your local disk. The file may contain a single sequence or a list of sequences. The data may be either a list of database accession numbers, NCBI gi numbers, or sequences in FASTA format.

Choose Search Set
New
Try experimental clustered nr database
For more info see What is clustered nr?

Clustered nr is the standard NCBI nr database clustered with each sequence within 90% identity and 90% length to other members of the cluster. Your BLAST search runs against a single representative sequence for each cluster.

The representative is used as a title for the cluster and can be used to fetch all the other members.

Clustered nr is smaller and more compact for searching. Results for a clustered nr search have more taxonomic depth than standard nr results.

Clustered nr uses the MMseqs2 software https://github.com/soedinglab/MMseqs2

1. Steinegger, M., Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat Biotechnol 35, 1026–1028 (2017) https://doi.org/10.1038/nbt.3988 PMID: 29035372

Help
Select a Standard Database to compare to an Experimental Database.
Standard
Database sequences non-default value Help

Select the sequence database to run searches against. No BLAST database contains all the sequences at NCBI. BLAST databases are organized by informational content (nr, RefSeq, etc.) or by sequencing technique (WGS, EST, etc.). more...

Enter organism common name, binomial, or tax id. Only 20 top taxa will be shown. Help

Start typing in the text box, then select your taxid. Use the "plus" button to add another organism or group, and the "exclude" checkbox to narrow the subset. The search will be restricted to the sequences in the database that correspond to your subset.

Create custom database
Enter an Entrez query to limit search Help

You can use Entrez query syntax to search a subset of the selected BLAST database. This can be helpful to limit searches to molecule types, sequence lengths or to exclude organisms. more...

Experimental
Help

Select the sequence database to run searches against. No BLAST database contains all the sequences at NCBI. BLAST databases are organized by informational content (nr, RefSeq, etc.) or by sequencing technique (WGS, EST, etc.). more...

Enter organism common name, binomial, or tax id. Only 20 top taxa will be shown. Help

Start typing in the text box, then select your taxid. Use the "plus" button to add another organism or group, and the "exclude" checkbox to narrow the subset. The search will be restricted to the sequences in the database that correspond to your subset.

Program Selection
Choose a BLAST algorithm Help
  • Megablast is intended for comparing a query to closely related sequences and works best if the target percent identity is 95% or more but is very fast.
  • Discontiguous megablast uses an initial seed that ignores some bases (allowing mismatches) and is intended for cross-species comparisons.
  • BlastN is slow, but allows a word-size down to seven bases.
more...

Help

Enter a PHI pattern to start the search. PHI-BLAST may perform better than simple pattern searching because it filters out false positives (pattern matches that are probably random and not indicative of homology).

Choose a BLAST algorithm Help
  • QuickBLASTP is an accelerated version of BLASTP that is very fast and works best if the target percent identity is 50% or more.
  • BlastP simply compares a protein query to a protein database.
  • PSI-BLAST allows the user to build a PSSM (position-specific scoring matrix) using the results of the first BlastP run.
  • PHI-BLAST performs the search but limits alignments to those that match a pattern in the query.
  • DELTA-BLAST constructs a PSSM using the results of a Conserved Domain Database search and searches a sequence database.
Search using Blastx (search protein databases using a translated nucleotide query)
Note: Parameter values that differ from the default are highlighted in yellow and marked with sign