The five traditional BLAST programs are: BLASTN, BLASTP, BLASTX, TBLASTN, and TBLASTX. BLASTN compares nucleotide sequences to one another (hence the N). All other programs compare protein sequences (see Table 5-1).
Program |
Database |
Query |
Typical uses |
---|---|---|---|
BLASTN |
Nucleotide |
Nucleotide |
Mapping oligonucleotides, cDNAs, and PCR products to a genome; screening repetitive elements; cross-species sequence exploration; annotating genomic DNA; clustering sequencing reads; vector clipping |
BLASTP |
Protein |
Protein |
Identifying common regions between proteins; collecting related proteins for phylogenetic analyses |
BLASTX |
Protein |
Nucleotide translated into protein |
Finding protein-coding genes in genomic DNA; determining if a cDNA corresponds to a known protein |
TBLASTN |
Nucleotide translated into protein |
Protein |
Identifying transcripts, potentially from multiple organisms, similar to a given protein; mapping a protein to genomic DNA |
TBLASTX |
Nucleotide translated into protein |
Nucleotide translated into protein |
Cross-species gene prediction at the genome or transcript level; searching for genes missed by traditional methods or not yet in protein databases |
You'll also find several BLAST derivatives and BLAST wrappers (scripts that run BLAST in a specialized way) with names such as PSI-BLAST, PHI-BLAST, MegaBLAST, BLASTZ, XBLAST, MPBLAST, HT-BLAST, and GENE-BLAST. This book?and especially this chapter?deals primarily with the five traditional programs. If you are familiar with the details of these algorithms, you will have a solid foundation for understanding the variants.