I have some promoter motifs which range from 4 to 10 bp. I want to perform blastn search with these short motifs using 2000bp gene upstream sequences as query. the problem is that I get zero hits in local blast. I've tried using blastn-short, decreasing the word size to 4 but still i am not getting any hits. How do i proceed with this? Thank You
How about using e.g. Bowtie2 instead of blast?
this is really nice idea, i'll download bowtie2 and try this
in fact for very short sequences botwtie (v1) is the best option rather than bowtie2 because it allows mismatches :).
I agree. Bowtie1 is a lot more sensitive than bowtie2 for such as short query sequences.
Notice that bowtie1 will give you matched sequences that are not fully identical to your subject. You will need to play with grep looking for the NM tag to control the number of truly identical annealed bases (e.g, a NM:i:0 means 100% identity, whereas NM:i:2 means that two bases are different. The concept of matching sequences and identity of these matched sequences are two different concepts in this mapping environment
I think as suggested by Jean-Karim Heriche, building HMM profiles will be a better idea than using bowtie.
Thanks for the information Buffo