Starting from a nucleotide sequence, I'm trying to find all genes in the human genome that contain this sequence. However whenever I run the BLAST, I never get any hits. I tried increasing the Expect value because I saw documentation saying that's a good parameter to change if you're BLASTing a short sequence. If anybody can help me with this I'd be most appreciative. Below is a screenshot of what I'm entering.
7 bp is too short for search using blast. You are better off getting the sequence of the genes you are interested in and then using a pattern search program (e.g. fuzznuc from EMBOSS) or just
grep
as a quick way.Thank you for the reply genomax2. What I'm trying to do is see which genes in the entire human genome contain this sequence, so I don't have any particular genes of interest. Do you have any recommendations for how I may go about doing this?
You can get the RefSeq mRNA from UCSC or fasta sequences from GENCODE and then use fuzznuc.