1) I performed a blastn for a list of mouse genes against the gene database and obtained an output consisting of several thousand hit, I needed only the best hit , which was first on the tabular output
so I tried restricting the result using max_target_seq 1
and max_hsps 1
I got a single hit result. but when I compared it to the unrestricted files I find that the results or the hit shown have different table values such as align.lemgth and %identity although the transcript itself is the same
Can anyone suggest a reason for this and any solutions
2) I used the original to get the first hits for the above
So I have a list of transcript that I obtained from performing a blastn execution. and I have to pull out the corresponding sequences from a source file of 3utr/5utr. but there are a few hundred transcripts and performing it manually is time consuming and error prone
Can anyone suggest a way or command line to perform it using terminal in linux
In summary a method to selectively pull out certain sequence from the gen database file using the ensembl transcript as identifying aspect