The title says it all, I was given multiples contigs with high E values in order to organize them and I just simply don't know where to start.
The title says it all, I was given multiples contigs with high E values in order to organize them and I just simply don't know where to start.
UPDATE: I just asked a graduate student in the lab.
He told me:
1e-7 < 1e-3. 1e-m = 1 * 10^-m, which mean 1e-3 is 0.001 and 1e-7 is 0.0000001
1e-7 is a respectable e-value, so you're better off picking the one with the least e-value.
Wait, so when you first said high e-values, you were looking at the 1e-X part? If so, the e-values are low, not high.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
What Geek_y said. Also, please define "organize". And high e-values are not worth looking at, low e-values are.
Homework? Do you have a reference genome of closely related species?
Yes a homework. I was just given multiple contigs and I have to search proteins that have those contigs and organize it in order.
You mean proteins that contain your translated contigs? Use BLAST (try and figure out the variant of BLAST and database that fit your case).
Again, "organize in order" is really vague. You're looking to organize in a specific order by the values of a specific attribute, such as match-score, highest to lowest or e-value, lowest to highest. Each attribute has its own quirks (Example: e-values are database-specific).
You'll need to define your problem statement better before trying to solve it.
why don't you just post like 2 contigs
I wonder why OP calls them contigs and not just sequences in the first place. Are these assembled sequences that have also undergone a BLAST search (e-values)? Who worked upstream from OP and why is a beginner being given a task that doesn't have a well defined start and end point? Too many questions, too few answers!