Hi!
I am an informatics student from Germany attending a practical course on bioinformatics. I am new to the field of bioinformatics but I am willing to learn a lot. My first task is the following: I have some protein sequences from one organism provided. These proteins build a class of similar proteins. Additionally I have one database for this organism and a second database for another organism.
The goal is to find homologies to the protein sequences in the second database. So what I have to do is to blast the protein sequences against the second database and then "reblast" the results against the the first database to verify the hits. To automatize this task I write a program in python, which locally blasts given sequences against databases.
My questions are not about the programming but the way to find homologies: is this the normal way to do this? What exactly am I verifying with a reblast? And how do I know which blast results are significant? What output is interesting for a bioinformatic? The course focuses on programming, which is no problem for me, but I'd like to understand what I am doing and why.
Thanks in advance!
For a bioinformatics "newbie", you asked this question very well.