The script should take 2 input vcf files(normal & tumor) and a output file.If a specific variant is found at a position in the tumor sample vcf file but there is no corresponding variant in the normal sample vcf the results will be written to the output file. Please help or guide for a clue how to work on this.
Hi. I suggest you to involve a bioinformatician in your project and then ask biostar for tips, not for the whole work...
Just a note that what you are describing is not likely to be a very effective somatic variant caller. There are MANY tools for somatic variant calling and none of them use the approach you are suggesting, for pretty good reason.
It may not be very effective way but still it is used as I am asked to do in my class.
There are often many approaches to answer a question. "As long as we don't ever evaluate the answers critically, all methods are equivalent as long as they generate results." Many of us experience this sentiment--resist it!
I think it is also problematic that he wants us to do his homework.
Agreed. It is generally poor practice to just post your homework on a forum. But, at least the poster has admitted it and made some attempt to solve. Although, as it is, the question is more about very beginner Perl scripting and not really a bioinformatic question. I wonder if you wouldn't have more luck reading some forums and tutorials specifically on Perl. Once you have figured out how to make a perl script, understand conditional statements, hash data structures, etc come back for help with the components of this problem that are really bioinformatics (e.g., understanding VCF format).
Hello guys thanks for your comments, finally I got my perl script working, and it is working fine. Now I am able to compare 2 files and get somatic variants. I came to this forum thinking people will help me, unfortunately I got few tips only and much of discouragement. I want to thank members who commented sarcastically too, as your negative comments helped me to work more hard and get desired output.
I think the correct guidance for getting a clue how to work is to google for "vcf diff", or "vcf variant" or similar, or beginning by studying the available documentation of the vcf format and the available tools, e.g. vcf-tools. Even thought about reading wikipedia articles? In conclusion, the right way to go is to improve ones general web/tech literacy first (learn to search), to make optimal use of the available resources.
I agree with Michael. Must of the things you need, are usually available online. If the work you are doing is part of a HW, you will need to spend a bit more time to understand things.