Hello everyone, this is my problem:
I am very new in this analysis. I have done bowtie2 for my genome sequence data and it was successfully run and got the output file in SAM format, and i converted it into BAM and VCF format, I have ~5600 scaffolds in reference genome file. If i want to know that which scaffold sequence consist maximum no. of SNPs and also want to know that how can i map these SNPs with the gene sequences.
Simply i want to know the gene with highest number of high quality SNPs in reference genome.
Please help me out.
Thanks in Advance !!
Thank you for your reply,but can you please help me that how can i get exon.bed file. I am totally confused . :(
The dexseq_prepare_annotation.py takes a GTF file and gives coordinates of exons. Be careful that you dont have duplicates in your bed file.