Dear all,
I was wondering if there is any tool to calculate PSI from the BAM files. Basically what I want is-
I have gtf file with all the exons and I want to calculate for each exon (lets take exon filled with grey for example) how many reads mapped to junction of that exon (a and b in figure) and how many reads skip that exon (c in figure). So if we have, let's say, 10k exons in the gtf, the output should contain 10k rows with a,b and c as columns.
I used MATS and bam2ssj but did not get what I was expecting. I have gtf file with approx. 300k exons but MATS gives the output only for 3k exons. bam2ssj is calculating the values for introns rather than exons.
Any suggestions?
Thanks in advance.
have a look at MISO: http://genes.mit.edu/burgelab/miso/docs/
Thanks for the reply. I read about MISO but it says
Two general kinds of analyses are possible:
I think it include the reads mapped on exons but I am interested only in junction reads.
in the documentation there is also a paragraph about psi estimation.
the definition given in their paper is: 'percentage spliced in' (PSI or Ψ)11 denotes the fraction of mRNAs that represent the inclusion isoform. Reads aligning to the alternative exon or to its junctions with adjacent constitutive exons provide support for the inclusion isoform, whereas reads aligning to the junction between the adjacent constitutive exons support the exclusion isoform; the relative read density of these two sets forms the standard estimate of Ψ.
the paper is here
Hi Vikas,
we have just written a small protocol to do this:
http://onlinelibrary.wiley.com/doi/10.1002/0471142905.hg1116s87/abstract