Entering edit mode
24 months ago
PK
▴
130
Hi,
I have WT and KO and I did the differential exon usage (Dexseq). I saw a pattern which means in most the gene the first exon seems to be affected with Padj value. I want to show that in a plot but I don't know how to do that. Assume that I have 300 gene that has the evidence for differential exon usage. In such scenario most of the genes first exon seems to be affected (or first few exons).
I have the final matrix of DEXseq and I added certain information. This is just a example I have ~ 300 rows
GENE_ID. TOTAL-EXON AFFECTED_EXON. Log2foldchange
ENSG00000142145 61 E008 2.19717286971528
I have like this. Also I have extra information as well.
Are you looking for suggestions how such a dataset can be visually represented or do you already have a particular plot in mind, but don't know how to create it? In the latter case, it would be helpful, if you could include a scrabbled version of the plot.
In the former case, it is helpful to decide first what you want to show. What is your message? If your message is for example "The first exon is affected in most genes", that would e.g. suggest an aggregate plot. If you would rather like to show details for particular genes and subtle differences or an effect depending on the distance from the TSS, a per gene representation with accurate distances may be needed. You can e.g. use colour, position or size to encode a fold change.
To how many genes do your 300 rows correspond? What about genes with multiple transcripts?
yeah. I have in my mind density plot but the each gene has different length right so i don't know how to do that. I'm looking at the gene level not at the transcript level. Here the exons are separated if they overlap at transcript level and you give the fraction number for the each exon based on the gene (E001,E002....).
Something like this picture. using the log2foldchage in the y-axis and x gene red and green for up and down regulated exons. It is not first exon, first few exons.
Maybe some inspiration: