how to use DEXSeq with raw exon count matrix downloaded from TCGA
0
0
Entering edit mode
7.3 years ago

Hi, I want to use DEXSeq to identify differential exon usage between ~800 cancer vs 90 normal samples from TCGA. I have a matrix of raw exon counts downloaded from TCGA in the format below:

Exon    TCGA-66-2742-01A        TCGA-L4-A4E5-01A        TCGA-86-8671-01A        TCGA-77-8008-01A    
chr1:11874-12227:+      2       7       15      1  
chr1:12595-12721:+      1       1       3       0      
chr1:12613-12721:+      1       1       3       0

The problem is that DEXSeq requires to use dexseq_count.py to perform exon counts for each sample using bam/sam as input, and then combine these files into an object "countFiles", which will then be used in the DEXSeq function "DEXSeqDataSetFromHTSeq". Since I only have a matrix of raw exon counts and don't have individual BAM/SAM, I wonder how I can still use DEXSeq. Any suggestions are appreciated! Thank you!

RNA-Seq exon R dexseq • 3.3k views
ADD COMMENT
0
Entering edit mode

Why don't you use dexseq_count.py then ?

ADD REPLY
0
Entering edit mode

thank you for the comments. I cannot use dexseq_count.py because I don't have the BAM files for the ~890 samples. I only have a matrix of raw exon counts downloaded from TCGA.

ADD REPLY
0
Entering edit mode

To identify differential exon usage, you need to know which is the parent gene name for different exons. If you have that information, you can convert your matrix to the DEXSeq count format.

ADD REPLY
0
Entering edit mode

Did you ever figure out how to do it? I'm trying to do the same thing

ADD REPLY

Login before adding your answer.

Traffic: 1645 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6