I used rem-calculate-expression as the following:
rsem-calculate-expression -p 8 --paired-end PE1.fastq PE2.fastq hg19_ref my_test_sample
But it only generated a collective sum for each of the chrs like the following:
gene_id transcript_id(s) length effective_length expected_count TPM FPKM
chr1 chr1 249250621.00 249250465.90 1323595.26 3.82 0.12
chr10 chr10 135534747.00 135534591.90 869923.40 4.62 0.14
chr11 chr11 135006516.00 135006360.90 1454485.33 7.76 0.24
chr12 chr12 133851895.00 133851739.90 728502.18 3.92 0.12
While what I want is the expression level for each gene (or each isoform).
Should I use another rsem program or different parameters for rem-calculate-expression?
Thank you!
-- m.x.
rsem-prepare-reference --bowtie reference_hg19.fa hg19_ref
Thanks!
When you ran
rsem-prepare-reference
, you did not set the--gtf
option:If you don't specify the GTF, RSEM does not know what your genes are. The hg19 FASTA file is just the sequence of the entire human genome.
Thanks a lot. I have tried your suggestion and waiting for the expression results -- it looks like it's now taking much longer to process. Is this expected?
Thanks for the --gtf suggestion. It now worked.