Entering edit mode
7.4 years ago
bioinfo_ga
▴
70
I have done reference alignment using tophat pipeline on 150*2 reads. For expression calculation i am using Cuffdiff , however i am getting 0 expression (as given below) for control as well as treated samples.
test_id : TCONS_00000001
gene_id : XLOC_000001
gene -
locus 1:5256-11783
sample_1:1A
sample_2:1B
status :NOTEST
value_1:0
value_2 :0
log2(fold_change): 0
test_stat: 0
p_value: 1
q_value: 1
significant:no
it is really hard to read this, could you edit this dataframe?
Is this the case for all genes?
yes this is in case of all the genes
I would recommend you post the command-lines used to help troubleshoot your issue.
Hisat was used for alignment, and further sorted sam was used for cufflinks
Cufflinks:
Cuffdiff:
1) What about cufflinks? Did you also have zero expression in cufflinks output file?
2) Why you used Reference.gff for cufflinks and Cuffmerge/merged.gtf for cuffdiff?
3) I suspect that you mapped on a reference and run cuffdiff on a different gff: can you send us the first five lines of the sam file you used?
1) Cufflinks is giving me non zero expression values. 2) I have checked with Reference.gff as well as converted it to gtf format, both gave the same result. 3) Reference and gff are same. Sam file first five lines are as follows:
Cool... moving forward! I can see that in the .sam file, you are aligning (in the first lines) against sequences whose name is "10". Do you have any entry named "10" in the Cuffmerge/merged.gtf file? If not, this is the problem. What you could do is to extract all the content of the third column of the bam file (or also read the bam header, which could actually be faster) to see what are the names of the sequences you are aligning against, and see if exactly the same names are present in the Cuffmerge/merged.gtf file. BTW: why did you use the -G in cufflinks (i.e. completely rely on the existing annotation) and then perform cuffdiff on a merged gtf?