I am trying to use Cuffdiff to analyze differential expression between two experimental groups. I used cufflinks and cuffmerge to produce the gtf assembly. When I run cuffdiff though I keep on getting empty output files. No error message comes up either.
This is the command I am running:
cuffdiff path_to_gtf path_to_data1 path_to_data2
How should I modify this command to get this to run properly?
I am only using one file for each sample group until I get the program to work. How do I incorporate the rest of the files as well?
The gene annotation file used in Cuffmerge has the p_id and tss_id. Do i have to include the gene annotation file in Cufflinks as well (as opposed to only cuffmerge)?
whats your exact command ?
cuffdiff c:/research/Cuffmerge/merged_asm/merged.gtf $T1 $N1
T1=c:/research/alignment_data/TB1_alignment_sorted.sam
N1=c:/research/alignment_data/NT1_alignment_sorted.sam
Where do you specify the output directory? such as this...
I just added cuffdiff_out and it produces empty output files within that folder. It seems like the data files aren't being picked up. How can I fix the command?
Can you clarify please: 1) you are getting output files but they contain no data or...2) You are getting no output files?
Also, can you copy and paste here the resulting terminal output.
Thanks.
I am getting output files that contain no data. The terminal output is one of the following (most often the first):
You are using Cufflinks v2.2.1, which is the most recent release.
OR
You are using Cufflinks v2.2.1, which is the most recent release. Error: cuffdiff requires at least 2 SAM files c:/research/alignment_data/TB1_alignment_sorted.sam: line 1: @HD: command not found c:/research/alignment_data/TB1_alignment_sorted.sam: line 2: @SQ: command not found c:/research/alignment_data/TB1_alignment_sorted.sam: line 3: @SQ: command not found
...
What can be going wrong?
Thank you.
Are you using
transcript.gtf
file andgenome.fasta
file for the same version of the genome? Is GTF file yourmerged.gtf
? Did you use same version of thegenome.fasta
to generatesam
files? Do you see output in cufflinks and cuffmerge?Both the transcript.gtf file and genome.fa file were included in the same reference genome download (mm10) so I am assuming it is the same version of the genome. I have run the script including both the merged.gtf and the reference transcript.gtf, but I also ran it with just the prior. I used the same fasta file in Hisat2 to create the reference index.
I am able to get an output from cufflinks and cuffmerge.
Does it make a difference that I didn't include any of the reference information while making the assemblies in cufflinks and that I only included the reference index (as opposed to including the gtf file as well) in the alignment?