Hello, all
I am still working on the DEG analysis of miRNA.
I have successfully trimmed out the adapter and index sequences and mapped them using bowtie2.
Then, I have used samtools to convert sam file to bam file.. and finally got the .bam files.
Here, I mapped our fastq file to mouse whole genome reference file instead of mirBase hairpin. fa file..
(mapping rate is around 90%) (Note I have 6 samples including 3 controls)
For DEG analysis of miRNA( differentally expressed miRNA), I am trying to use cufflink-cuffdiff pipeline.
By accident, I did not put the -g genes.gtf
option.
cufflinks -p 4 -g genes.gtf -o xxx_clout xxx.bam
when, I put the -g genes.gtf
option (since I want to annotate our result), my genes.fpkm_tracking
file have 25000 genes with annotation.
When, I put the command without -g genes.gtf option, my genes.fpkm.tracking
file has 1829 lines. (annotation.. e.g. CUFF.1 CUFF.2 ..... CUFF.1829)
Could you please someone let me know why these two files are so different even the different is only -g genes.gtf
option?
Thanks
Thank you!
Then, for the miRNA annotation, do I need to use
-g
option?It's unlikely to hurt and might increase the quality of the results a bit, at least if the GTF file is decent.
Thank you Devon,
BTW, I got some weird result. Before I got your answer, I have run two analyses with
-g
option for cufflinks and without-g
option for cufflinks.Based on the description, the results should not have much difference.
BUT, the DEG results were different. Could you please give some comments?
I understand the little difference for fpkm values for each samples however, the status of significant is also different. Please see the result.
It means I will have different DEGs.. How can I handle this?
This is the part of output of
gene_exp.diff
file (from cuffdiff) with-g
option for cufflinks.This is the part of output of
gene_exp.diff
file (from cuffdiff) without-g
option for cufflinks.