Hi,
I am running cuffmerge to merge a few gtfs that I have created using cufflinks. This is the command I use:
cuffmerge -o merged -s genome.fa list.txt
list.txt
contains the list of the gtf files that I want to merge with the relative path.
Here is the stdout (no error)
[Fri Oct 3 16:15:22 2014] Beginning transcriptome assembly merge
-------------------------------------------
[Fri Oct 3 16:15:22 2014] Preparing output location merged/
Warning: no reference GTF provided!
[Fri Oct 3 16:15:22 2014] Converting GTF files to SAM
[16:15:22] Loading reference annotation.
....
[16:15:24] Loading reference annotation.
[Fri Oct 3 16:15:24 2014] Assembling transcripts
But there are only these 2 directories in merged:
logs tmp
There is no merged.gtf
Has anybody seen and solved this error? There is nothing alarming in run.log file either. This is how log file ends:
cufflinks -o merged/ -F 0.05 -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 1 merged/tmp/mergeSam_fileoCJV2Y
Any help will be appreciated.
Thanks
Hi, please see my follow-up question below.
I am facing the same issue. Please help
Even if it appears to be the same issue it always helps to post the command line you are using and the exact error you are encountering. There is always something slightly different in each of these cases.
I am having the same problem- Cuffmerge isn't producing the merged gtf. I don't know what is going wrong. I am pretty sure I inputted everything correctly. What did you do to fix this?
Are you getting an error?
No, I am not getting an error. It runs for hours and then there is no gtf file- just the run log and empty temp file.
I have another question as well. When do I need to include the reference annotation and the reference sequence fasta? There is that option in Cufflinks too and I do not know when I should input that information.
Thanks.
See this for the answer to your reference question. Do you have a lot of samples? Are the GTF files large?
Thank you so much.
I have ten files in total but I have been running two just to test it out. One is about 15,000 KB and the other is 6,000 KB.
I am running this command:
file 1 path
file 2 path
I was reading something and it said there had to be one file per line but I don't exactly know how to format it that way. I tried it a bunch of times, just written a little differently, and it didn't work at all.
How could I make an assemblies.txt file in an editor that I can add to my script? Maybe it will work that way. But based on how things have been going, I am not too sure.
What could be going wrong?
Thanks again for the help.
You make the assemblies.txt file in a text editor (use any you like, just save the file in text format).
So
assemblies.txt
file will have these contents.Command would be
Thank you. I just ran it and it worked!. The gtf file is in my folder.
What do these outputs mean though? There is quite a long list of these:
SAM error on line 104: found spliced alignment without XS attribute
Warning: couldn't find fasta record for 'chr1_GL456221_random'!
I just have a few more questions:
What is the difference between including the reference annotation and sequence in both cufflinks and cuffmerge as opposed to just including that information in cuffmerge? I am looking to find novel genes in my study, so will preexisting reference information be a hindrance?
How does an annotation file made by cufflinks differ from one included in a reference genome download.
Also, I plan to input this merged gtf file into Cuffdiff. Do I make one large file that includes the two different experimental groups I am comparing?
I am having trouble running Cuffdiff as well. I have been trying for a long time now. When I enter the command, this comes up:
You are using Cufflinks v2.2.1, which is the most recent release.
And then it produces a bunch of empty output files.
I don't know if I am formatting the command correctly or what. I have two groups of files (each group has 5 files in it). How do I format that command? I tried another text file but it said it doesn't recognize that file type.
Thank you for all of the help. I am very new at all of this.