Entering edit mode
4.1 years ago
newbie
▴
130
I'm trying to use bedtools intersect on gencode annotation
gtf and another gtf which is output from stringtie merge
option.
bedtools intersect -v -b gencode.v27.annotation.gtf -a stringtiemerge.gtf > final.gtf
This is how gencode gtf looks:
And this is how the stringtie merged gtf looks:
But when I used bedtools intersect, I see there is warning. I can understand that it is because of the contigs
which are not found in gencode gtf. So, the below warning, should I consider that or no need to care?
bedtools intersect -v -b gencode.v27.annotation.gtf -a stringtiemerge.gtf > final.gtf
***** WARNING: File stringtiemerge.gtf has inconsistent naming convention for record:
GL000008.2 StringTie transcript 1577 8168 . + . transcript_id "MSTRG.4.1"; gene_id "MSTRG.4"; xloc "XLOC_000001"; class_code "u"; tss_id "TSS1";
***** WARNING: File stringtiemerge.gtf has inconsistent naming convention for record:
GL000008.2 StringTie transcript 1577 8168 . + . transcript_id "MSTRG.4.1"; gene_id "MSTRG.4"; xloc "XLOC_000001"; class_code "u"; tss_id "TSS1";
perhaps because the order of the attributes is unexpected? in the file you have
gene_id
followed bytranscript_id
, the warning shows it flipped. It shouldn't matter but it might.(BTW including the a few lines from the data instead of images would make more sense)