Hello,
I am trying to quantify transposable elements, but TEToolkit exits with an error, as seen below. Perhaps it does not like the transposable element GTF file that I created manually for the yeast? I know the syntax of this GTF file is correct since I have been able to build successful analogous GTF files for both the rabbit and mouse papilloma viruses which resulted in successful tophat and cufflink runs. I'm wondering about the contents of this GTF file however, that is, how should the transposon be specified? For example, in NCBI there is one full transposon available for the yeast, but the others are listed only as pieces flanking a gene. I was wondering if anyone has been able to run TEToolkit successfully who could shed some light on this question? Thanks a lot ! Anna
-bash-4.1$ ./TEtranscripts --format BAM --mode multi -t ../../../../HuiLing_4567_030716/HLC1.trim.bam -c ../../../../HuiLing_4567_030716/HLC2.trim.bam --project TE_2v1 --GTF ../../../../ref/sacCerR64.gtf --TE ../../../../ref/sacCerR64_virusesalltransposons_only.gtf
INFO @ Mon, 16 May 2016 15:01:03:
# ARGUMENTS LIST:
# name = TE_2v1
# treatment files = ['../../../../HuiLing_4567_030716/HLC1.trim.bam']
# control files = ['../../../../HuiLing_4567_030716/HLC2.trim.bam']
# GTF file = ../../../../ref/sacCerR64.gtf
# TE file = ../../../../ref/sacCerR64_virusesalltransposons_only.gtf
# multi-mapper mode = multi
# stranded = yes
# normalization = DESeq_default (rpm: Reads Per Million mapped; quant: Quantile normalization)
# FDR cutoff = 5.00e-02
# fold-change cutoff = 1.00
# read count cutoff = 1
# number of iteration = 10
# Alignments grouped by read ID = True
INFO @ Mon, 16 May 2016 15:01:03: Processing GTF files ...
INFO @ Mon, 16 May 2016 15:01:03: Building gene index .......
INFO @ Mon, 16 May 2016 15:01:04: Done building gene index ......
INFO @ Mon, 16 May 2016 15:01:04:
Building TE index .......
Error in building gene/TE index
The github issue tracker would be a more appropriate place to ask for help.