RNA-SeQC v1.1.7 05/14/12
Creating rRNA Interval List based on given GTF annotations
java.lang.ArrayIndexOutOfBoundsException: 1
at org.broadinstitute.cga.rnaseq.RNASeqMetrics$MetricSample.readInSamples(RNASeqMetrics.java:1369)
at org.broadinstitute.cga.rnaseq.RNASeqMetrics.prepareFiles(RNASeqMetrics.java:182)
at org.broadinstitute.cga.rnaseq.RNASeqMetrics.execute(RNASeqMetrics.java:165)
at org.broadinstitute.cga.rnaseq.RNASeqMetrics.main(RNASeqMetrics.java:135)
RNA-SeQC Total Runtime: 0 min
-s <arg>
Sample File: tab-delimited description of samples and their bams. This file header is:
Sample ID Bam File Notes
When running on just one sample, this argument can be a string of the form
"Sample ID|Bam File|Notes", where Bam File is the path to the input file.
It appears that I have very similar/the same problem as the person about. I tried all the answers above and I still can't get to work (my RNA-SeQC run). Would you be able to elaborate a bit more on the string under -s flag?
This is my actual bam file name cfDMG2_ACTGAT_sorted_reordered_removed_dups.bam. And this is now I'm inputting it in RNA-SeQC run
The required transcript_id attribute was not found on line 1
Here is verbose look of what I did
rna-seqc -t HomoSapiensH38.gtf -r HomoSapiensH38.fa -o outDir -s "TestID|cfDMG2_ACTGAT_sorted_reordered_removed_dups.bam|NA"
RNA-SeQC v1.1.8.1 07/11/14
Creating rRNA Interval List based on given GTF annotations
Retriving contig names from reference
contig names in reference: 194
Loading GTF for Read Counting
The required transcript_id attribute was not found on line 1 havana gene 11869 14409 . + . gene_id "ENSG00000223972"; gene_version "5"; gene_name "DDX11L1"; gene_source "havana"; gene_biotype "transcribed_unprocessed_pseudogene";
I got it all working by reverting back to assembly and annotation Ensembl 37. I did source my fasta and gtf files from GENCODE, but I think it didn't matter in the end, as long as I used other assembly, not the latest one. I'm pretty sure RNA-SeQC breaks when the latest assembly - Ensembl 38 used, as it was build when Ensembl 37 was the latest, but the gtf format has changed in Ensembl 38. I raised an issue on github RNA-SeQC error no output , but it doesn't look they check they page. I wanted to write to them directly, but couldn't find the best email to contact.
If you or anyone else knows the best contact email regarding RNA-SeQC please let me know. And also if you or anyone else have any thoughts on RNA-SeQC not working with the latest assembly - Ensembl 38, please comment on github issue page or here.
Also resolved my issue by making sure my Sample File was in the correct format and referred to actually extant .bam files. According to the input spec:
-s <arg>
Sample File: tab-delimited description of samples and their bams.
This file header is:
Sample ID[tab]Bam File[tab]Notes
I have the same issue how did you resolve?
I found it does work if you use the previous release (21) of the annotation file. It is not ideal, but it is still GRCh38
sorry, this is the answer to another issue below