Question

dedup STAR transcriptome file using umi_tools

0

Entering edit mode

5 months ago

ATRX ★ 1.2k

Hi,

I am interested in dedup the transcriptome output file (bulk RNA-seq) from STAR using umi_tools. I am using the following dedup function from umi_tools. Here is the command:

umi_tools dedup --paired
--stdin=B1-Cond1_Aligned.toTranscriptome.out.bam --log=B1-Cond1_dedup.txt --umi-separator=":" --output-stats=B1-Cond1_ > .dedup.bam

However, I am getting the following error:

ValueError: fetch called on bamfile without index

I don't think we can index transcriptome file from STAR. I read the umi_tools document and didn't much options for bulk RNA-seq libraries. Do you know what is the best way to dedup using umi_tools? Thanks!

rna-seq umi transcriptome umi_tools • 1.9k views

ADD COMMENT • link 5 months ago by ATRX ★ 1.2k

1

Entering edit mode

How about deduping based on the genomic bam, then filtering the transcriptome bam based on the reads leftover. Maybe some extra work, but might be faster if you can't figure out how to index/dedupe the transcriptome bam directly.

ADD REPLY • link 5 months ago by rfran010 ★ 1.4k

0

Entering edit mode

Yes, you are right. I plan to do this. Thanks!

ADD REPLY • link 5 months ago by ATRX ★ 1.2k

0

Entering edit mode

Very helpful, thanks a lot!

ADD REPLY • link 5 months ago by ATRX ★ 1.2k

score 6 · Accepted Answer · 2024-11-02

6

Entering edit mode

5 months ago

i.sudbery 21k

I don't know if any reason you shouldn't be able to index the bam produced by STAR, although you will need to sort them first.

ADD COMMENT • link 5 months ago by i.sudbery 21k

0

Entering edit mode

Thanks for the reply. STAR generates two different bam files. One is the genome based and the other one is the transcriptome based. I am able to dedup of genome based bam file but not for transcriptome based.

ADD REPLY • link 5 months ago by ATRX ★ 1.2k

3

Entering edit mode

You should definitely be able to use samtools ti sort and then index the transcriptome bam. We've done this many times.

ADD REPLY • link 5 months ago by i.sudbery 21k

0

Entering edit mode

It worked. thanks a lot!

ADD REPLY • link 5 months ago by ATRX ★ 1.2k

0

Entering edit mode

You can go ahead and accept @Ian answer to provide closure to this thread (green check mark).