Question

Ribosomal Depletion

0

Entering edit mode

2.8 years ago

kcarey • 0

Hello!

I was wondering if someone could help me? I am currently trying to do a step to check my ribosomal depletion step took during sequencing...using STAR Aligner. I had a plan to align the .fastq file to a ribosomal reference for homosapiens , and then write out a file that had all the unaligned reads (this would be the file I do my analysis on....it would not contain the rRNA contaminated reads)..then I planned to align that my Gh38 reference genome....however I keep getting an error and its not doing it...

Can someone help? I tried reading the manual and it is saying I must index first...I tried but I don't have a gtf file of my rRNA only Reference.

CODE:

STAR
--runThreadN 20
--genomeDir ~/directory/to my indexed/ribosomal/fasta/file
--readFilesIn patient.fastq
--outReadsUnmapped Fastx
--genomeFastaFiles Hsapiens rRNA only reference file.fasta 
--sjdbOverhang max(ReadLength)-1

*Also does anyone know code to get it to give you only unmapped reads in a fastq file output?

rRNA STAR sequencing contamination RNA • 1.5k views

ADD COMMENT • link updated 2.8 years ago by andres.firrincieli 3.9k • written 2.8 years ago by kcarey • 0

0

Entering edit mode

Instead of STAR use bbduk.sh from BBmap

ADD REPLY • link 2.8 years ago by andres.firrincieli 3.9k

0

Entering edit mode

Why do you recommend? Is it easier? From my understanding, STAR is the best aligner

ADD REPLY • link 2.8 years ago by kcarey • 0

0

Entering edit mode

It is easier and do his job. But if you are planning to do differential expression then follow @grant.hovhannisyan suggestion

ADD REPLY • link 2.8 years ago by andres.firrincieli 3.9k

0

Entering edit mode

what downstream analysis are you planning to do after getting the non-rRNA bam file?

ADD REPLY • link 2.8 years ago by grant.hovhannisyan ★ 2.6k

0

Entering edit mode

I am planning to generate counts to do differential expression and pathway analysis

ADD REPLY • link 2.8 years ago by kcarey • 0

0

Entering edit mode

then I recommend not bothering about removing the rRNA data during the mapping - just map all your data to the entire reference, do the counting (I guess you know the STAR also can do the counting) and then see how many counts for rRNA genes you have. In your DE analysis you simply can remove that genes before the analysis.

ADD REPLY • link 2.8 years ago by grant.hovhannisyan ★ 2.6k