Question

How to tell is sequencing data is unstranded, firststrand, or secondstrand

0

Entering edit mode

6.8 years ago

williamsbrian5064 ▴ 530

Hi

I have some RNA-Seq data that I received from a colleague. I am trying to figure out if the data is unstranded, firststrand, or secondstrand. Is there an easy way to tell? I am running the data through salmon and this is what the lib_format_counts.json file is giving me. I think it is secondstranded?

read_files  "Sample1"
expected_format "SR"
compatible_fragment_ratio   0.9945605200682789
num_compatible_fragments    18794420
num_assigned_fragments  18897211
num_consistent_mappings 56858584
num_inconsistent_mappings   328289
MSF 0
OSF 0
ISF 0
MSR 0
OSR 0
ISR 0
SF  328289
SR  56858584
MU  0
OU  0
IU  0
U   0

Any ideas?

RNA-Seq Assembly R sequence • 3.8k views

ADD COMMENT • link updated 6.8 years ago by swbarnes2 14k • written 6.8 years ago by williamsbrian5064 ▴ 530

2

Entering edit mode

Maybe ask the colleague?:)

ADD REPLY • link 6.8 years ago by grant.hovhannisyan ★ 2.6k

2

Entering edit mode

What fun is that!

ADD REPLY • link 6.8 years ago by Devon Ryan 104k

1

Entering edit mode

Sometimes colleagues forget what they did or remember incorrectly. The data, on the other hand, doesn't lie.

ADD REPLY • link 6.8 years ago by igor 13k

0

Entering edit mode

haha I would be they haven't been the easiest to get in contact with. You know how it goes.

ADD REPLY • link 6.8 years ago by williamsbrian5064 ▴ 530

1

Entering edit mode

There are some excellent answers posted already, but just in case you want to learn more about strandness, check this previous post: Read pair orientation : Illumina TruSeq Stranded mRNA library

ADD REPLY • link 6.8 years ago by igor 13k

0

Entering edit mode

Great link! Very helpful info! I appreciate it!

ADD REPLY • link 6.8 years ago by williamsbrian5064 ▴ 530

score 1 · Answer 1 · 2018-02-28

1

Entering edit mode

6.8 years ago

Devon Ryan 104k

Yes, this appears to be the standard dUTP/secondstrand/ISR library. As an aside, this is the most likely library prep type these days.

ADD COMMENT • link 6.8 years ago by Devon Ryan 104k

0

Entering edit mode

Awesome! So the sequencing data is single end. Would this then be considered SR instead ISR?

ADD REPLY • link 6.8 years ago by williamsbrian5064 ▴ 530

1

Entering edit mode

Yes, SR rather than ISR.

ADD REPLY • link 6.8 years ago by Devon Ryan 104k

0

Entering edit mode

Thanks you so much!

ADD REPLY • link 6.8 years ago by williamsbrian5064 ▴ 530

score 1 · Answer 2 · 2018-02-28

If you have no clue, take a look at RSeQC ( infer_experiment.py )

This program is used to “guess” how RNA-seq sequencing were configured, particulary how reads were stranded for strand-specific RNA-seq data, through comparing the “strandness of reads” with the “standness of transcripts”.

score 0 · Answer 3 · 2018-02-28

0

Entering edit mode

6.8 years ago

swbarnes2 14k

You can always eyeball it in IGV. It should be pretty easy to spotcheck half a dozen genes and see which way the reads go.

ADD COMMENT • link 6.8 years ago by swbarnes2 14k