Question

Should I generate a new Index genome in STAR for 50 base reads instead of 100 base which I had before?

0

Entering edit mode

5.8 years ago

Rimma ▴ 30

When I was generating Ggnome Indexes file in STAR I used the parameter --sjdbOverhang 100 since my reads are 100 bp length. However, now I have 50 base reads and I wondered if I can use the same index genome for it or not?

Thanks a lot!

rna-seq alignment • 2.7k views

ADD COMMENT • link updated 5.8 years ago by caggtaagtat ★ 1.9k • written 5.8 years ago by Rimma ▴ 30

score 0 · Answer 1 · 2019-07-18

0

Entering edit mode

5.8 years ago

caggtaagtat ★ 1.9k

As far as I know, it could be beneficial to adjust the parameter you stated accordingly.

On the other hand, I also read, that it doesn't completly disrupts your data.

Edit: let me cite the STAR manual

In case of reads of varying length, the ideal value is max(ReadLength)-1 . In most cases, a generic value of 100 will work as well as the ideal value.

So I would recalcualted the STAR index.

ADD COMMENT • link 5.8 years ago by caggtaagtat ★ 1.9k

1

Entering edit mode

The quote from the manual contradicts that recalculating the STAR index is likely to be very useful.

ADD REPLY • link 5.8 years ago by Devon Ryan 105k

0

Entering edit mode

It also states, that the ideal step, however, would be to recalculate. So, since it doesn't take that long, it could be slightly beneficial. But you are right, it seems to make no huge difference.

ADD REPLY • link 5.8 years ago by caggtaagtat ★ 1.9k