When I was generating Ggnome Indexes file in STAR I used the parameter --sjdbOverhang 100 since my reads are 100 bp length. However, now I have 50 base reads and I wondered if I can use the same index genome for it or not?
Thanks a lot!
When I was generating Ggnome Indexes file in STAR I used the parameter --sjdbOverhang 100 since my reads are 100 bp length. However, now I have 50 base reads and I wondered if I can use the same index genome for it or not?
Thanks a lot!
As far as I know, it could be beneficial to adjust the parameter you stated accordingly.
On the other hand, I also read, that it doesn't completly disrupts your data.
Edit: let me cite the STAR manual
In case of reads of varying length, the ideal value is max(ReadLength)-1 . In most cases, a generic value of 100 will work as well as the ideal value.
So I would recalcualted the STAR index.
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
The quote from the manual contradicts that recalculating the STAR index is likely to be very useful.
It also states, that the ideal step, however, would be to recalculate. So, since it doesn't take that long, it could be slightly beneficial. But you are right, it seems to make no huge difference.