Hi, When I tried to match the insertion/deletion (INDEL) events between 1) RNA-seq reads aligned by STAR against human reference genome hg38. 2) RNA-seq reads aligned by minimap against transcript sequences (gencode v36), I noticed that it's hard to match those indel events by genomic position. After inspected the aligned reads, I noticed that STAR and minimap2 introduced the gap for INDEL at different positions:
Alignment: (by minimap2)
Refer : GCTACATGGC -TTTTTTGACGTATGACGAAGTGAAAGC
Query: GCTACATGGCTTTTTTTGACGTATGACGAAGTGAAAGC
Alignment: (by STAR2)
Refer : GCTACATGGCTTTTTT- GACGTATGACGAAGTGAAAGC
Query: GCTACATGGCTTTTTTTGACGTATGACGAAGTGAAAGC
The results affect the downstream analysis, for example, after called the variants follower by annotation, it would report different positions of the INDEL:
Are there any way to solve this issue? Thanks!
These programs are meant for different types of data/applications. By
STAR2
I assume you meanSTAR v.2.x
?Thanks! I am using STAR 2.7.11a.