Entering edit mode
7.8 years ago
yuxinghai
▴
10
I'm doing RNAseq analysis, and I did mapping with hasit2. It give me a high total mapping rate(94%), but also high multimapping rate(20%). I use hisat2 mapping another RNAseq data which has been tophat aligned(total mapping 81.66%,unique mapping (78.95%), similarly, I get high total mapping rate(95.29%), but also high multimapping rate(38%). It make me confused. I build index with genome and transcriptome , I'm wondering if this cause this ouput? someone can give me a hint?
Which parameters are you using in the mapping? how long are your reads? does the quality control look fine?
hisat2 -p 8 --dta --no-mixed --no-discordant -x /data3/zhoulab/yuxinghai/anotiation/grch38_tran/genome_tran --un G1/G1.tran_unmap.sam -1 G1/G1_R1.fq -2 G1/G1_R2.fq -S G1/G1.tran_map.sam
read length? does your QC look file?
PE 150, and I did fasqc, the base quality all almost >28
G+C, overrepresented sequences?
through fastqc, I can see two peaks,a little differnent with other RNAseq. but It's just a normal RNAseq. sorry, I don't know how to upload the picture.
Sorry for my late answer. I have been away. I would find suspicious to see two peaks in the G+C distribution. Have you blast the overerepresented sequences?
I am also wondering why so high multiple alignment rate by HISAT2. I am using "NH:i:1" tag to extract uniquely mapped read, but this kind of unique reads seems much more than that number told by HISAT2.