Entering edit mode
8.9 years ago
dina.hesham139
▴
170
Hey,
Is it normal that I have a drop of ~10% mapping using hg38 compared to hg19?
I mean I mapped the same set of samples with the same tool and under the same condition. First, alignment against hg19 gives an average of 80s % while alignment against hg38 dropped to an average of 70s %.
Why would that tend to happen?
Exact command was? This could explain it if you only look at unambiguously mapping reads.
I used STAR, the only difference is that while building the index for hg38 I included the annotation gtf file in to the command. I didn't do that with hg19. the alignment command was the same for both!
Would that have an effect?
I also used both genome.fa and annotation file from ensembl in case of hg38, while from UCSC in case of hg19.
I don't know STAR. What was the exact alignment command you used? How does it report unambiguously mapping reads?
the command I used was:
This is the summary for a samlpe mapped against hg38:
This is the summary for the same sample mapped against hg19:
And what does the manual of STAR say about mapping of unambiguous reads? What does the manual of STAR say about the use of a GTF file in reference to mapping? You have read the manual, right?
Nothing about mapping of unambiguous reads!!
The use of a GTF file in reference to mapping is Highly recommended!!
It also says something about use of GTF file affecting alignments. Also, unambiguous reads are discussed in the manual (e.g. under multimappers). Not my job to read the manual. If you go through it and compare your reference genomes, unmapped reads, where they map in the other reference, etc. I'm sure you'll figure out what's happening. Good luck!
STAR paper in the Current Protocols in Bioinformatics says "The gene annotations allow STAR to identify and correctly map spliced alignments across known splice junctions. While it is possible to run the mapping jobs without annotations, it is not recommended. When gene annotations are not available, use the 2-pass mapping "
You could map against hg19 without the annotations and see if the percentage drops accordingly but that would be an academic exercise.