Entering edit mode
5.0 years ago
boymin2020
▴
80
Hi,
Recently I have been evaluating gene expression levels using STAR software. Regarding the counts in the ReadsPerGene.out.tab table, I am confused about the equation of N_unmapped (12935190), N_multimapping(6553338), N_noFeature(4320607), N_ambiguous(1013776) and the sum of readspergene (12473764).
The additional file is the Log.final.out as shown below. Could you help to correlate all of the reads information? Better in equations.
Started job on | Dec 03 17:38:54
Started mapping on | Dec 03 19:50:36
Finished on | Dec 03 21:04:20
Mapping speed, Million of reads per hour | 20.07
Number of input reads | 24660208
Average input read length | 200
UNIQUE READS:
Uniquely mapped reads number | 17808147
Uniquely mapped reads % | 72.21%
Average mapped length | 110.21
Number of splices: Total | 4853399
Number of splices: Annotated (sjdb) | 4841181
Number of splices: GT/AG | 4792927
Number of splices: GC/AG | 40247
Number of splices: AT/AC | 7008
Number of splices: Non-canonical | 13217
Mismatch rate per base, % | 10.54%
Deletion rate per base | 0.04%
Deletion average length | 2.35
Insertion rate per base | 0.03%
Insertion average length | 2.28
MULTI-MAPPING READS:
Number of reads mapped to multiple loci | 6553338
% of reads mapped to multiple loci | 26.57%
Number of reads mapped to too many loci | 291711
% of reads mapped to too many loci | 1.18%
UNMAPPED READS:
Number of reads unmapped: too many mismatches | 4348
% of reads unmapped: too many mismatches | 0.02%
Number of reads unmapped: too short | 0
% of reads unmapped: too short | 0.00%
Number of reads unmapped: other | 2664
% of reads unmapped: other | 0.01%
CHIMERIC READS:
Number of chimeric reads | 0
% of chimeric reads | 0.00%