Multimapping Reads to be counted or not
0
0
Entering edit mode
15 days ago

Should i include Multimapping reads in my RNA-seq data analysis? My successfully assigned alignments is coming as low as 45.1% . I have used this command featureCounts -p -T 8 -a /data/sata_data/home/rajdeep/GRCm39/gencode.vM36.chr_patch_hapl_scaff.annotation.gtf -s 2 -O --countReadPairs -o counts_CC2.txt sorted_39aligned_CC1.bam

Is there any way i can improve my reads bt chnging any parameter in featureCounts?

|| Process BAM file sorted_CC2.bam...                                        
||    Strand specific : reversely stranded                                   
||    Paired-end reads are included.                                         
||    Total alignments : 76891146                                            
||    Successfully assigned alignments : 34688363 (45.1%)       

Assigned    34688363
Unassigned_Unmapped 1274794
Unassigned_Read_Type    0
Unassigned_Singleton    0
Unassigned_MappingQuality   0
Unassigned_Chimera  0
Unassigned_FragmentLength   0
Unassigned_Duplicate    0
Unassigned_MultiMapping 12674576
Unassigned_Secondary    0
Unassigned_NonSplit 0
Unassigned_NoFeatures   28253413
Unassigned_Overlapping_Length   0
Unassigned_Ambiguity    0
lowcounts featureCounts RNASeq Multimapping • 466 views
ADD COMMENT
0
Entering edit mode

You don't gain anything by counting them since it is not reliable to where they map. Still, you have 35M successful assignments, that's good in most situations. See how downstream analysis goes first. Also, any reason you use the GTF with the haplotypes, that is unusual.

ADD REPLY
0
Entering edit mode

What gtf file i should use then, i dont know much of this.I found the gtf file in gencode .Kindly reply.

ADD REPLY
0
Entering edit mode

How about the one that says "This is the main annotation file for most users" which is the Basic Annotation at https://www.gencodegenes.org/human/

ADD REPLY
0
Entering edit mode

Looks like this is mouse data but your point stands. Use: https://www.gencodegenes.org/mouse/

ADD REPLY
0
Entering edit mode
featureCounts -p -T 8 --countReadPairs -a /data/sata_data/home/rajdeep/gencode.vM36.basic.annotation.gtf.gz -s 2  -o counts_CC2.txt sorted_CC2.bam

This is the result when i used the basic gtf file that you suggested. Still the assigned reads are quite low, even lower when i used the previous gtf file. Which one i should use then? |

|| Load annotation file gencode.vM36.basic.annotation.gtf.gz ...              
||    Features : 673003                                                       
||    Meta-features : 78239                                                   
||    Chromosomes/contigs : 22                                                
||                                                                            
|| Process BAM file sorted_CC2.bam...                                         
||    Strand specific : reversely stranded                                    
||    Paired-end reads are included.                                          
||    Total alignments : 76891146                                             
||    Successfully assigned alignments : 32901649 (42.8%)  
Assigned        32901649
Unassigned_Unmapped     1274794
Unassigned_Read_Type    0
Unassigned_Singleton    0
Unassigned_MappingQuality       0
Unassigned_Chimera      0
Unassigned_FragmentLength       0
Unassigned_Duplicate    0
Unassigned_MultiMapping 12674576
Unassigned_Secondary    0
Unassigned_NonSplit     0
Unassigned_NoFeatures   29595347
Unassigned_Overlapping_Length   0
Unassigned_Ambiguity    444780
ADD REPLY

Login before adding your answer.

Traffic: 1757 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6