Entering edit mode
9.9 years ago
biolab
★
1.4k
HI everyone
I am using RSeQC to determine strandness. When running the command infer_experiment.py -r TAIR10.bed -i WT_accepted_hits.bam.sort.bam
, I faced the following failure. Could anyone help? THANKS a lot.
Failure message:
This is PairEnd Data
Fraction of reads failed to determine: 1.0000
Fraction of reads explained by "1++,1--,2+-,2-+": 0.0000
Fraction of reads explained by "1+-,1-+,2++,2--": 0.0000
head of TAIR10.bed
chr01 0 30427671 Chr1 . . TAIR10 chromosome .ID=Chr1;Name=Chr1
chr01 3630 3759 AT1G01010.1-Protein . + TAIR10 five_prime_UTR . Parent=AT1G01010.1
chr01 3630 3913 AT1G01010.1-Protein . + TAIR10 exon .Parent=AT1G01010.1
head of bam file
FC42T26AAXX:7:104:1142:897#0 73 chr01 3636 50 75M * 00 ATTAGATATACCAAACCAGAGAAAACAAATACATAATCGGAGAAATACAGATTACAGAGAGCGAGAGAGATCGAC abbbabbbbb`^abaaaababaaaaabababaabbaaabaaaaaaaaaaaaaaaaa`aaaaaaaaaaaaaa^a__ AS:i:0 XN:i:0 XM:i:0 XO:i:0 XG:i:0 NM:i:0 MD:Z:75 YT:Z:UU NH:i:1
Hi Devon Thanks for your reply. I have changed the bed file (all "Chr"s are converted to "chr0"s, please see below). Unfortunately, the problem still exists.
The alignment you posted is of a singleton (i.e., one read from a pair aligned alone). Are they all like that? If so, then that's your problem. BTW, a simple say to check this is to just run
samtools view foo.bam | cut -f 2 | sort | uniq -c
.Hi Devon, thanks a lot! However, I still have the problem.
samtools view bam-file
, thensort
anduniq
produce following output. I think I probably need to check other sequencing datasets.You must have used
cut -f 3
instead ofcut -f 2
. We need to know the count of alignments for each flag value.The FLAGs are as follows. Anything strange? THANKS.
So most of the alignments are informative. I haven't a clue then why the script is having problems. Just look at a few genes in IGV and you should be able to discern the strandedness.
Hi Devon, Thanks a lot anyway! I will manually check some genes after separating XS strands.