Hi,
please tell me according to the following shared red circle in picture, can we say this as a insert size of our RNA-seq data.[1]: https://ibb.co/jf4keF
Hi,
please tell me according to the following shared red circle in picture, can we say this as a insert size of our RNA-seq data.[1]: https://ibb.co/jf4keF
The easiest way to find it programmatically is to convert the SRA to fastq on the fly (if you are unable to get it from their documents) and proceed with alignment to produce the bam file. After that just use bamtools with the below command and it will give the average and the median insert size. Good luck!
bamtools stats -i foo.bam -insert
changes in argument: -in
here is an example output:
$ bamtools stats -in aaaaaa.bam -insert
**********************************************
Stats for BAM file(s):
**********************************************
Total reads: 1367054
Mapped reads: 1367054 (100%)
Forward strand: 683527 (50%)
Reverse strand: 683527 (50%)
Failed QC: 0 (0%)
Duplicates: 0 (0%)
Paired-end reads: 1367054 (100%)
'Proper-pairs': 1367054 (100%)
Both pairs mapped: 1367054 (100%)
Read 1: 683527
Read 2: 683527
Singletons: 0 (0%)
Average insert size (absolute value): 104.995
Median insert size (absolute value): 80
$ bamtools --version
bamtools 2.2.2
Part of BamTools API and toolkit
Primary authors: Derek Barnett, Erik Garrison, Michael Stromberg
(c) 2009-2012 Marth Lab, Biology Dept., Boston College
No, that's the run ID, see the wikipedia article. If you have paired-end reads, you'll need to align them as such for an insert size to be reported.
If the data comes from SRA database, then you can search that info there. If it is a newly sequenced data, than maybe easiest would be to ask sequencing provider.
also samtools
has a very similar command to bamtools
:
samtools stats --insert-size foo.bam
you can additionally speed upt the process by using --threads 10
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
As Chris suggested, you need to align your reads to get a SAM/BAM file and then run 'samtools stats' to get information about insert sizes..it will also give you an insert size distribution graph..