Extract Raw Counts From Sam File
2
0
Entering edit mode
11.3 years ago
mad.cichlids ▴ 140

Hi, All

I was performing a de novo assembly of RNA seq data. I aligned my reads to the assembly for each of my samples and generated sam file for each, my question is how am i to extract the raw reads from the sam files? I know couple of tools such as htseq can extract reads with the reference genome, but how about the tools without any reference genome to perform the task of extracting raw reads?

Thanks a lot for any suggestions.

Best,

sam rnaseq counts • 5.4k views
ADD COMMENT
1
Entering edit mode
11.3 years ago
Gabriel R. ★ 2.9k

Sam or bam ?

Do you just want to see your reads + quality ?

If sam, just use

cut -f 10,11 file.sam

if bam

samtools view file.bam | cut -f 10,11

Now if you want your data fasta or fastq, use this hacked version of samtools which produces fastq/a : https://github.com/udo-stenzel/samtools-patched

ADD COMMENT
0
Entering edit mode

Thank you so much! Sorry, i could have stated it more clearly. I meant to ask how many raw reads aligned to each of my assembled contigs, so that i can compare the gene expression differences later. the 10, 11 are respective sequence and quality score right?

ADD REPLY
0
Entering edit mode
11.3 years ago
Rm 8.3k

use eXpress pipeline

ADD COMMENT
0
Entering edit mode

awesome, really appreciate it!

ADD REPLY

Login before adding your answer.

Traffic: 2033 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6