Entering edit mode
6.5 years ago
pinn
▴
210
Hi I tried to align almost more than 10 human genome datasets against the hg38. gsnap able to align, generating a truncated sam file. Can any way, suggest me how to sort it out ?
Command:
./gsnap -d hg38 -D /data/Likith/gmap-2018-05-30/share/hg38 -t 10 /data/shayantan/cancer_samples/sample1/fixed1_normal.fq /data/shayantan/cancer_samples/sample1/fixed2_normal.fq > S1.gsnap.sam
Input file sizes:
R1 & R2- 97Gb , 97 Gb
hg38 - 3.1 GB
output file:
S1.gsnap.sam - 410 GB
Viewing the sam file:
$ samtools view S1.testgsnap.sam | head
[W::sam_read1] Parse error at line 1
[main_samview] truncated file
Try just
head S1.testgsnap.sam
.Also, are you sure you want all of the datasets merged together like this? Usually you want them as separate files.
Hi, i'm able to view the SAM file with head that is not a problem. when converting from SAM to sort.bam its showing truncated sam file ? Is their any other way, can i convert to sort.bam ?
Then with samtools:
Ah, it's not actually writing a SAM file! Maybe there's an option to change that?