*EDIT* Thank you every one. It was user error. Please see the information below.
Hello everyone,
I have encountered an problem I have never had before. I have salmonella sequencing data(ion torrent) that will not map and I have no idea why.
First we tried to use DNAStar, it run thoughts without error but then at the end says there is no sequencing data and fails to align.
Next I tried using TMAP and get a segmentation error [core dump] after the initial load. (alright I lied. That is an error)
Next I tried bwa mem and it loads all the reads and finishes without error and samtools flagstats says 0 reads map.
Lastly I tried using SPAdes for a de novo build, which worked and I was able to make salmonella contigs.
Fastqc is able to read the data without issue and it seems to be OK data.
Has anyone encountered this type of issue before?
Any advice, help, or pointing me in the right direction will be greatly appreciated.
Thank you,
Sean
Can you try BBMap?
Can you try these options
minratio=0.15 ignorequality slow ordered
which are meant for Nanopore/PacBio reads with errors/indels.In what format is your Ion Torrent data? And would you paste the first 10-12 lines of data?
it is unmapped bam, but in fastq it looks like this
That first read is definitely from
Salmonella enterica
.Just to be sure it's not a hidden formatting issue, what happens if you convert the BAM to FASTQ and align with BBMap?
As @genomax2 noted, the reads are from S. eneterica so they should align.
You both are correct. It is fine. I apparently have a truncated genome file.
The genbank summary file is 4.0KB in size The genbank full (truncated) I was using is 3.4MB in size the genbank full (I just reloaded) is 11.4MB
Right now mapping seems to be moving along just fine with the correct file. The truncated file had all the genes listed but was missing the actual sequence.
Thank you both for your quick help!
Also for the record :)
It works so much better with something to actual align. ------------------ Results ------------------