Entering edit mode
2.5 years ago
pearl2070
▴
10
I'm running Salmon with this line:
*./salmon-1.8.0_linux_x86_64/bin/salmon quant -p 12 -t microbial_all_cds.fasta -l A -a Sample1_megahit.annotation_bwa.sam -o Sample1_salmon*
And I encounter these errors:
*" Transcript appears twice in the transcript FASTA file" and "Transcript appears in the reference but did not appear in the BAM."*
Is there a way to, in command line,
- process my transcript FASTA file so that there are no duplicates and
- process my reference file so that it's filtered, containing only transcripts that are also in my SAM file?
When you generated your BAM did you align your reads to the genome or transcriptome? Salmon requires alignment to the transcriptome. You may also want to include your code for that part.
Ah, I aligned the reads to a reference database (microbial_all_cds.fasta). For BWA, I ran:
Followed by:
Should I instead be doing:
And then:
If
Sample1_megahit.contigs.fa
is your assembled transcriptome then yes.Got it, thank you!
Prior reference: Running salmon in alignment mode?