Entering edit mode
2.4 years ago
el97004
▴
80
Hi all!
Anyone know of a software tool that can be used to get reads from a de-novo assembly if I have the raw reads? the raw reads went through several steps of preprocessing and filtering and I no longer have the final reads that were used to make the assembly.
Thank you!
Thanks GenoMax ! Once aligned and obtained bam file could I not then just convert that to fastq?
Not sure then what your question is. What format is your query data in? That must already be in fastq format? Are you simply looking to extract reads that align to the assembly then (sometimes all reads may not align)?
Yeah I was just trying to find the set of reads that were used to make up the final assembly. Some of those raw reads will definitely not align since they were filtered out (e.g. contaminants).
Format information: assembly: fasta; raw reads: fastq
In that case use an aligner of your choice and extract reads that map: How To Filter Mapped Reads With Samtools
Be sure to get primary alignments (otherwise) there may be multiple copies of reads in extract.