Question

Merging mapping results

0

Entering edit mode

10.4 years ago

bpz ▴ 60

Hello

I think this is a silly question, nonetheless I need help with this issue :p

I have several bam files that I have obtained from some mapping procedures I have done with bwa, and I have retrieved the fastq sequences from these files.

So, I need to put all the fastq sequences in one file, but without repetitions. (If one read was a hit in all mappings I need to have it only once in the final file; if another appeared in just two mappings; again, I need it only once... you get the picture).

Thanks in advance.

Mapping Assembly • 1.6k views

ADD COMMENT • link updated 3.2 years ago by Ram 45k • written 10.4 years ago by bpz ▴ 60

0

Entering edit mode

grep read names from fastq_file_1 into file_list, then grep read names from fastq_file_2 and add them to the same file_list... do it for all your samples, remove duplicates, and extract the reads with seqtk.

ADD REPLY • link 10.4 years ago by apelin20 ▴ 490

score 0 · Answer 1 · 2015-02-17

0

Entering edit mode

10.4 years ago

Biomonika (Noolean) 3.2k

Just put them all in one file and filter out duplicates, see this thread:

How To Remove The Same Sequences In The Fasta Files?

ADD COMMENT • link 10.4 years ago by Biomonika (Noolean) 3.2k