NGS bam file reference-guided assembly
1
0
Entering edit mode
4.2 years ago

hi,

I have some samples, where each has PE reads(150 bp length) of NGS in fastq format.

I already aligned them to the refseq with BWA.

I want to take N bases and do some kind of manipulation on them.

I thought to assemble the bam file, with the mutations, and in the gaps(where I don't have reads) fill it with the refseq.

do you know any reference-guided assembly that creates genomes to each sample, so I can take n bases from wherever I want?

thanks

next-gen assembly alignment • 996 views
ADD COMMENT
1
Entering edit mode
4.2 years ago

Do you mean something like consensus generation ?

Generating consensus sequence from bam file

I think you are also mixing alignment and assembly up. Please read basic NGS tutorials to understand this key difference.

ADD COMMENT
0
Entering edit mode

what is the diffrence from :

gatk4 FastaAlternateReferenceMaker -R $REFERENCE -O $CONSENSUS_FASTA -V $VCF

https://medium.com/brown-compbiocore/building-a-consensus-sequence-with-vcf-files-db7407f3f86f

ADD REPLY
1
Entering edit mode

When you say reference guided assembly, a professional bioinformatician will think you are trying to do an assembly using tools such as Canu, Spades, etc, but using a reference genome to guide you.

What you are trying to do is use an alignment (BAM, VCF etc) and an existing Fasta to generate a consensus reference. Note : this is _not_ assembly.

But yes, GATK or samtools mutfa will likely do a good job on that. Beware insertions and deletions are inserted correctly.

ADD REPLY

Login before adding your answer.

Traffic: 2605 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6