Question

Miseq Genomic DNA analysis

0

Entering edit mode

7.5 years ago

novicebioinforesearcher ▴ 70

We have sequenced genomic dna using miseq 2x250, and have received fastq files, i was wondering how to go about with respect to alignment of these reads and check for indels and snps is there a specific work flow?

genomic dna alignment • 1.8k views

ADD COMMENT • link updated 7.5 years ago by Brian Bushnell 20k • written 7.5 years ago by novicebioinforesearcher ▴ 70

0

Entering edit mode

which species are you sequencing?

ADD REPLY • link 7.5 years ago by dyollluap ▴ 310

0

Entering edit mode

we are working on mouse

ADD REPLY • link 7.5 years ago by novicebioinforesearcher ▴ 70

score 4 · Accepted Answer · 2017-07-13

4

Entering edit mode

7.5 years ago

Brian Bushnell 20k

Well, you could use the BBMap package and do something like this:

#Remove duplicates
clumpify.sh in=reads.fq.gz out=clumped.fq.gz dedupe optical

#Remove low-quality regions
filterbytile.sh in=clumped.fq.gz out=filtered_by_tile.fq.gz

#Trim adapters
bbduk.sh in=filtered_by_tile.fq.gz out=trimmed.fq.gz ktrim=r k=23 mink=11 hdist=1 tbo tpe minlen=100 ref=bbmap/resources/adapters.fa ftm=5 ordered

#Remove synthetic artifacts and spike-ins.
bbduk.sh in=trimmed.fq.gz out=filtered.fq.gz k=27 ref=bbmap/resources/sequencing_artifacts.fa.gz,bbmap/resources/phix174_ill.ref.fa.gz ordered qrtim=r trimq=6

#Map to reference
bbmap.sh in=filtered.fq.gz out=mapped.sam.gz bs=bs.sh pigz unpigz ref=reference.fa

#Call variants
callvariants.sh in=mapped.sam.gz out=vars.txt vcf=vars.vcf.gz ref=reference.fa ploidy=1 prefilter

ADD COMMENT • link 7.5 years ago by Brian Bushnell 20k

1

Entering edit mode

For WGS MiSeq runs first two steps should not be needed.

ADD REPLY • link 7.5 years ago by GenoMax 148k

1

Entering edit mode

I'll have to look into that. I don't see much MiSeq 2x250bp WGS data, mainly 2x300 amplicon data, which gets pretty ragged toward the ends. I have noticed, though, that MiSeq appears to have a pretty consistent positional quality component (at least in the sample I analyzed for that purpose) that appears to be due to focus (the number of mismatches increased radially out from the center).

ADD REPLY • link 7.5 years ago by Brian Bushnell 20k

0

Entering edit mode

Thank you,I will ask my admin to install bbMap meanwhile can this also be done using bwa mem?