Question

Aligning fasta to fastq

0

Entering edit mode

8.2 years ago

ammarsabir15 ▴ 70

I want to align fasta sequences for 28 exons of SCN5A gene with fastq sequence of whole exome sequence from 1000 genomes project.

By aligning these two I want to find mutations within the exons of SCN5A gene.

Should I use conventional NGS alignment methods for this purpose i.e make indices of fasta and then align fasta with fastq or there is any other strategy for this purpose?

sequencing ngs alignment • 2.5k views

ADD COMMENT • link 8.2 years ago by ammarsabir15 ▴ 70

2

Entering edit mode

Irrespective of the method you settle on keep in mind that when you align data to an abbreviated version of a genome (in this case just SCN5A gene) there is a chance that aligners will align reads that may not have originally belonged in that region. If that is a cause for concern then aligning to the whole genome/exome followed by retrieving regions of alignment of interest may be a safer option.

ADD REPLY • link 8.2 years ago by GenoMax 151k

0

Entering edit mode

and in your case, if any short-read comes from SCN10A which is homologous to SCN5A, it might produces a false positive when only mapped to SCN5A...

ADD REPLY • link 8.2 years ago by Pierre Lindenbaum 166k

0

Entering edit mode

Ok I will try to use this but lets say SCN5A is on chromosome 3 then how to get exons only for chromosome 3 from this whole exome sequence ?

ADD REPLY • link 8.2 years ago by ammarsabir15 ▴ 70

0

Entering edit mode

You could align to the whole genome/exome and then extract regions you need later: Extracting reads from multiple regions

ADD REPLY • link 8.2 years ago by GenoMax 151k

0

Entering edit mode

After alignment I got BAM file. What should be the next step to get the sequence of chromosome # where my desired gene is present? Should I extract regions from BAM file based on chromosome using samtools view and then convert that into fastq? Or Should I convert the BAM file to BED file and then extract required regions ?

ADD REPLY • link 8.2 years ago by ammarsabir15 ▴ 70

0

Entering edit mode

Things may be a bit tricky depending on what you want.

Are you looking to get a consensus? Take a look at this. If you only want to restrict to a chromosome or only the SCN5A gene then you could limit the region that you feed to samtools mpileup by using samtools view sorted.bam chr3:1-450000 (use the interval you want)

ADD REPLY • link 8.2 years ago by GenoMax 151k