Ways To Detect Bias In Dna Sampling For Genomic Sequencing
2
1
Entering edit mode
12.1 years ago
Lee Katz ★ 3.2k

Hi all, I need to detect DNA sampling bias. We extracted DNA using two different library preps and then multiplexed the samples.

I think that sampling bias will basically be something where each read will be more likely to start with a certain GC content or sub sequence. Therefore I think the best way to analyze any bias will be to assemble with an assembly output file (in my case, I'll do velvet, with the output afg file). After that, I am not exactly sure what to do. I could look places with low depth, e.g., at contig breaks, and see what the first 10 bp are at each place.

Another way might be to look at the first 10 bp at each read and see if there is any bias too. Gibbs sampling, %GC, some kind of kmer analysis?

But anyway, what is a good standard or common-sense way to do this? Thank you BioStar community!

genome sequencing • 4.2k views
ADD COMMENT
5
Entering edit mode
12.1 years ago

FastQC will give you over-represented subsequences, as well as kmers and GC content.

ADD COMMENT
0
Entering edit mode

as well as nucleotide distributions along each position of the reads in your sample ...

ADD REPLY
1
Entering edit mode
12.1 years ago

I would sugest trying MEME.

You can convert your fastq to fasta and sample some reads (for speed).

If you have a motif (rather than just elevated gc content) then you will get a nice motif plot.

http://tools.genouest.org/tools/meme/doc/examples/meme_example_output_files/meme.html

ADD COMMENT

Login before adding your answer.

Traffic: 1802 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6