Metagenome Read Simulators
0
0
Entering edit mode
6.6 years ago
alesssia ▴ 580

Hello everyone,

I need to simulate a few metagenomics samples and I was thinking to use BBMap's simulator, randomreads.sh, in "metagenome" mode.

The problem is that I need to simulate the effect of PCR amplification, and I thought of including identical duplicates but with different quality scores.

Does anyone have any suggestion on how to do this? Or knows if BBMap randomreads.sh already simulates PCR duplicates (and if yes, how do I modify their proportion)?

Thanks a lot!

metagenomics simulation WGS BBmap • 2.0k views
ADD COMMENT
0
Entering edit mode

It probably does not simulate PCR dups. You could go the route you mention above.

coverage=X will automatically set "reads" to a level that will give X average coverage (decimal point is allowed).

metagenome will assign each scaffold a random exponential variable, which decides the probability that a read be generated from that scaffold. So, if you concatenate together 20 bacterial genomes, you can run randomreads and get a metagenomic-like distribution. It could also be used for RNA-seq when using a transcriptome reference.

The coverage is decided on a per-reference-sequence level, so if a bacterial assembly has more than one contig, you may want to glue them together first with fuse.sh before concatenating them with the other references.

ADD REPLY
0
Entering edit mode

Thanks, I think I will just generate the metagenomics samples, then randomly select varying percentage of reads, duplicate them, and finally shuffle their quality score. Will this work in your opinion?

ADD REPLY
1
Entering edit mode

You could check if there are PCR dups after generating the metagenome (using A: Introducing Clumpify: Create 30% Smaller, Faster Gzipped Fastq Files ). If not then proceed with your plan as stated above.

ADD REPLY
0
Entering edit mode

Good point. Since I need to keep the percentage of duplicates equal to fixed values I should first check if there are any and add/remove if needed! Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2058 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6