freebayes for variant calling too slow
1
0
Entering edit mode
4.2 years ago
evafinegan • 0

Hi All,

I am using freebayes to call variants mainly SNPs from 200 samples. I have created bam files by aligning RNA-seq fastq files to the transcriptome. I want to make a vcf file with SNPs from all the samples. However, it seems to run fairly slow and job is not complete after three days. I am not sure if there is something wrong or is it normal speed? Thank you

freebayes  -f  transcriptome.fasta bam1 bam2-------bam200 > result_file.vcf
rna-seq snp • 2.8k views
ADD COMMENT
3
Entering edit mode
4.2 years ago
JC 13k

Freebayes is running in a single core, in general, you can split the task per chromosome, check the parallel script https://github.com/ekg/freebayes/blob/master/scripts/freebayes-parallel

ADD COMMENT
0
Entering edit mode

Is it a python script? Is there a way to split the regions in linux and use multi-threading. Thank you!

ADD REPLY
0
Entering edit mode

That is pure BASH using GNU parallel

ADD REPLY
0
Entering edit mode

I meant the script to generate chunks of regions.

ADD REPLY
1
Entering edit mode

it doesn't need multithread, it simply reads the Fasta and define chunks https://github.com/ekg/freebayes/blob/master/scripts/fasta_generate_regions.py

ADD REPLY
0
Entering edit mode

Got it, thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2796 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6