What's the best way to serve VCF data on web?
2
0
Entering edit mode
4.6 years ago
magnolia ▴ 30

I would like to know what is the best strategy to store VCF data and serve on the web. So far, it seems like I have two options but I don't know the limitations and negative sides of these options:

  1. Stream from VCF
  2. Store data in DB such as Postgres

Considering there will be thousands of VCFs, what should be done?

Thanks!

vcf web database postgres • 1.8k views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

and another that is a web based front end for GEMINI database

https://academic.oup.com/bioinformatics/article/35/1/122/5047758

ADD REPLY
0
Entering edit mode
4.6 years ago

I don't know if it will fullfill your needs, I wrote a basic bam+vcf file server. It's currently in the dev branch of jvarkit: https://github.com/lindenb/jvarkit/blob/dev/docs/HtsFileServer.md

java -jar dist/htsfileserver.jar -R src/test/resources/rotavirus_rf.fa src/test/resources/S*.bam  src/test/resources/rotavirus_rf.*.vcf.gz

enter image description here

ADD COMMENT
0
Entering edit mode

Considering there will be thousands of VCFs,

Will your solution scale to that level?

ADD REPLY
0
Entering edit mode

as I said, it's pretty basic.

pro: should work out of the box (no sql...)

con: there will be thousands of lines in the html page

ADD REPLY
0
Entering edit mode

let me be clear: it's a FILE server, not a tool to query VCFs...

ADD REPLY
0
Entering edit mode
4.6 years ago
d-cameron ★ 2.9k

htsget is the API for serving SAM/BAM/VCF over HTTP, but support is currently limited as it is a relatively new standard.

ADD COMMENT

Login before adding your answer.

Traffic: 1296 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6