I am new to bioinformatics. so this may be silly question.
We would like to do Pharmacogenomics clinical decision support. Instead of using genotyping, we want to use whole genome sequencing. I am assuming we will send 8 patient samples for analysis. (ideally we wound sent just one, but the pricing is such that we have to send a batch). There are lots of format and it is hard to know where to start, what we will have to do in-house and what can we get our of the box (going only to sequencing vendor, not interpretation vendor)
My questions are:
in what format we can expect to get the raw data? (would that be fastaq or BAM/SAM)
does one get right away a *.vcf file from a vendor or we will have to process it ourselves? (against which reference genome it would be done - would that be Homo sapiens GRCh37 Primary Assembly?
Definitely spend time looking at different vendors and sequencing centres to see what they give you and what support services they provide. If you are new to Bioinformatics you aren't going to want to go with the cheapest provider, because trust me.... the level of support just won't be there. WGS is a pretty fast moving field even for an experienced bioinformatician and figuring out good ways of approaching your data takes a bit of work and you need to address and lay out a plan before you send anything out for sequencing. Good sequencing centres have tremendous experience in experimental design and often provide good "first pass" bioinformatics support. I've done a lot of work with Genome Quebec's Innovation Centre in Montreal (University of McGill) and they are fantastic. As part of my PhD I worked in a lab doing lots of de novo genome sequencing of microbial eukaryotes and the cheaper providers may be more trouble than they are worth, and you will probably not get any bioinformatics support from them