Hello
I am returning to bioinformatics after a long absence and the landscape has certainly changed and there are a few (well, a lot of) things I'm not clear about.
Taking the human genome as an example, what is the difference between a SNP and an entry in a the HGMD? I've been using the ensembl genome browser to look at sequence variations and I'm unsure as to why some are classed as SNPs and some mutations. My understanding of a SNP is that it is a mutation that happened a long time ago and has now become established as a polymorphism in the population, so you refer to it as a polymorphism not a mutation. Using that logic a mutation would be a a variation from the reference genome which has recently occured, but I don't think that's the right answer. I think a mutation is a SNP associated with a disease phenotype. However SNPs and HGMD mutations that cause missense mutations seem to be treated differently by ensembl and I don't understand why and it is confusing me.
Take this example gene and transcript Gene: ENSG00000163914 Transcript: ENST00000296271
There is a 'mutation' at the 4th codon which gives rise to a missense mutation and is linked to the HGMD via this ID: CM930647. However this mutation is not listed among the protein variants for this transcript. Other variants from SNPs and some other mutations are listed. Perhaps this is nothing more than an error in the database.
Thanks in advance for your help
Why can't i add a comment?
I'm trying to add a reply but the website won't let me add a link: nothing happens when i click 'add comment.' I've also tried wrapping the link up in [?] tags too. How did you add a link to you reply? Thanks
Links will be parsed automatically in comments, there is a link button in the text editor for posts.
Hi. I've got 8 variations but i was looking at an archive version if that makes a difference
If you then look at the cdna you can see a mutation at the 4th codon which causes a missense mutation but this isn't listed in the protein variants.
Do you have any idea why this wouldn't be listed as a variant?
I tried posting links to be helpful but the page wouldn't let me. I was looking at the May 2010 archive
the point of blindly trusting what the database fields mean is what in fact they contain may not be always the best idea. as we all know, database fields are meant to handle information, and the exact meaning of that information does not always suit the database field names. databases help to handle very high percentages of the whole, but when it comes to diseases and particular issues then they are not so straight-forward. I'm expanding my point on a proper answer, as I'm running out of characters here ;)