I have a set of ~11k SNPs for Saccharomyces cerevisiae, baker's yeast and I would like to identify which ones of these sit in transcription factor binding sites and if they do information on the relevant TFBS.
I've scoured this site and the internet and I couldn't find a downloadable database that would give me locations of all known/verified TFBS for yeast. I've studied the YEASTRACT website thoroughly but didn't find such database to download information about TFBS for all the ORFs/genes in one go.
Moreover, when I tried getting the information manually through YEASTRACT search on-line I found results confusing. for example: searching for TF for ORF "YOL166W-A" returns a list of 4 TFs. clicking on one of them, say Sok2p, takes you to another page which says, amongst other things, that the corresponding TFBS is "acMTGCAKg"... what does it mean? (i know the ACTG alphabet but what are the 'a', 'c', 'K' and 'g' symbols?) and what does this tell me about the actual location of the binding site? do I have to BLAST the whole genome to identify it? (shouldn't there be a database with this info for yeast already?) if so, how do I BLAST for symbols like 'K' and 'g'?
I have background in stats but currently work in genetics applications, hence extracting relevant bioinformatics data is very confusing for me on occasions. any help will be appreciated.
thank you for this! i shall try this in the coming week