Entering edit mode
4.7 years ago
Luther_Blisset
▴
30
I have a list of SNP positions:
761155
4402247
472711
4027247
2867693
I have a gff reference annotation file and a reference fasta file.
How do I simply find the gene name and locus (e.g. "ctpJ", "Rv3743c", respectively; or "Rv3760", "Rv3760" if there is no gene name) where these SNPs occur?
Position data should include both chromosome and position within chromosome. Once you have that, you can use biomaRt to get gene data. Please search the site - that topic has been addressed multiple times.