getting gene symbol from probe sequence
1
0
Entering edit mode
8.7 years ago
vahideh ▴ 20

hello I have a platform(GPL8708 for GSE74930) with this data:

ID     SEQUENCE        RANGE_START         RANGE_END            RANGE_GB 
ECOLIK12_F_00000001    GCTTTTCATTCTGACTGCAACGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTG 2   62   U00096.2 
ECOLIK12_R_00000002 GAAGCTGCTATCAGACACTCTTTTTTTAATCCACACAGAGACATATTGCCCGTTGCAGTCAG   12   73   U00096.2 
ECOLIK12_F_00000003    CGGGCAATATGTCTCTGTGTGGATTAAAAAAAGAGTGTCTGATAGCAGCTTCTGAACTGG 22   81   U00096.2

... and so on!

I want to get gene names with those sequences. which packages help me(annotate, AnnotationDbi or annotationTools or ...)? Note that this platform isn't AFFY! Thank you!

probe sequence gene symbol • 1.6k views
ADD COMMENT
1
Entering edit mode
8.7 years ago
GenoMax 147k

Those probes are from E. coli K12 MG1655 genome and also here. Feature table for that genome is here. You have the start and end coordinates in your file. An intersectBed type strategy should give you the names you need. This appears to be a tiling array so all entries in your file would likely not get a gene name.

ADD COMMENT

Login before adding your answer.

Traffic: 2661 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6