extract gene coordinates and TSS from gene list
3
0
Entering edit mode
20 months ago
Chironex ▴ 50

Hi, I have a list of 50 genes like this:

"Tshz2"   "Ebf1"    "Col26a1" "Ebf2"    "Unc5c"   "Prkg1"   "Eya1"    "Ldb2"    "Rbms3"   "Arhgap6"

And I need the coordinates of gene body and the TSS +1000/-1000 bp. How can I get it on R? The genome is mm10.

Thanks

r genomicranges • 1.8k views
ADD COMMENT
3
Entering edit mode
20 months ago
Gordon Smyth ★ 7.7k

To get gene bodies:

library(Rsubread)
GeneBody <- promoterRegions("mm10", upstream=0, downstream=Inf)

To get TSS +/- 1000 bp:

TSS <- promoterRegions("mm10", upstream=1000, downstream=1000)

This gives results for all genes by Entrez Gene ID.

ADD COMMENT
0
Entering edit mode

Thank you, this is perfect!!

ADD REPLY
1
Entering edit mode
20 months ago
Trivas ★ 1.8k

I always like using the UCSC Table Browser for things like these. You might need to convert to Refseq (NM_*) for it to work well, which can also be done easily with biomaRt.

ADD COMMENT
0
Entering edit mode
20 months ago

I'm not going to give you a complete answer, but you can start looking at this posts: how to retrieve mouse (mm10) gene information from Ensemble using Biomart inR

And use the R interface for biomart. Its also possible to access biomart through the Ensemb website.

ADD COMMENT
0
Entering edit mode

thank you, I was able to get the coordinates of the gene, but I need to extract the TSS and extend it to 1000 bp.

ADD REPLY

Login before adding your answer.

Traffic: 2554 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6