I have a list of ncbi refseq_ncrna id, for example NR_046233, NR_015516, NR_102721...I need the position of the transcripts on chromosome.
I have tried to convert the refseq id to ensembl id, and use R package ensembldb to get the chromosome location of the transcript, but a lot of refseq_ncrna ids don't have corresponding ensembl id.
Is there any way to directly get the chromosome location of the transcript use refseq_ncrna id?
Use BioMart from ENSEMBL.
1. Go to the ENSEMBL BioMART tool.
2.Choose database- usually I choose ENSEMBL genes
3.On the right hand, you will find Filters, click on it and then choose genes.
4. In the second drop down you will find - Input External references. Tick this one. On the left, there is a drop down box, select RefSeq ncRNA ID (there you have it! :)), and paste all the RefSeq IDs of nc RNAs of your interest.
5. Then on the left you will see attributes, click on that.
6. Then select Gene- here you can select chromosome/scaffold name, strand, transcription start site or length of transcript.
And hopefully now you will have almost all information for your ncRNAs.
Thanks a lot! although it still doesn't work for ncRNAs without ensembl transcript id
Did you choose Refseq ncRNA ID from the dropbox menu?
Yes I did, and it only gives results for those with ensemble transcript ids