Retrieve amino acid sequences from old locusID?
2
0
Entering edit mode
7.0 years ago
taylrtam • 0

Hi,

I have a list of discontinued locusID tags that I want to retrieve the amino acid sequences for.

The list is as follows:

Bphyt_0466

Bphyt_4414

...

Bphyt_4439

Is there any way to batch retrieve the amino acid sequences?

blast genome sequencing sequence gene • 1.7k views
ADD COMMENT
1
Entering edit mode
7.0 years ago
Sparrow_kop ▴ 260

Hi, you can use R 'KEGGREST' library to extract the amino seq from the KEGG database: The below is the demo script:

library(KEGGREST)
library(Biostrings)

aaseq_query <- keggGet("bpy:Bphyt_2906", "aaseq")
aaseq <- as.character(unlist(aaseq))
aaseq

[1] "MSEKEIDQVLVERVQKGDKAAFELLVSKYHRKILRLISRLVRDPAEVEDVAQDAFIKAYRALPQFRGESAFYTWLYRIAVNTAKNYLATQGRRAPTSTEADAEEAETFSDADQLRDINTPESMLMSKQIAETVNAAMAVLPEELRTAITLREIEGLSYEEIAEMMGCPIGTVRSRIFRAREAIAAKLRPLLDTPEGKRW"

The other way is to use web scrapping by python or others, use the below url :

http://www.genome.jp/dbget-bin/www_bget?bpy:Bphyt_2906

and you can replace the gene id at the last position in the url , and the write a script to parse the web to extract the seq.

ADD COMMENT
0
Entering edit mode
7.0 years ago
Sparrow_kop ▴ 260

Hi, you can use R 'KEGGREST' library to extract the amino seq from the KEGG database: The below is the demo script:

library(KEGGREST)
library(Biostrings)

aaseq_query <- keggGet("bpy:Bphyt_2906", "aaseq")
aaseq <- as.character(unlist(aaseq))
aaseq

[1] "MSEKEIDQVLVERVQKGDKAAFELLVSKYHRKILRLISRLVRDPAEVEDVAQDAFIKAYRALPQFRGESAFYTWLYRIAVNTAKNYLATQGRRAPTSTEADAEEAETFSDADQLRDINTPESMLMSKQIAETVNAAMAVLPEELRTAITLREIEGLSYEEIAEMMGCPIGTVRSRIFRAREAIAAKLRPLLDTPEGKRW"

The other way is to use web scrapping by python or others, use the below url :

http://www.genome.jp/dbget-bin/www_bget?bpy:Bphyt_2906

and you can replace the gene id at the last position in the url , and the write a script to parse the web to extract the seq.

ADD COMMENT

Login before adding your answer.

Traffic: 1806 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6