Find Corresponding GenBank ID
0
0
Entering edit mode
6.7 years ago
Penny Liu ▴ 30

This is what I have.

Is it possible to get corresponding GenBank ID via BioSample or BioProject Accession? It is great if you guys could give me some valuable suggestions and comments how to solve.

Thanks

gene BioSample BioProject GenBank • 2.6k views
ADD COMMENT
0
Entering edit mode

Are you referring to NZ* accession numbers or WP* Protein ID's as described on this summary annotation page?

ADD REPLY
0
Entering edit mode

Thanks, but it's not what I expected. For example, https://www.ncbi.nlm.nih.gov/protein/AAC76128.2

BioProject: PRJNA225, BioSample: SAMN02604091. Both of these should refer to the GenBank accession AAC76128.

ADD REPLY
0
Entering edit mode

I am not sure I understand your intent. You want to associate a biosample (which is whole genome sequence) to a specific protein?

ADD REPLY
0
Entering edit mode

I was assigned BioProject and BioSample accession number when I take WGS submission. Will I get the GenBank ID when the annotation process is finished? Perhaps I misunderstood and thanks again!

ADD REPLY
0
Entering edit mode

Referring back to the original example isn't GCA_002318995.1the accession that has been assigned to your genome? Looks like yours is one of the 8 assemblies available for this organism.

ADD REPLY
0
Entering edit mode

In my submission, the strain is KCT instead of YHL.

ADD REPLY
0
Entering edit mode

Classification is at the taxid level so it looks like all strains of the species are collected under main taxid (Shewanella algae (taxid:38313)).

ADD REPLY
0
Entering edit mode

As far as I know, Shewanella haliotis and Shewanella algae are the same genus but different species.

ADD REPLY
0
Entering edit mode

I did this as a part of one of my project. This could be helpful

library("genomes")
proks <- reports("prokaryotes.txt")

# BioProject Accession: PRJNA312015
biopro <- proks[grep("312015",proks$`BioProject ID`),]
ftp_biopro <- biopro$`FTP Path`


file_type <- "feature_table.txt"
GCA <- tail(unlist(strsplit(ftp_biopro, "/")),1)
file_type <- paste0(GCA,"_",paste0(file_type, ".gz"))
down_file <- paste(ftp_biopro,file_type, sep ="/")
dest <- "weather.op.gz" 
download.file(url = down_file, destfile = dest) 
my_data <- read.table(dest, header=F, sep = "\t")
ADD REPLY

Login before adding your answer.

Traffic: 2010 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6