Possible to get NCBI assembly and read set of several genomes?
0
0
Entering edit mode
5.2 years ago

Hello!

I am trying to get a set of assemblies from one species, lets say for example Bacillus cereus, for which also the according read sets are available.

Is that somehow possible in NCBI - to get the direct connection?

I tried sofar by searching in the database "Assembly" for B.cereus, but if I then choose some assembly, there is no connection to the read set, from which this assembly was created? DOes someone know how to do the trick?

Thanks

Assembly • 736 views
ADD COMMENT
1
Entering edit mode

This question has been answered multiple times in past. See the answers and the links I posted in this thread: more elegant way to bulk download genomes from the NCBI

In short, NCBI genome download tool mentioned in @jrj.healey's answer should do the trick.

You will need to look through the biosamples accessions associated with read assemblies to get the read data. Use sra-explorer tool from Phil Ewels for that.

ADD REPLY
0
Entering edit mode

Thank you for the answer, but which @jrj.healey answer do you mean? I did not see anyone named like that.

and which accessions do you mean? I tried to use the Biosample ids, or the assembly accessions but that did not work out.

ADD REPLY
0
Entering edit mode

That user changed his screen name to @Joe. So that would be the answer to look for.

Second answer in the thread I linked above can be used for an example. Using that if you did this search at NCBI you are going to see some assemblies for Lactobacillus. Select sort by date refseq assembly released (at top of page, newer assemblies are likely to have NGS data) you will see this first result.. Clicking on associated biosample gives you the SRA accession.

You can probably use EntrezDirect to get some of this information. I may look it up later today.

ADD REPLY

Login before adding your answer.

Traffic: 1794 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6