custom search and download sequences for a given Rotavirus strain from databases
1
1
Entering edit mode
9.1 years ago

Hi every body

I am working on rotavirus, this virus has not a long genome as usual but 11 separated fragments. each fragment correspond to a gene so we have here 11 genes. each viral strain has these 11 genes. In the biological databases we can find these sequences but it happens that for one viral strain it was not sequenced all genes and/or for the same strain we have only partial sequence of genes. So my question is: How to retrieve using script or other else, only the full sequences of the 11 genes for the same viral strain. for example:

Strain 1: gene 1 (full sequence), gene 2 (full sequence), gene 3 (full sequence), gene 4 (full sequence), gene 5 (full sequence), gene 6 (full sequence), gene 7 (full sequence), gene 8 (full sequence), gene 9 (full sequence), gene 10 (full sequence), gene 11 (full sequence).
Strain 2: gene 1 (full sequence), gene 2 (full sequence), gene 3 (full sequence), gene 4 (full sequence), gene 5 (full sequence), gene 6 (full sequence), gene 7 (full sequence), gene 8 (full sequence), gene 9 (full sequence), gene 10 (full sequence), gene 11 (full sequence).
.
.
.
.
.
.

and so on.

Thank you for your help

gene virus genome sequence • 2.2k views
ADD COMMENT
0
Entering edit mode

I worked on Rotavirus during my thesis, apart from the few classical strains (RF..) I'm afraid there is no solution to your question.

ADD REPLY
0
Entering edit mode

o you thin so?

I think there is solution for that through some scripts.

I think may be we can create a personal database that will be linked to the international databases and through a script get what we look fo.

ADD REPLY
0
Entering edit mode

as far as I remember most sequences are partial, poorly annotated or only a fews segments have been sequenced.

ADD REPLY
0
Entering edit mode

No Dear,

i have collected more than 130 full genome sequences until now

ADD REPLY
0
Entering edit mode

great ! so, things have changed ! what was your method ?

ADD REPLY
0
Entering edit mode

just manually from genebank

ADD REPLY
0
Entering edit mode

for that reason i want more rapid method to retrive information automatically

ADD REPLY
0
Entering edit mode

again, what was your method ?: if "manually" means you looked in the articles and peeked the accession numbers, then you cannot automatize things. If you found a way ( e.g: a feature in genbank) to get all the sequences for a given strains, then we might help you.

ADD REPLY
0
Entering edit mode

Hi

Yeh manually through articles.

I think we can retrieve from Genbank sequences of rotavirus A. Then from these sequences we retrieve only the same strain that is repeated 11 times. The name of the strain is indicated in the title of the sequence. We get all genes sequences from the same strain. Again we filter the results to keep only sequences that indicate complete. In such way we get all complete sequences of the same strain.

ADD REPLY
0
Entering edit mode

Does that mean you want to retrieve a sequence that has not been sequenced or not been uploaded?

ADD REPLY
0
Entering edit mode

no . just i want to get from the database (NCBI) only the full sequences of genes (11) that belongs to the same strain or isolate.

ADD REPLY
0
Entering edit mode

thank you i will try that

ADD REPLY
0
Entering edit mode
9.1 years ago
skbrimer ▴ 740

I'm working with reovirus and in a very similar spot. I just did it the long way like you ended up doing but, like you, if I could have figured out a script I would have. I'm not a strong coder but I know you can do something similar in biopython, http://biopython.org/DIST/docs/tutorial/Tutorial.html#chapter:entrez Also there are some threads from Biostars:

Retrieving Fasta Sequences From Ncbi Using Biopython

Biopython Entrez.Efetch Problem

Also, you could try contacting Peter as he is one of the developers of biopython. He may be able to help you on this as well.

ADD COMMENT
0
Entering edit mode

thank you i will try that

ADD REPLY

Login before adding your answer.

Traffic: 1637 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6