Fasta Sequence Construction
0
0
Entering edit mode
12.6 years ago
Jessica • 0

I'm trying to finish up an assignment (writing a software tool that will extract gene sequences from the chromosome) and this is the only question I can't get. Can anyone walk me through this? THANKS!

For each (non-header) line in the gene file you need to create an entry for the FastA file as follows. For a sequence like:

NM_00100344 chr11 + 5925152 5926098 5925152 5926098 2 5925152,5925652, 5925404,5926098,

I want an information line that looks like the following (unspliced version):

>NM_00100343|chr11(+):5925152Z5926098

or (spliced&version):

>NM_00100343|chr11(+):5925152Z5926098|5925151Z5925404,5925652Z5926098

Following the information line should be the actual sequence (extracted from the chromosome file, and reverse-complemented if necessary), using a column width of 70.

python gene sequence • 2.0k views
ADD COMMENT
2
Entering edit mode

is this homework? you should tag it with the homework tag if it is. Also, what have you tried and why is it not working? Are you running into errors or do you have no idea whatsover how to start addressing the problem?

ADD REPLY
0
Entering edit mode

It looks like a homework assignment.. Anyway, is it limited to solved this problem by python?

ADD REPLY
0
Entering edit mode

It's okay to ask questions about homework here.

But it will be better if you share your opinion and what you have tried to solve this problem ^^

ADD REPLY
0
Entering edit mode

It is okay, but you should tag it with a homework tag. The answer given can differ between homework and non-homework questions (hints how to solve it instead of giving the answer outright)

ADD REPLY

Login before adding your answer.

Traffic: 2699 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6