FASTA of an arbitrary genome: Is the first base the 5' end?
3
2
Entering edit mode
9.4 years ago
Marvin ▴ 220

Hello fellows,

I know that it doesn't matter which one of the 2 strands you declare the reference. But it should matter whether you store the strand (whichever you have chosen) in 5' -> 3' or 3' -> 5 orientation in a fasta file. When I download a sequence, is there sort of a convention that the first base is always the 5' (or 3') end? When I have a read that overhangs at the beginning of the sequence in the fasta file, is it a correct to say "The read overhangs the 5' end" ?

5' 3' fasta • 9.0k views
ADD COMMENT
2
Entering edit mode
9.4 years ago

Nucleotide fasta is always 5' to 3', as are all other formats.

The only exception I am aware of is Solid Colorspace, but it's not in nucleotide space, and has the interesting property that the reverse is the same as the reverse-complement, and as a result, it doesn't matter. But it's notable because their chemistry includes an enzyme that reads 3' to 5' for read 2.

Even so, when the colorspace reads were translated to nucleotide space, they were always represented 5' to 3'.

ADD COMMENT
0
Entering edit mode

If I recall correctly that technology produced reads from the same strand - pointing the same direction, but still right orientation. So it may have read the same fragment backwards from the end but it reversed it during reporting it. It was simple to do that since another property of the color space was that colors in reverse decode to a reverse sequence.

ADD REPLY
0
Entering edit mode
9.4 years ago

Depends on if you find the ATG codon at the beginning of a gene that is annotated on the + strand :). I'm not sure that there is a convention but it's surely more confusing to distribute sequences as 3' > 5' and so probably uncommon.

ADD COMMENT
0
Entering edit mode
9.4 years ago

This is an interesting question because my first answer is of course it is in 5' to 3' direction but then I don't know of a rule that requires this - other than pandemonium would break out otherwise.

ADD COMMENT

Login before adding your answer.

Traffic: 1913 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6