How can you sort variants based on their positions from upstream to downstream?
1
0
Entering edit mode
6.7 years ago
bioinfo89 ▴ 60

Hello!

I have a set of variants with coding sequence positions for eg:

c.-3A>C

c.134+1G>A

c.*5362T>C

~10,000 of such variants in a single file.

Aim: I want to sort them based on their position from upstream to downstream of the gene (5'UTR variants followed by coding region and then followed by 3' UTR variants).

Is there any method or a tool available which can do this?

Any help will be appreciated!

snp • 1.5k views
ADD COMMENT
0
Entering edit mode

Hello bioinfo89,

does your file realy just contains this variant description? What about the gene and transcript informations?

fin swimmer

ADD REPLY
1
Entering edit mode
6.7 years ago

You really need to provide more information on what kind of file format you have, but for shits and giggles, let's assume it's a bed file with the first three columns: chromsome, start, stop. Sorting is then as easy as:

sort -k 1,1 -k 2,2n myfile >outfile
ADD COMMENT
0
Entering edit mode

Oh yes I should have mentioned this before, I have coding sequence positions for the variants eg. c.-3A>C, c.134+1G>A, c.*5362T>C.

ADD REPLY
0
Entering edit mode

Please edit your original question with this additional information.

ADD REPLY
0
Entering edit mode

These are still not enough informations. Please post the first lines of you file.

fin swimmer

ADD REPLY

Login before adding your answer.

Traffic: 1947 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6