Extract each annotation in 'info' column in snpEff output
0
1
Entering edit mode
9.2 years ago
jan ▴ 170

Hi,

I have used snpEff to annotate my vcf file. I previously use Annovar to annotate my vcf file and the fields are nicely separated in tabular forms, hence making it easy to extract information for further analysis.

snpEff annotation is written in 'info' column in one line separated by a pipe |, making it difficult to extract certain information . Is there a tool to separate each annotation into separate tab ? I tried to write a python script (with a very limited knowledge about programming ) but it gets messy .

snpEff sequencing • 4.4k views
ADD COMMENT
1
Entering edit mode

awk would probably be able to do it efficiently. Exactly which information do you want? Do you need the tab-delimited stuff at the beginning, or the semi-colon-delimited values, etc?

A simple way would be to just to specify your field separators at the start of an awk command, and split everything into tab-delimited columns.

ADD REPLY
2
Entering edit mode

I just found out about snpSift and the tool suits my work

ADD REPLY
1
Entering edit mode

That looks like a useful tool, thank you for making me aware of snpSift!

ADD REPLY

Login before adding your answer.

Traffic: 1975 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6