Remove duplicate lines in a vcf file
1
0
Entering edit mode
10 months ago
Patrick ▴ 10

I have a vcf file that I created from 23andme raw data results.

In this vcf file, there are lines that have the exact same chromosome number and position, however they have a different rsID. Here is a screenshot:

Screenshot of what I have

They are causing errors in a program I am trying to run for PRS calculation. Should I remove them and how can I do that? If I remove them, which one do I keep?

I searched for similar questions online, but couldn't find anything that resembles my exact situation.

variants vcf • 667 views
ADD COMMENT
0
Entering edit mode

Sry I forgot to mark them.

ADD REPLY
1
Entering edit mode
10 months ago
bcftools norm --remove-duplicates

If I remove them, which one do I keep?

you just have a different ID, so you don't really care...

ADD COMMENT

Login before adding your answer.

Traffic: 1531 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6