Pandas Data Frame

0

Entering edit mode

2.1 years ago

priya.bmg ▴ 60

Hello

I have a column with many semicolon separated values and it is not same number of values in each row. For example, in row 1, there are five headings separated by semicolon and in row 2 there are seven headings separated by semicolon and so on

Row1 SVTYPE=DEL;END=42784444;SVLEN=-2206;BCSQ=sv:cds|TTBK2||protein_coding|-||;hetalt=002-020_dedup

Row2 IMPRECISE;SVTYPE=DEL;END=63493240;SVLEN=-2060;SVANN=TANDEM;BCSQ=sv:cds|EEF1A2||protein_coding|-||;hetalt=002-016,002-051

Is there a way to organize the data in Pandas dataframe, such a way that when I divide the semicolon separated values into columns, these values don't get interchanged or misplaced in the separate columns

Thanks

vcf GATK-SV pandas annotation • 638 views

ADD COMMENT • link updated 2.1 years ago by husensofteng ▴ 410 • written 2.1 years ago by priya.bmg ▴ 60

0

Entering edit mode

maybe you want to consider vcfpy in case you are handling a VCF file!

ADD REPLY • link 2.1 years ago by husensofteng ▴ 410

Login before adding your answer.