Entering edit mode
7.4 years ago
wangshx
▴
10
The chromosome_strand
column should indicate "-" or "+". It is right when I use zcat simple_somatic_mutation.open.tsv.gz |head -5
to see the head of data (from ICGC tsv file). But when I use awk
to get some columns I am interested in, chromosome_strand
all change to 1
. Anybody know why?
$zcat simple_somatic_mutation.open.tsv.gz |head -5 | awk 'BEGIN{FS="\t";OFS="\t";} {print $1,$2,$3,$12}'
icgc_mutation_id icgc_donor_id project_code chromosome_strand
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
paste the output of
zcat simple_somatic_mutation.open.tsv.gz |head -5
............. too many columns. Thanks for reminding me. I am wrong~
So your 12th col is not the strand, as you might have noticed too.