I have a dataset with 61 lists, I want to compare each list with remaining 60, and grep the difference in each list with remaining others
This is how list look like
C 00010 Glycolysis / Gluconeogenesis [PATH:hca00010]
C 00020 Citrate cycle (TCA cycle) [PATH:hca00020]
C 00030 Pentose phosphate pathway [PATH:hca00030]
C 00040 Pentose and glucuronate interconversions [PATH:hca00040]
C 00051 Fructose and mannose metabolism [PATH:hca00051]
C 00052 Galactose metabolism [PATH:hca00052]
C 00500 Starch and sucrose metabolism [PATH:hca00500]
C 00520 Amino sugar and nucleotide sugar metabolism [PATH:hca00520]
lists contain near about similar data, I want output only of dissimilar lines among all lists with list name.
what about just:
?
uniq -c
oruniq -cu
? OP wants dissimilar lines. As dissimilar ones occurs only once,c
is redundant.uniq -u
should be sufficient