Dear all,
i have a two columns table of 20k lines.
1st column: list of gene IDs (there can be duplicated IDs)
2nd column: a value
What i want is to rank my list leaving with only unique gene IDs. For the duplicated gene IDs i want to leave only the ones with the highest score.
here an example, Thanks in advance
TMCS09g1008699 6.4
TMCS09g1008671 6.4
TMCS09g1008672 6.5
TMCS09g1008673 6
TMCS09g1008674 5.4
TMCS09g1008675 5.4
TMCS09g1008676 4.9
TMCS09g1008677 4.6
TMCS09g1008677 4.4
TMCS09g1008679 4.3
TMCS09g1008680 3.9
TMCS09g1008681 3.8
TMCS09g1008682 3.6
TMCS09g1008683 3.5
TMCS09g1008684 3.5
TMCS09g1008685 3.4
TMCS09g1008686 3.4
TMCS09g1008687 3.4
TMCS09g1008688 3
TMCS09g1008689 2.6
TMCS09g1008690 2
TMCS09g1008699 5.9
there's a mistake here, it takes the lowest
Can you try it again ?
when i put it into a script in this way:
i get a partial output and then this error:
do you know what that means?
Can you use
test["V1"]
instead oftest$V1
? You get this error becausetest$V1
is non-recursive object. You can find more info here.i did, now i get this:
Very useful code. Any idea how to make this working for a list of data frames???
https://pasteboard.co/JlDwYx4.png