Entering edit mode
2.9 years ago
adnan.lahuf
•
0
Dear Colleagues, I have conducted a BLAST analysis in Galaxy and obtained a table with 3,400,000 lines. How can I summarize it and remove repeating or similar information using galaxy platform or another tool suggested. Thanks in advance A very happy new year Adnan
With large tabular data you generally want to use a programming language like R or Python. I personally recommend R using either the base functions or the tidyverse libraries (see here). For Python Pandas would be the go-to library.