I have a Diamond BLASTx tsv file that has the Gene_Descriptions:
I have a csv file of the RNAseq expression data:
I want to match the gene names (column E) and descriptions from the tsv file using the corresponding IDs from column B in the tsv file to the IDs (column A) in the csv file and insert them into column H of the csv file so that I have everything in one file and I don't have to cross reference. Can anybody point me to some command line scripts that can do this? Thanks in advance!!
Column E from the
.tsv
file and column A from the.csv
file are nothing alike. Did you mean the respective columns B and A?I have a python script that can merge the two files according to columns B from the first and A from the second file. That won't be exactly what you wanted, but they will all be in one file and you can later delete or move the columns as needed. Let me know if you want to try that.
ok yes! So it will just merge the files but rows will be matched based on the IDs? I'll try it out!